HubLens › Topics › Generative AI

// topic

Generative AI

17trending in last 90 days·17all-time

// new this month

// ecosystem

AI 17

// recent newcomers

see all newcomers →

#1Toonflow-app: One-stop AI Short Drama Creation Workbench🆕 3mo ago↗ 312.31/d★ 7,460 #2ArcReel: Open-source AI-powered video generation workbench🆕 3mo ago↗ 91.86/d★ 2,045 #3ERNIE-Image: High-Performance Open-Source Text-to-Image Diffusion Model🆕 18d ago↗ 53.67/d★ 412 #4Agent Sprite Forge: AI-Powered 2D Game Asset Generation🆕 9d ago↗ 15.49/d★ 70 #5OpenMontage: The First Agentic Video Production System🆕 1mo ago↗ 5.78/d★ 68

// this week's top 9

HBAI-Ltd / Toonflow-app

Toonflow-app is an AI workbench designed for short drama production, achieving full-process automation from script to video through an infinite canvas and a three-layer Agent collaboration system. The platform supports chapter event graph-driven adaptation and provides a programmable provider system to flexibly integrate various AI models. Users can leverage its persistent memory system and modular skill configuration to significantly improve the efficiency and consistency of short drama creation.

baidu / ERNIE-Image

ERNIE-Image is an open-source text-to-image model developed by Baidu based on the Diffusion Transformer (DiT) architecture. The model is equipped with a lightweight prompt enhancer that transforms short inputs into structure-rich descriptions, achieving industry-leading generation results at an 8B parameter scale. It excels at handling complex text rendering, multi-object layout, and instruction-following tasks, while supporting efficient deployment on consumer-grade GPUs.

bilibili / Index-anisora

Index-AniSora is a powerful open-source framework designed specifically for high-quality anime video generation and animation production. The system features a comprehensive data processing pipeline, a controllable generation model with spatiotemporal masking, and a specialized evaluation benchmark. It supports diverse creative tasks including character 3D generation, video style transfer, and multimodal guidance for precise motion control.

ArcReel / ArcReel

ArcReel is an open-source AI video generation workbench that implements an automated pipeline from novel scripts to finished videos via a multi-agent architecture. The platform supports integration with various providers including Gemini, Volcengine Ark, Grok, and OpenAI, offering character consistency maintenance and narrative tracking features. Users can manage projects, track costs, and export Jianying drafts through a visual interface to achieve efficient AI-assisted video creation.

0x0funky / agent-sprite-forge

Agent Sprite Forge is a tool designed to convert natural-language prompts into game-ready 2D sprites and layered maps using Codex. It automates the asset pipeline by combining AI image generation with deterministic local post-processing for cleanup and export. The system supports various outputs, including animation sheets, transparent GIFs, collision data, and complex scene layouts.

microsoft / VibeVoice

VibeVoice is a family of open-source voice AI models that utilizes continuous speech tokenizers and next-token diffusion to achieve high-fidelity audio processing. The framework includes advanced tools for long-form speech recognition and real-time streaming text-to-speech generation. These models are designed for research purposes to advance collaboration and innovation within the speech synthesis community.

Anil-matcha / Open-Generative-AI

Open Generative AI is a free, open-source platform providing an unrestricted alternative to commercial AI media tools. It supports over 200 state-of-the-art models for image, video, and lip-sync generation without content filters or subscription fees. Users can access these capabilities through a web-based interface or a desktop application that supports both local and remote inference.

hugohe3 / ppt-master

PPT Master is an open-source tool that converts documents like PDFs, DOCX files, and URLs into fully editable PowerPoint presentations. Unlike image-based AI tools, it generates native DrawingML shapes, text boxes, and charts that users can modify directly in PowerPoint. The workflow integrates with AI IDEs to provide a local, privacy-focused solution for creating professional decks.

fspecii / ace-step-ui

ACE-Step UI provides a professional, Spotify-inspired interface for the open-source ACE-Step 1.5 AI music generation model. It allows users to generate high-quality songs, instrumentals, and lyrics entirely locally without subscription fees or cloud restrictions. The platform includes advanced tools for audio editing, stem extraction, and batch processing to give creators full control over their music production.

// all-time featured (17)

HBAI-Ltd / Toonflow-app

Toonflow-app is an AI workbench designed for short drama production, achieving full-process automation from script to video through an infinite canvas and a three-layer Agent collaboration system. The platform supports chapter event graph-driven adaptation and provides a programmable provider system to flexibly integrate various AI models. Users can leverage its persistent memory system and modular skill configuration to significantly improve the efficiency and consistency of short drama creation.

baidu / ERNIE-Image

ERNIE-Image is an open-source text-to-image model developed by Baidu based on the Diffusion Transformer (DiT) architecture. The model is equipped with a lightweight prompt enhancer that transforms short inputs into structure-rich descriptions, achieving industry-leading generation results at an 8B parameter scale. It excels at handling complex text rendering, multi-object layout, and instruction-following tasks, while supporting efficient deployment on consumer-grade GPUs.

bilibili / Index-anisora

Index-AniSora is a powerful open-source framework designed specifically for high-quality anime video generation and animation production. The system features a comprehensive data processing pipeline, a controllable generation model with spatiotemporal masking, and a specialized evaluation benchmark. It supports diverse creative tasks including character 3D generation, video style transfer, and multimodal guidance for precise motion control.

ArcReel / ArcReel

ArcReel is an open-source AI video generation workbench that implements an automated pipeline from novel scripts to finished videos via a multi-agent architecture. The platform supports integration with various providers including Gemini, Volcengine Ark, Grok, and OpenAI, offering character consistency maintenance and narrative tracking features. Users can manage projects, track costs, and export Jianying drafts through a visual interface to achieve efficient AI-assisted video creation.

0x0funky / agent-sprite-forge

Agent Sprite Forge is a tool designed to convert natural-language prompts into game-ready 2D sprites and layered maps using Codex. It automates the asset pipeline by combining AI image generation with deterministic local post-processing for cleanup and export. The system supports various outputs, including animation sheets, transparent GIFs, collision data, and complex scene layouts.

bilibili / Index-anisora

Index-AniSora is a comprehensive open-source system developed by Bilibili for high-quality anime video generation. The project provides a controllable generation model, a specialized data processing pipeline, and an evaluation benchmark tailored for animation aesthetics. It supports advanced features such as character 3D video generation, video style transfer, and multimodal guidance to facilitate diverse animation production tasks.

calesthio / OpenMontage

OpenMontage is an open-source, agentic system that transforms AI coding assistants into comprehensive video production studios. It automates the entire creative workflow, including research, scripting, asset generation, editing, and final composition. The platform supports both AI-generated visuals and real-footage documentary montages using a variety of free and premium tools.

OpenBMB / VoxCPM

VoxCPM2 is a tokenizer-free, 2B parameter text-to-speech system that utilizes a diffusion autoregressive architecture to generate high-quality, expressive audio. The model supports 30 languages and offers advanced capabilities including voice design, controllable voice cloning, and studio-quality 48kHz output. It is fully open-source under the Apache-2.0 license and provides production-ready deployment options via vLLM-Omni and Nano-vLLM.

jd-opensource / JoyAI-Image

JoyAI-Image is a unified multimodal foundation model that integrates an 8B Multimodal Large Language Model with a 16B Multimodal Diffusion Transformer to support image understanding, generation, and editing. The model utilizes a closed-loop collaboration between understanding and generation to enhance spatial reasoning and controllable editing capabilities. It provides a scalable training pipeline and supports advanced features like multi-view generation and precise spatial manipulation.

PenglongHuang / chinese-novelist-skill

Chinese-novelist is a skill plugin designed for Claude Code, aimed at helping users complete the entire process of writing Chinese novels through simple interactions. Users only need to answer five core questions, and the AI can automatically generate detailed outlines, character profiles, and coherent chapter content. The tool incorporates professional writing principles and quality checklists to ensure the coherence and appeal of the novel's plot.

PenglongHuang / chinese-novelist-skill

Chinese-novelist is a skill plugin designed specifically for Claude Code, aimed at helping users quickly generate complete novel outlines and character profiles by answering five core questions. Through automated chapter tracking and coherence management, this tool ensures the creative process remains logically rigorous and the plot engaging. Once the user confirms the plan, the AI enters automatic creation mode to efficiently complete the first draft of the entire novel.

microsoft / VibeVoice

VibeVoice is a family of open-source voice AI models that utilizes continuous speech tokenizers and next-token diffusion to achieve high-fidelity audio processing. The framework includes advanced tools for long-form speech recognition and real-time streaming text-to-speech generation. These models are designed for research purposes to advance collaboration and innovation within the speech synthesis community.

mnfst / awesome-free-llm-apis

This repository provides a curated list of LLM API providers that offer permanent free tiers for text inference. It categorizes services into direct provider APIs and third-party inference platforms, detailing model capabilities, context windows, and rate limits. The collection serves as a comprehensive resource for developers seeking cost-effective access to various large language models.

google-ai-edge / gallery

Google AI Edge Gallery is a mobile application designed to run powerful open-source Large Language Models directly on your device. It offers a fully offline and private environment for users to experience advanced generative AI capabilities, including the latest Gemma 4 family. The app provides a comprehensive suite of tools for model management, benchmarking, and interactive AI features.

Anil-matcha / Open-Generative-AI

Open Generative AI is a free, open-source platform providing an unrestricted alternative to commercial AI media tools. It supports over 200 state-of-the-art models for image, video, and lip-sync generation without content filters or subscription fees. Users can access these capabilities through a web-based interface or a desktop application that supports both local and remote inference.

hugohe3 / ppt-master

PPT Master is an open-source tool that converts documents like PDFs, DOCX files, and URLs into fully editable PowerPoint presentations. Unlike image-based AI tools, it generates native DrawingML shapes, text boxes, and charts that users can modify directly in PowerPoint. The workflow integrates with AI IDEs to provide a local, privacy-focused solution for creating professional decks.

fspecii / ace-step-ui

ACE-Step UI provides a professional, Spotify-inspired interface for the open-source ACE-Step 1.5 AI music generation model. It allows users to generate high-quality songs, instrumentals, and lyrics entirely locally without subscription fees or cloud restrictions. The platform includes advanced tools for audio editing, stem extraction, and batch processing to give creators full control over their music production.

// use cases by project

01Novel-to-film adaptation and script development
02Short video content creation and asset generation
03AI-driven automated storyboarding and video production

01High-quality poster and infographic generation
02Multi-object and layout control under complex instructions
03Multi-style image creation and rapid inference acceleration

01Character 3D video generation from front-facing illustrations
02Video style transfer and frame interpolation for anime production
03Multimodal guidance for precise control over video motion and aesthetics

01Multi-agent automated video generation workflow based on the Claude Agent SDK
02Support for multi-provider image and video generation with character consistency and narrative tracking capabilities
03Built-in visual workbench supporting project management, cost tracking, and one-click export of Jianying drafts

agent-sprite-forge

01Generating character animations and spell effect sprite sheets
02Creating layered RPG maps with collision data and transparent props
03Building end-to-end playable game scenes with integrated assets

// related topics

Computer Vision (5)Automation (4)Video Generation (4)Deep Learning (4)AI (2)