HubLensTopicsGenerative AI
// topic

Generative AI

17trending in last 90 days·17all-time

// new this month

// ecosystem

Computer Vision5Automation4Video Generation4Deep Learning4AI2Generative AI
AI 17

// recent newcomers

see all newcomers →

// this week's top 9

01
HBAI-Ltd / Toonflow-app
Toonflow-app is an AI workbench designed for short drama production, achieving full-process automation from script to video through an infinite canvas and a three-layer Agent collaboration system. The platform supports chapter event graph-driven adaptation and provides a programmable provider system to flexibly integrate various AI models. Users can leverage its persistent memory system and modular skill configuration to significantly improve the efficiency and consistency of short drama creation.
737,460
02
baidu / ERNIE-Image
ERNIE-Image is an open-source text-to-image model developed by Baidu based on the Diffusion Transformer (DiT) architecture. The model is equipped with a lightweight prompt enhancer that transforms short inputs into structure-rich descriptions, achieving industry-leading generation results at an 8B parameter scale. It excels at handling complex text rendering, multi-object layout, and instruction-following tasks, while supporting efficient deployment on consumer-grade GPUs.
71412
03
bilibili / Index-anisora
Index-AniSora is a powerful open-source framework designed specifically for high-quality anime video generation and animation production. The system features a comprehensive data processing pipeline, a controllable generation model with spatiotemporal masking, and a specialized evaluation benchmark. It supports diverse creative tasks including character 3D generation, video style transfer, and multimodal guidance for precise motion control.
682,421
04
ArcReel / ArcReel
ArcReel is an open-source AI video generation workbench that implements an automated pipeline from novel scripts to finished videos via a multi-agent architecture. The platform supports integration with various providers including Gemini, Volcengine Ark, Grok, and OpenAI, offering character consistency maintenance and narrative tracking features. Users can manage projects, track costs, and export Jianying drafts through a visual interface to achieve efficient AI-assisted video creation.
662,045
05
0x0funky / agent-sprite-forge
Agent Sprite Forge is a tool designed to convert natural-language prompts into game-ready 2D sprites and layered maps using Codex. It automates the asset pipeline by combining AI image generation with deterministic local post-processing for cleanup and export. The system supports various outputs, including animation sheets, transparent GIFs, collision data, and complex scene layouts.
6170
06
microsoft / VibeVoice
VibeVoice is a family of open-source voice AI models that utilizes continuous speech tokenizers and next-token diffusion to achieve high-fidelity audio processing. The framework includes advanced tools for long-form speech recognition and real-time streaming text-to-speech generation. These models are designed for research purposes to advance collaboration and innovation within the speech synthesis community.
4373
07
Anil-matcha / Open-Generative-AI
Open Generative AI is a free, open-source platform providing an unrestricted alternative to commercial AI media tools. It supports over 200 state-of-the-art models for image, video, and lip-sync generation without content filters or subscription fees. Users can access these capabilities through a web-based interface or a desktop application that supports both local and remote inference.
39129
08
hugohe3 / ppt-master
PPT Master is an open-source tool that converts documents like PDFs, DOCX files, and URLs into fully editable PowerPoint presentations. Unlike image-based AI tools, it generates native DrawingML shapes, text boxes, and charts that users can modify directly in PowerPoint. The workflow integrates with AI IDEs to provide a local, privacy-focused solution for creating professional decks.
3042
09
fspecii / ace-step-ui
ACE-Step UI provides a professional, Spotify-inspired interface for the open-source ACE-Step 1.5 AI music generation model. It allows users to generate high-quality songs, instrumentals, and lyrics entirely locally without subscription fees or cloud restrictions. The platform includes advanced tools for audio editing, stem extraction, and batch processing to give creators full control over their music production.
2765

// all-time featured (17)

HBAI-Ltd / Toonflow-app
Toonflow-app is an AI workbench designed for short drama production, achieving full-process automation from script to video through an infinite canvas and a three-layer Agent collaboration system. The platform supports chapter event graph-driven adaptation and provides a programmable provider system to flexibly integrate various AI models. Users can leverage its persistent memory system and modular skill configuration to significantly improve the efficiency and consistency of short drama creation.
73
baidu / ERNIE-Image
ERNIE-Image is an open-source text-to-image model developed by Baidu based on the Diffusion Transformer (DiT) architecture. The model is equipped with a lightweight prompt enhancer that transforms short inputs into structure-rich descriptions, achieving industry-leading generation results at an 8B parameter scale. It excels at handling complex text rendering, multi-object layout, and instruction-following tasks, while supporting efficient deployment on consumer-grade GPUs.
71
bilibili / Index-anisora
Index-AniSora is a powerful open-source framework designed specifically for high-quality anime video generation and animation production. The system features a comprehensive data processing pipeline, a controllable generation model with spatiotemporal masking, and a specialized evaluation benchmark. It supports diverse creative tasks including character 3D generation, video style transfer, and multimodal guidance for precise motion control.
68
ArcReel / ArcReel
ArcReel is an open-source AI video generation workbench that implements an automated pipeline from novel scripts to finished videos via a multi-agent architecture. The platform supports integration with various providers including Gemini, Volcengine Ark, Grok, and OpenAI, offering character consistency maintenance and narrative tracking features. Users can manage projects, track costs, and export Jianying drafts through a visual interface to achieve efficient AI-assisted video creation.
66
0x0funky / agent-sprite-forge
Agent Sprite Forge is a tool designed to convert natural-language prompts into game-ready 2D sprites and layered maps using Codex. It automates the asset pipeline by combining AI image generation with deterministic local post-processing for cleanup and export. The system supports various outputs, including animation sheets, transparent GIFs, collision data, and complex scene layouts.
61
bilibili / Index-anisora
Index-AniSora is a comprehensive open-source system developed by Bilibili for high-quality anime video generation. The project provides a controllable generation model, a specialized data processing pipeline, and an evaluation benchmark tailored for animation aesthetics. It supports advanced features such as character 3D video generation, video style transfer, and multimodal guidance to facilitate diverse animation production tasks.
61
calesthio / OpenMontage
OpenMontage is an open-source, agentic system that transforms AI coding assistants into comprehensive video production studios. It automates the entire creative workflow, including research, scripting, asset generation, editing, and final composition. The platform supports both AI-generated visuals and real-footage documentary montages using a variety of free and premium tools.
60
OpenBMB / VoxCPM
VoxCPM2 is a tokenizer-free, 2B parameter text-to-speech system that utilizes a diffusion autoregressive architecture to generate high-quality, expressive audio. The model supports 30 languages and offers advanced capabilities including voice design, controllable voice cloning, and studio-quality 48kHz output. It is fully open-source under the Apache-2.0 license and provides production-ready deployment options via vLLM-Omni and Nano-vLLM.
56
jd-opensource / JoyAI-Image
JoyAI-Image is a unified multimodal foundation model that integrates an 8B Multimodal Large Language Model with a 16B Multimodal Diffusion Transformer to support image understanding, generation, and editing. The model utilizes a closed-loop collaboration between understanding and generation to enhance spatial reasoning and controllable editing capabilities. It provides a scalable training pipeline and supports advanced features like multi-view generation and precise spatial manipulation.
52
PenglongHuang / chinese-novelist-skill
Chinese-novelist is a skill plugin designed for Claude Code, aimed at helping users complete the entire process of writing Chinese novels through simple interactions. Users only need to answer five core questions, and the AI can automatically generate detailed outlines, character profiles, and coherent chapter content. The tool incorporates professional writing principles and quality checklists to ensure the coherence and appeal of the novel's plot.
49
PenglongHuang / chinese-novelist-skill
Chinese-novelist is a skill plugin designed specifically for Claude Code, aimed at helping users quickly generate complete novel outlines and character profiles by answering five core questions. Through automated chapter tracking and coherence management, this tool ensures the creative process remains logically rigorous and the plot engaging. Once the user confirms the plan, the AI enters automatic creation mode to efficiently complete the first draft of the entire novel.
46
microsoft / VibeVoice
VibeVoice is a family of open-source voice AI models that utilizes continuous speech tokenizers and next-token diffusion to achieve high-fidelity audio processing. The framework includes advanced tools for long-form speech recognition and real-time streaming text-to-speech generation. These models are designed for research purposes to advance collaboration and innovation within the speech synthesis community.
43
mnfst / awesome-free-llm-apis
This repository provides a curated list of LLM API providers that offer permanent free tiers for text inference. It categorizes services into direct provider APIs and third-party inference platforms, detailing model capabilities, context windows, and rate limits. The collection serves as a comprehensive resource for developers seeking cost-effective access to various large language models.
43
google-ai-edge / gallery
Google AI Edge Gallery is a mobile application designed to run powerful open-source Large Language Models directly on your device. It offers a fully offline and private environment for users to experience advanced generative AI capabilities, including the latest Gemma 4 family. The app provides a comprehensive suite of tools for model management, benchmarking, and interactive AI features.
41
Anil-matcha / Open-Generative-AI
Open Generative AI is a free, open-source platform providing an unrestricted alternative to commercial AI media tools. It supports over 200 state-of-the-art models for image, video, and lip-sync generation without content filters or subscription fees. Users can access these capabilities through a web-based interface or a desktop application that supports both local and remote inference.
39
hugohe3 / ppt-master
PPT Master is an open-source tool that converts documents like PDFs, DOCX files, and URLs into fully editable PowerPoint presentations. Unlike image-based AI tools, it generates native DrawingML shapes, text boxes, and charts that users can modify directly in PowerPoint. The workflow integrates with AI IDEs to provide a local, privacy-focused solution for creating professional decks.
30
fspecii / ace-step-ui
ACE-Step UI provides a professional, Spotify-inspired interface for the open-source ACE-Step 1.5 AI music generation model. It allows users to generate high-quality songs, instrumentals, and lyrics entirely locally without subscription fees or cloud restrictions. The platform includes advanced tools for audio editing, stem extraction, and batch processing to give creators full control over their music production.
27

// use cases by project

Toonflow-app
  • 01Novel-to-film adaptation and script development
  • 02Short video content creation and asset generation
  • 03AI-driven automated storyboarding and video production
ERNIE-Image
  • 01High-quality poster and infographic generation
  • 02Multi-object and layout control under complex instructions
  • 03Multi-style image creation and rapid inference acceleration
Index-anisora
  • 01Character 3D video generation from front-facing illustrations
  • 02Video style transfer and frame interpolation for anime production
  • 03Multimodal guidance for precise control over video motion and aesthetics
ArcReel
  • 01Multi-agent automated video generation workflow based on the Claude Agent SDK
  • 02Support for multi-provider image and video generation with character consistency and narrative tracking capabilities
  • 03Built-in visual workbench supporting project management, cost tracking, and one-click export of Jianying drafts
agent-sprite-forge
  • 01Generating character animations and spell effect sprite sheets
  • 02Creating layered RPG maps with collision data and transparent props
  • 03Building end-to-end playable game scenes with integrated assets

// related topics