HubLensTopicsGenerative AI
// topic

Generative AI

11 trending in last 90 days ·11 all-time

// new this month

// this week's top 6

01
bilibili / Index-anisora
Index-AniSora is a powerful open-source framework designed specifically for high-quality anime video generation and animation production. The system features a comprehensive data processing pipeline, a controllable generation model with spatiotemporal masking, and a specialized evaluation benchmark. It supports diverse creative tasks including character 3D video generation, style transfer, and multimodal guidance for precise motion control.
882,411
02
ArcReel / ArcReel
ArcReel is an open-source video generation workbench powered by AI Agents, designed to automate the production process from novel scripts to short video clips. The platform supports multi-vendor image and video generation models, ensuring character consistency and visual coherence through a multi-agent architecture. Users can manage projects, track generation costs, and export Jianying drafts via a visual interface for an efficient video creation experience.
881,792
03
OpenBMB / VoxCPM
VoxCPM2 is a tokenizer-free, 2B parameter text-to-speech system that utilizes a diffusion autoregressive architecture to generate high-quality, expressive audio. The model supports 30 languages and offers advanced capabilities including natural-language voice design and controllable voice cloning. It is fully open-source under the Apache-2.0 license and provides production-ready features like real-time streaming and high-fidelity 48kHz output.
7846
04
baidu / ERNIE-Image
ERNIE-Image is an open-source text-to-image model developed by Baidu based on a single-stream Diffusion Transformer architecture. The model is equipped with a lightweight prompt enhancer capable of transforming short inputs into structurally rich, detailed descriptions. With an 8B parameter scale, it demonstrates industry-leading performance in text rendering and instruction following while maintaining efficient deployment.
78238
05
calesthio / OpenMontage
OpenMontage is an open-source, agentic system that transforms AI coding assistants into comprehensive video production studios. It automates the entire creative workflow, including research, scripting, asset generation, editing, and final composition. The platform supports both AI-generated visuals and real-footage documentary montages using a variety of free and premium tools.
7868
06
jd-opensource / JoyAI-Image
JoyAI-Image is a unified multimodal foundation model that integrates an 8B Multimodal Large Language Model with a 16B Multimodal Diffusion Transformer to support image understanding, generation, and editing. The model utilizes a closed-loop collaboration between understanding and generation to enhance spatial reasoning and controllable editing capabilities. It provides a scalable training pipeline and supports advanced features like multi-view generation and precise spatial manipulation.
28105

// all-time featured (11)

bilibili / Index-anisora
Index-AniSora is a powerful open-source framework designed specifically for high-quality anime video generation and animation production. The system features a comprehensive data processing pipeline, a controllable generation model with spatiotemporal masking, and a specialized evaluation benchmark. It supports diverse creative tasks including character 3D video generation, style transfer, and multimodal guidance for precise motion control.
88
ArcReel / ArcReel
ArcReel is an open-source video generation workbench powered by AI Agents, designed to automate the production process from novel scripts to short video clips. The platform supports multi-vendor image and video generation models, ensuring character consistency and visual coherence through a multi-agent architecture. Users can manage projects, track generation costs, and export Jianying drafts via a visual interface for an efficient video creation experience.
88
OpenBMB / VoxCPM
VoxCPM2 is a tokenizer-free, 2B parameter text-to-speech system that utilizes a diffusion autoregressive architecture to generate high-quality, expressive audio. The model supports 30 languages and offers advanced capabilities including natural-language voice design and controllable voice cloning. It is fully open-source under the Apache-2.0 license and provides production-ready features like real-time streaming and high-fidelity 48kHz output.
78
baidu / ERNIE-Image
ERNIE-Image is an open-source text-to-image model developed by Baidu based on a single-stream Diffusion Transformer architecture. The model is equipped with a lightweight prompt enhancer capable of transforming short inputs into structurally rich, detailed descriptions. With an 8B parameter scale, it demonstrates industry-leading performance in text rendering and instruction following while maintaining efficient deployment.
78
calesthio / OpenMontage
OpenMontage is an open-source, agentic system that transforms AI coding assistants into comprehensive video production studios. It automates the entire creative workflow, including research, scripting, asset generation, editing, and final composition. The platform supports both AI-generated visuals and real-footage documentary montages using a variety of free and premium tools.
78
PenglongHuang / chinese-novelist-skill
Chinese-novelist is a skill plugin designed specifically for Claude Code, aimed at helping users quickly generate complete novel outlines and character profiles by answering five core questions. Through automated chapter tracking and coherence management, this tool ensures the creative process remains logically rigorous and the plot engaging. Once the user confirms the plan, the AI enters automatic creation mode to efficiently complete the first draft of the entire novel.
76
PenglongHuang / chinese-novelist-skill
Chinese-novelist is a skill plugin designed for Claude Code, aimed at helping users complete the entire process of writing Chinese novels through simple interactions. Users only need to answer five core questions, and the AI can automatically generate detailed outlines, character profiles, and coherent chapter content. The tool incorporates professional writing principles and quality checklists to ensure the coherence and appeal of the novel's plot.
74
bilibili / Index-anisora
Index-AniSora is a comprehensive open-source system developed by Bilibili for high-quality anime video generation. The project provides a controllable generation model, a specialized data processing pipeline, and an evaluation benchmark tailored for animation aesthetics. It supports advanced features such as character 3D video generation, video style transfer, and multimodal guidance to facilitate diverse animation production tasks.
68
google-ai-edge / gallery
Google AI Edge Gallery is a mobile application designed to run powerful open-source Large Language Models directly on your device. It offers a fully offline and private environment for users to experience advanced generative AI capabilities, including the latest Gemma 4 family. The app provides a comprehensive suite of tools for model management, benchmarking, and interactive AI features.
42
microsoft / VibeVoice
VibeVoice is a collection of open-source voice AI models that utilize continuous speech tokenizers and a next-token diffusion framework to achieve high-fidelity audio processing. The project provides specialized models for long-form automatic speech recognition, real-time streaming text-to-speech, and multi-speaker synthesis. These models are designed for research purposes, offering capabilities like single-pass processing for hour-long audio and support for over 50 languages.
38
jd-opensource / JoyAI-Image
JoyAI-Image is a unified multimodal foundation model that integrates an 8B Multimodal Large Language Model with a 16B Multimodal Diffusion Transformer to support image understanding, generation, and editing. The model utilizes a closed-loop collaboration between understanding and generation to enhance spatial reasoning and controllable editing capabilities. It provides a scalable training pipeline and supports advanced features like multi-view generation and precise spatial manipulation.
28

// related topics