HubLensTopicsRAG
// topic

RAG

11 trending in last 90 days ·11 all-time

// new this month

// this week's top 7

01
Tencent / WeKnora
WeKnora is an intelligent knowledge management and Q&A framework that utilizes LLMs to provide enterprise-grade document understanding and semantic retrieval. The platform offers both a RAG-based Quick Q&A mode for fast queries and a ReACT Agent engine for complex, multi-source reasoning tasks. It features a highly modular architecture that supports various document formats, multiple LLM providers, and seamless integration with popular IM channels for private or local deployment.
8213,845
02
MemPalace / mempalace
MemPalace provides a local-first solution for storing and retrieving conversation history as verbatim text without the need for summarization or external API calls. The system organizes data into a structured hierarchy of wings, rooms, and drawers to enable precise, scoped semantic searches. It features a pluggable backend architecture and includes a temporal knowledge graph to manage entity relationships locally.
7853
03
opendataloader-project / opendataloader-pdf
OpenDataLoader PDF is a high-performance, open-source parser designed to convert PDF documents into structured formats like Markdown, JSON, and HTML for AI and RAG pipelines. It features a hybrid processing mode that combines deterministic local parsing with AI-driven analysis to achieve industry-leading extraction accuracy for complex tables, formulas, and scanned documents. Additionally, the project provides automated accessibility solutions, including end-to-end Tagged PDF generation compliant with international standards.
7866
04
bytedance / agentkit-samples
AgentKit Code Workshop is a companion sample repository for the AI Agent development platform launched by Volcano Engine, designed to help developers quickly master the agent construction and deployment process. The project provides a variety of code examples ranging from basic introductions to complex business scenarios, covering core functions such as multi-agent collaboration, RAG retrieval augmentation, and tool calling. Developers can use these tutorials to gain an in-depth understanding of the AgentKit development toolchain, thereby efficiently implementing various intelligent applications.
78310
05
anthropics / claude-cookbooks
The Claude Cookbooks provide a comprehensive collection of code snippets and guides to help developers integrate Claude into their own applications. The repository covers a wide range of topics including tool use, multimodal capabilities, and advanced techniques like prompt caching. These resources are designed to be easily adaptable for various programming languages and project requirements.
6892
06
HKUDS / DeepTutor
DeepTutor is an agent-native platform designed to provide a personalized, persistent, and autonomous tutoring experience. It features a unified workspace that integrates chat, deep research, quiz generation, and math visualization into a single, context-aware environment. The system supports flexible deployment options, including a guided interactive tour, manual local installation, and Docker-based setups.
28107
07
endee-io / endee
Endee is a high-performance, open-source vector database specifically engineered for AI search, RAG pipelines, and semantic retrieval workloads. It is implemented in C++ and optimized for modern CPU architectures to ensure production-grade performance and low-latency results. The platform supports flexible deployment options, including Docker and local builds, while providing advanced features like hybrid search and metadata-aware filtering.
2857

// all-time featured (11)

Tencent / WeKnora
WeKnora is an intelligent knowledge management and Q&A framework that utilizes LLMs to provide enterprise-grade document understanding and semantic retrieval. The platform offers both a RAG-based Quick Q&A mode for fast queries and a ReACT Agent engine for complex, multi-source reasoning tasks. It features a highly modular architecture that supports various document formats, multiple LLM providers, and seamless integration with popular IM channels for private or local deployment.
82
MemPalace / mempalace
MemPalace provides a local-first solution for storing and retrieving conversation history as verbatim text without the need for summarization or external API calls. The system organizes data into a structured hierarchy of wings, rooms, and drawers to enable precise, scoped semantic searches. It features a pluggable backend architecture and includes a temporal knowledge graph to manage entity relationships locally.
78
opendataloader-project / opendataloader-pdf
OpenDataLoader PDF is a high-performance, open-source parser designed to convert PDF documents into structured formats like Markdown, JSON, and HTML for AI and RAG pipelines. It features a hybrid processing mode that combines deterministic local parsing with AI-driven analysis to achieve industry-leading extraction accuracy for complex tables, formulas, and scanned documents. Additionally, the project provides automated accessibility solutions, including end-to-end Tagged PDF generation compliant with international standards.
78
bytedance / agentkit-samples
AgentKit Code Workshop is a companion sample repository for the AI Agent development platform launched by Volcano Engine, designed to help developers quickly master the agent construction and deployment process. The project provides a variety of code examples ranging from basic introductions to complex business scenarios, covering core functions such as multi-agent collaboration, RAG retrieval augmentation, and tool calling. Developers can use these tutorials to gain an in-depth understanding of the AgentKit development toolchain, thereby efficiently implementing various intelligent applications.
78
anthropics / claude-cookbooks
The Claude Cookbooks provide a comprehensive collection of code snippets and guides to help developers integrate Claude into their own applications. The repository covers a wide range of topics including tool use, multimodal capabilities, and advanced techniques like prompt caching. These resources are designed to be easily adaptable for various programming languages and project requirements.
68
pingcap / autoflow
AutoFlow is an open-source knowledge base tool that utilizes graph RAG technology built on TiDB Vector, LlamaIndex, and DSPy. The platform provides a Perplexity-style conversational search experience powered by an advanced built-in website crawler. Users can also integrate a customizable search widget into their own websites using a simple JavaScript snippet.
68
memvid / memvid
Memvid is a database-free, single-file memory layer designed to provide AI agents with instant retrieval and long-term memory capabilities. Through an innovative "smart frame" design, it encapsulates data, embeddings, and indexes into a single file, achieving efficient compression and parallel reading. The system is model-agnostic and requires zero infrastructure dependencies, supporting persistent memory in various offline or online scenarios.
42
tobi / qmd
QMD is an on-device search engine that indexes markdown notes, documentation, and transcripts for efficient local retrieval. It utilizes a hybrid approach combining BM25 full-text search, vector semantic search, and LLM-based re-ranking to deliver high-quality results. The tool is designed for agentic workflows, offering both a command-line interface and an MCP server for seamless integration with AI agents.
32
HKUDS / DeepTutor
DeepTutor is an agent-native platform designed to provide a personalized, persistent, and autonomous tutoring experience. It features a unified workspace that integrates chat, deep research, quiz generation, and math visualization into a single, context-aware environment. The system supports flexible deployment options, including a guided interactive tour, manual local installation, and Docker-based setups.
28
endee-io / endee
Endee is a high-performance, open-source vector database specifically engineered for AI search, RAG pipelines, and semantic retrieval workloads. It is implemented in C++ and optimized for modern CPU architectures to ensure production-grade performance and low-latency results. The platform supports flexible deployment options, including Docker and local builds, while providing advanced features like hybrid search and metadata-aware filtering.
28
onyx-dot-app / onyx
Onyx is a feature-rich open source AI platform designed to provide an easy-to-deploy application layer interface for large language models. The platform supports RAG, deep research, code execution, and various AI agent capabilities, while remaining compatible with mainstream self-hosted and proprietary LLMs. Users can deploy via the standard or lightweight versions to meet different needs ranging from personal use to enterprise-level collaboration.
28

// related topics