// summary
QMD is an on-device search engine that indexes markdown notes, documentation, and transcripts for efficient local retrieval. It utilizes a hybrid approach combining BM25 full-text search, vector semantic search, and LLM-based re-ranking to deliver high-quality results. The tool is designed for agentic workflows, offering both a command-line interface and an MCP server for seamless integration with AI agents.
// technical analysis
QMD is an on-device search engine designed to index and retrieve information from personal knowledge bases, such as markdown notes and documentation, using a hybrid architecture that combines BM25 full-text search, vector semantic search, and LLM-based re-ranking. By running locally via node-llama-cpp and GGUF models, it solves the challenge of maintaining private, agent-ready search capabilities without relying on external cloud APIs. The project employs a sophisticated retrieval pipeline featuring Reciprocal Rank Fusion (RRF) and position-aware blending, which balances the precision of keyword matching with the contextual understanding of semantic search to ensure high-quality results for agentic workflows.
// key highlights
// use cases
// getting started
To begin, install the tool globally using 'npm install -g @tobilu/qmd' or 'bun install -g @tobilu/qmd'. Once installed, create a collection from your local directory using 'qmd collection add [path] --name [name]', then run 'qmd embed' to generate the necessary search indexes. You can then perform searches using 'qmd search' for keywords, 'qmd vsearch' for semantic queries, or 'qmd query' for the full hybrid experience.