// summary
Secret Llama is an entirely in-browser chatbot that allows users to run open-source models like Llama 3 and Mistral locally. Because the application operates directly within the browser, all conversation data remains private and no server installation is required. The platform provides a user-friendly interface that functions offline while leveraging WebGPU technology for performance.
// technical analysis
Secret Llama is a browser-based LLM chatbot designed to provide a fully private, offline-capable AI experience by leveraging the WebGPU-powered web-llm inference engine. By executing models entirely within the user's browser, it eliminates the need for server-side infrastructure and ensures that sensitive conversation data never leaves the local machine. This architecture prioritizes user privacy and accessibility, though it necessitates a modern browser with WebGPU support and sufficient system RAM to handle the specific model sizes.
// key highlights
// use cases
// getting started
To begin using Secret Llama, simply visit the hosted website in a WebGPU-compatible browser like Chrome or Edge. If you wish to modify or build the project locally, clone the repository, run 'yarn' to install dependencies, and use 'yarn dev' to launch the development environment with live reload.