HubLensAgentMininglamp-AI/Mano-P
Mininglamp-AI

Mano-P

AI🌱 NEW PROJECT BOOST#Agent#Computer Vision#Edge Computing#LLM#Automation
View on GitHub
1,264

// summary

Mano-P is a GUI-VLA agent project designed to enable autonomous, private task execution on edge devices like Mac mini and MacBook. It utilizes advanced reinforcement learning and edge-native inference to perform complex GUI automation, cross-system data integration, and long-task planning. The project provides a secure, local-first solution that eliminates the need for cloud API calls while maintaining high performance across various benchmarks.

// technical analysis

Mano-P is a GUI-VLA (Vision-Language-Action) agent framework specifically engineered for edge devices, prioritizing privacy by enabling local execution on Apple Silicon hardware. It addresses the critical need for autonomous, secure, and complex GUI automation without relying on cloud-based APIs, thereby overcoming bottlenecks in traditional human-in-the-loop workflows. The project employs a sophisticated 'think-act-verify' reasoning mechanism and a three-stage progressive training methodology to achieve high-precision task execution. A notable technical trade-off is its focus on edge-native optimization, utilizing mixed-precision quantization and visual token pruning to maintain high performance on constrained hardware like Mac minis.

// key highlights

01
Achieves a 58.2% success rate on the OSWorld benchmark, ranking first among specialized GUI agent models.
02
Enables fully local execution on Apple M4 chips, ensuring that all sensitive screenshots and task data remain on-device.
03
Supports complex, autonomous business process automation, including end-to-end software construction pipelines.
04
Includes the Cider SDK, which provides W8A8/W4A8 activation-quantization primitives to deliver significant prefill speedups on Apple Silicon.
05
Demonstrates high-performance inference, with the 4B model reaching approximately 80 tokens/s on Apple M5 Pro hardware.
06
Provides an end-to-end autonomous application construction loop (Mano-AFK) that handles requirements, coding, deployment, and testing without human intervention.

// use cases

01
Complex GUI automation for autonomous interface operations
02
End-to-end autonomous software construction pipelines
03
Private, local-side business process and task execution

// getting started

To begin using Mano-P, developers should first explore the project's phased open-source roadmap, starting with the Mano-CUA Skills for constructing task workflows. For local deployment, ensure you have an Apple Silicon device (M4 chip or higher) with at least 32GB of RAM. Future updates will provide specific SDK installation instructions and deployment guides for both direct hardware usage and compute stick integration.