ProAction is a multi-agent, proactive AI assistant that monitors and analyzes speech transcriptions to provide relevant insights, factual information, and assistance in real-time through AugmentOS smartglasses.
- Multi-agent Architecture: Extensible system of specialized AI agents that each focus on different types of assistance
- Priority System: Configurable priority levels to control when and how information is presented
- Contextual Awareness: Maintains short-term and long-term memory to provide relevant information
- Voice Commands: Natural language interface for controlling the system
- Configurable: User-configurable agent settings and priority thresholds
- Factual Agent: Provides factual information and verifies statements
- Quiz Agent: Helps with studying and recalling information
- Personal Assistant: Manages reminders, tasks, and calendar events
-
Set up environment variables:
AUGMENTOS_API_KEY=your_api_key OPENAI_API_KEY=your_openai_key PORT=3000 (optional) AUGMENTOS_WS_URL=wss://staging.augmentos.org/tpa-ws (optional)
-
Install dependencies:
npm install
-
Build and start the server:
npm run build npm start
- "Show insights" - Display pending insights
- "Hide insights" - Clear the current display
- "Show agent settings" - View and configure agents
- "Enable [agent name]" - Enable a specific agent
- "Disable [agent name]" - Disable a specific agent
- "Help me" - Show help information
ProAction uses a coordinator-based architecture where:
- Transcriptions are received from AugmentOS
- The coordinator processes the transcription
- Active agents analyze the transcription and generate insights
- Insights are prioritized and presented to the user based on configured thresholds
- Create a new agent class in
src/agents/
that extendsBaseAgent
- Implement the required methods
- Register the agent in
src/server.ts
Modify the thresholds in the ConfigManager
to adjust when different insights are shown.
MIT