ROXY AI ASSISTANT
Nov 2025 - Jan 2026 • CLI / API
SNAPSHOT
Real-time voice AI assistant with full-duplex conversation, dynamic MCP tool usage, and low-latency speech pipeline.
THE PROBLEM
Voice assistants lack real-time, full-duplex conversation with contextual reasoning and dynamic tool use.
THE RESULT
Built a real-time voice assistant with on-device VAD, STT, and fast TTS — enabling natural, low-latency conversations with LLM-powered tool use.
HOW IT WORKS
Developed a real-time voice assistant enabling full-duplex conversations with on-device VAD, STT, and fast TTS response. Integrated LangChain + LangGraph agent using MCP tools (FastMCP) for contextual reasoning and dynamic tool use. Implemented a Kokoro TTS pipeline with optional echo cancellation (AEC) for natural low-latency speech output. Built a modular system allowing easy swaps of VAD, STT, TTS, or LLM components for flexible configuration.