California, United States
Architected low-latency, real-time voice AI systems and frameworks for autonomous agent collaboration and task execution.
Optimized high-throughput telemetry using ClickHouse for real-time feedback loops; streamlined delivery via AI-generated code under strict human-reviewed architecture standards.
Integrated GraphRAG to enhance context accuracy and response density across agentic pipelines.
Engineered real-time streaming voice responses over SSE and WebSockets; built multi-turn context-aware NLP for high-performance interactions.
Designed LLM evaluation frameworks for automated output scoring and regression testing; implemented prompt versioning and model routing to reduce inference costs.
Built AI safety guardrails — PII redaction, content filtering, and anomaly detection — across production agentic pipelines.