Impact at a glance
Featured · funded · awarded
What I do
Four archetypes I get hired for
Voice AI, LLMOps, distributed backends, and engineering leadership — usually two or three at the same time.
Real-time voice AI
What it covers (6)
End-to-end voice agents and avatars: ASR, intent, TTS, transport, latency optimization. Production-grade across 40k+ locations.
End-to-end voice agents and avatars: ASR, intent, TTS, transport, latency optimization. Production-grade across 40k+ locations.
→ −41.8% E2E latency · paper
LLM pipelines & MCP
What it covers (6)
RAG, agent orchestration, evals, MCP server design. Anthropic SDK + multi-model routing. Author of MCP Server Architecture paper.
RAG, agent orchestration, evals, MCP server design. Anthropic SDK + multi-model routing. Author of MCP Server Architecture paper.
→ MCP architecture paper · 5+ AI systems shipped
Distributed backends
What it covers (6)
Event-driven NestJS / FastAPI / Hono backends, geospatial pipelines, queues, observability. The plumbing that keeps AI products up.
Event-driven NestJS / FastAPI / Hono backends, geospatial pipelines, queues, observability. The plumbing that keeps AI products up.
→ 40,000+ locations · 99.9% uptime
Engineering leadership
What it covers (6)
Lead 6+ engineers through architecture reviews, hiring, and shipping cadence. Cut release cycles weeks → hours via infra automation.
Lead 6+ engineers through architecture reviews, hiring, and shipping cadence. Cut release cycles weeks → hours via infra automation.
→ Team of 6 at Ôdasie · founder × 2
Speaking
On stage at BITS Law, Mumbai
3-hour AI for Marketing Masterclass — creative AI tooling, content + distribution automation, zero-code CRM pipelines, hands-on with LLMs.
Featured
Production systems shipped
- Real-time AI voice mock interviews — Pipecat + LiveKit + Claude + ElevenLabs + Deepgram
- LLM-driven resume tailoring per JD with RAG over candidate documents
- Job matching, application tracker, interview prep, upskilling paths
- Resume parsing pipeline: Tesseract OCR + pdf-parse + mammoth
- Multi-payment integration: Razorpay, Stripe, Apple Pay
- Super Admin, Branch Admin, Dashboard, Faculty, Student PWA, Parent PWA + 2 RN apps
- Hono backend (18 route modules) with Drizzle ORM and 25+ tables
- Claude API for AI question generation across JEE / NEET / CLAT / IPMAT
- Full RBAC + JWT + OTP, MSG91 SMS/WhatsApp, FCM push, Sentry monitoring
- Event-driven Claude Code agent orchestration in development workflow
AI Voice Caddie ft. Andy North (2× US Open Champion)
Senior Software Developer · VoiceQube
- Real-time voice caddie via LiveKit + WebRTC stack
- MongoDB geospatial indexing for live GPS course position
- AWS Lambda + MediaConvert for shot video capture and processing
- Production-grade across 40,000+ golf courses (event-driven backend)
- Voice + intent stack with Anthropic Claude in the loop
- Architected end-to-end across iOS, Android, web, vendor ecosystem
- ML-powered planning workflows and intelligent vendor matching
- Viral QR photo-sharing for live event capture
- Live on App Store and Google Play; multi-market customer base
- Established engineering culture, CI/CD automation, technical roadmap
Mobile apps
Live on the App Store and Play Store
React Native / Expo across health-tech and consumer products. Multi-market customers downloading them today.
CELABE
AI-Powered Wedding & Events Platform
Co-founded multi-market wedding / events platform. iOS, Android, web; ML-powered planning, vendor marketplace, viral QR photo-sharing. Founding engineering leader.
SmokeMukti
ICMR-Funded Tobacco Cessation App
Behavior-change app for tobacco cessation funded by the Indian Council of Medical Research (ICMR). Personalized AI chat coach, progress tracking, goal setting, and 24/7 motivation pathway. Built under guidance of Dr. Anil V. Ankola; ICMR STS-backed research project.
Research
Six papers on the systems I ship
Voice AI latency, ASR intent detection, MCP architecture, LLM optimization — measured on production workloads, not benchmarks.
MCP Server Architecture Patterns
Design patterns for production MCP servers — Gamma-format taxonomy, anti-patterns, cross-cutting concerns. 100% classification accuracy at ≤10 tools; Proxy Aggregator pattern when tool count exceeds threshold.
Target: ACM / IEEE Software
Latency Optimization for Voice AI
Systems-level latency study of a production voice AI platform (ElevenLabs TTS + Anthropic Claude intent + NestJS over WebSocket). Concurrency between intent detection and TTS is the highest-leverage optimization, not per-stage speed.
Target: ICASSP / IEEE Access
ASR Intent Detection at Scale
Using LLMs as a post-ASR intent classifier on noisy production transcripts. Prompt + structured-output classifier tested across ~120 prompt variants. Lifted intent accuracy without retraining the base ASR.
Target: EMNLP / IEEE NLP
LLM Optimization for TensorFlow
Evaluation framework and fine-tuning recipe lifting benchmark task performance by 5.6 pp. Translates research improvements into production systems.
Target: NeurIPS / IEEE TNNLS
Experience
Where I've shipped
- Co-founded and architected an AI-powered wedding / events platform
- Led full lifecycle from concept to production launch (iOS, Android, web, vendor ecosystem)
- ML-powered planning workflows, viral QR photo-sharing, real-time vendor coordination
- Established engineering culture, CI/CD automation, and the technical roadmap
- Led a team of 6 engineers through architecture reviews, code reviews, and technical mentorship
- Architected and shipped 5+ production AI systems: LLM pipelines, voice agents, MCP servers, geospatial processors
- Real-time AI avatar with LiveKit + WebRTC (bi-directional voice + video + data)
- Heavy n8n automation — LLM agent orchestration, API integrations, approval workflows
- Adopted Claude Code in production engineering workflow; cut release cycles weeks → hours
- Shipped three production AI products end-to-end: Phiny.ai, Crosslane, Northway Tech
- Architected NestJS + AWS backend serving real-time interactions across 40,000+ locations
- Live AI avatars and voice agents — LiveKit, Pipecat, ElevenLabs TTS, Deepgram ASR, Anthropic intent
- Geospatial indexing + media pipelines (Lambda + MediaConvert + CloudFront)
- Smart Recruiter backend: 516 TS files / ~101k LoC NestJS + ~17k LoC Python WebRTC microservice
- Engineered enterprise IoT desktop applications — 20k+ LoC, 1,000+ Jest tests
- Architected real-time MQTT communication and concurrent schedulers — ~3× performance improvement
- Stack: Angular, Electron, RxJS, NestJS, RxDB, Python, MQTT pub/sub
Stack
What I build with
Voice AI and LLMOps day-to-day. Heavy lean on Claude, MCP, and the agentic stack.
Voice AI
AI / LLMs
AI Dev Tools
Backend
Frontend
Cloud / DevOps
Data
Observability
Writing
Latest articles
Available for senior AI / contract / FDE work
Building something with AI?
Voice agents, MCP servers, LLM pipelines, agentic workflows — pick a slot, drop a message, or send your email and I'll reply within a day.
Replies within ~24 hours · Bengaluru, India · IST friendly