Skip to Content
Solution PlaysPlay 33: Play 33 β€” Voice AI Agent πŸŽ™οΈ

Play 33 β€” Voice AI Agent πŸŽ™οΈ

Autonomous voice agent with dialog state, multi-turn conversations, and proactive actions.

An AI voice agent that goes beyond Play 04’s inbound call handling. Dialog state machine manages multi-turn conversations, intent detection with entity extraction enables complex requests, and proactive actions let the agent initiate (reminders, follow-ups, notifications) β€” not just respond.

Quick Start

cd solution-plays/33-voice-ai-agent az deployment group create -g $RG -f infra/main.bicep -p infra/parameters.json code . # Use @builder for dialog/voice, @reviewer for compliance audit, @tuner for prosody/speed

How It Differs from Play 04 (Call Center)

AspectPlay 04 (Call Center)Play 33 (Voice Agent)
ModeInbound phone callsAny interface (phone, web, IoT)
ConversationSingle-intentMulti-turn stateful dialog
StateStateless per callPersistent state machine
ActionsRespond + escalateProactive (initiate tasks)
MemoryCurrent call onlyRemembers past interactions

Architecture

ServicePurpose
Azure Speech (STT + TTS)Voice recognition + neural voice synthesis
Azure OpenAI (gpt-4o + mini)Dialog intelligence + intent classification
Cosmos DBDialog state persistence across sessions
Container AppsVoice agent runtime

πŸ“ Full architecture details

Key Metrics

  • Intent accuracy: β‰₯92% Β· Dialog completion: β‰₯80% Β· Latency: <2s Β· Voice MOS: β‰₯4.0/5.0

DevKit (Voice Agent-Focused)

PrimitiveWhat It Does
3 agentsBuilder (dialog state/intent/voice loop), Reviewer (compliance/latency/UX), Tuner (prosody/flow/cost)
3 skillsDeploy (109 lines), Evaluate (107 lines), Tune (106 lines)
4 prompts/deploy (dialog + voice), /test (interaction loop), /review (compliance), /evaluate (intent/quality)

Cost

ServiceDevProdEnterprise
Azure AI Speech$5 (Free+overflow)$200 (Standard S0)$800 (S0+Custom Voice)
Azure OpenAI$45 (PAYG)$350 (PAYG)$1,200 (PTU)
Communication Services$15 (PAYG)$150 (PAYG)$500 (Direct Routing)
Container Apps$15 (Consumption)$120 (Dedicated)$350 (Dedicated HA)
Redis Cache$15 (Basic C0)$55 (Standard C1)$200 (Premium P1)
Cosmos DB$5 (Serverless)$65 (800 RU/s)$300 (4000 RU/s)
Key Vault$1 (Standard)$3 (Standard)$10 (Premium HSM)
Application Insights$0 (Free)$30 (Pay-per-GB)$120 (Pay-per-GB)
Total$101/mo$973/mo$3,480/mo

πŸ’° Full cost breakdown

πŸ“– Full docs Β· 🌐 frootai.dev/solution-plays/33-voice-ai-agentΒ 

FAI Manifest

FieldValue
Play33-voice-ai-agent
Version1.0.0
KnowledgeF1-GenAI-Foundations, R2-RAG-Architecture, T2-Responsible-AI
WAF Pillarssecurity, reliability, cost-optimization, responsible-ai
Groundednessβ‰₯ 85%
Safety0 violations max
Last updated on