Skip to Content
Solution PlaysPlay 94: Play 94 — AI Podcast Generator 🎙️

Play 94 — AI Podcast Generator 🎙️

AI podcast production — script writing, multi-voice Neural TTS with SSML prosody, audio post-processing, music integration, automated publishing.

Build an AI podcast generator. GPT-4o writes conversational scripts in 4 formats (interview, monologue, debate, panel), Azure Neural TTS synthesizes multiple distinct voices with SSML prosody control, pydub handles post-processing (crossfades, -16 LUFS normalization, intro/outro music), and RSS feeds auto-publish to podcast directories.

Quick Start

cd solution-plays/94-ai-podcast-generator az deployment group create -g $RG -f infra/main.bicep -p infra/parameters.json code . # Use @builder to implement, @reviewer to audit, @tuner to optimize

Architecture

📐 Full architecture details

ServicePurpose
Azure OpenAI (gpt-4o)Script generation + research (temp 0.7)
Azure Speech ServiceMulti-voice Neural TTS with SSML
Azure Content SafetyScript content moderation
Azure AI Search (Basic)Source material for research grounding
Azure Storage + CDNAudio hosting + delivery
Container AppsGeneration API + management dashboard

Pre-Tuned Defaults

  • Script: 4 formats · 150 words/min · source citations · conversational markers · humor
  • Voice: 7 Neural voices · SSML prosody (rate, pitch, express-as) · 800ms speaker pauses
  • Audio: -16 LUFS · 320kbps MP3 · 15s intro + 20s outro · crossfade transitions
  • Publishing: RSS 2.0 with iTunes tags · SRT transcript · CDN delivery

DevKit (AI-Assisted Development)

PrimitiveWhat It Does
agent.mdRoot orchestrator with builder→reviewer→tuner handoffs
copilot-instructions.mdPodcast domain (script formats, SSML, TTS voices, audio production)
3 agentsBuilder (gpt-4o), Reviewer (gpt-4o-mini), Tuner (gpt-4o-mini)
3 skillsDeploy (215+ lines), Evaluate (110+ lines), Tune (230+ lines)
4 prompts/deploy, /test, /review, /evaluate with agent routing

Cost Estimate

ServiceDev/moProd/moEnterprise/mo
Azure AI Speech$5 (Free tier)$250 (Standard Neural)$800 (Custom Neural Voice)
Azure OpenAI$20 (PAYG)$280 (PAYG)$1,000 (PTU Reserved)
Blob Storage$2 (Hot LRS)$40 (Hot LRS)$150 (Hot GRS)
Azure CDN$5 (Standard Microsoft)$120 (Standard Verizon)$400 (Premium Verizon)
Azure Functions$0 (Consumption)$180 (Premium EP2)$450 (Premium EP3)
Key Vault$1 (Standard)$5 (Standard)$10 (Standard)
Application Insights$0 (Free)$25 (Pay-per-GB)$80 (Pay-per-GB)
Total$33$900$2,890

💰 Full cost breakdown

vs. Play 04 (Call Center Voice AI)

AspectPlay 04Play 94
FocusReal-time call handlingBatch podcast production
VoiceSTT + TTS (bidirectional)TTS only (script → audio)
FormatCustomer call (reactive)Multi-format podcast (creative)
ProcessingReal-time streamingBatch post-processing + publishing

📖 Full documentation · 🌐 frootai.dev/solution-plays/94-ai-podcast-generator  · 📦 FAI Protocol

FAI Manifest

FieldValue
Play94-ai-podcast-generator
Version1.0.0
KnowledgeF1-GenAI-Foundations, T3-Production-Patterns, T2-Responsible-AI
WAF Pillarsresponsible-ai, performance-efficiency, cost-optimization, reliability
Last updated on