Skip to Content
Solution PlaysPlay 25: Play 25 β€” Conversation Memory Layer πŸ§ πŸ’Ύ

Play 25 β€” Conversation Memory Layer πŸ§ πŸ’Ύ

Persistent AI memory with tiered storage β€” short-term, long-term, and episodic recall.

Give your AI agent persistent memory across conversations. Redis handles short-term session state, Cosmos DB stores compressed long-term summaries, and a vector store enables episodic recall of key facts and preferences. PII-aware with GDPR-compliant delete.

Quick Start

cd solution-plays/25-conversation-memory-layer az deployment group create -g $RG -f infra/main.bicep -p infra/parameters.json code . # Use @builder for memory tiers, @reviewer for privacy audit, @tuner for compression

Architecture

ServicePurpose
Redis CacheShort-term memory (session state, 15-min TTL)
Cosmos DBLong-term memory (compressed summaries, 90-day TTL)
Vector StoreEpisodic memory (key facts, similarity-based recall)
Azure OpenAI (gpt-4o-mini)Conversation compression
Azure OpenAI (embedding)Memory embedding for vector recall

Memory Tiers

TierStorageTTLWhat It Stores
Short-termRedis15 minCurrent conversation turns
Long-termCosmos DB90 daysCompressed summaries
EpisodicVector store1 yearKey facts, preferences, decisions

Key Metrics

  • Recall precision: β‰₯85% Β· Compression: 4000β†’200 tokens Β· PII scrub: 100% Β· Latency: <200ms

DevKit (Memory-Focused)

PrimitiveWhat It Does
3 agentsBuilder (tiers/compression/recall), Reviewer (privacy/PII/consent), Tuner (thresholds/TTL/cost)
3 skillsDeploy (102 lines), Evaluate (103 lines), Tune (101 lines)
4 prompts/deploy (tiered memory), /test (recall), /review (PII/privacy), /evaluate (compression quality)

Note: This is a memory/context management play. TuneKit covers compression prompts, recall similarity thresholds, per-tier TTLs, embedding model selection, and storage cost per user (~$0.04/user/mo with pruning) β€” not AI inference quality.

Cost

DevProd (100K users)
$30–80/mo~$4,000/mo ($0.04/user)

πŸ“– Full docs Β· 🌐 frootai.dev/solution-plays/25-conversation-memory-layerΒ 

FAI Manifest

FieldValue
Play25-conversation-memory-layer
Version1.0.0
KnowledgeR2-RAG-Architecture, O2-Agent-Coding, O1-Semantic-Kernel, F1-GenAI-Foundations
WAF Pillarssecurity, reliability, performance-efficiency, cost-optimization, responsible-ai
Groundednessβ‰₯ 85%
Safety0 violations max
Last updated on