Skip to main content

FAI Model Serving AKS Tuner

Model Serving AKS tuner โ€” GPU SKU selection, vLLM memory/batching optimization, quantization decisions, autoscaling thresholds, and inference cost analysis.

Overviewโ€‹

PropertyValue
TypeAgent
Fileagents/fai-play-12-tuner.agent.md
Toolscodebase, terminal
Modelgpt-4o, gpt-4o-mini
WAF Alignmentcost-optimization, performance-efficiency
Compatible Plays12-model-serving-aks
Lines56

Usageโ€‹

In VS Code (GitHub Copilot)โ€‹

@fai-play-12-tuner How can you help me?

In fai-manifest.jsonโ€‹

{
"primitives": {
"agents": ["../../agents/fai-play-12-tuner.agent.md"]
}
}

Sourceโ€‹


Auto-generated from the FrootAI primitive catalog. Last updated: 2026-04-20.