Skip to main content

FAI AI Infra Expert

AI infrastructure expert โ€” GPU compute sizing (A100/H100), VRAM estimation, model serving (vLLM/TensorRT-LLM/Triton), AKS node pool design, PTU vs PAYG cost modeling, and quantization strategies.

Overviewโ€‹

PropertyValue
TypeAgent
Fileagents/fai-ai-infra-expert.agent.md
Toolscodebase, terminal, azure
Modelgpt-4o, gpt-4o-mini
WAF Alignmentperformance-efficiency, cost-optimization, reliability
Compatible Plays02-ai-landing-zone, 12-model-serving-aks
Lines107

Usageโ€‹

In VS Code (GitHub Copilot)โ€‹

@fai-ai-infra-expert How can you help me?

In fai-manifest.jsonโ€‹

{
"primitives": {
"agents": ["../../agents/fai-ai-infra-expert.agent.md"]
}
}

Sourceโ€‹


Auto-generated from the FrootAI primitive catalog. Last updated: 2026-04-20.