Operations: observability and HA

Production deployments need redundancy and instrumentation. N+1 HA doubles your GPU footprint but is non-negotiable for revenue workloads.

Customer Support RAG

Observability level
High availability