- Contract Tests (Service-to-Service)
- End-to-End Inference Path Tests (Known Input → Expected Output + Logs)
- Regression Tests — Golden Dataset with Per-Subgroup Cases
- Human Oversight Interface Testing (Selenium/Playwright/Cypress Automation)
- Load Testing (Locust, k6) — Latency & Throughput Under Load
- Chaos & Fault Injection Testing (Gremlin, Litmus) — Graceful Degradation