PMM Tooling The PMM tooling landscape spans several categories. Metric collection and visualisation: Prometheus for metric ingestion and alerting, Grafana for dashboards, Datadog or New Relic for integrated observability. Drift and performance estimation: Evidently AI (open-source, computes drift using PSI, JS divergence, KS test; generates structured reports), NannyML (open-source, CBPE performance estimation without ground truth, drift detection). Fairness monitoring: Fairlearn for fairness metric computation, Aequitas for automated bias reporting. LLM monitoring: RAGAS and Trulens for hallucination and quality evaluation, Lakera Guard and NeMo Guardrails for safety monitoring. For smaller organisations, the open-source stack (Prometheus, Grafana, Evidently AI, NannyML, Fairlearn) provides comprehensive capability with no licence cost. For larger organisations, integrated platforms (Datadog, Arize, WhyLabs, Arthur) provide managed infrastructure at scale. The PMM plan documents the chosen tooling, the rationale for selection, and the integration architecture connecting the tools into a coherent monitoring pipeline. Key outputs
- Tooling selection documented with rationale
- Open-source stack for smaller organisations
- Integrated platforms for larger portfolios
- Integration architecture connecting tools into coherent pipeline