Changelog¶

Unreleased¶

Reframed documentation around private RAG/agent release control.
Renumbered ADRs by priority and scope.
Clarified Pinecone-first reference runtime vs backend-neutral release evidence layer.
Added security/compliance boundary and public-proof evidence rules.

Initial evaluation suite release.
100-item golden eval dataset from open-license sources.
Retrieval metrics: Recall@k, MRR, NDCG@k.
Generation scoring for faithfulness, relevance, completeness, and overall quality.
Regression testing with baseline comparison.
Latency benchmarking harness.
Branded SVG badge generation.
CLI via python -m raghelm eval.
CI integration with GitHub Actions.