Regression Testing¶
The evaluation framework automatically detects performance regressions.
How It Works¶
- First run saves
data/baseline.jsonwith current metrics - Each subsequent run compares against the baseline
- If any metric drops significantly, the run fails
CLI Output¶
When a regression is detected:
Exit code is non-zero, failing CI pipelines.