Skip to content

Contributing

Development Setup

uv sync
uv run lefthook install

Linting

uv run ruff check .

Testing

uv run pytest tests/ -v

Adding to the Golden Dataset

  1. Add examples to raghelm/eval/golden_dataset.json
  2. Use unique IDs matching the pattern source_NNN
  3. Use valid categories: factual_lookup, comparison, synthesis, temporal, contradictory
  4. Run validation: uv run python -c "from raghelm.eval.golden_dataset import *; d=load_golden_dataset('raghelm/eval/golden_dataset.json'); print(validate_dataset(d))"
  5. Run the test suite to ensure nothing breaks