civicsetu / eval

Commit History

feat(eval): RAGAS evaluation framework + RAG pipeline improvements
f8b04c3

adeshboudh16 Claude Sonnet 4.6 commited on

fix: improve RAG retrieval quality and reduce generator hallucination
98da9ee

adeshboudh16 Claude Sonnet 4.6 commited on

eval: fix MULTI-XREF-001 ground truth and add MULTI-TEMP-001 row
2139758

adeshboudh16 Claude Sonnet 4.6 commited on

eval: add 30-row golden dataset for RAGAS benchmark
aedfb7f

adeshboudh16 commited on