Commit History

docs: add Solidity Eval 2026 pass@1 leaderboard at top — 46.5% beats Claude Opus 4.7 by +7.5pp
9dc92d9
verified

samscrack commited on

Trim eval to self-consistent forge metrics; remove reference-oracle rows (irrelevant to model performance)
82e0c50
verified

samscrack commited on

Add model card with full pipeline details + Stage 4 RFT eval results
566dbe3
verified

samscrack commited on