GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities Paper • 2507.12367 • Published Jul 16, 2025 • 7
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources Paper • 2509.25531 • Published Sep 29, 2025 • 10
Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics Paper • 2603.01209 • Published Mar 1
FreshBrew: A Benchmark for Evaluating AI Agents on Java Code Migration Paper • 2510.04852 • Published Oct 13, 2025
ontocord/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-merged Feature Extraction • 2B • Updated 4 days ago • 315
ontocord/0.4b-mixturevitae-v1-decontaminated-300B-4096-longsft_16k Feature Extraction • 0.4B • Updated 8 days ago • 28
ontocord/0.4b-mixturevitae-v1-decontaminated-300B-4096-longsft_16k Feature Extraction • 0.4B • Updated 8 days ago • 28
ontocord/1.7b-MixtureVitae-web_curated-100BT-longsft_16k Feature Extraction • 2B • Updated 8 days ago • 33
ontocord/1.7b-MixtureVitae-web_curated-100BT-longsft_16k Feature Extraction • 2B • Updated 8 days ago • 33
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18, 2025 • 19
ontocord/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k Feature Extraction • 2B • Updated 9 days ago • 25
ontocord/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k Feature Extraction • 2B • Updated 9 days ago • 25
ontocord/1.7b-Comma0.1-300BT-longsft_16k Feature Extraction • 2B • Updated 10 days ago • 32 • 1
ontocord/1.7b-Comma0.1-300BT-longsft_16k Feature Extraction • 2B • Updated 10 days ago • 32 • 1