Infinity Instruct Collection Scaling Instruction Selection and Synthesis to Enhance Language Models • 17 items • Updated Feb 4 • 11
OctoPack: Instruction Tuning Code Large Language Models Paper • 2308.07124 • Published Aug 14, 2023 • 33
ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget Paper • 2604.01195 • Published 12 days ago • 3
MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios Paper • 2603.28130 • Published 14 days ago • 11