Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published Jan 30 • 60
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published Feb 27 • 88
Qwen/Qwen3.5-397B-A17B Image-Text-to-Text • 403B • Updated about 1 month ago • 782k • • 1.44k