Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 4 days ago • 136
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 10 days ago • 278
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 11 days ago • 316
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 29 days ago • 338
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 20 days ago • 340
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 23 days ago • 354
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 16 days ago • 361
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 17 days ago • 480
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 32
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 17 days ago • 864
Refusal in Language Models Is Mediated by a Single Direction Paper • 2406.11717 • Published Jun 17, 2024 • 9
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published Mar 4 • 210
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published Mar 17 • 308
Coding Datasets Collection These are the best coding corpuses to make the LLM more stronger to surpass proprietary ones, basically it can be used in both post and pre training. • 15 items • Updated 19 days ago • 1
Distillation Datasets Collection These are the datasets that can be used to finetune small LLMs to reach the level of the closed models and large open LLMs • 41 items • Updated 16 days ago • 2