SINQ: Sinkhorn-Normalized Quantization for Calibration-Free
Low-Precision LLM Weights
Paper
• 2509.22944
• Published • 81
Robot Learning: A Tutorial
Paper
• 2510.12403
• Published • 129
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity
MoE
Paper
• 2510.13344
• Published • 64
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal
Generation and Understanding
Paper
• 2510.06308
• Published • 55
Training-Free Group Relative Policy Optimization
Paper
• 2510.08191
• Published • 46
Detect Anything via Next Point Prediction
Paper
• 2510.12798
• Published • 50
RLP: Reinforcement as a Pretraining Objective
Paper
• 2510.01265
• Published • 45
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
Paper
• 2510.06710
• Published • 43
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language
Models
Paper
• 2510.11341
• Published • 35
Paper
• 2510.13998
• Published • 59
Agentic Entropy-Balanced Policy Optimization
Paper
• 2510.14545
• Published • 108