-
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics
Paper • 2512.12602 • Published • 44 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132 -
DoPE: Denoising Rotary Position Embedding
Paper • 2511.09146 • Published • 98 -
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
Paper • 2510.25602 • Published • 80
Collections
Discover the best community collections!
Collections including paper arxiv:2511.03276
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 128 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 88
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 47 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 41 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 159 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 140
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 58 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 50 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132
-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 170 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 134 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 70
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 263 -
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper • 2507.15846 • Published • 135 -
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
Paper • 2507.22827 • Published • 101 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 218
-
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics
Paper • 2512.12602 • Published • 44 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132 -
DoPE: Denoising Rotary Position Embedding
Paper • 2511.09146 • Published • 98 -
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
Paper • 2510.25602 • Published • 80
-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 170 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 134 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 70
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 128 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 88
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 47 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 41 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 159 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 140
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 58 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 50 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 132
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 263 -
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper • 2507.15846 • Published • 135 -
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
Paper • 2507.22827 • Published • 101 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 218