Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2510.13998

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 107
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 85
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

Paper • 2502.11880 • Published Feb 17, 2025 • 17

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140
mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 322
BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

Interesting Papers

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

about 15 hours ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 33
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published Oct 16, 2025 • 42
BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published Oct 22, 2025 • 53

Run on CPU Optimizations

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 80

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

Selected_Trending_Papers

TradingAgents: Multi-Agents LLM Financial Trading Framework

Paper • 2412.20138 • Published Dec 28, 2024 • 47
MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 41
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 160
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140

Paper2Web: Let's Make Your Paper Alive!

Paper • 2510.15842 • Published Oct 17, 2025 • 27
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 120
BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31, 2025 • 13

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 81
Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 130
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published Oct 15, 2025 • 64
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 55

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 107
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 85
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

Paper • 2502.11880 • Published Feb 17, 2025 • 17

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140
mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 322
BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

Selected_Trending_Papers

TradingAgents: Multi-Agents LLM Financial Trading Framework

Paper • 2412.20138 • Published Dec 28, 2024 • 47
MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 41
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 160
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140

Interesting Papers

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

Paper2Web: Let's Make Your Paper Alive!

Paper • 2510.15842 • Published Oct 17, 2025 • 27
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 120
BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31, 2025 • 13

about 15 hours ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 33
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published Oct 16, 2025 • 42
BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published Oct 22, 2025 • 53

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 81
Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 130
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published Oct 15, 2025 • 64
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 55

Run on CPU Optimizations

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 80

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs