Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2512.12602

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44
Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132
DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 80

YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Paper • 2512.23273 • Published Dec 29, 2025 • 15
A 58-Addition, Rank-23 Scheme for General 3x3 Matrix Multiplication

Paper • 2512.21980 • Published Dec 26, 2025 • 3
Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 87
SAM Audio: Segment Anything in Audio

Paper • 2512.18099 • Published Dec 19, 2025 • 24

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 110

数学严谨 or 启发性高的人工智能论文

侧重于世界观、方法论

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

Representation & Optimization

Understanding about representation sheds light on optimization

Nuclear Norm Regularization for Deep Learning

Paper • 2405.14544 • Published May 23, 2024 • 1
Token embeddings violate the manifold hypothesis

Paper • 2504.01002 • Published Apr 1, 2025 • 1
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Paper • 2403.10476 • Published Mar 15, 2024 • 1
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning

Paper • 2504.00254 • Published Mar 31, 2025 • 1

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44
Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132
DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 80

数学严谨 or 启发性高的人工智能论文

侧重于世界观、方法论

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Paper • 2512.23273 • Published Dec 29, 2025 • 15
A 58-Addition, Rank-23 Scheme for General 3x3 Matrix Multiplication

Paper • 2512.21980 • Published Dec 26, 2025 • 3
Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 87
SAM Audio: Segment Anything in Audio

Paper • 2512.18099 • Published Dec 19, 2025 • 24

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 110

Representation & Optimization

Understanding about representation sheds light on optimization

Nuclear Norm Regularization for Deep Learning

Paper • 2405.14544 • Published May 23, 2024 • 1
Token embeddings violate the manifold hypothesis

Paper • 2504.01002 • Published Apr 1, 2025 • 1
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Paper • 2403.10476 • Published Mar 15, 2024 • 1
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning

Paper • 2504.00254 • Published Mar 31, 2025 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs