Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2510.19779

Xtra-Computing/XtraGPT-14B

Text Generation • Updated Dec 8, 2025 • 1.26k • 3
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 67
Molecular Contrastive Learning with Chemical Element Knowledge Graph

Paper • 2112.00544 • Published Dec 1, 2021 • 1
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

Paper • 2404.00884 • Published Apr 1, 2024 • 1

The Builder's Arsenal

Turning theory into weapons

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published Oct 22, 2025 • 70
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 62
InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 99

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 196 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published Oct 21, 2025 • 115
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 62
Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 114

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24, 2025 • 44
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28, 2025 • 68
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 42
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 81

Xtra-Computing/XtraGPT-14B

Text Generation • Updated Dec 8, 2025 • 1.26k • 3
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published Jan 16 • 67
Molecular Contrastive Learning with Chemical Element Knowledge Graph

Paper • 2112.00544 • Published Dec 1, 2021 • 1
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

Paper • 2404.00884 • Published Apr 1, 2024 • 1

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published Oct 21, 2025 • 115
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 62
Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 114

The Builder's Arsenal

Turning theory into weapons

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published Oct 22, 2025 • 70
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 62
InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 99

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24, 2025 • 44
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28, 2025 • 68
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 42
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 81

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 196 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs