-
Xtra-Computing/XtraGPT-14B
Text Generation • Updated • 1.26k • 3 -
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper • 2601.11077 • Published • 67 -
Molecular Contrastive Learning with Chemical Element Knowledge Graph
Paper • 2112.00544 • Published • 1 -
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Paper • 2404.00884 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2510.19779
-
Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1
Paper • 2510.19600 • Published • 70 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 62 -
InteractComp: Evaluating Search Agents With Ambiguous Queries
Paper • 2510.24668 • Published • 99
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 196 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 115 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 62 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114
-
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models
Paper • 2506.19697 • Published • 44 -
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning
Paper • 2509.23873 • Published • 68 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 42 -
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Paper • 2509.22944 • Published • 81
-
Xtra-Computing/XtraGPT-14B
Text Generation • Updated • 1.26k • 3 -
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper • 2601.11077 • Published • 67 -
Molecular Contrastive Learning with Chemical Element Knowledge Graph
Paper • 2112.00544 • Published • 1 -
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Paper • 2404.00884 • Published • 1
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 115 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 62 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114
-
Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1
Paper • 2510.19600 • Published • 70 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 62 -
InteractComp: Evaluating Search Agents With Ambiguous Queries
Paper • 2510.24668 • Published • 99
-
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models
Paper • 2506.19697 • Published • 44 -
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning
Paper • 2509.23873 • Published • 68 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 42 -
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Paper • 2509.22944 • Published • 81
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 196 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88