Collections
Discover the best community collections!
Collections including paper arxiv:2602.10809
-
UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation
Paper • 2504.08761 • Published • 7 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 111 -
DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal
Paper • 2601.18081 • Published • 8 -
nvidia/nemotron-colembed-vl-8b-v2
Visual Document Retrieval • 9B • Updated • 1.3k • 39
-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 106 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 121 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 99 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 66
-
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning
Paper • 2506.22434 • Published • 10 -
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
Paper • 2507.13348 • Published • 79 -
RewardDance: Reward Scaling in Visual Generation
Paper • 2509.08826 • Published • 73 -
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
Paper • 2510.18876 • Published • 37
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Endless Terminals: Scaling RL Environments for Terminal Agents
Paper • 2601.16443 • Published • 18 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 42
-
Benchmark^2: Systematic Evaluation of LLM Benchmarks
Paper • 2601.03986 • Published • 34 -
BabyVision: Visual Reasoning Beyond Language
Paper • 2601.06521 • Published • 201 -
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors
Paper • 2601.07226 • Published • 33 -
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty
Paper • 2601.22027 • Published • 85
-
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
Paper • 2509.24107 • Published • 80 -
Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
Paper • 2510.08276 • Published • 10 -
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
Paper • 2510.20168 • Published • 28 -
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Paper • 2510.17797 • Published • 11
-
Open Deep Search: Democratizing Search with Open-source Reasoning Agents
Paper • 2503.20201 • Published • 48 -
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Spacer: Towards Engineered Scientific Inspiration
Paper • 2508.17661 • Published • 32 -
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
Paper • 2509.01396 • Published • 58
-
Endless Terminals: Scaling RL Environments for Terminal Agents
Paper • 2601.16443 • Published • 18 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 42
-
UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation
Paper • 2504.08761 • Published • 7 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 111 -
DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal
Paper • 2601.18081 • Published • 8 -
nvidia/nemotron-colembed-vl-8b-v2
Visual Document Retrieval • 9B • Updated • 1.3k • 39
-
Benchmark^2: Systematic Evaluation of LLM Benchmarks
Paper • 2601.03986 • Published • 34 -
BabyVision: Visual Reasoning Beyond Language
Paper • 2601.06521 • Published • 201 -
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors
Paper • 2601.07226 • Published • 33 -
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty
Paper • 2601.22027 • Published • 85
-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 106 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 121 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 99 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 66
-
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
Paper • 2509.24107 • Published • 80 -
Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
Paper • 2510.08276 • Published • 10 -
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
Paper • 2510.20168 • Published • 28 -
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Paper • 2510.17797 • Published • 11
-
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning
Paper • 2506.22434 • Published • 10 -
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
Paper • 2507.13348 • Published • 79 -
RewardDance: Reward Scaling in Visual Generation
Paper • 2509.08826 • Published • 73 -
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
Paper • 2510.18876 • Published • 37
-
Open Deep Search: Democratizing Search with Open-source Reasoning Agents
Paper • 2503.20201 • Published • 48 -
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Spacer: Towards Engineered Scientific Inspiration
Paper • 2508.17661 • Published • 32 -
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
Paper • 2509.01396 • Published • 58
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25