-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 153 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 119
Collections
Discover the best community collections!
Collections including paper arxiv:2601.21204
-
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
Paper • 2602.01058 • Published • 44 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 46 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
Paper • 2602.04804 • Published • 50
-
Endless Terminals: Scaling RL Environments for Terminal Agents
Paper • 2601.16443 • Published • 18 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 42
-
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
jina-embeddings-v5-text: Task-Targeted Embedding Distillation
Paper • 2602.15547 • Published • 26 -
ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation
Paper • 2602.20093 • Published • 29 -
Distribution-Conditioned Transport
Paper • 2603.04736 • Published • 3
-
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 290 -
Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs
Paper • 2602.02338 • Published • 42 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 204
-
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery
Paper • 2601.19325 • Published • 81 -
TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers
Paper • 2601.14133 • Published • 61 -
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods
Paper • 2601.21821 • Published • 62
-
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security
Paper • 2601.18491 • Published • 125 -
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents
Paper • 2601.16746 • Published • 91 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 111
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 26 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 20 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 20 -
Recursive Language Models
Paper • 2512.24601 • Published • 94
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 153 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 119
-
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 290 -
Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs
Paper • 2602.02338 • Published • 42 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 204
-
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
Paper • 2602.01058 • Published • 44 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 46 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
Paper • 2602.04804 • Published • 50
-
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery
Paper • 2601.19325 • Published • 81 -
TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers
Paper • 2601.14133 • Published • 61 -
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods
Paper • 2601.21821 • Published • 62
-
Endless Terminals: Scaling RL Environments for Terminal Agents
Paper • 2601.16443 • Published • 18 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 42
-
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security
Paper • 2601.18491 • Published • 125 -
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents
Paper • 2601.16746 • Published • 91 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 111
-
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
jina-embeddings-v5-text: Task-Targeted Embedding Distillation
Paper • 2602.15547 • Published • 26 -
ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation
Paper • 2602.20093 • Published • 29 -
Distribution-Conditioned Transport
Paper • 2603.04736 • Published • 3
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 26 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 20 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 20 -
Recursive Language Models
Paper • 2512.24601 • Published • 94