From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models Paper • 2604.09459 • Published 3 days ago • 9
NVIDIA Ising Collection NVIDIA Ising is a new Model Family to enable building useful Quantum Computers with AI. • 4 items • Updated about 9 hours ago • 10
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 3 days ago • 22
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published 22 days ago • 115
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models Paper • 2604.10949 • Published 3 days ago • 36
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 5 days ago • 66
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 8 days ago • 70
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 3 days ago • 61
Qualixar OS: A Universal Operating System for AI Agent Orchestration Paper • 2604.06392 • Published 9 days ago • 16
Automating Database-Native Function Code Synthesis with LLMs Paper • 2604.06231 • Published 14 days ago • 17
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 7 days ago • 41
Structured Causal Video Reasoning via Multi-Objective Alignment Paper • 2604.04415 • Published 10 days ago • 8
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published 7 days ago • 46
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 8 days ago • 51
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published 18 days ago • 16
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 7 days ago • 229
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 8 days ago • 33