Cocoa's picture

6

Cocoa

LeconCoca32

AI & ML interests

None yet

Organizations

None yet

upvoted 6 papers 6 months ago

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 12

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 117

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22, 2025 • 63

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85

Language Models are Injective and Hence Invertible

Paper • 2510.15511 • Published Oct 17, 2025 • 70

Attention Sinks in Diffusion Language Models

Paper • 2510.15731 • Published Oct 17, 2025 • 50