Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18, 2025 • 19
Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models Paper • 2602.01849 • Published Feb 2 • 5
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 18 days ago • 68
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 10 days ago • 38
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 10 days ago • 38
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published Mar 16 • 185
Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models Paper • 2601.18129 • Published Jan 26 • 11
Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models Paper • 2601.18129 • Published Jan 26 • 11
On the Robustness of Answer Formats in Medical Reasoning Models Paper • 2509.20866 • Published Sep 25, 2025 • 2
Extending Audio Context for Long-Form Understanding in Large Audio-Language Models Paper • 2510.15231 • Published Oct 17, 2025
ThaiOCRBench: A Task-Diverse Benchmark for Vision-Language Understanding in Thai Paper • 2511.04479 • Published Nov 6, 2025 • 1
AudioJudge: Understanding What Works in Large Audio Model Based Speech Evaluation Paper • 2507.12705 • Published Jul 17, 2025
Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition Paper • 2601.13044 • Published Jan 19 • 12
Typhoon OCR: Open Vision-Language Model For Thai Document Extraction Paper • 2601.14722 • Published Jan 21 • 15
Mangosteen: An Open Thai Corpus for Language Model Pretraining Paper • 2507.14664 • Published Jul 19, 2025 • 7