Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published Mar 13 • 38
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published Mar 13 • 38
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published Mar 13 • 38
TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution Paper • 2602.09662 • Published Feb 10 • 6
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published Jan 29 • 51
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Paper • 2509.18154 • Published Sep 16, 2025 • 56
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8, 2025 • 31
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation Paper • 2501.06598 • Published Jan 11, 2025 • 2
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation Paper • 2501.06598 • Published Jan 11, 2025 • 2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators Paper • 2502.14752 • Published Feb 20, 2025 • 1