5 26 8

junyuan

Carkham

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

opendatalab/MinerU2.5-Pro-2604-1.2B

authored a paper 14 days ago

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

authored a paper 14 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

View all activity

Organizations

None yet

liked a model 3 days ago

opendatalab/MinerU2.5-Pro-2604-1.2B

Image-Text-to-Text • 1B • Updated 3 days ago • 798 • 38

authored 5 papers 14 days ago

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

Paper • 2502.11494 • Published Feb 17, 2025

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 156

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 42

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Paper • 2512.01248 • Published Dec 1, 2025 • 12

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

Paper • 2512.10619 • Published Dec 11, 2025

upvoted a paper 2 months ago

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published Jan 27 • 81

updated a Space 4 months ago

TRivia-3B

⭐

Convert table images into HTML tags with TRivia-3B

liked a Space 4 months ago

TRivia-3B

⭐

Convert table images into HTML tags with TRivia-3B

liked a model 4 months ago

opendatalab/TRivia-3B

Image-Text-to-Text • 4B • Updated Dec 2, 2025 • 509 • 8

upvoted a paper 4 months ago

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Paper • 2512.01248 • Published Dec 1, 2025 • 12

commented a paper 4 months ago

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Paper • 2512.01248 • Published Dec 1, 2025 • 12 •

updated a model 4 months ago

opendatalab/TRivia-3B

Image-Text-to-Text • 4B • Updated Dec 2, 2025 • 509 • 8

published a Space 4 months ago

TRivia-3B

⭐

Convert table images into HTML tags with TRivia-3B

published a model 4 months ago

opendatalab/TRivia-3B

Image-Text-to-Text • 4B • Updated Dec 2, 2025 • 509 • 8

upvoted 5 papers 6 months ago

Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition

Paper • 2510.01068 • Published Oct 1, 2025 • 21

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 42

junyuan

AI & ML interests

Recent Activity

Organizations

Carkham's activity

TRivia-3B

TRivia-3B

TRivia-3B