-
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 156 -
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 124 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 41
Anurag Yadav
harryadav3
·
AI & ML interests
None yet
Recent Activity
updated a model 1 day ago
harryadav3/Qwen3-30B-A3B-REAP-50 published a model 1 day ago
harryadav3/Qwen3-30B-A3B-REAP-50 updated a model 3 days ago
harryadav3/nanochat-d24-chatOrganizations
None yet
web-crawlers
llms-mlm
3d-4d/embodied
LLMS
-
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Paper • 2508.21148 • Published • 142 -
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Paper • 2504.10479 • Published • 308 -
Fara-7B: An Efficient Agentic Model for Computer Use
Paper • 2511.19663 • Published • 17
audio
RL
mech-inter
videogeneration
agentic ai
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
Paper • 2508.16279 • Published • 61 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 140
parsing
-
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 156 -
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 124 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 41
audio
web-crawlers
RL
llms-mlm
mech-inter
3d-4d/embodied
videogeneration
LLMS
-
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Paper • 2508.21148 • Published • 142 -
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Paper • 2504.10479 • Published • 308 -
Fara-7B: An Efficient Agentic Model for Computer Use
Paper • 2511.19663 • Published • 17
agentic ai
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
Paper • 2508.16279 • Published • 61 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 140