MinerU Document Extraction Tools
📚
582
Easy converting PDF and Office docs into Markdown and JSON
OpenDataLab provides high-quality open datasets and tools for large models. China Large model corpus Data Alliance open source data service designated platform
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale
The Trinity of Consistency as a Defining Principle for General World Models