MinerU-Diffusion: Document OCR Rethought as Inverse Rendering, 3x Faster

OpenDataLab’s 2.5B-parameter model reframes document OCR as inverse rendering, using parallel diffusion decoding to achieve up to 3.26x faster text extraction at near-perfect accuracy.
artificial-intelligence
Author

Kabui, Charles

Published

2026-03-30

Keywords

document-ocr, diffusion-decoding, inverse-rendering, mineru-diffusion, document-parsing