PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend
PaddleOCR 3.5 adds a Transformers inference backend, enabling developers to seamlessly use its OCR and document parsing models within the Hugging Face ecosystem, lowering integration barriers for building applications like RAG.
Hugging Face Blog · May 18, 2026
Introducing ParseBench: The First Document Parsing Benchmark for AI Agents
LlamaIndex releases ParseBench, the first document parsing benchmark designed for AI Agents, revealing that the traditional OCR standard of 'human-readable' is insufficient for agents' strict requirement of 'absolute correctness'.
LlamaIndex Blog ·
LiteParse v2.0 Runs Everywhere
LlamaIndex rewrote its lightweight PDF parser LiteParse in Rust, enabling cross-language and cross-platform (including browser) operation with up to 100x performance gains, providing critical infrastructure for real-time AI applications.
LlamaIndex Blog ·
LlamaIndex Newsletter 2026-04-14
LlamaIndex releases ParseBench, the first OCR benchmark for AI agents, alongside tools tackling structural loss and security in document parsing, marking a paradigm shift from text extraction to contextual understanding.
LlamaIndex Blog ·
LlamaIndex Newsletter 5-19-26
LlamaIndex introduces ParseBench, the first OCR benchmark designed specifically for AI agents, alongside open-sourcing a local document parsing server and a secure sandboxed CLI agent, signaling a shift in document processing towards agent-native infrastructure.
LlamaIndex Blog ·