PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend
PaddleOCR 3.5 adds a Transformers inference backend, enabling developers to seamlessly use its OCR and document parsing models within the Hugging Face ecosystem, lowering integration barriers for building applications like RAG.
Hugging Face Blog · May 18, 2026
Agentic OCR for Receipts: Why Traditional Pipelines Break
The article argues that receipt processing is not a simple OCR task but a document intelligence challenge that stress-tests systems with non-standard, complex layouts, where traditional rule-based pipelines break down and AI agent-driven architectures prove more robust.
LlamaIndex Blog ·
How Agentic AI Improves Document Extraction Accuracy and Automation
The article explains how Agentic AI overcomes the limitations of template-based OCR by mimicking human expert reasoning through a 'plan-act-verify' loop, enabling robust document understanding and automation.
LlamaIndex Blog ·
LlamaIndex Newsletter 5-19-26
LlamaIndex introduces ParseBench, the first OCR benchmark designed specifically for AI agents, alongside open-sourcing a local document parsing server and a secure sandboxed CLI agent, signaling a shift in document processing towards agent-native infrastructure.
LlamaIndex Blog ·
OCR for Tables: How to Extract Structured Data from Documents
The article delves into the challenges of extracting table data from documents, highlighting that it's not just about character recognition, but also involves layout analysis, structural reconstruction, and contextual reasoning, marking a key step towards intelligent document processing.
LlamaIndex Blog ·