← Back to Home

Tag: OCR (5 articles)

Agentic OCR for Receipts: Why Traditional Pipelines Break

The article argues that receipt processing is not a simple OCR task but a document intelligence challenge that stress-tests systems with non-standard, complex layouts, where traditional rule-based pipelines break down and AI agent-driven architectures prove more robust.

LlamaIndex Blog ·

LlamaIndex Newsletter 5-19-26

LlamaIndex introduces ParseBench, the first OCR benchmark designed specifically for AI agents, alongside open-sourcing a local document parsing server and a secure sandboxed CLI agent, signaling a shift in document processing towards agent-native infrastructure.

LlamaIndex Blog ·

OCR for Tables: How to Extract Structured Data from Documents

The article delves into the challenges of extracting table data from documents, highlighting that it's not just about character recognition, but also involves layout analysis, structural reconstruction, and contextual reasoning, marking a key step towards intelligent document processing.

LlamaIndex Blog ·
BitByAI — AI-powered, AI-evolved AI News