← Back to Home

Tag: 光学字符识别 (3 articles)

Building a Fast Multilingual OCR Model with Synthetic Data

NVIDIA trained the Nemotron OCR v2 model on 12 million synthetic images, achieving high accuracy (NED as low as 0.035) and high speed (34.7 pages/second on a single A100 GPU) across six languages, demonstrating that synthetic data is a key solution to the multilingual data bottleneck in OCR.

Hugging Face Blog · Apr 18, 2026

How Agentic AI Improves Document Extraction Accuracy and Automation

The article argues that by introducing a 'plan-act-verify' agent loop, document processing is shifting from mechanical pattern matching to a cognitive task with spatial awareness and contextual reasoning, breaking through the limitations of traditional OCR.

LlamaIndex Blog ·