LLM 0.32a0 is a major backwards-compatible refactor
Simon Willison's LLM library undergoes a major refactor, evolving from simple text prompts/responses to a structure supporting multi-turn message sequences and streaming mixed-type responses, adapting to modern LLMs' multimodal and tool-calling capabilities.
Simon Willison · Apr 30, 2026
Run Highly Efficient Multimodal Agentic AI with NVIDIA Nemotron 3 Nano Omni Using vLLM
NVIDIA releases the open-source multimodal model Nemotron 3 Nano Omni, which uses a Mixture of Experts architecture to activate only 3B of its 30B parameters, achieving 9x higher throughput than comparable models to solve efficiency and fragmentation issues in multimodal AI agents.
vLLM Blog · Apr 28, 2026
Agentic Document Processing: How AI Agents Are Automating Complex Workflows
The article explains that traditional document automation tools only extract text, while Agentic Document Processing uses AI Agents to understand document context, make autonomous decisions, and connect to downstream systems, enabling end-to-end intelligent workflow automation.
LlamaIndex Blog ·
LlamaIndex Newsletter 2026-04-14
LlamaIndex releases ParseBench, the first OCR benchmark for AI agents, alongside tools tackling structural loss and security in document parsing, marking a paradigm shift from text extraction to contextual understanding.
LlamaIndex Blog ·