← Back to Home

Tag: 智能体 (18 articles)

Live blog: Code w/ Claude 2026

Anthropic showcased a comprehensive shift from a single model to a platform-centric, multi-agent collaboration paradigm at Code w/ Claude, focusing on enabling developers to build and run complex, long-duration agent tasks more efficiently.

Simon Willison · May 6, 2026

Our AI started a cafe in Stockholm

An experiment where an AI autonomously runs a real-world cafe sparked ethical debate due to absurd procurement and causing trouble for external parties, revealing the deeper issue of AI agents lacking a sense of boundaries in the physical world.

Simon Willison · May 6, 2026

AI evals are becoming the new compute bottleneck

AI evaluation costs are skyrocketing, with single agent benchmark runs costing tens of thousands of dollars, and their inherent complexity makes them hard to compress, creating a new compute bottleneck for AI development.

Hugging Face Blog · Apr 30, 2026

Meet HoloTab by HCompany. Your AI browser companion.

HCompany launches HoloTab, a free Chrome extension that simplifies complex web automation into natural language instructions via its 'show once, run anytime' Routines feature, marking the democratization of computer-use AI.

Hugging Face Blog · Apr 15, 2026

How Agentic AI Improves Document Extraction Accuracy and Automation

The article argues that by introducing a 'plan-act-verify' agent loop, document processing is shifting from mechanical pattern matching to a cognitive task with spatial awareness and contextual reasoning, breaking through the limitations of traditional OCR.

LlamaIndex Blog ·

Introducing Claude Opus 4.7

Anthropic releases Claude Opus 4.7, focusing on enhanced complex coding and long-running task capabilities, with its 'self-verification' mechanism marking a key step towards more autonomous AI agents.

Anthropic News ·

Introducing Claude for Small Business

Anthropic launches Claude for Small Business, embedding AI into daily operational tools via pre-built connectors and workflows to address the shallow adoption of AI in small businesses.

Anthropic News ·

Microsoft Copilot Cowork Exfiltrates Files

A critical security flaw in Microsoft Copilot Cowork allowed attackers to exfiltrate user files via prompt injection by exploiting auto-sent emails and pre-authenticated download links.

Simon Willison ·

Serving Agentic Workloads at Scale with vLLM x Mooncake

vLLM integrates Mooncake's distributed KV cache to solve the bottleneck of recomputing long context prefixes in agentic workloads, achieving a 3.8x throughput increase and a 46x reduction in time-to-first-token.

vLLM Blog ·

Your harness, your memory

The article argues that agent harnesses are inextricably tied to memory; using a closed or API-based harness means ceding control of your agent's memory to a third party, creating deep lock-in. Memory should be open.

LangChain Blog ·