Tag: 工程实践 (9 articles)

Engineering TTS Inference in vLLM-Omni

TTS inference is a heterogeneous pipeline combining latency-bound and throughput-bound stages, making traditional LLM optimization strategies ineffective and requiring architecture-aware scheduling.

vLLM Blog · Jun 23, 2026

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Hugging Face rebuilt its release pipeline using open models and AI agents, automating mechanical tasks with CI, delegating drafting to AI, and keeping human review for final approval to achieve stable weekly releases.

Hugging Face Blog · Jun 23, 2026

olmo-eval: An evaluation workbench for the model development loop

Allen AI releases olmo-eval, shifting evaluation from final benchmarking to an iterative development loop with prompt-level analysis and flexible execution.

Hugging Face Blog · Jun 12, 2026

Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude

Anthropic reverses its controversial policy of silently limiting Claude for frontier LLM research, sparking industry-wide reflection on AI safety transparency and developer trust.

Simon Willison · Jun 11, 2026

AI Agents Are Too Human? A Counter-Intuitive Critique and Its Deeper Implications

An expert critiques current AI agents for being too 'human'—lacking rigor, patience, and focus, and tending to compromise when faced with difficulties, revealing fundamental flaws in their design.

Simon Willison · Apr 22, 2026

Escaping the Fork: How Meta Modernized WebRTC Across 50+ Use Cases

Meta solved the long-term maintenance challenge of a large open-source fork by building a dual-stack architecture and shim layer, enabling continuous upstream synchronization and safe A/B testing.

Meta Engineering Blog ·

From Text to Multimodal Routing: Hardening Vision Signals in vLLM Semantic Router

vLLM Semantic Router discovered that its vision encoder signals were significantly misaligned with the reference model, causing confidently wrong routing decisions, which reveals that signal correctness becomes a critical control-plane requirement as AI systems evolve from processing text to full requests.

vLLM Blog ·

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

The article clarifies the confusion around key AI Agent terms like Harness and Scaffolding, aiming to build a clear, shared mental model for the field.

Hugging Face Blog ·

Vibe coding and agentic engineering are getting closer than I'd like

Veteran engineer Simon Willison observes that as AI coding tools become more reliable, the line he once drew between 'vibe coding' and 'agentic engineering' is blurring, raising new questions about code review responsibility and trust.

Simon Willison ·