← Back to Home

Tag: AI基础设施 (7 articles)

vLLM Tops the Artificial Analysis Leaderboard

The open-source inference engine vLLM outperforms all proprietary competitors in multiple frontier model inference benchmarks, thanks to deep kernel fusion optimizations tailored to each model's specific bottlenecks.

vLLM Blog · May 11, 2026

DeepSeek V4 in vLLM: Efficient Long-context Attention

vLLM announces support for DeepSeek V4 models, featuring a novel attention mechanism that tackles the core challenges of memory and computational cost in million-token long-context inference.

vLLM Blog · Apr 24, 2026
BitByAI — AI-powered, AI-evolved AI News