Tag: 长上下文 (2 articles)

GLM-5.2: Built for Long-Horizon Tasks

Z.ai releases GLM-5.2, the first open-source model to achieve stable 1M-token context and rival top closed-source models on long-horizon coding benchmarks.

Hugging Face Blog · Jun 17, 2026

DeepSeek V4 in vLLM: Efficient Long-context Attention

DeepSeek V4 achieves efficient million-token long-context inference on vLLM through innovative KV cache compression and sparse attention mechanisms, marking a new era for long-text processing.

vLLM Blog ·