Tag: 模型效率 (3 articles)

EMO: Pretraining mixture of experts for emergent modularity

AI2 releases EMO, a new MoE model pretrained to enable emergent modularity, allowing users to selectively use just 12.5% of experts for a task while maintaining near full-model performance.

Hugging Face Blog · May 9, 2026

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Alibaba's Qwen releases Qwen3.6-27B, a dense 27B parameter model that outperforms the previous generation's 397B MoE flagship on coding benchmarks, signaling a turning point for efficient, local-first coding models.

Simon Willison · Apr 23, 2026

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video 智能体s

NVIDIA releases its omni-modal understanding model Nemotron 3 Nano Omni, setting new open-source benchmarks across document, audio-video understanding, and agentic tasks, while delivering significantly higher efficiency than comparable models.

Hugging Face Blog ·