EMO: Pretraining mixture of experts for emergent modularity
AI2 releases EMO, a new MoE model pretrained to enable emergent modularity, allowing users to selectively use just 12.5% of experts for a task while maintaining near full-model performance.
Hugging Face Blog · May 9, 2026
Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model
Alibaba's Qwen releases Qwen3.6-27B, a dense 27B parameter model that outperforms the previous generation's 397B MoE flagship on coding benchmarks, signaling a turning point for efficient, local-first coding models.
Simon Willison · Apr 23, 2026
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents
NVIDIA releases its omni-modal understanding model Nemotron 3 Nano Omni, setting new open-source benchmarks across document, audio-video understanding, and agentic tasks, while delivering significantly higher efficiency than comparable models.
Hugging Face Blog ·