Granite 4.1 LLMs: How They’re Built
IBM's Granite 4.1 series demonstrates that a meticulously engineered data pipeline and multi-stage training can enable an 8B dense model to match or exceed the performance of a previous 32B MoE model, highlighting a paradigm shift where data quality trumps parameter count.
Hugging Face Blog · Apr 29, 2026
Introducing talkie: a 13B vintage language model from 1930
A 13B model trained exclusively on pre-1931 text aims to explore AI's reasoning, creativity, and 're-discovery' abilities within knowledge boundaries, sparking new discussions on data copyright and model purity.
Simon Willison · Apr 28, 2026
Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents
This work extends reinforcement learning environments from logic puzzles to e-commerce conversations, using 8 algorithmically verifiable scenarios to train AI agents from 'chatting well' to 'getting things done'.
Hugging Face Blog · Apr 16, 2026
TRL v1.0: Post-Training Library Built to Move with the Field
The release of TRL v1.0 marks a significant shift in post-training libraries, designed to cope with the rapidly changing AI landscape while offering a stable yet experimental development environment.
Hugging Face Blog · Mar 31, 2026