Tag: 模型训练 (4 articles)

Granite 4.1 LLMs: How They’re Built

IBM's Granite 4.1 series demonstrates that a meticulously engineered data pipeline and multi-stage training can enable an 8B dense model to match or exceed the performance of a previous 32B MoE model, highlighting a paradigm shift where data quality trumps parameter count.

Hugging Face Blog · Apr 29, 2026

Introducing talkie: a 13B vintage language model from 1930

A 13B model trained exclusively on pre-1931 text aims to explore AI's reasoning, creativity, and 're-discovery' abilities within knowledge boundaries, sparking new discussions on data copyright and model purity.

Simon Willison · Apr 28, 2026

Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational 智能体s

This work extends reinforcement learning environments from logic puzzles to e-commerce conversations, using 8 algorithmically verifiable scenarios to train AI agents from 'chatting well' to 'getting things done'.

Hugging Face Blog · Apr 16, 2026

TRL v1.0: Post-Training Library Built to Move with the Field

The release of TRL v1.0 marks a significant shift in post-training libraries, designed to cope with the rapidly changing AI landscape while offering a stable yet experimental development environment.

Hugging Face Blog · Mar 31, 2026