Announcing VeRL-Omni: Easy, Fast, and Stable RL Training for Diffusion and Omni-Modality Models
VeRL-Omni is a reinforcement learning training framework designed for multimodal generative models, addressing the engineering challenges of efficient and stable RL training on diffusion and omni-modality models, extending the LLM RL training paradigm to image, video, and audio generation.
vLLM Blog · May 14, 2026