Tag

#flow-grpo

1 article

Microsoft Research’s World-R1 Uses Flow-GRPO and 3D-Aware Rewards to Inject Geometric Consistency Into Wan 2.1 Without Architectural Changes

This article explains how Microsoft Research's World-R1 uses reinforcement learning and 3D-aware rewards to improve geometric consistency in text-to-video generation without changing the underlying model architecture.

Apr 3043