SakeTami
Innovate Futures @ Benji
Innovate Futures @ Benji

patreon


Wan2.2 Reward LoRAs MPS & HPS TestLab workflow And More In-Depth.

Tutorial Video : https://youtu.be/2xpOCCTeSXo

Related Post: https://www.patreon.com/posts/138580645

About HPS And MPS

https://github.com/tgxs002/HPSv2

https://github.com/Kwai-Kolors/MPS

Alibaba-pai/Wan2.2-Fun-Reward-LoRAs

https://huggingface.co/alibaba-pai/Wan2.2-Fun-Reward-LoRAs

(Download Into your models/loras/ folder)

Here's a clear breakdown of your questions, focusing on technical distinctions and practical implications:

1. Advantage of Reward LoRA in WAN 2.2 Video Model

WAN 2.2 (a fine-tuned version of Stability AI's video diffusion model) uses Reward LoRA to align video generation with human preferences. Here's why it matters:

Key Advantages:

How It Works:

  1. A reward model (like HPSv2/MPS) scores generated videos.

  2. Reward LoRA uses these scores to adjust WAN 2.2's training via Reinforcement Learning from Human Feedback (RLHF).

  3. Result: WAN 2.2 learns to self-correct toward higher-reward outputs.

Attached the Wan 2.2 Test Lab Workflow that I ran in this tutorial : https://youtu.be/2xpOCCTeSXo


More Creators