Add Wan2.2-Animate: Unified Character Animation and Replacement with Holistic Replication #12442

tolgacangoz · 2025-10-06T18:56:19Z

This PR is fixing #12441.

Project Page: https://humanaigc.github.io/wan-animate/

TODOs:

⏳ WanAnimatePipeline
Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
Did you write any new necessary tests?

Try WanAnimatePipeline!

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

- Introduced WanAnimateTransformer3DModel and WanAnimatePipeline. - Updated get_transformer_config to handle the new model type. - Modified convert_transformer to instantiate the correct transformer based on model type. - Adjusted main execution logic to accommodate the new Animate model type.

…l guidance

…prove error handling for undefined parameters

…work for character animation and replacement - Added Wan 2.2 Animate 14B model to the documentation. - Introduced the Wan-Animate framework, detailing its capabilities for character animation and replacement. - Included example usage for the WanAnimatePipeline with preprocessing steps and guidance on input requirements.

- Introduced `WanAnimateGGUFSingleFileTests` to validate functionality. - Added dummy input generation for testing model behavior.

- Introduced `EncoderApp`, `Encoder`, `Direction`, `Synthesis`, and `Generator` classes for enhanced motion and appearance encoding. - Added `FaceEncoder`, `FaceBlock`, and `FaceAdapter` classes to integrate facial motion processing. - Updated `WanTimeTextImageMotionEmbedding` to utilize the new `Generator` for motion embedding. - Enhanced `WanAnimateTransformer3DModel` with additional face adapter and pose patch embedding for improved model functionality.

- Introduced `pad_video` method to handle padding of video frames to a target length. - Updated video processing logic to utilize the new padding method for `pose_video`, `face_video`, and conditionally for `background_video` and `mask_video`. - Ensured compatibility with existing preprocessing steps for video inputs.

…roved video processing - Added optional parameters: `conditioning_pixel_values`, `refer_pixel_values`, `refer_t_pixel_values`, `bg_pixel_values`, and `mask_pixel_values` to the `prepare_latents` method. - Updated the logic in the denoising loop to accommodate the new parameters, enhancing the flexibility and functionality of the pipeline.

…eneration - Updated the calculation of `num_latent_frames` and adjusted the shape of latent tensors to accommodate changes in frame processing. - Enhanced the `get_i2v_mask` method for better mask generation, ensuring compatibility with new tensor shapes. - Improved handling of pixel values and device management for better performance and clarity in the video processing pipeline.

…and mask generation - Consolidated the handling of `pose_latents_no_ref` to improve clarity and efficiency in latent tensor calculations. - Updated the `get_i2v_mask` method to accept batch size and adjusted tensor shapes accordingly for better compatibility. - Enhanced the logic for mask pixel values in the replacement mode, ensuring consistent processing across different scenarios.

…nced processing - Introduced custom QR decomposition and fused leaky ReLU functions for improved tensor operations. - Implemented upsampling and downsampling functions with native support for better performance. - Added new classes: `FusedLeakyReLU`, `Blur`, `ScaledLeakyReLU`, `EqualConv2d`, `EqualLinear`, and `RMSNorm` for advanced neural network layers. - Refactored `EncoderApp`, `Generator`, and `FaceBlock` classes to integrate new functionalities and improve modularity. - Updated attention mechanism to utilize `dispatch_attention_fn` for enhanced flexibility in processing.

tolgacangoz added 5 commits October 6, 2025 21:46

template1

3529a0a

temp2

4f2ee5e

up

778fb54

up

d77b6ba

fix-copies

2fc6ac2

tolgacangoz changed the title ~~Add Wan-Animate: Unified Character Animation and Replacement with Holistic Replication~~ Add Wan2.2-Animate: Unified Character Animation and Replacement with Holistic Replication Oct 6, 2025

tolgacangoz and others added 23 commits October 7, 2025 11:14

style

6182d44

Refactor WanAnimate model components

8c9fd89

Enhance WanAnimatePipeline with new parameters for mode and tempora…

d01e941

…l guidance

Update WanAnimatePipeline to require additional video inputs and im…

7af953b

…prove error handling for undefined parameters

Add unit test template for WanAnimatePipeline functionality

05a01c6

Add unit tests for WanAnimateTransformer3DModel in GGUF format

22b83ce

- Introduced `WanAnimateGGUFSingleFileTests` to validate functionality. - Added dummy input generation for testing model behavior.

style

7fb6732

Update WanAnimatePipeline

624a314

style

fc0edb5

Refactor test for WanAnimatePipeline to include new input structure

eb7eedd

from einops to torch

8968b42

Merge branch 'main' into integrations/wan2.2-animate

dce83a8

style

802896e

up

84768f6

style

b8337c6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Wan2.2-Animate: Unified Character Animation and Replacement with Holistic Replication #12442

Add Wan2.2-Animate: Unified Character Animation and Replacement with Holistic Replication #12442

tolgacangoz commented Oct 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add Wan2.2-Animate: Unified Character Animation and Replacement with Holistic Replication #12442

Are you sure you want to change the base?

Add Wan2.2-Animate: Unified Character Animation and Replacement with Holistic Replication #12442

Conversation

tolgacangoz commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Who can review?

Uh oh!

Uh oh!

tolgacangoz commented Oct 6, 2025 •

edited

Loading