Wan Animate Pipeline by csgoogle · Pull Request #367 · AI-Hypercomputer/maxdiffusion

csgoogle · 2026-03-28T14:42:32Z

Wan Animate Pipeline

This CL publishes add the Wan Animate pipepline.

Reused the existing Wan attention operator for face encoder cross attention.
Swept Flash Attention block-size configurations to identify the best inference setting.

Links

Performance

compile_time: 292.73833787906915
generation_time: 157.68515427410603

Configuration

cp: 8 (v6e8)
cfg: 1.0
prev_segments: 5
resolution: 1280x720
fps: 24
generated_frames: 77

github-actions · 2026-03-28T14:42:41Z

e2e testgrid: https://8bcf50593faf4ea38060e236169827e5-dot-us-central1.composer.googleusercontent.com/dags/maxdiffusion_tpu_e2e/grid

csgoogle · 2026-04-13T09:26:28Z

-      sigmas = 1.0 - alphas
-      sigmas = jnp.flip(self.config.flow_shift * sigmas / (1 + (self.config.flow_shift - 1) * sigmas))[:-1].copy()
-      timesteps = (sigmas * self.config.num_train_timesteps).copy().astype(jnp.int64)
+      sigmas = jnp.linspace(1.0, 1.0 / self.config.num_train_timesteps, num_inference_steps + 1)[:-1]


converging it with hf implementation, lmk if I need to revert it

https://github.com/huggingface/diffusers/blob/62b10716093b78028923ad86eb8a8cc787b70aba/src/diffusers/schedulers/scheduling_unipc_multistep.py#L428

Perseus14 · 2026-04-13T09:45:54Z

+      sigmas = self.config.flow_shift * sigmas / (1 + (self.config.flow_shift - 1) * sigmas)
+      eps = 1e-6
+      sigmas = sigmas.at[0].set(jnp.where(jnp.abs(sigmas[0] - 1.0) < eps, sigmas[0] - eps, sigmas[0]))
+      timesteps = (sigmas * self.config.num_train_timesteps).copy().astype(jnp.int32)


Why move from jnp.int64 to jnp.int32?

timestamps are sufficient to be in int32, so casted to int32 only

Perseus14 · 2026-04-13T09:52:29Z

Could we move the assets to a public GCS path or use an existing hf dataset link?

Perseus14 · 2026-04-18T07:46:27Z

+          noise_pred = animate_transformer_forward_pass(
+              graphdef,
+              state,
+              rest_of_state,
+              seg_latents,
+              reference_latents,
+              pose_latents,
+              face_seg,
+              timestep,
+              prompt_embeds,
+              image_embeds,
+              motion_encode_batch_size=motion_encode_batch_size,
+          )
+
+          if do_classifier_free_guidance:
+            # Blank face pixels (all -1) for the unconditional pass.
+            face_seg_uncond = face_seg * 0 - 1
+            noise_uncond = animate_transformer_forward_pass(
+                graphdef,
+                state,
+                rest_of_state,
+                seg_latents,
+                reference_latents,
+                pose_latents,
+                face_seg_uncond,
+                timestep,
+                negative_prompt_embeds,
+                image_embeds,
+                motion_encode_batch_size=motion_encode_batch_size,
+            )
+            noise_pred = noise_uncond + guidance_scale * (noise_pred - noise_uncond)
+
+          noise_pred = noise_pred.astype(seg_latents.dtype)
+          seg_latents, scheduler_state = self.scheduler.step(scheduler_state, noise_pred, t, seg_latents, return_dict=False)


Can we batch the cfg and prompt?

Ref: 1 and 2

Perseus14 · 2026-04-18T07:51:35Z

-      sigmas = 1.0 - alphas
-      sigmas = jnp.flip(self.config.flow_shift * sigmas / (1 + (self.config.flow_shift - 1) * sigmas))[:-1].copy()
-      timesteps = (sigmas * self.config.num_train_timesteps).copy().astype(jnp.int64)
+      sigmas = jnp.linspace(1.0, 1.0 / self.config.num_train_timesteps, num_inference_steps + 1)[:-1]


Perseus14 · 2026-04-18T07:53:48Z

+        f"{_frame_summary('mask', mask_video)}"
+    )
+
+  animate_settings = _get_animate_inference_settings(config)


Could you also add lora support?

I wonder if there is a need for a separate generate script? Can we add this to existing generate_wan.py file?

Perseus14 · 2026-04-18T07:56:44Z

Please resolve conflicts and enable support for diagnostics and profiling as in this PR

csgoogle marked this pull request as ready for review April 6, 2026 16:33

csgoogle requested a review from entrpn as a code owner April 6, 2026 16:33

csgoogle force-pushed the sagarchapara/wananimate-pipeline branch from 67233e9 to e281524 Compare April 13, 2026 08:49

Add WAN animate pipeline support

349d080

csgoogle force-pushed the sagarchapara/wananimate-pipeline branch from e281524 to 349d080 Compare April 13, 2026 09:10

csgoogle commented Apr 13, 2026

View reviewed changes

Perseus14 reviewed Apr 13, 2026

View reviewed changes

Perseus14 added the gemini-review label Apr 17, 2026

Perseus14 reviewed Apr 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wan Animate Pipeline#367

Wan Animate Pipeline#367
csgoogle wants to merge 1 commit intomainfrom
sagarchapara/wananimate-pipeline

csgoogle commented Mar 28, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 28, 2026

Uh oh!

csgoogle Apr 13, 2026

Uh oh!

Perseus14 Apr 18, 2026

Uh oh!

Perseus14 Apr 13, 2026

Uh oh!

csgoogle Apr 13, 2026

Uh oh!

Perseus14 Apr 13, 2026

Uh oh!

Perseus14 Apr 18, 2026

Uh oh!

Perseus14 Apr 18, 2026

Uh oh!

Perseus14 Apr 18, 2026

Uh oh!

Perseus14 Apr 18, 2026

Uh oh!

Perseus14 commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

csgoogle commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Wan Animate Pipeline

Links

Performance

Configuration

Uh oh!

github-actions bot commented Mar 28, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Perseus14 commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

csgoogle commented Mar 28, 2026 •

edited

Loading