[feat] World model training using third person games by mignonjia · Pull Request #1443 · hao-ai-lab/FastVideo

mignonjia · 2026-06-08T23:02:03Z

Purpose

Add world-model training helpers for MatrixGame2/Zelda, including LongLive-style streaming long tuning, Zelda validation utilities, action overlays, and synthetic optical-flow validation wiring.

Fixes #

Changes

Add StreamingLongTuningMethod for LongLive-style streaming long tuning from a self-forcing checkpoint, with long overlapping chunks and streaming context.
Add MatrixGame2 causal attention sink support for long-context streaming.
Add Zelda world-model configs, validation metric wiring, and synthetic optical-flow calibration.
Add validation action overlays and artifact-save handling.
Document Zelda training/validation data downloads and the suggested data/zeldam2-clean local path.

Test Plan

PYTHONPATH=. pytest fastvideo/tests/train/methods/test_streaming_long_tuning.py -q
PYTHONPATH=. pytest fastvideo/tests/train/callbacks/test_validation.py -q
pre-commit run --files docs/training/train_infra.md examples/train/scenario/worldmodel/README.md examples/train/scenario/worldmodel/self_forcing_causal_i2v_zelda.yaml examples/train/scenario/worldmodel/streaming_long_tuning_causal_i2v.yaml fastvideo/train/methods/distribution_matching/streaming_long_tuning.py fastvideo/tests/train/callbacks/test_validation.py fastvideo/tests/train/methods/test_streaming_long_tuning.py PR_doc.md

Test Results

Test output

PYTHONPATH=. pytest fastvideo/tests/train/methods/test_streaming_long_tuning.py -q
..                                                                       [100%]
2 passed, 14 warnings in 0.55s

PYTHONPATH=. pytest fastvideo/tests/train/callbacks/test_validation.py -q
...........................                                              [100%]
27 passed, 14 warnings in 0.56s

pre-commit run --files ...
yapf.....................................................................Passed
ruff (legacy alias)......................................................Passed
codespell................................................................Passed
PyMarkdown...............................................................Passed
Lint GitHub Actions workflow files...................(no files to check)Skipped
mypy.....................................................................Passed
Check for spaces in all filenames........................................Passed
Suggestion...............................................................Passed

Review Notes

Kept the independent no-grad critic rollout in _losses_for_batch, matching existing DMD2Method and SelfForcingMethod behavior.
Left _dmd_loss_masked denominator unchanged because it already clamps with clamp_min(1e-6).

Checklist

I ran pre-commit on the changed PR files and fixed all issues
I added or updated tests for my changes
I updated documentation if needed
I considered GPU memory impact of my changes

For model/pipeline changes, also check:

I verified SSIM regression tests pass
I updated the support matrix if adding a new model

mergify · 2026-06-08T23:02:43Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🔴 PR merge requirements

Waiting for

#approved-reviews-by>=1
check-success=fastcheck-passed
check-success=full-suite-passed
check-success~=pre-commit

This rule is failing.

#approved-reviews-by>=1
check-success=fastcheck-passed
check-success=full-suite-passed
check-success~=pre-commit
title~=(?i)^\[(feat|feature|bugfix|fix|refactor|perf|ci|doc|docs|misc|chore|kernel|new.?model|skill|skills|infra)\]

gemini-code-assist

Code Review

This pull request introduces a LongLive-style multi-stage self-forcing method (StreamingLongTuningMethod) for streaming rollouts, along with training-time validation metrics and action overlays (keyboard/mouse) on validation frames. It also adds several configurations, documentation, and tests supporting these features. The review feedback highlights three key areas for improvement: optimizing the student rollout in _losses_for_batch to avoid redundant computations when update_student is enabled, clamping the denominator in _dmd_loss_masked to prevent division-by-zero and gradient explosion, and replacing copy.deepcopy with copy.copy on attn_metadata to avoid performance and memory overhead.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-08T23:04:26Z

+        if update_student:
+            generator_pred_x0 = self._student_rollout(
+                training_batch,
+                with_grad=True,
+            )
+            student_ctx = (
+                training_batch.timesteps,
+                training_batch.attn_metadata_vsa,
+            )
+            generator_loss = self._dmd_loss_masked(
+                generator_pred_x0,
+                training_batch,
+                chunk_mask=chunk_mask,
+            )
+
+        with torch.no_grad():
+            generator_pred_x0 = self._student_rollout(
+                training_batch,
+                with_grad=False,
+            )
+
+        fake_score_loss, critic_ctx, critic_outputs = (self._critic_flow_matching_loss_for_x0(
+            generator_pred_x0,
+            training_batch,
+            chunk_mask=chunk_mask,
+        ))


When update_student is True, the student rollout is executed twice: once with gradients enabled (with_grad=True) and once with gradients disabled (with_grad=False). Since DiT rollouts are computationally expensive, we can optimize this by only running the second rollout when update_student is False, and simply detaching generator_pred_x0 when update_student is True.

if update_student: generator_pred_x0 = self._student_rollout( training_batch, with_grad=True, ) student_ctx = ( training_batch.timesteps, training_batch.attn_metadata_vsa, ) generator_loss = self._dmd_loss_masked( generator_pred_x0, training_batch, chunk_mask=chunk_mask, ) critic_pred_x0 = generator_pred_x0.detach() else: with torch.no_grad(): critic_pred_x0 = self._student_rollout( training_batch, with_grad=False, ) fake_score_loss, critic_ctx, critic_outputs = (self._critic_flow_matching_loss_for_x0( critic_pred_x0, training_batch, chunk_mask=chunk_mask, ))

gemini-code-assist · 2026-06-08T23:04:26Z

+            denom = torch.abs(generator_pred_x0 - real_cfg_x0).mean()
+            grad = (faker_x0 - real_cfg_x0) / denom


If generator_pred_x0 and real_cfg_x0 are identical or extremely close, denom will be zero or near-zero. This causes division by zero, resulting in inf or NaN values in grad. While torch.nan_to_num is called afterwards, it replaces inf with the maximum representable float value (e.g., 3.4e38), which will cause overflow and gradient explosion during the backward pass. Clamping denom with a small epsilon (e.g., 1e-8) prevents this instability.

Suggested change

denom = torch.abs(generator_pred_x0 - real_cfg_x0).mean()

grad = (faker_x0 - real_cfg_x0) / denom

denom = torch.abs(generator_pred_x0 - real_cfg_x0).mean().clamp_min(1e-8)

grad = (faker_x0 - real_cfg_x0) / denom

gemini-code-assist · 2026-06-08T23:04:26Z

+        batch.attn_metadata = None
+        batch.attn_metadata_vsa = None
+        build_metadata(batch)
+        batch.attn_metadata_vsa = copy.deepcopy(batch.attn_metadata)


Using copy.deepcopy on attn_metadata can be extremely slow and memory-intensive because it deep-copies the underlying PyTorch tensors. As documented in MatrixGame2Model.prepare_batch, a shallow copy (copy.copy) should be used instead to keep the lru_cache'd index fields shared while allowing modifications to metadata fields like VSA_sparsity.

Suggested change

batch.attn_metadata_vsa = copy.deepcopy(batch.attn_metadata)

batch.attn_metadata_vsa = copy.copy(batch.attn_metadata)

mergify · 2026-06-10T03:48:04Z

Pre-commit checks failed

Hi @mignonjia, the pre-commit checks have failed. To fix them locally:

# Install pre-commit if you haven't already
uv pip install pre-commit
pre-commit install

# Run all checks and auto-fix what's possible
pre-commit run --all-files

Common fixes:

yapf: yapf -i <file> (formatting)
ruff: ruff check --fix <file> (linting)
codespell: codespell --write-changes <file> (spelling)

After fixing, commit and push the changes. The checks will re-run automatically.

For future commits, pre-commit will run automatically on changed files before each commit.

mergify · 2026-06-10T04:02:39Z

Pre-commit checks failed

Hi @mignonjia, the pre-commit checks have failed. To fix them locally:

# Install pre-commit if you haven't already
uv pip install pre-commit
pre-commit install

# Run all checks and auto-fix what's possible
pre-commit run --all-files

Common fixes:

yapf: yapf -i <file> (formatting)
ruff: ruff check --fix <file> (linting)
codespell: codespell --write-changes <file> (spelling)

After fixing, commit and push the changes. The checks will re-run automatically.

For future commits, pre-commit will run automatically on changed files before each commit.

mergify · 2026-06-10T21:11:51Z

Pre-commit checks failed

Hi @mignonjia, the pre-commit checks have failed. To fix them locally:

# Install pre-commit if you haven't already
uv pip install pre-commit
pre-commit install

# Run all checks and auto-fix what's possible
pre-commit run --all-files

Common fixes:

yapf: yapf -i <file> (formatting)
ruff: ruff check --fix <file> (linting)
codespell: codespell --write-changes <file> (spelling)

After fixing, commit and push the changes. The checks will re-run automatically.

For future commits, pre-commit will run automatically on changed files before each commit.

mergify · 2026-06-10T21:39:37Z

Pre-commit checks failed

Hi @mignonjia, the pre-commit checks have failed. To fix them locally:

# Install pre-commit if you haven't already
uv pip install pre-commit
pre-commit install

# Run all checks and auto-fix what's possible
pre-commit run --all-files

Common fixes:

yapf: yapf -i <file> (formatting)
ruff: ruff check --fix <file> (linting)
codespell: codespell --write-changes <file> (spelling)

After fixing, commit and push the changes. The checks will re-run automatically.

For future commits, pre-commit will run automatically on changed files before each commit.

mergify · 2026-06-12T04:21:03Z

This PR has merge conflicts with the base branch. Please rebase:

git fetch origin main
git rebase origin/main
# Resolve any conflicts, then:
git push --force-with-lease

mergify · 2026-06-16T17:32:43Z

Pre-commit checks failed

Hi @mignonjia, the pre-commit checks have failed. To fix them locally:

# Install pre-commit if you haven't already
uv pip install pre-commit
pre-commit install

# Run all checks and auto-fix what's possible
pre-commit run --all-files

Common fixes:

yapf: yapf -i <file> (formatting)
ruff: ruff check --fix <file> (linting)
codespell: codespell --write-changes <file> (spelling)

After fixing, commit and push the changes. The checks will re-run automatically.

For future commits, pre-commit will run automatically on changed files before each commit.

mergify · 2026-06-16T18:06:25Z

Pre-commit checks failed

Hi @mignonjia, the pre-commit checks have failed. To fix them locally:

# Install pre-commit if you haven't already
uv pip install pre-commit
pre-commit install

# Run all checks and auto-fix what's possible
pre-commit run --all-files

Common fixes:

yapf: yapf -i <file> (formatting)
ruff: ruff check --fix <file> (linting)
codespell: codespell --write-changes <file> (spelling)

After fixing, commit and push the changes. The checks will re-run automatically.

For future commits, pre-commit will run automatically on changed files before each commit.

H1yori233 · 2026-06-20T04:18:25Z

+        state.previous_latents = full_chunk.detach()[:, -chunk_size:]
+
+        if not dist.is_initialized() or dist.get_rank() == 0:
+            print(


should use logger

H1yori233 · 2026-06-20T04:25:10Z

    )


+def _retain_kv_with_sink(


There are duplicate functions in fastvideo/models/dits/matrixgame2/causal_model.py; they should be merged.

H1yori233 · 2026-06-20T04:26:03Z

+                        where=("multi_phased_distill_schedule"
+                               f"[{idx}].streaming_fixed_overlap_latents"),
+                    )),
+                    train_first_chunk=_as_bool(


train_first_chunk is read but never used, so it should be deleted.

mignonjia added 9 commits May 29, 2026 07:01

long live init

05b4b15

Merge branch 'hao-ai-lab:main' into mhuo/longlive-nvl

81dfb53

[feat]: add eval metrics during validation training

8347ee8

[feat]: add ptlflow validation metrics during training

49d1eeb

[feat]: add fvd validation metric during training

7ed428c

[bugfix]: map zelda matrixgame models to mg2 config

8667398

[feat] add MatrixGame2 validation action overlay

61e6371

Merge branch 'hao-ai-lab:main' into mhuo/longlive-nvl

d515b00

Merge branch 'hao-ai-lab:main' into mhuo/longlive-nvl

7c13819

mignonjia requested a review from alexzms June 8, 2026 23:02

mergify Bot added scope: training Training pipeline, methods, configs scope: infra CI, tests, Docker, build scope: model Model architecture (DiTs, encoders, VAEs) labels Jun 8, 2026

gemini-code-assist Bot reviewed Jun 8, 2026

View reviewed changes

mignonjia added 2 commits June 9, 2026 00:25

[misc]: prune MatrixGame2 registry extras

bb5a1d5

long tuning for mg2

0b3d3ec

mergify Bot added the scope: inference Inference pipeline, serving, CLI label Jun 9, 2026

mignonjia added 2 commits June 9, 2026 20:21

[misc]: rename worldmodel flow calibration

08f0f8c

registry

acaa379

mignonjia requested review from H1yori233 and SolitaryThinker June 9, 2026 22:22

mignonjia self-assigned this Jun 9, 2026

long tuning

3448462

mergify Bot added the scope: docs Documentation label Jun 9, 2026

mignonjia marked this pull request as ready for review June 9, 2026 22:29

mignonjia changed the title ~~World model training helper functions~~ World model training Jun 9, 2026

mignonjia changed the title ~~World model training~~ [feat] World model training using third person games Jun 9, 2026

mergify Bot added the type: feat New feature or capability label Jun 9, 2026

add test for validation during train

3da8b02

Merge branch 'hao-ai-lab:main' into mhuo/longlive-nvl

b8c09e3

hao-ai-lab deleted a comment from mergify Bot Jun 10, 2026

revert MG self forcing orig

028c00e

update keyward scale

8c50db4

mergify Bot added the scope: data Data preprocessing, datasets label Jun 10, 2026

add data mix logic

4003e70

mignonjia force-pushed the mhuo/longlive-nvl branch from a212635 to 4003e70 Compare June 10, 2026 21:37

mignonjia marked this pull request as draft June 10, 2026 21:40

mignonjia added 5 commits June 11, 2026 16:54

Merge branch 'hao-ai-lab:main' into mhuo/longlive-nvl

b55f472

update data path

07494fe

[bugfix]: limit synthetic flow validation metrics

c63beb0

[feat]: support wandb entity in trainer config

2cb90fe

yaml files for wm

7ea3ac1

mignonjia marked this pull request as ready for review June 12, 2026 01:50

mergify Bot added the needs-rebase PR has merge conflicts label Jun 12, 2026

mignonjia removed their assignment Jun 16, 2026

Merge branch 'main' into mhuo/longlive-nvl

48109b4

update merge

594a685

mergify Bot removed the needs-rebase PR has merge conflicts label Jun 16, 2026

H1yori233 reviewed Jun 20, 2026

View reviewed changes

		denom = torch.abs(generator_pred_x0 - real_cfg_x0).mean()
		grad = (faker_x0 - real_cfg_x0) / denom

	batch.attn_metadata_vsa = copy.deepcopy(batch.attn_metadata)
	batch.attn_metadata_vsa = copy.copy(batch.attn_metadata)

		)


		def _retain_kv_with_sink(

Conversation

mignonjia commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Changes

Test Plan

Test Results

Review Notes

Checklist

Uh oh!

mergify Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Protections

🔴 PR merge requirements

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

mergify Bot commented Jun 10, 2026

Pre-commit checks failed

Uh oh!

mergify Bot commented Jun 10, 2026

Pre-commit checks failed

Uh oh!

mergify Bot commented Jun 10, 2026

Pre-commit checks failed

Uh oh!

mergify Bot commented Jun 10, 2026

Pre-commit checks failed

Uh oh!

mergify Bot commented Jun 12, 2026

Uh oh!

mergify Bot commented Jun 16, 2026

Pre-commit checks failed

Uh oh!

mergify Bot commented Jun 16, 2026

Pre-commit checks failed

Uh oh!

H1yori233 Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

H1yori233 Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

H1yori233 Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mignonjia commented Jun 8, 2026 •

edited

Loading

mergify Bot commented Jun 8, 2026 •

edited

Loading