[bugfix] fix qwen3_vl by Jintao-Huang · Pull Request #73 · modelscope/mcore-bridge

Jintao-Huang · 2026-05-12T08:46:33Z

No description provided.

gemini-code-assist

Code Review

This pull request modifies the transformer block to support keyword arguments during activation checkpointing by wrapping the forward function and converting kwargs into positional arguments. This ensures that tensor keyword arguments are correctly tracked by the autograd graph. Feedback suggests that other tensor inputs currently captured via closure, such as attention_bias and packed_seq_params, should also be passed as explicit arguments to ensure robust autograd tracking during recomputation.

gemini-code-assist · 2026-05-12T08:53:33Z

+            def wrapped_forward(hidden_states, attention_mask, context, context_mask, rotary_pos_emb, padding_mask,
+                                *extra_args):
+                extra_kwargs = dict(zip(extra_kwargs_keys, extra_args))
+                return forward_func(
+                    hidden_states,
+                    attention_mask,
+                    context,
+                    context_mask,
+                    rotary_pos_emb,
+                    padding_mask,
+                    **extra_kwargs,
+                )


The wrapped_forward function correctly converts positional arguments back to keyword arguments to bypass the torch.utils.checkpoint limitation. However, note that attention_bias and packed_seq_params (from the outer _checkpointed_forward scope) are still captured via closure inside custom_forward. While this might be acceptable if they don't require gradients, it is generally safer to pass all Tensor inputs as explicit arguments to the checkpointed function to ensure they are correctly tracked by the autograd engine during activation recomputation.

fix qwen3_vl

e568d80

gemini-code-assist Bot reviewed May 12, 2026

View reviewed changes

Merge branch 'main' into fix_qwen3_vl

cf6acd6

Jintao-Huang merged commit f59a5d7 into modelscope:main May 12, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix qwen3_vl#73

[bugfix] fix qwen3_vl#73
Jintao-Huang merged 2 commits into
modelscope:mainfrom
Jintao-Huang:fix_qwen3_vl

Jintao-Huang commented May 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jintao-Huang commented May 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant