[multimodal] add language_model_only flag for models like qwen3.5 by erictang000 · Pull Request #1487 · NovaSky-AI/SkyRL

erictang000 · 2026-04-09T21:08:06Z

Add `language_model_only` flag for multimodal models (Qwen3.5)

Summary

Add language_model_only config flag across policy, ref, and inference engine configs to skip vision encoder initialization for multimodal models like Qwen3.5, reducing GPU memory usage
Fix FSDP weight sync: remap CausalLM param names (model.layers.*) to vLLM's expected namespace (language_model.model.layers.*) via new weight_prefix in FSDPWeightExtractor
Make FSDP wrap policy resilient to missing vision-only layer classes (warn + skip instead of crash)
Add flash-linear-attention and causal-conv1d dependencies; unblock causal-conv1d install override -- required for performant GDN layer execution
Add run_qwen3.5_0.8b.sh example with use_sample_packing=false (GDN layers are incompatible with packing)

Runs

FSDP and megatron reward matching

Test plan

Run run_qwen3.5_0.8b.sh on 4 GPUs -- verify weight sync, no GDN fallback warnings, avg_final_rewards trends up
Run existing non-multimodal FSDP test to confirm no regression
Verify config validation rejects mismatched language_model_only across policy/ref/generator

…uage_model_only

devin-ai-integration

Devin Review found 1 potential issue.

View 5 additional findings in Devin Review.

…uage_model_only

erictang000 · 2026-04-13T21:45:51Z

cc: @nithinvc PR adding language_model_only flag - this shouldn't effect any of your runs since it's false by default but just heads up

add language_model_only flag for models like qwen3.5

4ab2f2e

This comment was marked as resolved.

Sign in to view

x

929ef5f

This comment was marked as resolved.

Sign in to view

x

d5535e1

This comment was marked as resolved.

Sign in to view

erictang000 added 3 commits April 13, 2026 20:17

x

201a7c9

x

0f2962a

Merge branch 'main' of https://github.com/erictang000/SkyRL into lang…

63745c7

…uage_model_only

devin-ai-integration bot reviewed Apr 13, 2026

View reviewed changes

erictang000 added 2 commits April 13, 2026 21:25

Merge branch 'main' of https://github.com/erictang000/SkyRL into lang…

f3839be

…uage_model_only

X

8b87240

This comment was marked as resolved.

Sign in to view

x

97ea50b

erictang000 merged commit 5cf22c5 into NovaSky-AI:main Apr 13, 2026
5 of 6 checks passed

erictang000 deleted the language_model_only branch April 13, 2026 21:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[multimodal] add language_model_only flag for models like qwen3.5#1487

[multimodal] add language_model_only flag for models like qwen3.5#1487
erictang000 merged 9 commits intoNovaSky-AI:mainfrom
erictang000:language_model_only

erictang000 commented Apr 9, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

This comment was marked as resolved.

Uh oh!

erictang000 commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

erictang000 commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add language_model_only flag for multimodal models (Qwen3.5)

Summary

Runs

Test plan

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

erictang000 commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

erictang000 commented Apr 9, 2026 •

edited

Loading

Add `language_model_only` flag for multimodal models (Qwen3.5)