Update to cute dsl 4.6.0.dev0 by anakinxc · Pull Request #94 · inclusionAI/cuLA

anakinxc · 2026-06-12T11:06:08Z

📌 Description

Fixing compatibility issues with cute dsl 4.6.0.dev0

This change is not compatible with versions below 4.6.0

🔍 Related Issues

🚀 Pull Request Checklist

Thank you for contributing to cuLA! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

I have installed pre-commit by running pip install pre-commit (or used your preferred method).
I have installed the hooks with pre-commit install.
I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

Tests have been added or updated as needed.
All tests are passing.

⚡ Performance

Reviewer Notes

gemini-code-assist

Code Review

This pull request updates various SM100 operations to adapt to the new nvidia-cutlass-dsl version (bumped to >=4.6.0.dev0), including direct imports of OperandMajorMode, intrinsic cleanups, and passing separate operand data types to make_trivial_tiled_mma. The review feedback highlights several instances in the fully fused KDA, lightning attention, and linear attention modules where the newly added second data type argument was incorrectly duplicated (e.g., passing the same type twice) instead of correctly specifying the distinct data types for both operands (such as q_dtype, k_dtype, v_dtype, or io_dtype).

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-12T11:08:23Z

+            self.q_dtype,
            self.q_dtype,


For qk_tiled_mma, the operands are Q and K. The first two arguments should be self.q_dtype and self.k_dtype respectively, rather than passing self.q_dtype twice.

Suggested change

self.q_dtype,

self.q_dtype,

self.q_dtype,

self.k_dtype,

gemini-code-assist · 2026-06-12T11:08:23Z

+            self.io_dtype,
            self.io_dtype,


For sq_tiled_mma, the operands are State and Q. The first two arguments should be self.io_dtype and self.q_dtype respectively.

Suggested change

self.io_dtype,

self.io_dtype,

self.io_dtype,

self.q_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.io_dtype,
            self.io_dtype,


For ks_tiled_mma, the operands are State and K. The first two arguments should be self.io_dtype and self.k_dtype respectively.

Suggested change

self.io_dtype,

self.io_dtype,

self.io_dtype,

self.k_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.k_dtype,
            self.k_dtype,


For fake_kv_tiled_mma_acc32, the operands are V and K. The first two arguments should be self.v_dtype and self.k_dtype respectively.

Suggested change

self.k_dtype,

self.k_dtype,

self.v_dtype,

self.k_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.q_dtype,
            self.q_dtype,


For qk_tiled_mma, the operands are Q and K. The first two arguments should be self.q_dtype and self.k_dtype respectively.

Suggested change

self.q_dtype,

self.q_dtype,

self.q_dtype,

self.k_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.k_dtype,
            self.k_dtype,


For fake_kv_tiled_mma_acc32, the operands are V and K. The first two arguments should be self.v_dtype and self.k_dtype respectively.

Suggested change

self.k_dtype,

self.k_dtype,

self.v_dtype,

self.k_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.q_dtype,
            self.q_dtype,


For qk_tiled_mma, the operands are Q and K. The first two arguments should be self.q_dtype and self.k_dtype respectively.

Suggested change

self.q_dtype,

self.q_dtype,

self.q_dtype,

self.k_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.k_dtype,
            self.k_dtype,


For kv_tiled_mma, the operands are V and K. The first two arguments should be self.v_dtype and self.k_dtype respectively.

Suggested change

self.k_dtype,

self.k_dtype,

self.v_dtype,

self.k_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.io_dtype,
            self.io_dtype,


For sq_tiled_mma, the operands are State and Q. The first two arguments should be self.io_dtype and self.q_dtype respectively.

Suggested change

self.io_dtype,

self.io_dtype,

self.io_dtype,

self.q_dtype,

gemini-code-assist · 2026-06-12T11:08:24Z

+            self.k_dtype,
            self.k_dtype,


For fake_kv_tiled_mma_acc32, the operands are V and K. The first two arguments should be self.v_dtype and self.k_dtype respectively.

Suggested change

self.k_dtype,

self.k_dtype,

self.v_dtype,

self.k_dtype,

Copilot

Pull request overview

Updates cuLA to be compatible with nvidia-cutlass-dsl 4.6.0.dev0, primarily by adapting SM100 (Blackwell) CuteDSL kernel code to API changes in operand major-mode enums, MMA helper signatures, and NVVM tcgen05 MLIR op bindings.

Changes:

Bump nvidia-cutlass-dsl dependency to >=4.6.0.dev0.
Update multiple SM100 ops to use OperandMajorMode (instead of tcgen05.OperandMajorMode) and pass explicit operand dtypes into sm100_utils.make_trivial_tiled_mma(...).
Adjust SM100 NVVM tcgen05 wrapper calls to match updated MLIR op argument names/signatures (e.g., val= for stores, drop num= where no longer accepted).

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/conftest.py	Minor collection logic formatting; maintains existing skip behavior.
pyproject.toml	Bumps Cutlass DSL dependency to `>=4.6.0.dev0`.
cula/ops/linear_attn_sm100.py	Updates major-mode enum usage and MMA helper argument list for Cutlass DSL 4.6.0.
cula/ops/lightning_attn_sm100.py	Same Cutlass DSL 4.6.0 compatibility adjustments (enum + MMA helper signature).
cula/ops/kda_fully_fused_sm100_wip.py	Same Cutlass DSL 4.6.0 compatibility adjustments across KDA fused path.
cula/ops/intrinsics_sm100.py	Updates NVVM tcgen05 wrapper bindings to new MLIR op APIs (`val=`, vector extract changes, etc.).
cula/ops/fwd_o_sm100.py	Updates MMA setup to new major-mode enum + MMA helper signature.
cula/ops/cp/pre_scan.py	Updates MMA setup to new major-mode enum + MMA helper signature.
cula/ops/chunk_wy_dqkg_sm100.py	Updates multiple MMA setups to new major-mode enum + MMA helper signature.
cula/ops/chunk_delta_h_sm100.py	Updates MMA setup to new major-mode enum + MMA helper signature.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

 dependencies = [
-    "nvidia-cutlass-dsl>=4.4.2",
+    "nvidia-cutlass-dsl>=4.6.0.dev0",
    "apache-tvm-ffi>=0.1.9",
 ]


icavan

LGTM, will merge this PR once flashinfer has cutedsl 4.6 enabled.

anakinxc added 2 commits June 11, 2026 21:24

Update to cute dsl 4.6.0.dev0

abc695f

Format+lint

c20faf8

gemini-code-assist Bot reviewed Jun 12, 2026

View reviewed changes

tongke6 requested a review from Copilot June 12, 2026 12:02

Copilot started reviewing on behalf of tongke6 June 12, 2026 12:03 View session

Copilot AI reviewed Jun 12, 2026

View reviewed changes

Comment thread pyproject.toml

Comment on lines 13 to 16

dependencies = [

"nvidia-cutlass-dsl>=4.4.2",

"nvidia-cutlass-dsl>=4.6.0.dev0",

"apache-tvm-ffi>=0.1.9",

]

icavan requested review from icavan and tongke6 June 14, 2026 04:30

icavan approved these changes Jun 14, 2026

View reviewed changes

Conversation

anakinxc commented Jun 12, 2026

📌 Description

🔍 Related Issues

🚀 Pull Request Checklist

✅ Pre-commit Checks

🧪 Tests

⚡ Performance

Reviewer Notes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

icavan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants