Recover Conv/ConvTranspose rank from weight when input shape is unknown by fanchenkong1 · Pull Request #29149 · microsoft/onnxruntime

fanchenkong1 · 2026-06-18T04:25:48Z

Recover Conv/ConvTranspose rank from weight when input shape is unknown, enabling layout transformation to NHWC for more nodes.

Description

The layout transformer skips converting a node to NHWC when input[0] has no inferred shape.

For Conv and ConvTranspose operators, the data input (input[0]) and the weight (input[1]) always share the same rank. When the input rank is unknown, recover it from the weight.

Performance Impact

Measured on Kokoro-82M-v1.0-ONNX text-to-speech model (onnx-community/Kokoro-82M-v1.0-ONNX) with WebGPU ep,

Platform	Latency reduction	Speedup
Intel Wildcat Lake	−32.0%	1.47×
Intel Panther Lake	−20.0%	1.25×

This change yields a 1.2–1.5× speedup on the Kokoro-82M text-to-speech model.

The layout transformer skips converting a node to NHWC when input[0] has no inferred shape. But the NCHW<->NHWC permutation depends only on rank. For Conv/ConvTranspose the data input and weight share the same rank, so when input[0]'s rank is unknown, recover it from the weight at input[1].

fanchenkong1 · 2026-06-18T05:11:41Z

@qjia7 @guschmue This change is ready for review. PTAL, thanks!

qjia7

Correctness

The change is sound. Per ONNX spec, Conv and ConvTranspose require W (input[1])
at the same rank as X (input[0]) — [M, C/group, k1..kn] vs [N, C, d1..dn].
Falling back to the weight's rank when the data input's rank is unknown is safe.
Downstream (ChannelFirstToLastPerm / ChannelLastToFirstPerm) only needs the
rank, not the full shape. FusedConv is covered via the existing op_type
normalization to "Conv".

Simplicity

The defensive guard node->Inputs().size() > 1 && !node->Inputs()[1].empty() is
redundant. Per ONNX spec, W is a mandatory input for both Conv and
ConvTranspose — a node missing it would already be malformed and rejected
upstream. The empty-string convention only applies to optional inputs (like
B).

Suggest simplifying to:

if (!input_rank.has_value() && (op_type == "Conv" || op_type == "ConvTranspose")) {
  input_rank = api_graph->GetValueInfo(node->Inputs()[1])->ShapeRank();
}

Using ShapeRank() over Shape()->size() is the right API choice.

Security

No new attack surface. Reads an existing graph value-info; no allocation, no
unchecked arithmetic.

Testing

A unit test that constructs a Conv with unknown input[0] rank but known weight
rank, runs the layout transformer, and asserts the transpose is inserted would
lock the behavior in.

Verdict

Approve. One optional simplification (drop the redundant guard) and an optional
test.

fanchenkong1 · 2026-06-24T04:42:23Z

Correctness

The change is sound. Per ONNX spec, Conv and ConvTranspose require W (input[1]) at the same rank as X (input[0]) — [M, C/group, k1..kn] vs [N, C, d1..dn]. Falling back to the weight's rank when the data input's rank is unknown is safe. Downstream (ChannelFirstToLastPerm / ChannelLastToFirstPerm) only needs the rank, not the full shape. FusedConv is covered via the existing op_type normalization to "Conv".

Simplicity

The defensive guard node->Inputs().size() > 1 && !node->Inputs()[1].empty() is redundant. Per ONNX spec, W is a mandatory input for both Conv and ConvTranspose — a node missing it would already be malformed and rejected upstream. The empty-string convention only applies to optional inputs (like B).

Suggest simplifying to:
if (!input_rank.has_value() && (op_type == "Conv" || op_type == "ConvTranspose")) {
  input_rank = api_graph->GetValueInfo(node->Inputs()[1])->ShapeRank();
}
Using ShapeRank() over Shape()->size() is the right API choice.

Security

No new attack surface. Reads an existing graph value-info; no allocation, no unchecked arithmetic.

Testing

A unit test that constructs a Conv with unknown input[0] rank but known weight rank, runs the layout transformer, and asserts the transpose is inserted would lock the behavior in.

Verdict

Approve. One optional simplification (drop the redundant guard) and an optional test.

@qjia7, addressed your comments. PTAL, thanks!

Copilot

Pull request overview

This PR improves the layout transformation pass so it can still convert Conv/ConvTranspose nodes to NHWC even when the data input shape (input[0]) has no inferred rank, by recovering the rank from the weight input (input[1]). This enables more nodes to be transformed and allows downstream transpose optimization to reduce overhead, particularly benefiting WebGPU.

Changes:

Update layout transformation to use ShapeRank() and, for Conv/ConvTranspose, fall back to the weight’s rank when the data input rank is unknown.
Add a unit test that constructs a Conv with cleared input shape and verifies layout transformation proceeds (via inserted Transpose nodes).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
onnxruntime/core/optimizer/layout_transformation/layout_transformation.cc	Recover `Conv`/`ConvTranspose` rank from weight when input rank is unknown, enabling NHWC conversion in more cases.
onnxruntime/test/optimizer/transpose_optimizer_test.cc	Adds coverage verifying rank recovery from weights allows layout transformation to insert transposes for a `Conv` with unknown input shape.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

…is unknown

qjia7

LGTM with one nit.

fanchenkong1 · 2026-06-25T07:01:01Z

The Windows GPU CUDA CI Pipeline Test Job encountered a network error when trying to download a test model, which seems to be a infra-related failure.

fanchenkong1 marked this pull request as ready for review June 18, 2026 04:44

qjia7 reviewed Jun 22, 2026

View reviewed changes

fanchenkong1 added 2 commits June 24, 2026 11:15

Simplify rank recovery check by removing redundant guard

9eb512d

Add test for Conv rank recovery from weight when input rank is unknown

88889e3

qjia7 requested a review from Copilot June 24, 2026 07:35

Copilot started reviewing on behalf of qjia7 June 24, 2026 07:35 View session

Copilot AI reviewed Jun 24, 2026

View reviewed changes

Comment thread onnxruntime/core/optimizer/layout_transformation/layout_transformation.cc Outdated

Comment thread onnxruntime/test/optimizer/transpose_optimizer_test.cc Outdated

fanchenkong1 and others added 2 commits June 24, 2026 15:44

Update comments

5c934f1

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Add test for ConvTranspose rank recovery from weight when input rank …

c6c3039

…is unknown

qjia7 previously approved these changes Jun 24, 2026

View reviewed changes

Comment thread onnxruntime/test/optimizer/transpose_optimizer_test.cc

Refactor Conv/ConvTranspose rank recovery tests to use helper function

28cc225

fanchenkong1 dismissed qjia7’s stale review via 28cc225 June 25, 2026 02:01

qjia7 approved these changes Jun 25, 2026

View reviewed changes

qjia7 requested a review from skottmckay June 25, 2026 08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Recover Conv/ConvTranspose rank from weight when input shape is unknown#29149

Recover Conv/ConvTranspose rank from weight when input shape is unknown#29149
fanchenkong1 wants to merge 6 commits into
microsoft:mainfrom
fanchenkong1:rank-recover

fanchenkong1 commented Jun 18, 2026

Uh oh!

fanchenkong1 commented Jun 18, 2026

Uh oh!

qjia7 left a comment

Uh oh!

fanchenkong1 commented Jun 24, 2026

Correctness

Simplicity

Security

Testing

Verdict

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

qjia7 left a comment

Uh oh!

Uh oh!

fanchenkong1 commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

fanchenkong1 commented Jun 18, 2026

Description

Performance Impact

Uh oh!

fanchenkong1 commented Jun 18, 2026

Uh oh!

qjia7 left a comment

Choose a reason for hiding this comment

Correctness

Simplicity

Security

Testing

Verdict

Uh oh!

fanchenkong1 commented Jun 24, 2026

Correctness

Simplicity

Security

Testing

Verdict

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

qjia7 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fanchenkong1 commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants