Which component requires the feature?
CuTe DSL
Feature Request
Is your feature request related to a problem? Please describe.
It would be useful for cute.Tensor to support permute, transpose, unsqueeze, expand. Calling them in host-side launchers instead of torch would reduce CPU-side overheads.
Describe the solution you'd like
Expand the cute.Tensor class to include the methods suggested.
Describe alternatives you've considered
Defining helper methods
Additional context
N/A
Which component requires the feature?
CuTe DSL
Feature Request
Is your feature request related to a problem? Please describe.
It would be useful for
cute.Tensorto supportpermute,transpose,unsqueeze,expand. Calling them in host-side launchers instead of torch would reduce CPU-side overheads.Describe the solution you'd like
Expand the
cute.Tensorclass to include the methods suggested.Describe alternatives you've considered
Defining helper methods
Additional context
N/A