Skip to content

feat: Optimize Select op#115

Merged
tngchien merged 8 commits intoMooreThreads:mainfrom
awexxxx:main
Mar 24, 2026
Merged

feat: Optimize Select op#115
tngchien merged 8 commits intoMooreThreads:mainfrom
awexxxx:main

Conversation

@awexxxx
Copy link
Copy Markdown
Contributor

@awexxxx awexxxx commented Mar 19, 2026

Shared input and output memory space, saving gmem space.

Count: 144

origin:
Total Time: 5ms
Avg Time: 0.035ms

after:
Total Time: 2.289ms
Avg Time: 0.016ms

@awexxxx
Copy link
Copy Markdown
Contributor Author

awexxxx commented Mar 20, 2026

Add Multiply optimization
Count: 201

origin:
Total Time: 4.43ms
Avg Time: 0.022ms

after:
Total Time: 2.678ms
Avg Time: 0.013ms

@tngchien tngchien merged commit e94c3ed into MooreThreads:main Mar 24, 2026
0 of 2 checks passed
EternityTang pushed a commit to EternityTang/tensorflow_musa_extension that referenced this pull request Apr 4, 2026
* feat: Optimize Select op

---------

Co-authored-by: hxc <>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants