[BUILD] Modularize device runtime into per-backend DSOs by tqchen · Pull Request #19593 · apache/tvm

tqchen · 2026-05-21T14:30:23Z

Summary

Modularize libtvm_runtime into per-backend shared libraries (libtvm_runtime_cuda, libtvm_runtime_vulkan, etc.) and libtvm_runtime_extra for contrib/disco modules.

Each backend can be built independently (USE_CUDA=ON produces only libtvm_runtime.so + libtvm_runtime_cuda.so)
During Python import, available backend DSOs are discovered and loaded automatically, with missing backends silently skipped
libtvm_runtime_extra is always produced if it has sources (disco, contrib modules)
No user-visible config flag changes — existing USE_CUDA, USE_VULKAN, etc. continue to work

Test plan

Build with USE_CUDA=ON — verify libtvm_runtime.so, libtvm_runtime_cuda.so, libtvm_runtime_extra.so produced
python -c "import tvm; print(tvm.cuda(0).exist)" prints True
Runtime GPU tests pass
Moving libtvm_runtime_cuda.so out of lib/ — CUDA unavailable, no crash

gemini-code-assist

Code Review

This pull request refactors the TVM runtime build system to split the monolithic runtime into a core library, per-backend dynamic shared objects (DSOs) for CUDA, Vulkan, OpenCL, Metal, ROCm, and Hexagon, and a separate libtvm_runtime_extra library for contrib and disco sources. Correspondingly, the Python loading logic was updated to dynamically discover and load these backend DSOs. Review feedback identified a missing compile definition for the cuDNN frontend and a package naming inconsistency in the library loading utility that could lead to resolution failures.

tqchen · 2026-05-21T15:11:24Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the TVM runtime build system to split backend-specific code (CUDA, Vulkan, OpenCL, Metal, ROCm, Hexagon) and various contrib modules into separate dynamic shared libraries, specifically introducing libtvm_runtime_extra. This change allows for a more modular runtime and dynamic loading of backends in Python. Feedback includes concerns regarding regressions for monolithic Hexagon DSP builds where Disco sources might be missing, the need to gate the creation of the extra runtime library for Hexagon, and improving the robustness of the CMake logic when no extra sources are present.

gemini-code-assist · 2026-05-21T15:13:18Z

 include(cmake/modules/contrib/vllm.cmake)
 include(cmake/modules/Git.cmake)

+# ---- libtvm_runtime_extra assembly ----


The libtvm_runtime_extra assembly block should be gated to avoid creating a shared library when building for the Hexagon DSP (BUILD_FOR_HEXAGON). Hexagon DSP builds typically use static linking and a monolithic runtime, and the standard shared library mechanism used here may not be appropriate for the DSP environment.

# ---- libtvm_runtime_extra assembly ---- if(NOT BUILD_FOR_HEXAGON)

Modularize libtvm_runtime into per-backend shared libraries (libtvm_runtime_cuda, libtvm_runtime_vulkan, etc.) and libtvm_runtime_extra for contrib/disco modules. Each backend can be built independently. During Python import, available backend DSOs are discovered and loaded automatically, with missing backends silently skipped.

tqchen · 2026-05-22T04:17:03Z

Reopening from upstream branch to pick up Jenkins groovy changes.

gemini-code-assist Bot reviewed May 21, 2026

View reviewed changes

Comment thread cmake/modules/CUDA.cmake Outdated

Comment thread python/tvm/libinfo.py Outdated

tqchen force-pushed the runtime-modularize branch from 9c78db7 to 0a12a3c Compare May 21, 2026 14:49

gemini-code-assist Bot reviewed May 21, 2026

View reviewed changes

tqchen force-pushed the runtime-modularize branch 9 times, most recently from 029ad2c to 31200ba Compare May 21, 2026 19:01

tlopex approved these changes May 21, 2026

View reviewed changes

tqchen force-pushed the runtime-modularize branch from 31200ba to 3e89882 Compare May 22, 2026 02:21

tqchen closed this May 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUILD] Modularize device runtime into per-backend DSOs#19593

[BUILD] Modularize device runtime into per-backend DSOs#19593
tqchen wants to merge 1 commit into
apache:mainfrom
tqchen:runtime-modularize

tqchen commented May 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

tqchen commented May 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

Uh oh!

tqchen commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tqchen commented May 21, 2026

Summary

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

tqchen commented May 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tqchen commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants