Skip to content

CUDA: support for lazy init#758

Open
Sergei-Lebedev wants to merge 3 commits intoopenucx:masterfrom
Sergei-Lebedev:topic/cuda_lazy_init
Open

CUDA: support for lazy init#758
Sergei-Lebedev wants to merge 3 commits intoopenucx:masterfrom
Sergei-Lebedev:topic/cuda_lazy_init

Conversation

@Sergei-Lebedev
Copy link
Copy Markdown
Contributor

What

Lazily initialize TL NCCL and TL CUDA on first CUDA collective.

Why ?

Both NCCL and CUDA require CUDA devices to be set before team create. In MPI workloads it's not always possible since MPI_Init creates UCC team and to set device we need to know rank and local rank.

@swx-jenkins3
Copy link
Copy Markdown

Can one of the admins verify this patch?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants