Skip to content

UCT/ZE/TEST: expand ZE RMA coverage and fix ze_copy rkey_compare path#11331

Open
yafshar wants to merge 6 commits intoopenucx:masterfrom
intel-staging:test/ze-uct-rma
Open

UCT/ZE/TEST: expand ZE RMA coverage and fix ze_copy rkey_compare path#11331
yafshar wants to merge 6 commits intoopenucx:masterfrom
intel-staging:test/ze-uct-rma

Conversation

@yafshar
Copy link
Copy Markdown
Contributor

@yafshar yafshar commented Apr 7, 2026

What?

This PR expands ZE test coverage in UCT gtests and fixes a ZE MD callback issue
that caused a crash in existing tests.

  • Expand ZE RMA test coverage for the ze_copy transport in test_ze.cc
  • Extend ZE-related MD test coverage and expectations in test_md.cc
  • Register ZE gtest sources in Makefile.am
  • Update ZE test helpers in test_p2p_rma.cc and uct_test.h
  • Fix ze_copy component rkey compare callback and mem_dereg parameter handling
    in ze_copy_md.c
  • Fix ze_copy_md.c aligned with UCX style guidelines

Why?

ZE paths lacked sufficient automated coverage for ze_copy transport and MD
behavior, and an existing ZE MD test exposed a real failure:

  • The ze_copy MD test path crashed during rkey comparison due to missing component callback wiring.
  • Additional coverage is required to consistently validate ZE host, device, and managed memory cases across RMA and MD operations.

yafshar added 3 commits April 6, 2026 14:07
- Add UCT RMA tests for ze_copy covering put/get zcopy operations
  on ZE device and managed memory.
- Enable runtime skip logic based on ZE memory type availability
  instead of compile-time gtest filtering.
- Include ze_copy in p2p_rma source memory-type selection alongside
  cuda_ipc and rocm_copy.
- Introduce UCT_INSTANTIATE_ZE_TEST_CASE macro in uct_test.h to
  standardize ZE test instantiation.
- Update gtest Makefile.am to build ZE transport tests.
- Wire ze_copy component rkey_compare to uct_base_rkey_compare to
  prevent crashes in ze_copy/test_md.rkey_compare.
- Validate ze_copy mem_dereg parameters (memh and invalidate flag)
  to match MD bad-argument behavior when parameter checking is enabled.
- Fix the style
- Add ZE host, device, and managed memory RMA tests for ze_copy
  transport in test_ze.cc.
- Extend MD tests to cover ZE mem_query behavior and add ZE dmabuf
  query tests in test_md.cc.
@yafshar yafshar marked this pull request as ready for review April 7, 2026 21:36
Comment thread test/gtest/uct/test_md.cc Outdated
yafshar added 2 commits April 13, 2026 11:05
Run dmabuf mem_query tests via the generic MD fixture (test_md_dmabuf)
instead of a ZE-only fixture, validating the generic dmabuf interface
across MDs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants