Remove code paths that depend on RMM_STATIC_CUDART #1667

robertmaynard · 2024-09-04T19:43:38Z

Description

We can remove the optimizations around CUDA_STATIC_RUNTIME and instead see if the function is already in the process space so that RMM doesn't need to have any build context to run properly

Fixes #1679

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

harrism · 2024-09-04T23:18:48Z

Are existing tests sufficient to cover this?

robertmaynard · 2024-09-05T13:43:20Z

Are existing tests sufficient to cover this?

I will add explicit tests that control the type of cudart we are using to verify these chages

robertmaynard · 2024-09-06T11:57:56Z

Did more testing today and realize that more work is needed. With this PR we are getting all the symbols required at link time, but the dlsym with RTLD_DEFAULT is failing for a couple of the use cases. The tests still pass in CI since a libcudart.so exists on the machine.

Need more time to figure out what is going wrong.

vyasr · 2024-09-20T21:52:27Z

In what contexts is this failing? I assume it's a statically linked case, so the symbols cannot have been (dynamically) loaded with RTLD_LOCAL in a way that would obscure them from this scope.

robertmaynard · 2024-10-25T18:19:41Z

In what contexts is this failing? I assume it's a statically linked case, so the symbols cannot have been (dynamically) loaded with RTLD_LOCAL in a way that would obscure them from this scope.

That is correct it is the static linking use case the is failing. The tests have the textual entries for the symbol, but the dlsym returns null.

No longer needs RMM_STATIC_CUDART to be set for static cudart usages

include/rmm/detail/dynamic_load_runtime.hpp

tests/CMakeLists.txt

include/rmm/detail/dynamic_load_runtime.hpp

…_CUDART

bdice · 2024-10-31T14:21:26Z

This introduces a bit of complexity that we may be able to avoid. RMM declares its minimum supported CUDA version is 11.4 (this has been true since November 2023). We have required a minimum of at least 11.2, usually 11.4, everywhere I can think of for a long time. I think we can remove the shims for CUDA < 11.2 in this PR.

robertmaynard requested a review from a team as a code owner September 4, 2024 19:43

robertmaynard requested review from wence- and jrhemstad September 4, 2024 19:43

github-actions bot added the cpp Pertains to C++ code label Sep 4, 2024

robertmaynard requested a review from a team as a code owner September 5, 2024 13:42

github-actions bot added the CMake label Sep 5, 2024

robertmaynard force-pushed the remove_code_dependency_on_RMM_STATIC_CUDART branch 2 times, most recently from 326fd58 to 4866138 Compare September 5, 2024 14:34

harrism approved these changes Sep 6, 2024

View reviewed changes

robertmaynard added the 5 - DO NOT MERGE Hold off on merging; see PR for details label Sep 6, 2024

vyasr mentioned this pull request Sep 20, 2024

[FEA] rmm should stop prescribing cudart linking #1320

Open

robertmaynard changed the base branch from branch-24.10 to branch-24.12 October 28, 2024 18:48

robertmaynard added 3 commits October 28, 2024 15:45

rmm dynamic_load_runtime can now detect static linking of cudart.

38da65b

No longer needs RMM_STATIC_CUDART to be set for static cudart usages

Correct nvcc 12.4 compile errors with tests

8099966

Correct style issues found by CI

6dc3ee3

robertmaynard force-pushed the remove_code_dependency_on_RMM_STATIC_CUDART branch from 91e325d to 6dc3ee3 Compare October 28, 2024 19:46

robertmaynard added feature request New feature or request non-breaking Non-breaking change and removed 5 - DO NOT MERGE Hold off on merging; see PR for details labels Oct 28, 2024

wence- reviewed Oct 30, 2024

View reviewed changes

robertmaynard added 2 commits October 30, 2024 11:41

Update coding with changes from review

72987cf

Update coding with changes from review

7ff3c60

wence- reviewed Oct 30, 2024

View reviewed changes

include/rmm/detail/dynamic_load_runtime.hpp Outdated Show resolved Hide resolved

robertmaynard added 2 commits October 30, 2024 13:05

Correct incorrect function forward declare name

c303ec2

Move forward declares out of namespaces so the work properly

0c16395

robertmaynard added 3 commits October 30, 2024 13:35

Correct style issues found by CI

06f9817

Correct style issues found by CI

c3fd262

Merge branch 'branch-24.12' into remove_code_dependency_on_RMM_STATIC…

baf6257

…_CUDART

robertmaynard requested a review from a team as a code owner October 31, 2024 17:43

github-actions bot added the Python Related to RMM Python API label Oct 31, 2024

Since we require CUDART 11.2+ remove all conditional usages

22d2b7f

robertmaynard force-pushed the remove_code_dependency_on_RMM_STATIC_CUDART branch from 5f17e74 to 22d2b7f Compare October 31, 2024 19:43

robertmaynard added 3 commits October 31, 2024 16:00

Correct style issues found by CI

991653a

Correct python style issues found by CI

8531288

Update docs so that return types are documented

79ba908

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove code paths that depend on RMM_STATIC_CUDART #1667

Remove code paths that depend on RMM_STATIC_CUDART #1667

robertmaynard commented Sep 4, 2024 •

edited by harrism

Loading

harrism commented Sep 4, 2024

robertmaynard commented Sep 5, 2024

robertmaynard commented Sep 6, 2024

vyasr commented Sep 20, 2024

robertmaynard commented Oct 25, 2024

bdice commented Oct 31, 2024 •

edited

Loading

Remove code paths that depend on RMM_STATIC_CUDART #1667

Are you sure you want to change the base?

Remove code paths that depend on RMM_STATIC_CUDART #1667

Conversation

robertmaynard commented Sep 4, 2024 • edited by harrism Loading

Description

Checklist

harrism commented Sep 4, 2024

robertmaynard commented Sep 5, 2024

robertmaynard commented Sep 6, 2024

vyasr commented Sep 20, 2024

robertmaynard commented Oct 25, 2024

bdice commented Oct 31, 2024 • edited Loading

robertmaynard commented Sep 4, 2024 •

edited by harrism

Loading

bdice commented Oct 31, 2024 •

edited

Loading