Sequencing of Kernels / Graphs? #458
Unanswered
apowers313
asked this question in
Q&A
Replies: 3 comments 1 reply
-
Hey sorry for the stalled progress on that PR. If you have some bandwidth, I can guide you to have something working for your use case. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Thanks for the quick reply. Yea, that'd be great. |
Beta Was this translation helpful? Give feedback.
0 replies
-
If some can show me how PyCUDA does it's C++ bindings I think I can put together a PR for building out some graph functionality. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey, just learning CUDA and I'm loving PyCUDA.
I have a number of small kernels that pass data to each other and I don't want to use the host / CPU to coordinate all the kernel launching and data passing because of the overhead. I think that graphs are the solution that I need, but it looks like that PR is still in progress.
I started looking at the CUDA Python Graph Management API, but CUDA Python is like writing in assembly and it's going to take forever to get that up and running. (Again, thanks for PyCUDA)
Is there some workaround for sequencing kernels and memory operations in PyCUDA? A way of calling CUDA C++ APIs directly or something?
Or is there some way having a kernel block on an event from another kernel, so that it can block until a kernel is done running?
Thanks for any insights. :)
Beta Was this translation helpful? Give feedback.
All reactions