We have these for CUDA, but we should also have these for ROCm. The APIs are likely nearly identical, so this should be easy.
We have these for CUDA, but we should also have these for ROCm. The APIs are likely nearly identical, so this should be easy.