fix: fix package deprecation introduced by CUDA 13 #4117

windreamer · 2025-11-11T03:48:20Z

Motivation

NVIDIA has deprecated versioned wheel package since CUDA 13, causing CUDA 13+ installations to fail with deprecated package names like nvidia-cublas-cu13 .

Modification

Remove the unconditional return to allow the version check to execute.

.github/workflows/test_docker.yml

lvhan028 · 2025-11-14T05:50:27Z

In the runtime_cuda.txt file, the version of torch is restricted to torch<=2.8.0 and >=2.0.0. However, under CUDA 13, pytorch is required a minimum version of 2.9.
We need to upgrade and test it.
cc @zhulinJulia24

lvhan028 · 2025-11-14T06:12:51Z

Should we also upgrade triton?

lvhan028 · 2025-11-14T07:19:34Z

In the runtime_cuda.txt file, the version of torch is restricted to torch<=2.8.0 and >=2.0.0. However, under CUDA 13, pytorch is required a minimum version of 2.9. We need to upgrade and test it. cc @zhulinJulia24

Since flash-attention doesn't have a CUDA 13 build yet, we need to be more careful with the lmdeploy CUDA 13 release due to potential compatibility issues.

windreamer · 2025-11-14T07:47:58Z

In the runtime_cuda.txt file, the version of torch is restricted to torch<=2.8.0 and >=2.0.0. However, under CUDA 13, pytorch is required a minimum version of 2.9. We need to upgrade and test it. cc @zhulinJulia24

Since flash-attention doesn't have a CUDA 13 build yet, we need to be more careful with the lmdeploy CUDA 13 release due to potential compatibility issues.

I think currently we can make our code CUDA 13 ready but do not ship the CUDA 13 wheels and images until testing and relevant dependencies ready. Anyone wants to use LMDeploy in CUDA 13 can build from source by themselves.

lvhan028 · 2025-11-14T09:08:04Z

I've built the docker image by

docker build . -f docker/Dockerfile -t openmmlab/lmdeploy:test-cu13 --build-arg CUDA_VERSION=cu13

Then in the container, I tried serving a model using turbomind backend but got failure

>>> from lmdeploy import turbomind
/opt/py3/lib/python3.10/site-packages/torch/cuda/__init__.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
  import pynvml  # type: ignore[import]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/opt/py3/lib/python3.10/site-packages/lmdeploy/turbomind/__init__.py", line 24, in <module>
    from .turbomind import TurboMind, update_parallel_config  # noqa: E402
  File "/opt/py3/lib/python3.10/site-packages/lmdeploy/turbomind/turbomind.py", line 35, in <module>
    import _turbomind as _tm  # noqa: E402
ImportError: libcublas.so.13: cannot open shared object file: No such file or directory

There is not "libcublas.so" in /usr/local/cuda

lvhan028 · 2025-11-14T09:12:23Z

Pytorch engine doesn't work either

lvhan028 · 2025-11-14T09:14:27Z

In the runtime_cuda.txt file, the version of torch is restricted to torch<=2.8.0 and >=2.0.0. However, under CUDA 13, pytorch is required a minimum version of 2.9. We need to upgrade and test it. cc @zhulinJulia24

Since flash-attention doesn't have a CUDA 13 build yet, we need to be more careful with the lmdeploy CUDA 13 release due to potential compatibility issues.

I think currently we can make our code CUDA 13 ready but do not ship the CUDA 13 wheels and images until testing and relevant dependencies ready. Anyone wants to use LMDeploy in CUDA 13 can build from source by themselves.

But neither inference engine can work even though users can build lmdeploy from source in cu13 env.

lvhan028 · 2025-11-14T09:25:05Z

I've built the docker image by

docker build . -f docker/Dockerfile -t openmmlab/lmdeploy:test-cu13 --build-arg CUDA_VERSION=cu13

Then in the container, I tried serving a model using turbomind backend but got failure

>>> from lmdeploy import turbomind
/opt/py3/lib/python3.10/site-packages/torch/cuda/__init__.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
  import pynvml  # type: ignore[import]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/opt/py3/lib/python3.10/site-packages/lmdeploy/turbomind/__init__.py", line 24, in <module>
    from .turbomind import TurboMind, update_parallel_config  # noqa: E402
  File "/opt/py3/lib/python3.10/site-packages/lmdeploy/turbomind/turbomind.py", line 35, in <module>
    import _turbomind as _tm  # noqa: E402
ImportError: libcublas.so.13: cannot open shared object file: No such file or directory

There is not "libcublas.so" in /usr/local/cuda

After setting export LD_LIBRARY_PATH=/opt/py3/lib/python3.10/site-packages/nvidia/cu13/lib/:$LD_LIBRARY_PATH, turbomind engine works

lvhan028 · 2025-11-14T09:46:50Z

Pytorch engine doesn't work either

After ugrading triton to its latest version, pytorch engine works too.

I agree we should defer the release until complete verification. In the meantime, I recommend updating the LD_LIBRARY_PATH configuration to this PR to ensure at least one engine is functional.

… paths

windreamer linked an issue Nov 11, 2025 that may be closed by this pull request

[Bug] 'nvidia-cublas-cu13' is deprecated causes the installation of lmdeploy from source using uv to fail. #4116

Closed

3 tasks

windreamer force-pushed the fix_nv_dep branch 5 times, most recently from 4ccb1e2 to f6e326c Compare November 13, 2025 07:01

windreamer marked this pull request as ready for review November 13, 2025 07:52

windreamer requested review from lvhan028 and lzhangzz November 13, 2025 07:52

lvhan028 reviewed Nov 13, 2025

View reviewed changes

.github/workflows/test_docker.yml Show resolved Hide resolved

lvhan028 added the Bug:P1 label Nov 13, 2025

windreamer marked this pull request as draft November 14, 2025 10:34

windreamer force-pushed the fix_nv_dep branch from 1d2ece8 to 4594575 Compare November 14, 2025 11:31

windreamer added 6 commits November 17, 2025 12:22

fix: fix package deprecation introduced by CUDA 13

44ffea1

build: add cuda 13 docker build test

63f41b0

fix: fix PFN_cuTensorMapEncodeTiled undefined in cuda 13

04c58d9

fix: modify build scripts for cuda 13

5887c62

build: remove py39 wheels from PYPI

0d02674

fix: change rpath for cuda 13, as new wheel for cuda13+ has different…

05d200a

… paths

windreamer force-pushed the fix_nv_dep branch from 4594575 to 05d200a Compare November 17, 2025 04:22

windreamer marked this pull request as ready for review November 17, 2025 04:22

windreamer requested a review from lvhan028 November 17, 2025 08:55

lvhan028 approved these changes Nov 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: fix package deprecation introduced by CUDA 13 #4117

fix: fix package deprecation introduced by CUDA 13 #4117

Uh oh!

windreamer commented Nov 11, 2025

Uh oh!

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

windreamer commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025 •

edited

Loading

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025 •

edited

Loading

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: fix package deprecation introduced by CUDA 13 #4117

Are you sure you want to change the base?

fix: fix package deprecation introduced by CUDA 13 #4117

Uh oh!

Conversation

windreamer commented Nov 11, 2025

Motivation

Modification

Uh oh!

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

windreamer commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

lvhan028 commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lvhan028 commented Nov 14, 2025 •

edited

Loading

lvhan028 commented Nov 14, 2025 •

edited

Loading