未验证 提交 d414678d 编写于 作者: J john li 提交者: GitHub

Small tweak on cuda version mismatch documentation (#3706)

* Small tweak on cuda version mismatch documentation

* clarify minor versions should also match

---------
Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
上级 fb2b4ab1
...@@ -155,15 +155,20 @@ If you're getting the following error: ...@@ -155,15 +155,20 @@ If you're getting the following error:
Exception: >- DeepSpeed Op Builder: Installed CUDA version {VERSION} does not match the version torch was compiled with {VERSION}, unable to compile cuda/cpp extensions without a matching cuda version. Exception: >- DeepSpeed Op Builder: Installed CUDA version {VERSION} does not match the version torch was compiled with {VERSION}, unable to compile cuda/cpp extensions without a matching cuda version.
``` ```
You have a misaligned version of CUDA installed compared to the version of CUDA You have a misaligned version of CUDA installed compared to the version of CUDA
used to compile torch. We only require that major version match (e.g., 11.1 and used to compile torch. A mismatch in the major version is likely to result in
11.8 are OK). However a mismatch in the major version may result in unexpected errors or unexpected behavior.
behavior and errors.
The easiest fix for this error is changing the CUDA version installed (check The easiest fix for this error is changing the CUDA version installed (check
with `nvcc --version`) or updating the torch version to match the installed with `nvcc --version`) or updating the torch version to match the installed
CUDA version (check with `python3 -c "import torch; print(torch.__version__)"`). CUDA version (check with `python3 -c "import torch; print(torch.__version__)"`).
If you want to skip this check and proceed with the mismatched CUDA versions, use the following environment variable: We only require that the major version matches (e.g., 11.1 and 11.8). However,
note that even a mismatch in the minor version _may still_ result in unexpected
behavior and errors, so it's recommended to match both major and minor versions.
When there's a minor version mismatch, DeepSpeed will log a warning.
If you want to skip this check and proceed with the mismatched CUDA versions,
use the following environment variable, but beware of unexpected behavior:
```bash ```bash
DS_SKIP_CUDA_CHECK=1 DS_SKIP_CUDA_CHECK=1
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册