Error: RuntimeError: CUDA error: invalid device ordinal #3

Closed
opened 2024-06-15 18:53:08 +00:00 by gnh1201 · 1 comment
Owner
## Summary Error: RuntimeError: CUDA error: invalid device ordinal ## Related links * https://velog.io/@pswo1021/CUDA-ERROR-%ED%95%B4%EA%B2%B0 * https://heygeronimo.tistory.com/14 * https://discuss.huggingface.co/t/solved-accelerate-accelerator-cuda-error-invalid-device-ordinal/21509/6
gnh1201 changed title from RuntimeError: CUDA error: invalid device ordinal to Error: RuntimeError: CUDA error: invalid device ordinal 2024-06-15 18:53:25 +00:00
Author
Owner
  1. Set the environment variable" CUDA_DEVICE_ORDER=PCI_BUS_ID and CUDA_VISIBLE_DEVICES=0
  2. Add option when training: like accelerate launch --num_processes=$NUM_GPUS
1. Set the environment variable" `CUDA_DEVICE_ORDER=PCI_BUS_ID` and `CUDA_VISIBLE_DEVICES=0` 2. Add option when training: like `accelerate launch --num_processes=$NUM_GPUS`
Sign in to join this conversation.
No Label
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: gnh1201/Open-Llama#3
No description provided.