You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have an 8xA100 GPU machine. I noticed that one Mixtral instance requires at least 2 GPUs. However, when I attempt to run two Mixtral instances on the machine, each allocated 2 GPUs, the second one hangs at Started a local Ray instance. Additionally, the terminal reports fork: retry: Resource temporarily unavailable. It appears that the two Ray instances are causing a conflict. Is there any solution to resolve this issue?
The text was updated successfully, but these errors were encountered:
Thanks for your help. I have specified CUDA_VISIBLE_DEVICES. Besides, multiple vllm process works when each model only needs one GPU, where Ray is not used. So I believe the issue is about setting Ray.
I have an 8xA100 GPU machine. I noticed that one Mixtral instance requires at least 2 GPUs. However, when I attempt to run two Mixtral instances on the machine, each allocated 2 GPUs, the second one hangs at
Started a local Ray instance.
Additionally, the terminal reportsfork: retry: Resource temporarily unavailable
. It appears that the two Ray instances are causing a conflict. Is there any solution to resolve this issue?The text was updated successfully, but these errors were encountered: