Set is_distributed false by default in vllm #266

asesorov · 2024-09-20T08:56:37Z

Fixes #251 error (vLLM AMD GPU parallelism timeout error).

Seems is_distributed was set to True in case of vLLM parallel inference, which caused deadlock and memory leak. Adding an additional condition to prevent it from being set to True solved the issue.

Logs (2xMI210, Llama 3 8B): vllm_2gpus.log

IlyasMoutawwakil

Thanks !

Set is_distributed false by default in vllm

1ee0882

IlyasMoutawwakil approved these changes Sep 20, 2024

View reviewed changes

IlyasMoutawwakil closed this Sep 20, 2024

IlyasMoutawwakil reopened this Sep 20, 2024

IlyasMoutawwakil merged commit 99a90c9 into huggingface:main Sep 20, 2024
22 of 54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set is_distributed false by default in vllm #266

Set is_distributed false by default in vllm #266

asesorov commented Sep 20, 2024

IlyasMoutawwakil left a comment

Set is_distributed false by default in vllm #266

Set is_distributed false by default in vllm #266

Conversation

asesorov commented Sep 20, 2024

IlyasMoutawwakil left a comment

Choose a reason for hiding this comment