From c2414e48e6ffc82fe6319c6c7c5453f72399770b Mon Sep 17 00:00:00 2001 From: Ev Lacey Date: Tue, 2 Dec 2025 16:34:35 -0800 Subject: [PATCH] Clarify cluster status verification steps Updated instructions for verifying cluster status and finding the vLLM container name. --- nvidia/vllm/README.md | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/nvidia/vllm/README.md b/nvidia/vllm/README.md index 390d2d6..658519f 100644 --- a/nvidia/vllm/README.md +++ b/nvidia/vllm/README.md @@ -213,12 +213,14 @@ bash run_cluster.sh $VLLM_IMAGE $HEAD_NODE_IP --worker ~/.cache/huggingface \ > **Note:** Replace `` with the actual IP address from Node 1. If using automatic link-local addressing, this will be something like `169.254.x.x`. If using manual static IPs, it will be `192.168.100.10`. ## Step 6. Verify cluster status - Confirm both nodes are recognized and available in the Ray cluster. - ```bash -## On Node 1 (head node) -docker exec node ray status +# On Node 1 (head node) +# Find the vLLM container name (it will be node-) +export VLLM_CONTAINER=$(docker ps --format '{{.Names}}' | grep -E '^node-[0-9]+$') +echo "Found container: $VLLM_CONTAINER" + +docker exec $VLLM_CONTAINER ray status ``` Expected output shows 2 nodes with available GPU resources.