Clarify cluster status verification steps

Updated instructions for verifying cluster status and finding the vLLM container name.
This commit is contained in:
Ev Lacey 2025-12-02 16:34:35 -08:00 committed by GitHub
parent 3b53b7aeca
commit c2414e48e6
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -213,12 +213,14 @@ bash run_cluster.sh $VLLM_IMAGE $HEAD_NODE_IP --worker ~/.cache/huggingface \
> **Note:** Replace `<NODE_1_IP_ADDRESS>` with the actual IP address from Node 1. If using automatic link-local addressing, this will be something like `169.254.x.x`. If using manual static IPs, it will be `192.168.100.10`.
## Step 6. Verify cluster status
Confirm both nodes are recognized and available in the Ray cluster.
```bash
## On Node 1 (head node)
docker exec node ray status
# On Node 1 (head node)
# Find the vLLM container name (it will be node-<random_number>)
export VLLM_CONTAINER=$(docker ps --format '{{.Names}}' | grep -E '^node-[0-9]+$')
echo "Found container: $VLLM_CONTAINER"
docker exec $VLLM_CONTAINER ray status
```
Expected output shows 2 nodes with available GPU resources.