chore: Regenerate all playbooks

2026-06-22 06:09:31 +00:00 · 2025-12-18 04:06:55 +00:00 · 2025-12-18 04:06:55 +00:00 · 5228253a7d
commit 5228253a7d
parent 51a5e4de2b
1 changed files with 3 additions and 3 deletions
--- a/nvidia/vllm/README.md
+++ b/nvidia/vllm/README.md
@ -235,9 +235,9 @@ Expected output shows 2 nodes with available GPU resources.
 Authenticate with Hugging Face and download the recommended production-ready model.
 ```bash
-## On Node 1, authenticate and download
+## From within the same container where `ray serve` ran, run the following
-huggingface-cli login
+hf auth login
-huggingface-cli download meta-llama/Llama-3.3-70B-Instruct
+hf download meta-llama/Llama-3.3-70B-Instruct
 ```
 ## Step 8. Launch inference server for Llama 3.3 70B