chore: Regenerate all playbooks

2026-06-18 04:22:21 +00:00 · 2025-12-18 04:06:55 +00:00 · 2025-12-18 04:06:55 +00:00 · 5228253a7d
commit 5228253a7d
parent 51a5e4de2b
1 changed files with 3 additions and 3 deletions
--- a/nvidia/vllm/README.md
+++ b/nvidia/vllm/README.md
@ -235,9 +235,9 @@ Expected output shows 2 nodes with available GPU resources.
 Authenticate with Hugging Face and download the recommended production-ready model.

 ```bash
-## On Node 1, authenticate and download
-huggingface-cli login
-huggingface-cli download meta-llama/Llama-3.3-70B-Instruct
+## From within the same container where `ray serve` ran, run the following
+hf auth login
+hf download meta-llama/Llama-3.3-70B-Instruct
 ```

 ## Step 8. Launch inference server for Llama 3.3 70B