mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-22 01:53:53 +00:00
chore: Regenerate all playbooks
This commit is contained in:
parent
51a5e4de2b
commit
5228253a7d
@ -235,9 +235,9 @@ Expected output shows 2 nodes with available GPU resources.
|
||||
Authenticate with Hugging Face and download the recommended production-ready model.
|
||||
|
||||
```bash
|
||||
## On Node 1, authenticate and download
|
||||
huggingface-cli login
|
||||
huggingface-cli download meta-llama/Llama-3.3-70B-Instruct
|
||||
## From within the same container where `ray serve` ran, run the following
|
||||
hf auth login
|
||||
hf download meta-llama/Llama-3.3-70B-Instruct
|
||||
```
|
||||
|
||||
## Step 8. Launch inference server for Llama 3.3 70B
|
||||
|
||||
Loading…
Reference in New Issue
Block a user