mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-24 10:53:52 +00:00
chore: Regenerate all playbooks
This commit is contained in:
parent
51a5e4de2b
commit
5228253a7d
@ -235,9 +235,9 @@ Expected output shows 2 nodes with available GPU resources.
|
|||||||
Authenticate with Hugging Face and download the recommended production-ready model.
|
Authenticate with Hugging Face and download the recommended production-ready model.
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
## On Node 1, authenticate and download
|
## From within the same container where `ray serve` ran, run the following
|
||||||
huggingface-cli login
|
hf auth login
|
||||||
huggingface-cli download meta-llama/Llama-3.3-70B-Instruct
|
hf download meta-llama/Llama-3.3-70B-Instruct
|
||||||
```
|
```
|
||||||
|
|
||||||
## Step 8. Launch inference server for Llama 3.3 70B
|
## Step 8. Launch inference server for Llama 3.3 70B
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user