mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-25 03:13:53 +00:00
chore: Regenerate all playbooks
This commit is contained in:
parent
b754b14ff7
commit
35995f8384
@ -65,6 +65,8 @@ df -h .
|
|||||||
* Quantization process is memory-intensive and may fail on systems with insufficient GPU memory
|
* Quantization process is memory-intensive and may fail on systems with insufficient GPU memory
|
||||||
* Output files are large (several GB) and require adequate storage space
|
* Output files are large (several GB) and require adequate storage space
|
||||||
* **Rollback**: Remove the output directory and any pulled Docker images to restore original state.
|
* **Rollback**: Remove the output directory and any pulled Docker images to restore original state.
|
||||||
|
* **Last Updated**: 12/05/2025
|
||||||
|
* Fix broken client CURL request in Step 8
|
||||||
|
|
||||||
## Instructions
|
## Instructions
|
||||||
|
|
||||||
@ -213,7 +215,7 @@ curl -X POST http://localhost:8000/v1/chat/completions \
|
|||||||
-H "Content-Type: application/json" \
|
-H "Content-Type: application/json" \
|
||||||
-d '{
|
-d '{
|
||||||
"model": "deepseek-ai/DeepSeek-R1-Distill-Llama-8B",
|
"model": "deepseek-ai/DeepSeek-R1-Distill-Llama-8B",
|
||||||
"prompt": "What is artificial intelligence?",
|
"messages": [{"role": "user", "content": "What is artificial intelligence?"}],
|
||||||
"max_tokens": 100,
|
"max_tokens": 100,
|
||||||
"temperature": 0.7,
|
"temperature": 0.7,
|
||||||
"stream": false
|
"stream": false
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user