mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-22 01:53:53 +00:00
chore: Regenerate all playbooks
This commit is contained in:
parent
b754b14ff7
commit
35995f8384
@ -65,6 +65,8 @@ df -h .
|
||||
* Quantization process is memory-intensive and may fail on systems with insufficient GPU memory
|
||||
* Output files are large (several GB) and require adequate storage space
|
||||
* **Rollback**: Remove the output directory and any pulled Docker images to restore original state.
|
||||
* **Last Updated**: 12/05/2025
|
||||
* Fix broken client CURL request in Step 8
|
||||
|
||||
## Instructions
|
||||
|
||||
@ -213,7 +215,7 @@ curl -X POST http://localhost:8000/v1/chat/completions \
|
||||
-H "Content-Type: application/json" \
|
||||
-d '{
|
||||
"model": "deepseek-ai/DeepSeek-R1-Distill-Llama-8B",
|
||||
"prompt": "What is artificial intelligence?",
|
||||
"messages": [{"role": "user", "content": "What is artificial intelligence?"}],
|
||||
"max_tokens": 100,
|
||||
"temperature": 0.7,
|
||||
"stream": false
|
||||
|
||||
Loading…
Reference in New Issue
Block a user