Merge a8f475d35f into 599cf838a0

Update SKILL.md
2026-06-22 14:19:30 +00:00 · 2026-05-02 09:03:18 +01:00 · 2026-05-02 09:03:14 +01:00
1 changed files with 1 additions and 1 deletions
--- a/skills/dgx-spark-vllm/SKILL.md
+++ b/skills/dgx-spark-vllm/SKILL.md
@ -10,7 +10,7 @@ description: Install and run vLLM for high-throughput LLM inference on NVIDIA DG

 vLLM is an inference engine designed to run large language models efficiently. The key idea is **maximizing throughput and minimizing memory waste** when serving LLMs.

- It uses a memory-efficient attention algoritm called **PagedAttention** to handle long sequences without running out of GPU memory.
+- It uses a memory-efficient attention algo called **PagedAttention** to handle long sequences without running out of GPU memory.
 - New requests can be added to a batch already in process through **continuous batching** to keep GPUs fully utilized.
 - It has an **OpenAI-compatible API** so applications built for the OpenAI API can switch to a vLLM backend with little or no modification.
Author	SHA1	Message	Date
Jason Kneen	652142252d	Merge `a8f475d35f` into `599cf838a0`	2026-05-02 09:03:18 +01:00
Jason Kneen	a8f475d35f	Update SKILL.md	2026-05-02 09:03:14 +01:00