Merge f75d5817aa into 3ba4d58f1e

2026-04-22 01:53:53 +00:00 · 2026-04-19 18:07:12 +09:00 · 2026-04-19 18:07:12 +09:00 · a48b522d43
commit a48b522d43
parent 3ba4d58f1e f75d5817aa
1 changed files with 1 additions and 1 deletions
--- a/nvidia/vllm/README.md
+++ b/nvidia/vllm/README.md
@ -82,7 +82,7 @@ The following models are supported with vLLM on Spark. All listed models are ava
 | **Nemotron3-Nano** | FP8 | ✅ | [`nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8`](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8) |

 > [!NOTE]
-> The Phi-4-multimodal-instruct models require `--trust-remote-code` when launching vLLM.
+> The Phi-4-multimodal-instruct and Nemotron3-Nano models require `--trust-remote-code` when launching vLLM.

 > [!NOTE]
 > You can use the NVFP4 Quantization documentation to generate your own NVFP4-quantized checkpoints for your favorite models. This enables you to take advantage of the performance and memory benefits of NVFP4 quantization even for models not already published by NVIDIA.