Compare commits

...

2 Commits

Author SHA1 Message Date
Ago
b9f41c4d73
Merge f75d5817aa into cfbe0f9631 2026-04-01 22:54:08 +00:00
agolajko
f75d5817aa nemotron reqs --trust-remote-code for vllm setup 2026-01-26 08:16:23 -08:00

View File

@ -77,7 +77,7 @@ The following models are supported with vLLM on Spark. All listed models are ava
| **Nemotron3-Nano** | FP8 | ✅ | [`nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8`](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8) |
> [!NOTE]
> The Phi-4-multimodal-instruct models require `--trust-remote-code` when launching vLLM.
> The Phi-4-multimodal-instruct and Nemotron3-Nano models require `--trust-remote-code` when launching vLLM.
> [!NOTE]
> You can use the NVFP4 Quantization documentation to generate your own NVFP4-quantized checkpoints for your favorite models. This enables you to take advantage of the performance and memory benefits of NVFP4 quantization even for models not already published by NVIDIA.