From c1d144d4b44ac9eddb02344ca431c87100dfac0d Mon Sep 17 00:00:00 2001 From: GitLab CI Date: Mon, 13 Oct 2025 15:43:09 +0000 Subject: [PATCH] chore: Regenerate all playbooks --- nvidia/trt-llm/README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/nvidia/trt-llm/README.md b/nvidia/trt-llm/README.md index a17ed2c..d2de4ba 100644 --- a/nvidia/trt-llm/README.md +++ b/nvidia/trt-llm/README.md @@ -97,6 +97,7 @@ The following models are supported with TensorRT-LLM on Spark. All listed models | **Qwen3-8B** | NVFP4 | ✅ | `nvidia/Qwen3-8B-FP4` | | **Qwen3-14B** | FP8 | ✅ | `nvidia/Qwen3-14B-FP8` | | **Qwen3-14B** | NVFP4 | ✅ | `nvidia/Qwen3-14B-FP4` | +| **Qwen3-32B** | NVFP4 | ✅ | `nvidia/Qwen3-32B-FP4` | | **Phi-4-multimodal-instruct** | FP8 | ✅ | `nvidia/Phi-4-multimodal-instruct-FP8` | | **Phi-4-multimodal-instruct** | NVFP4 | ✅ | `nvidia/Phi-4-multimodal-instruct-FP4` | | **Phi-4-reasoning-plus** | FP8 | ✅ | `nvidia/Phi-4-reasoning-plus-FP8` |