Merge 050f799875 into 9414a5141f

chore: Regenerate all playbooks
2026-06-24 15:19:30 +00:00 · 2026-04-07 11:39:24 -07:00 · 2026-04-07 04:13:30 +00:00 · 2026-04-06 19:32:24 +00:00
3 changed files with 5 additions and 5 deletions
--- a/nvidia/nemo-fine-tune/README.md
+++ b/nvidia/nemo-fine-tune/README.md
@ -47,8 +47,8 @@ All necessary files for the playbook can be found [here on GitHub](https://githu
 * **Duration:** 45-90 minutes for complete setup and initial model fine-tuning
 * **Risks:** Model downloads can be large (several GB), ARM64 package compatibility issues may require troubleshooting, distributed training setup complexity increases with multi-node configurations
 * **Rollback:** Virtual environments can be completely removed; no system-level changes are made to the host system beyond package installations.
-* **Last Updated:** 01/15/2026
-  * Fix qLoRA fine-tuning workflow
+* **Last Updated:** 03/04/2026
+  * Recommend running Nemo finetune workflow via Docker

 ## Instructions

--- a/nvidia/txt2kg/assets/deploy/compose/docker-compose.yml
+++ b/nvidia/txt2kg/assets/deploy/compose/docker-compose.yml
@ -27,8 +27,8 @@ services:
      # Ollama configuration
      - OLLAMA_BASE_URL=http://ollama:11434/v1
      - OLLAMA_MODEL=llama3.1:8b
-      # Disable vLLM
-      - VLLM_BASE_URL=http://localhost:8001/v1
+      # vLLM disabled in default Ollama mode
+      # - VLLM_BASE_URL=http://localhost:8001/v1
      - VLLM_MODEL=disabled
      # Vector DB configuration
      - QDRANT_URL=http://qdrant:6333
--- a/nvidia/txt2kg/assets/frontend/lib/text-processor.ts
+++ b/nvidia/txt2kg/assets/frontend/lib/text-processor.ts
@ -108,7 +108,7 @@ export class TextProcessor {
    
    // Determine which LLM provider to use based on configuration
    // Priority: vLLM > NVIDIA > Ollama
-    if (process.env.VLLM_BASE_URL) {
+    if (process.env.VLLM_BASE_URL && process.env.VLLM_MODEL && process.env.VLLM_MODEL !== 'disabled') {
      this.selectedLLMProvider = 'vllm';
    } else if (process.env.NVIDIA_API_KEY) {
      this.selectedLLMProvider = 'nvidia';
Author	SHA1	Message	Date
Ramzey Ghanaim	1b35d6074f	Merge `050f799875` into `9414a5141f`	2026-04-07 11:39:24 -07:00
GitLab CI	9414a5141f	chore: Regenerate all playbooks	2026-04-07 04:13:30 +00:00
GitLab CI	911ca6db8b	chore: Regenerate all playbooks	2026-04-06 19:32:24 +00:00