diff --git a/nvidia/trt-llm/README.md b/nvidia/trt-llm/README.md
index d7c9002..b8908a2 100644
--- a/nvidia/trt-llm/README.md
+++ b/nvidia/trt-llm/README.md
@@ -12,9 +12,7 @@
   - [Step 4. Validate TensorRT-LLM installation](#step-4-validate-tensorrt-llm-installation)
   - [Step 5. Create cache directory](#step-5-create-cache-directory)
   - [Step 6. Validate setup with quickstart_advanced](#step-6-validate-setup-with-quickstartadvanced)
-  - [LLM quickstart example](#llm-quickstart-example)
   - [Step 7. Validate setup with quickstart_multimodal](#step-7-validate-setup-with-quickstartmultimodal)
-  - [VLM quickstart example](#vlm-quickstart-example)
   - [Step 8. Serve LLM with OpenAI-compatible API](#step-8-serve-llm-with-openai-compatible-api)
   - [Step 9. Troubleshooting](#step-9-troubleshooting)
   - [Step 10. Cleanup and rollback](#step-10-cleanup-and-rollback)
@@ -39,6 +37,15 @@
 
 ## Overview
 
+## Basic idea
+
+**NVIDIA TensorRT-LLM (TRT-LLM)** is an open-source library for optimizing and accelerating large language model (LLM) inference on NVIDIA GPUs. 
+
+It provides highly efficient kernels, memory management, and parallelism strategies—like tensor, pipeline, and sequence parallelism—so developers can serve LLMs with lower latency and higher throughput. 
+
+TRT-LLM integrates with frameworks like Hugging Face and PyTorch, making it easier to deploy state-of-the-art models at scale.
+
+
 ## What you'll accomplish
 
 You'll set up TensorRT-LLM to optimize and deploy large language models on NVIDIA Spark with
@@ -89,13 +96,17 @@ The following models are supported with TensorRT-LLM on Spark. All listed models
 | **Llama-4-Scout-17B-16E-Instruct** | NVFP4 | ✅ | `nvidia/Llama-4-Scout-17B-16E-Instruct-FP4` |
 | **Qwen3-235B-A22B (two Sparks only)** | NVFP4 | ✅ | `nvidia/Qwen3-235B-A22B-FP4` |
 
-**Note:** You can use the NVFP4 Quantization documentation to generate your own NVFP4-quantized checkpoints for your favorite models. This enables you to take advantage of the performance and memory benefits of NVFP4 quantization even for models not already published by NVIDIA. Note: Not all model architectures are supported for NVFP4 quantization.
+**Note:** You can use the NVFP4 Quantization documentation to generate your own NVFP4-quantized checkpoints for your favorite models. This enables you to take advantage of the performance and memory benefits of NVFP4 quantization even for models not already published by NVIDIA. 
+
+Reminder: not all model architectures are supported for NVFP4 quantization.
 
 ## Time & risk
 
 **Duration**: 45-60 minutes for setup and API server deployment
+
 **Risk level**: Medium - container pulls and model downloads may fail due to network issues
-**Rollback**: Stop inference servers and remove downloaded models to free resources
+
+**Rollback**: Stop inference servers and remove downloaded models to free resources.
 
 ## Single Spark
 
@@ -170,7 +181,7 @@ mkdir -p $HOME/.cache/huggingface/
 
 This quickstart validates your TensorRT-LLM setup end-to-end by testing model loading, inference engine initialization, and GPU execution with real text generation. It's the fastest way to confirm everything works before starting the inference API server.
 
-### LLM quickstart example
+**LLM quickstart example**
 
 #### Llama 3.1 8B Instruct
 ```bash
@@ -241,7 +252,7 @@ docker run \
 ```
 ### Step 7. Validate setup with quickstart_multimodal
 
-### VLM quickstart example
+**VLM quickstart example**
 
 This demonstrates vision-language model capabilities by running inference with image understanding. The example uses multimodal inputs to validate both text and vision processing pipelines.
 
@@ -405,9 +416,7 @@ docker rmi nvcr.io/nvidia/tensorrt-llm/release:spark-single-gpu-dev
 
 ### Step 1. Review Spark clustering documentation
 
-Go to the official DGX Spark clustering documentation to understand the networking requirements and setup procedures:
-
-[DGX Spark Clustering Documentation](https://docs.nvidia.com/dgx/dgx-spark/spark-clustering.html)
+Go to the official DGX Spark clustering documentation to understand the networking requirements and setup procedures:[DGX Spark Clustering Documentation](https://docs.nvidia.com/dgx/dgx-spark/spark-clustering.html)
 
 Review the networking configuration options and choose the appropriate setup method for your environment.
 
diff --git a/nvidia/txt2kg/assets/.cursor/rules/nextjs.mdc b/nvidia/txt2kg/assets/.cursor/rules/nextjs.mdc
new file mode 100644
index 0000000..53dc982
--- /dev/null
+++ b/nvidia/txt2kg/assets/.cursor/rules/nextjs.mdc
@@ -0,0 +1,5 @@
+---
+Use pnpm as the main package manager
+description: nextjs projects
+alwaysApply: false
+---
diff --git a/nvidia/txt2kg/assets/.dockerignore b/nvidia/txt2kg/assets/.dockerignore
new file mode 100644
index 0000000..e3638a5
--- /dev/null
+++ b/nvidia/txt2kg/assets/.dockerignore
@@ -0,0 +1,3 @@
+node_modules
+.next
+.git 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/.gitignore b/nvidia/txt2kg/assets/.gitignore
new file mode 100644
index 0000000..99ca64e
--- /dev/null
+++ b/nvidia/txt2kg/assets/.gitignore
@@ -0,0 +1,59 @@
+# See https://help.github.com/articles/ignoring-files/ for more about ignoring files.
+
+# dependencies
+/node_modules
+
+# next.js
+/.next/
+/out/
+
+# production
+/build
+
+# debug
+npm-debug.log*
+yarn-debug.log*
+yarn-error.log*
+.pnpm-debug.log*
+
+# env files
+.env*
+
+# vercel
+.vercel
+
+# typescript
+*.tsbuildinfo
+next-env.d.ts
+volumes/
+frontend/data/
+scripts/*.pt
+frontend/node_modules
+frontend/.next
+frontend/.env
+frontend/.env.local
+frontend/pnpm-lock.yaml
+frontend/pnpm-workspace.yaml/volumes/etcd/
+/frontend/node_modules/
+/frontend/.next/
+/node_modules/
+/volumes/etcd/
+.DS_Store
+*.log
+/volumes/
+video-demo.md
+
+.trae/
+.vscode/
+
+biorxiv_creative_commons/
+
+# benchmark results
+benchmark_results/
+
+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
diff --git a/nvidia/txt2kg/assets/LICENSE b/nvidia/txt2kg/assets/LICENSE
new file mode 100644
index 0000000..303c30b
--- /dev/null
+++ b/nvidia/txt2kg/assets/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2024 NVIDIA Corporation
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE. 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/README.md b/nvidia/txt2kg/assets/README.md
new file mode 100644
index 0000000..dbec322
--- /dev/null
+++ b/nvidia/txt2kg/assets/README.md
@@ -0,0 +1,307 @@
+# NVIDIA txt2kg
+
+Use the following documentation to learn about NVIDIA txt2kg.
+- [Overview](#overview)
+- [Key Features](#key-features)
+- [Target Audience](#target-audience)
+- [Software Components](#software-components)
+- [Technical Diagram](#technical-diagram)
+- [GPU-Accelerated Visualization](#gpu-accelerated-visualization)
+- [Minimum System Requirements](#minimum-system-requirements)
+  - [OS Requirements](#os-requirements)
+  - [Deployment Options](#deployment-options)
+  - [Driver Versions](#driver-versions)
+  - [Hardware Requirements](#hardware-requirements)
+- [Next Steps](#next-steps)
+- [Deployment Guide](#deployment-guide)
+  - [Standard Deployment](#standard-deployment)
+  - [PyGraphistry GPU-Accelerated Deployment](#pygraphistry-gpu-accelerated-deployment)
+- [Available Customizations](#available-customizations)
+- [License](#license)
+
+## Overview
+
+This blueprint serves as a reference solution for knowledge graph extraction and querying with Retrieval Augmented Generation (RAG). This txt2kg blueprint extracts knowledge triples from text and constructs a knowledge graph for visualization and querying, creating a more structured form of information retrieval compared to traditional RAG approaches. By leveraging graph databases and entity relationships, this blueprint delivers more contextually rich answers that better represent complex relationships in your data.
+
+By default, this blueprint leverages **Ollama** for local LLM inference, providing a fully self-contained solution that runs entirely on your own hardware. You can optionally use NVIDIA-hosted models available in the [NVIDIA API Catalog](https://build.nvidia.com) or vLLM for advanced GPU-accelerated inference.
+
+## Key Features
+
+![Screenshot](/frontend/public/txt2kg.png)
+
+[Watch the demo video](https://drive.google.com/file/d/1a0VG67zx_pGT4WyPTPH2ynefhfy2I0Py/view?usp=sharing)
+
+- Knowledge triple extraction from text documents
+- Knowledge graph construction and visualization
+- **Local-first architecture** with Ollama for LLM inference
+- Graph-based RAG for more contextual answers
+- Graph database integration with ArangoDB
+- Local vector embeddings with Pinecone-compatible storage
+- GPU-accelerated LLM inference with Ollama and optional vLLM
+- Sentence Transformers for efficient embedding generation
+- Interactive knowledge graph visualization with Three.js WebGPU
+- Optional NVIDIA API integration for cloud-based models
+- Fully containerized deployment with Docker Compose
+- Decomposable and customizable
+
+## Target Audience
+
+This blueprint is for:
+
+- **Developers**: Developers who want to quickly set up a local-first Graph-based RAG solution
+- **Data Scientists**: Data scientists who want to extract structured knowledge from unstructured text
+- **Enterprise Architects**: Architects seeking to combine knowledge graph and RAG solutions for their organization
+- **Privacy-Conscious Users**: Organizations requiring fully local, air-gapped deployments
+- **GPU Researchers**: Researchers wanting to leverage GPU acceleration for LLM inference and graph visualization
+
+## Software Components
+
+The following are the default components included in this blueprint:
+
+* **LLM Inference**
+  * **Ollama** (default): Local LLM inference with GPU acceleration
+    * Default model: `llama3.1:8b`
+    * Supports any Ollama-compatible model
+  * **vLLM** (optional): Advanced GPU-accelerated inference with quantization
+    * Default model: `meta-llama/Llama-3.2-3B-Instruct`
+  * **NVIDIA API** (optional): Cloud-based models via NVIDIA API Catalog
+* **Vector Database & Embedding**
+  * **SentenceTransformer**: Local embedding generation
+    * Model: `all-MiniLM-L6-v2`
+  * **Pinecone (Local)**: Self-hosted vector storage and similarity search
+    * No cloud API key required
+    * Compatible with Pinecone client libraries
+* **Knowledge Graph Database**
+  * **ArangoDB**: Graph database for storing knowledge triples (entities and relationships)
+    * Web interface on port 8529
+    * No authentication required (configurable)
+* **Graph Visualization**
+  * **Three.js WebGPU**: Client-side GPU-accelerated graph rendering
+  * Optional remote WebGPU clustering for large graphs
+* **Frontend & API**
+  * **Next.js**: Modern React framework with API routes
+
+## Technical Diagram
+
+The architecture follows this workflow:
+1. User uploads documents through the txt2kg web UI
+2. Documents are processed and chunked for analysis
+3. **Ollama** extracts knowledge triples (subject-predicate-object) from the text using local LLM inference
+4. Triples are stored in **ArangoDB** graph database
+5. **SentenceTransformer** generates entity embeddings
+6. Embeddings are stored in local **Pinecone** vector database
+7. User queries are processed through graph-based RAG:
+   - KNN search identifies relevant entities in the vector database
+   - Graph traversal enhances context with entity relationships from ArangoDB
+   - Ollama generates responses using the enriched context
+8. Results are visualized with **Three.js WebGPU** rendering in the browser
+
+## GPU-Accelerated LLM Inference
+
+This blueprint includes **GPU-accelerated LLM inference** with Ollama:
+
+### Ollama Features
+- **Fully local inference**: No cloud dependencies or API keys required
+- **GPU acceleration**: Automatic CUDA support with NVIDIA GPUs
+- **Multiple model support**: Use any Ollama-compatible model
+- **Optimized performance**: Flash attention, KV cache optimization, and quantization
+- **Easy model management**: Pull and switch models with simple commands
+- **Privacy-first**: All data processing happens on your hardware
+
+### Default Configuration
+- Model: `llama3.1:8b`
+- GPU memory fraction: 0.9 (90% of available VRAM)
+- Flash attention enabled
+- Q8_0 KV cache for memory efficiency
+
+### Using Different Models
+```bash
+# Pull a different model
+docker exec ollama-compose ollama pull llama3.1:70b
+
+# Update environment variable in docker-compose.yml
+OLLAMA_MODEL=llama3.1:70b
+```
+
+## Minimum System Requirements
+
+### OS Requirements
+
+Ubuntu 22.04 or later
+
+### Deployment Options
+
+- [Standard Docker Compose](./deploy/compose/docker-compose.yml) (Default - Ollama + ArangoDB + Pinecone)
+- [vLLM Docker Compose](./deploy/compose/docker-compose.vllm.yml) (Advanced - vLLM for FP8 and NVFP4 quantization)
+- [Complete Docker Compose](./deploy/compose/docker-compose.complete.yml) (Full stack with MinIO S3)
+
+### Driver Versions
+
+- GPU Driver - 530.30.02+
+- CUDA version - 12.0+
+### Hardware Requirements
+
+- **For Ollama LLM inference**:
+  - NVIDIA GPU with CUDA support (GTX 1060 or newer, RTX series recommended)
+  - VRAM requirements depend on model size:
+    - 8B models: 6-8GB VRAM
+    - 70B models: 48GB+ VRAM (or use quantized versions)
+  - System RAM: 16GB+ recommended
+- **For vLLM (optional)**:
+  - NVIDIA GPU with Ampere architecture or newer (RTX 30xx+, A100, H100)
+  - Support for FP8 quantization for optimal performance
+  - Similar VRAM requirements as Ollama
+
+## Next Steps
+
+- Clone the repository
+- Install Docker and NVIDIA Container Toolkit
+- Deploy with Docker Compose (no API keys required!)
+- Pull your preferred Ollama model
+- Upload documents and explore the knowledge graph
+- Customize for your specific use case
+
+## Deployment Guide
+
+### Environment Variables
+
+**No API keys required for default deployment!** All services run locally.
+
+The default configuration uses:
+- Local Ollama (no API key needed)
+- Local Pinecone (no API key needed)
+- Local ArangoDB (no authentication by default)
+- Local SentenceTransformer embeddings
+
+#### Optional Environment Variables
+
+```bash
+# Ollama configuration (optional - defaults are set)
+OLLAMA_BASE_URL=http://ollama:11434/v1
+OLLAMA_MODEL=llama3.1:8b
+
+# NVIDIA API (optional - for cloud models)
+NVIDIA_API_KEY=your-nvidia-api-key
+
+# vLLM configuration (optional)
+VLLM_BASE_URL=http://vllm:8001/v1
+VLLM_MODEL=meta-llama/Llama-3.2-3B-Instruct
+```
+
+### Standard Deployment
+
+1. **Clone the repository:**
+```bash
+git clone <repository-url>
+cd txt2kg
+```
+
+2. **Start the application:**
+```bash
+./start.sh
+```
+
+That's it! No configuration needed. The script will:
+- Start all required services with Docker Compose
+- Set up ArangoDB database
+- Initialize local Pinecone vector storage
+- Launch Ollama with GPU acceleration
+- Start the Next.js frontend
+
+3. **Pull an Ollama model (first time only):**
+```bash
+docker exec ollama-compose ollama pull llama3.1:8b
+```
+
+4. **Access the application:**
+- **Web UI**: http://localhost:3001
+- **ArangoDB**: http://localhost:8529 (no authentication required)
+- **Ollama API**: http://localhost:11434
+
+### Advanced Deployment Options
+
+#### Using vLLM for FP8 Quantization
+
+vLLM provides advanced GPU acceleration with FP8 quantization for smaller memory footprint:
+
+```bash
+# Use vLLM compose file
+docker compose -f deploy/compose/docker-compose.vllm.yml up -d
+```
+
+vLLM is recommended for:
+- Newer NVIDIA GPUs (Ampere architecture or later)
+- Production deployments requiring maximum throughput
+- Memory-constrained environments (FP8 uses less VRAM)
+
+#### GPU Setup Prerequisites
+
+1. **Install NVIDIA Container Toolkit**:
+   ```bash
+   # Ubuntu/Debian
+   distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
+   curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
+   curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
+
+   sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
+   sudo systemctl restart docker
+   ```
+
+2. **Verify GPU Access**:
+   ```bash
+   docker run --rm --gpus all nvidia/cuda:12.0-base-ubuntu22.04 nvidia-smi
+   ```
+
+#### Troubleshooting
+
+**Check Service Logs**:
+```bash
+# View all service logs
+docker compose logs -f
+
+# View Ollama logs
+docker compose logs -f ollama
+
+# View vLLM logs (if using vLLM)
+docker compose -f deploy/compose/docker-compose.vllm.yml logs -f vllm
+```
+
+**GPU Issues**:
+```bash
+# Check GPU availability
+nvidia-smi
+
+# Verify Docker GPU access
+docker run --rm --gpus all nvidia/cuda:12.0-base nvidia-smi
+```
+
+**Ollama Model Management**:
+```bash
+# List available models
+docker exec ollama-compose ollama list
+
+# Pull a different model
+docker exec ollama-compose ollama pull mistral
+
+# Remove a model to free space
+docker exec ollama-compose ollama rm llama3.1:8b
+```
+
+## Available Customizations
+
+The following are some of the customizations you can make:
+
+- **Switch Ollama models**: Use any model from Ollama's library (Llama, Qwen, etc.)
+- **Modify extraction prompts**: Customize how triples are extracted from text
+- **Adjust embedding parameters**: Change the SentenceTransformer model
+- **Implement custom entity relationships**: Define domain-specific relationship types
+- **Add domain-specific knowledge sources**: Integrate external ontologies or taxonomies
+- **Configure GPU settings**: Optimize VRAM usage and performance for your hardware
+- **Switch to vLLM**: Use vLLM for advanced quantization and higher throughput
+- **Use NVIDIA API**: Connect to cloud models for specific use cases
+
+## License
+
+[MIT](LICENSE)
+
+This is licensed under the MIT License. This project will download and install additional third-party open source software projects and containers.
diff --git a/nvidia/txt2kg/assets/deploy/README.md b/nvidia/txt2kg/assets/deploy/README.md
new file mode 100644
index 0000000..5e7fd75
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/README.md
@@ -0,0 +1,38 @@
+# Deployment Configuration
+
+This directory contains all deployment-related configuration for the txt2kg project.
+
+## Structure
+
+- **compose/**: Docker Compose files for local development and testing
+  - `docker-compose.yml`: Main Docker Compose configuration
+  - `docker-compose.gnn.yml`: Docker Compose configuration for GNN components
+  - `docker-compose.neo4j.yml`: Docker Compose configuration for Neo4j
+
+- **docker/**: Docker-related files
+  - Dockerfile
+  - Initialization scripts for services
+
+- **services/**: Containerized services
+  - **gnn_model/**: Graph Neural Network model service
+  - **sentence-transformers/**: Sentence transformer service for embeddings
+
+## Usage
+
+To start the default services:
+
+```bash
+docker-compose -f deploy/compose/docker-compose.yml up
+```
+
+To include GNN components:
+
+```bash
+docker-compose -f deploy/compose/docker-compose.yml -f deploy/compose/docker-compose.gnn.yml up
+```
+
+To include Neo4j:
+
+```bash
+docker-compose -f deploy/compose/docker-compose.yml -f deploy/compose/docker-compose.neo4j.yml up
+```
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/app/Dockerfile b/nvidia/txt2kg/assets/deploy/app/Dockerfile
new file mode 100644
index 0000000..8e0f786
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/app/Dockerfile
@@ -0,0 +1,48 @@
+# Use the official Node.js image from the Docker Hub
+FROM node:18-slim
+
+# Set environment variables to avoid interactive prompts
+ENV DEBIAN_FRONTEND=noninteractive
+ENV NPM_CONFIG_YES=true
+ENV PNPM_HOME=/pnpm
+ENV PATH="$PNPM_HOME:$PATH"
+
+# Set the working directory
+WORKDIR /app
+
+# Install pnpm globally with --force and yes flags
+RUN npm install -g pnpm --force --yes
+
+# Copy package files ONLY (for better Docker layer caching)
+# Copy package.json (required) and pnpm-lock.yaml (optional)
+COPY ./frontend/package.json ./
+COPY ./frontend/pnpm-lock.yaml* ./
+
+# Copy the scripts directory (needed for setup-pinecone)
+COPY ./scripts/ /scripts/
+
+# Update the setup-pinecone.js path in package.json
+RUN sed -i 's|"setup-pinecone": "node ../scripts/setup-pinecone.js"|"setup-pinecone": "node /scripts/setup-pinecone.js"|g' package.json
+
+# Install project dependencies (this layer will be cached if package files don't change)
+# Use --no-frozen-lockfile as fallback if lockfile is missing or out of sync
+RUN pnpm config set auto-install-peers true && \
+    if [ -f pnpm-lock.yaml ]; then \
+        echo "Lock file found, installing with frozen lockfile..." && \
+        (pnpm install --no-optional --frozen-lockfile || pnpm install --no-optional --no-frozen-lockfile); \
+    else \
+        echo "No lock file found, installing without frozen lockfile..." && \
+        pnpm install --no-optional --no-frozen-lockfile; \
+    fi
+
+# Copy the rest of the frontend files
+COPY ./frontend/ ./
+
+# Build the application
+RUN pnpm build
+
+# Expose the port the app runs on
+EXPOSE 3000
+
+# Start the application
+CMD ["pnpm", "start"]
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/app/Dockerfile.remote-webgpu b/nvidia/txt2kg/assets/deploy/app/Dockerfile.remote-webgpu
new file mode 100644
index 0000000..da3a56b
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/app/Dockerfile.remote-webgpu
@@ -0,0 +1,50 @@
+# Remote WebGPU Clustering Service Dockerfile
+# Based on NVIDIA PyTorch Geometric container which includes cuGraph/RAPIDS for GPU acceleration
+
+FROM nvcr.io/nvidia/pyg:25.05-py3
+
+# Set working directory
+WORKDIR /app
+
+# Install system dependencies
+RUN apt-get update && apt-get install -y \
+    curl \
+    wget \
+    git \
+    build-essential \
+    && rm -rf /var/lib/apt/lists/*
+
+# Install Python dependencies for remote WebGPU service
+COPY requirements-remote-webgpu.txt .
+RUN pip install --no-cache-dir -r requirements-remote-webgpu.txt
+
+# Install additional dependencies for WebRTC streaming
+RUN pip install --no-cache-dir \
+    opencv-python-headless \
+    plotly \
+    kaleido \
+    Pillow \
+    redis
+
+# Copy service files
+COPY remote_webgpu_clustering_service.py .
+COPY unified_gpu_service.py .
+COPY local_gpu_viz_service.py .
+COPY simple_webgpu_test.py .
+
+# Create directories for temporary files
+RUN mkdir -p /tmp/webrtc_frames
+
+# Set environment variables
+ENV PYTHONPATH=/app
+ENV CUDA_VISIBLE_DEVICES=0
+
+# Expose port
+EXPOSE 8083
+
+# Health check
+HEALTHCHECK --interval=30s --timeout=10s --start-period=5s --retries=3 \
+    CMD curl -f http://localhost:8083/health || exit 1
+
+# Start the main remote WebGPU clustering service
+CMD ["python", "remote_webgpu_clustering_service.py"]
diff --git a/nvidia/txt2kg/assets/deploy/app/arangodb-init/create-database.js b/nvidia/txt2kg/assets/deploy/app/arangodb-init/create-database.js
new file mode 100644
index 0000000..57c0812
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/app/arangodb-init/create-database.js
@@ -0,0 +1,22 @@
+// ArangoDB initialization script to create the txt2kg database
+// This script is executed automatically when the ArangoDB container starts
+
+db._createDatabase("txt2kg");
+console.log("Database 'txt2kg' created successfully!");
+
+// Optional: Create collections needed by your application
+// Replace with actual collections you need
+/*
+const db = require("@arangodb").db;
+db._useDatabase("txt2kg");
+
+if (!db._collection("entities")) {
+  db._createDocumentCollection("entities");
+  console.log("Collection 'entities' created");
+}
+
+if (!db._collection("relationships")) {
+  db._createEdgeCollection("relationships");
+  console.log("Collection 'relationships' created");
+}
+*/ 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/app/arangodb-init/init.sh b/nvidia/txt2kg/assets/deploy/app/arangodb-init/init.sh
new file mode 100755
index 0000000..196fd19
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/app/arangodb-init/init.sh
@@ -0,0 +1,19 @@
+#!/bin/bash
+set -e
+
+# Wait for ArangoDB to be ready
+echo "Waiting for ArangoDB to start..."
+until curl --silent --fail http://localhost:8529/_api/version > /dev/null; do
+  echo "ArangoDB is unavailable - sleeping"
+  sleep 1
+done
+
+echo "ArangoDB is up - executing initialization script"
+
+# Run the database creation script
+arangosh \
+  --server.endpoint tcp://127.0.0.1:8529 \
+  --server.authentication false \
+  --javascript.execute /docker-entrypoint-initdb.d/create-database.js
+
+echo "Initialization completed" 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/app/pinecone-init.sh b/nvidia/txt2kg/assets/deploy/app/pinecone-init.sh
new file mode 100755
index 0000000..86eb49c
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/app/pinecone-init.sh
@@ -0,0 +1,47 @@
+#!/bin/sh
+
+# Script to initialize Pinecone index at container startup
+echo "Initializing Pinecone index..."
+
+# Wait for the Pinecone service to become available
+echo "Waiting for Pinecone service to start..."
+max_attempts=30
+attempt=1
+
+while [ $attempt -le $max_attempts ]; do
+  if curl -s --head http://pinecone:5080 > /dev/null; then
+    echo "Pinecone service is up!"
+    break
+  fi
+  echo "Waiting for Pinecone service (attempt $attempt/$max_attempts)..."
+  attempt=$((attempt + 1))
+  sleep 2
+done
+
+if [ $attempt -gt $max_attempts ]; then
+  echo "Timed out waiting for Pinecone service"
+  exit 1
+fi
+
+# Create the index directly
+echo "Creating index 'entity-embeddings'..."
+curl -v -X POST "http://pinecone:5080/create_index" \
+  -H "Content-Type: application/json" \
+  -d '{
+    "name": "entity-embeddings",
+    "dimension": 384,
+    "metric": "cosine"
+  }'
+
+# Also try alternate endpoint as fallback
+echo "Trying alternate endpoint..."
+curl -v -X POST "http://pinecone:5080/indexes" \
+  -H "Content-Type: application/json" \
+  -H "Api-Key: pclocal" \
+  -d '{
+    "name": "entity-embeddings",
+    "dimension": 384,
+    "metric": "cosine"
+  }'
+
+echo "Pinecone initialization complete" 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/compose/docker-compose.complete.yml b/nvidia/txt2kg/assets/deploy/compose/docker-compose.complete.yml
new file mode 100644
index 0000000..02e18f5
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/compose/docker-compose.complete.yml
@@ -0,0 +1,140 @@
+version: '3.8'
+
+services:
+  app:
+    build:
+      context: ../..
+      dockerfile: deploy/app/Dockerfile
+    ports:
+      - '3001:3000'
+    environment:
+      - ARANGODB_URL=http://arangodb:8529
+      - ARANGODB_DB=txt2kg
+      - PINECONE_HOST=entity-embeddings
+      - PINECONE_PORT=5081
+      - PINECONE_API_KEY=pclocal
+      - PINECONE_ENVIRONMENT=local
+      - LANGCHAIN_TRACING_V2=true
+      - SENTENCE_TRANSFORMER_URL=http://sentence-transformers:80
+      - MODEL_NAME=all-MiniLM-L6-v2
+      - GRPC_SSL_CIPHER_SUITES=HIGH+ECDSA:HIGH+aRSA
+      - NODE_TLS_REJECT_UNAUTHORIZED=0
+      # - XAI_API_KEY=${XAI_API_KEY}  # xAI integration removed
+      - OLLAMA_BASE_URL=http://ollama:11434/v1
+      - OLLAMA_MODEL=qwen3:1.7b
+      - S3_ENDPOINT=http://minio:9000
+      - S3_REGION=us-east-1
+      - S3_BUCKET=txt2kg
+      - S3_ACCESS_KEY=minioadmin
+      - S3_SECRET_KEY=minioadmin
+    networks:
+      - pinecone-net
+      - s3-net
+      - default
+    depends_on:
+      - arangodb
+      - entity-embeddings
+      - sentence-transformers
+      - minio
+
+  arangodb:
+    image: arangodb:latest
+    ports:
+      - '8529:8529'
+    environment:
+      - ARANGO_NO_AUTH=1
+    volumes:
+      - arangodb_data:/var/lib/arangodb3
+      - arangodb_apps_data:/var/lib/arangodb3-apps
+
+  arangodb-init:
+    image: arangodb:latest
+    depends_on:
+      arangodb:
+        condition: service_started
+    restart: on-failure
+    entrypoint: >
+      sh -c "
+        echo 'Waiting for ArangoDB to start...' &&
+        sleep 10 &&
+        echo 'Creating txt2kg database...' &&
+        arangosh --server.endpoint tcp://arangodb:8529 --server.authentication false --javascript.execute-string 'try { db._createDatabase(\"txt2kg\"); console.log(\"Database txt2kg created successfully!\"); } catch(e) { if(e.message.includes(\"duplicate\")) { console.log(\"Database txt2kg already exists\"); } else { throw e; } }'
+      "
+
+  entity-embeddings:
+    image: ghcr.io/pinecone-io/pinecone-index:latest
+    container_name: entity-embeddings
+    environment:
+      PORT: 5081
+      INDEX_TYPE: serverless
+      VECTOR_TYPE: dense
+      DIMENSION: 384
+      METRIC: cosine
+      INDEX_NAME: entity-embeddings
+    ports:
+      - "5081:5081"
+    platform: linux/amd64
+    networks:
+      - pinecone-net
+    restart: unless-stopped
+
+  sentence-transformers:
+    build:
+      context: ../../deploy/services/sentence-transformers
+      dockerfile: Dockerfile
+    ports:
+      - '8000:80'
+    environment:
+      - MODEL_NAME=all-MiniLM-L6-v2
+    networks:
+      - default
+
+  # MinIO S3-compatible storage
+  minio:
+    image: minio/minio:latest
+    container_name: txt2kg-minio
+    ports:
+      - "9000:9000"  # API endpoint
+      - "9001:9001"  # Web console
+    environment:
+      - MINIO_ROOT_USER=minioadmin
+      - MINIO_ROOT_PASSWORD=minioadmin
+    volumes:
+      - minio_data:/data
+    command: server /data --console-address ":9001"
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:9000/minio/health/live"]
+      interval: 30s
+      timeout: 20s
+      retries: 3
+    networks:
+      - s3-net
+      - default
+
+  createbucket:
+    image: minio/mc
+    depends_on:
+      - minio
+    entrypoint: >
+      /bin/sh -c "
+      sleep 5;
+      /usr/bin/mc config host add myminio http://minio:9000 minioadmin minioadmin;
+      /usr/bin/mc mb myminio/txt2kg;
+      /usr/bin/mc policy set public myminio/txt2kg;
+      exit 0;
+      "
+    networks:
+      - s3-net
+
+volumes:
+  arangodb_data:
+  arangodb_apps_data:
+  minio_data:
+
+networks:
+  pinecone-net:
+    name: pinecone
+  s3-net:
+    name: s3-network
+  default:
+    driver: bridge 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/compose/docker-compose.vllm.yml b/nvidia/txt2kg/assets/deploy/compose/docker-compose.vllm.yml
new file mode 100644
index 0000000..817695c
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/compose/docker-compose.vllm.yml
@@ -0,0 +1,137 @@
+services:
+  app:
+    build:
+      context: ../..
+      dockerfile: deploy/app/Dockerfile
+    ports:
+      - '3001:3000'
+    environment:
+      - ARANGODB_URL=http://arangodb:8529
+      - ARANGODB_DB=txt2kg
+      - PINECONE_HOST=entity-embeddings
+      - PINECONE_PORT=5081
+      - PINECONE_API_KEY=pclocal
+      - PINECONE_ENVIRONMENT=local
+      - LANGCHAIN_TRACING_V2=true
+      - SENTENCE_TRANSFORMER_URL=http://sentence-transformers:80
+      - MODEL_NAME=all-MiniLM-L6-v2
+      - GRPC_SSL_CIPHER_SUITES=HIGH+ECDSA:HIGH+aRSA
+      - NODE_TLS_REJECT_UNAUTHORIZED=0
+      - OLLAMA_BASE_URL=http://ollama:11434/v1
+      - OLLAMA_MODEL=qwen3:1.7b
+      - VLLM_BASE_URL=http://vllm:8001/v1
+      - VLLM_MODEL=meta-llama/Llama-3.2-3B-Instruct
+      - REMOTE_WEBGPU_SERVICE_URL=http://txt2kg-remote-webgpu:8083
+    networks:
+      - pinecone-net
+      - default
+      - txt2kg-network
+    depends_on:
+      - arangodb
+      - entity-embeddings
+      - sentence-transformers
+      - vllm
+  arangodb:
+    image: arangodb:latest
+    ports:
+      - '8529:8529'
+    environment:
+      - ARANGO_NO_AUTH=1
+    volumes:
+      - arangodb_data:/var/lib/arangodb3
+      - arangodb_apps_data:/var/lib/arangodb3-apps
+  arangodb-init:
+    image: arangodb:latest
+    depends_on:
+      arangodb:
+        condition: service_started
+    restart: on-failure
+    entrypoint: >
+      sh -c "
+        echo 'Waiting for ArangoDB to start...' &&
+        sleep 10 &&
+        echo 'Creating txt2kg database...' &&
+        arangosh --server.endpoint tcp://arangodb:8529 --server.authentication false --javascript.execute-string 'try { db._createDatabase(\"txt2kg\"); console.log(\"Database txt2kg created successfully!\"); } catch(e) { if(e.message.includes(\"duplicate\")) { console.log(\"Database txt2kg already exists\"); } else { throw e; } }'
+      "
+  entity-embeddings:
+    image: ghcr.io/pinecone-io/pinecone-index:latest
+    container_name: entity-embeddings
+    environment:
+      PORT: 5081
+      INDEX_TYPE: serverless
+      VECTOR_TYPE: dense
+      DIMENSION: 384
+      METRIC: cosine
+      INDEX_NAME: entity-embeddings
+    ports:
+      - "5081:5081"
+    platform: linux/amd64
+    networks:
+      - pinecone-net
+    restart: unless-stopped
+  sentence-transformers:
+    build:
+      context: ../../deploy/services/sentence-transformers
+      dockerfile: Dockerfile
+    ports:
+      - '8000:80'
+    environment:
+      - MODEL_NAME=all-MiniLM-L6-v2
+    networks:
+      - default
+  vllm:
+    build:
+      context: ../../deploy/services/vllm
+      dockerfile: Dockerfile
+    container_name: vllm-service
+    ports:
+      - '8001:8001'
+    environment:
+      # Model configuration
+      - VLLM_MODEL=meta-llama/Llama-3.2-3B-Instruct
+      - VLLM_TENSOR_PARALLEL_SIZE=1
+      - VLLM_MAX_MODEL_LEN=4096
+      - VLLM_GPU_MEMORY_UTILIZATION=0.9
+      # NVfp4 quantization settings
+      - VLLM_QUANTIZATION=fp8
+      - VLLM_KV_CACHE_DTYPE=fp8
+      # Service configuration
+      - VLLM_PORT=8001
+      - VLLM_HOST=0.0.0.0
+      # Performance tuning
+      - CUDA_VISIBLE_DEVICES=0
+      - NCCL_DEBUG=INFO
+    volumes:
+      - vllm_models:/app/models
+      - /tmp:/tmp
+      # Mount model cache for faster startup
+      - ~/.cache/huggingface:/root/.cache/huggingface
+    networks:
+      - default
+    restart: unless-stopped
+    deploy:
+      resources:
+        reservations:
+          devices:
+            - driver: nvidia
+              count: 1
+              capabilities: [gpu]
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:8001/v1/models"]
+      interval: 30s
+      timeout: 10s
+      retries: 5
+      start_period: 120s  # Longer start period for model loading
+
+volumes:
+  arangodb_data:
+  arangodb_apps_data:
+  vllm_models:
+
+networks:
+  pinecone-net:
+    name: pinecone
+  default:
+    driver: bridge
+  txt2kg-network:
+    driver: bridge
diff --git a/nvidia/txt2kg/assets/deploy/compose/docker-compose.yml b/nvidia/txt2kg/assets/deploy/compose/docker-compose.yml
new file mode 100644
index 0000000..3107cb3
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/compose/docker-compose.yml
@@ -0,0 +1,168 @@
+services:
+  app:
+    build:
+      context: ../..
+      dockerfile: deploy/app/Dockerfile
+    ports:
+      - '3001:3000'
+    environment:
+      - ARANGODB_URL=http://arangodb:8529
+      - ARANGODB_DB=txt2kg
+      - PINECONE_HOST=entity-embeddings
+      - PINECONE_PORT=5081
+      - PINECONE_API_KEY=pclocal
+      - PINECONE_ENVIRONMENT=local
+      - LANGCHAIN_TRACING_V2=true
+      - SENTENCE_TRANSFORMER_URL=http://sentence-transformers:80
+      - MODEL_NAME=all-MiniLM-L6-v2
+      - GRPC_SSL_CIPHER_SUITES=HIGH+ECDSA:HIGH+aRSA
+      - NODE_TLS_REJECT_UNAUTHORIZED=0
+      # - XAI_API_KEY=${XAI_API_KEY}  # xAI integration removed
+      - OLLAMA_BASE_URL=http://ollama:11434/v1
+      - OLLAMA_MODEL=llama3.1:8b
+      - VLLM_BASE_URL=http://vllm:8001/v1
+      - VLLM_MODEL=meta-llama/Llama-3.2-3B-Instruct
+      - REMOTE_WEBGPU_SERVICE_URL=http://txt2kg-remote-webgpu:8083
+      # Node.js timeout configurations for large model processing
+      - NODE_OPTIONS=--max-http-header-size=80000
+      - UV_THREADPOOL_SIZE=128
+      - HTTP_TIMEOUT=1800000
+      - REQUEST_TIMEOUT=1800000
+    networks:
+      - pinecone-net
+      - default
+      - txt2kg-network
+  arangodb:
+    image: arangodb:latest
+    ports:
+      - '8529:8529'
+    environment:
+      - ARANGO_NO_AUTH=1
+    volumes:
+      - arangodb_data:/var/lib/arangodb3
+      - arangodb_apps_data:/var/lib/arangodb3-apps
+  arangodb-init:
+    image: arangodb:latest
+    depends_on:
+      arangodb:
+        condition: service_started
+    restart: on-failure
+    entrypoint: >
+      sh -c "
+        echo 'Waiting for ArangoDB to start...' &&
+        sleep 10 &&
+        echo 'Creating txt2kg database...' &&
+        arangosh --server.endpoint tcp://arangodb:8529 --server.authentication false --javascript.execute-string 'try { db._createDatabase(\"txt2kg\"); console.log(\"Database txt2kg created successfully!\"); } catch(e) { if(e.message.includes(\"duplicate\")) { console.log(\"Database txt2kg already exists\"); } else { throw e; } }'
+      "
+  entity-embeddings:
+    image: ghcr.io/pinecone-io/pinecone-index:latest
+    container_name: entity-embeddings
+    environment:
+      PORT: 5081
+      INDEX_TYPE: serverless
+      VECTOR_TYPE: dense
+      DIMENSION: 384
+      METRIC: cosine
+      INDEX_NAME: entity-embeddings
+    ports:
+      - "5081:5081"
+    platform: linux/amd64
+    networks:
+      - pinecone-net
+    restart: unless-stopped
+  sentence-transformers:
+    build:
+      context: ../../deploy/services/sentence-transformers
+      dockerfile: Dockerfile
+    ports:
+      - '8000:80'
+    environment:
+      - MODEL_NAME=all-MiniLM-L6-v2
+    networks:
+      - default
+  ollama:
+    build:
+      context: ../services/ollama
+      dockerfile: Dockerfile
+    image: ollama-custom:latest
+    container_name: ollama-compose
+    ports:
+      - '11434:11434'
+    volumes:
+      - ollama_data:/root/.ollama
+    environment:
+      - OLLAMA_FLASH_ATTENTION=1          # Enable flash attention for better performance
+      - OLLAMA_KEEP_ALIVE=30m             # Keep models loaded for 30 minutes
+      - OLLAMA_CUDA=1                     # Enable CUDA acceleration
+      - OLLAMA_LLM_LIBRARY=cuda           # Use CUDA library for LLM operations
+      - OLLAMA_NUM_PARALLEL=1             # Process one request at a time for 70B models
+      - OLLAMA_MAX_LOADED_MODELS=1        # Load only one model at a time to avoid VRAM contention
+      - OLLAMA_KV_CACHE_TYPE=q8_0         # Reduce KV cache VRAM usage with minimal performance impact
+      - OLLAMA_GPU_LAYERS=999             # Use maximum GPU layers
+      - OLLAMA_GPU_MEMORY_FRACTION=0.9    # Use 90% of GPU memory
+      - CUDA_VISIBLE_DEVICES=0            # Use GPU 0 (change to 'all' for multi-GPU)
+    networks:
+      - default
+    restart: unless-stopped
+    deploy:
+      resources:
+        reservations:
+          devices:
+            - driver: nvidia
+              count: all
+              capabilities: [gpu]
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:11434/api/tags"]
+      interval: 30s
+      timeout: 10s
+      retries: 3
+      start_period: 60s
+  vllm:
+    build:
+      context: ../../deploy/services/vllm
+      dockerfile: Dockerfile
+    container_name: vllm-service
+    ports:
+      - '8001:8001'
+    environment:
+      - VLLM_MODEL=meta-llama/Llama-3.2-3B-Instruct
+      - VLLM_TENSOR_PARALLEL_SIZE=1
+      - VLLM_MAX_MODEL_LEN=4096
+      - VLLM_GPU_MEMORY_UTILIZATION=0.9
+      - VLLM_QUANTIZATION=fp8
+      - VLLM_KV_CACHE_DTYPE=fp8
+      - VLLM_PORT=8001
+      - VLLM_HOST=0.0.0.0
+    volumes:
+      - vllm_models:/app/models
+      - /tmp:/tmp
+    networks:
+      - default
+    restart: unless-stopped
+    deploy:
+      resources:
+        reservations:
+          devices:
+            - driver: nvidia
+              count: all
+              capabilities: [gpu]
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:8001/v1/models"]
+      interval: 30s
+      timeout: 10s
+      retries: 3
+      start_period: 60s
+
+volumes:
+  arangodb_data:
+  arangodb_apps_data:
+  ollama_data:
+  vllm_models:
+
+networks:
+  pinecone-net:
+    name: pinecone
+  default:
+    driver: bridge
+  txt2kg-network:
+    driver: bridge
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gnn_model/Dockerfile b/nvidia/txt2kg/assets/deploy/services/gnn_model/Dockerfile
new file mode 100644
index 0000000..a1558d3
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gnn_model/Dockerfile
@@ -0,0 +1,26 @@
+FROM nvcr.io/nvidia/pyg:25.03-py3
+
+WORKDIR /app
+
+# Install Flask and other required packages
+RUN pip install --no-cache-dir \
+    flask==2.0.1 \
+    gunicorn==20.1.0 \
+    tqdm
+
+# Create model directory
+RUN mkdir -p /app/models
+
+# Copy application code
+COPY services/gnn_model/app.py /app/
+
+# Set environment variables
+ENV MODEL_PATH=/app/models/tech-qa-model.pt
+ENV PYTHONUNBUFFERED=1
+ENV FLASK_APP=app.py
+
+# Expose the port
+EXPOSE 5000
+
+# Run the service with gunicorn
+CMD ["gunicorn", "--bind", "0.0.0.0:5000", "app:app"] 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gnn_model/README.md b/nvidia/txt2kg/assets/deploy/services/gnn_model/README.md
new file mode 100644
index 0000000..c6ff392
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gnn_model/README.md
@@ -0,0 +1,95 @@
+# GNN Model Service
+
+This service provides a REST API for serving predictions from a Graph Neural Network (GNN) model trained to enhance RAG (Retrieval Augmented Generation) performance. It allows comparing GNN-based knowledge graph retrieval with traditional RAG approaches.
+
+## Overview
+
+The service exposes a simple API to:
+- Load a pre-trained GNN model that combines graph structures with language models
+- Process queries by incorporating graph-structured knowledge
+- Return predictions that leverage both text and graph relationships
+
+## Getting Started
+
+### Prerequisites
+
+- Docker and Docker Compose
+- The trained model file (created using `train_export.py`)
+
+### Running the Service
+
+The service is included in the main docker-compose configuration. Simply run:
+
+```bash
+docker-compose up -d
+```
+
+This will start the GNN model service along with other services in the system.
+
+## Training the Model
+
+Before using the service, you need to train the GNN model:
+
+```bash
+# Create the models directory if it doesn't exist
+mkdir -p models
+
+# Run the training script
+python deploy/services/gnn_model/train_export.py --output_dir models
+```
+
+This will create the `tech-qa-model.pt` file in the models directory, which the service will load.
+
+## API Endpoints
+
+### Health Check
+
+```
+GET /health
+```
+
+Returns the health status of the service.
+
+### Prediction
+
+```
+POST /predict
+```
+
+Request body:
+```json
+{
+  "question": "Your question here",
+  "context": "Retrieved context information"
+}
+```
+
+Response:
+```json
+{
+  "question": "Your question here",
+  "answer": "The generated answer"
+}
+```
+
+## Using the Client Example
+
+A simple client script is provided to test the service:
+
+```bash
+python deploy/services/gnn_model/client_example.py --question "What is the capital of France?" --context "France is a country in Western Europe. Its capital is Paris, which is known for the Eiffel Tower."
+```
+
+This script also includes a placeholder for comparing the GNN-based approach with a traditional RAG approach.
+
+## Architecture
+
+The GNN model service uses:
+- A Graph Attention Network (GAT) to process graph structured data
+- A Language Model (LLM) to generate answers
+- A combined architecture (GRetriever) that leverages both components
+
+## Limitations
+
+- The current implementation requires graph construction to be handled separately
+- The `create_graph_from_text` function in the service is a placeholder that needs implementation based on your specific graph construction approach 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gnn_model/app.py b/nvidia/txt2kg/assets/deploy/services/gnn_model/app.py
new file mode 100644
index 0000000..fc89eef
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gnn_model/app.py
@@ -0,0 +1,114 @@
+#!/usr/bin/env python3
+
+import os
+import torch
+from flask import Flask, request, jsonify
+import torch_geometric
+from torch_geometric.nn import GAT, LLM, GRetriever
+
+app = Flask(__name__)
+
+# Constants
+MODEL_PATH = os.environ.get('MODEL_PATH', '/app/models/tech-qa-model.pt')
+LLM_GENERATOR_NAME = os.environ.get('LLM_GENERATOR_NAME', 'meta-llama/Meta-Llama-3.1-8B-Instruct')
+GNN_HID_CHANNELS = int(os.environ.get('GNN_HID_CHANNELS', '1024'))
+GNN_LAYERS = int(os.environ.get('GNN_LAYERS', '4'))
+
+# Prompt template for questions
+prompt_template = """Answer this question based on retrieved contexts. Just give the answer without explanation.
+[QUESTION] {question} [END_QUESTION]
+[RETRIEVED_CONTEXTS] {context} [END_RETRIEVED_CONTEXTS]
+Answer: """
+
+# Load the model
+def load_model():
+    print(f"Loading model from {MODEL_PATH}")
+    
+    # Create the GNN component
+    gnn = GAT(in_channels=768, hidden_channels=GNN_HID_CHANNELS,
+             out_channels=1024, num_layers=GNN_LAYERS, heads=4)
+    
+    # Create the LLM component
+    llm = LLM(model_name=LLM_GENERATOR_NAME)
+    
+    # Create the GRetriever model
+    model = GRetriever(llm=llm, gnn=gnn)
+    
+    # Load trained weights
+    if os.path.exists(MODEL_PATH):
+        state_dict = torch.load(MODEL_PATH, weights_only=True)
+        model.load_state_dict(state_dict)
+        model.eval()
+        print("Model loaded successfully")
+    else:
+        print(f"WARNING: Model file not found at {MODEL_PATH}. Using untrained model.")
+    
+    return model
+
+# Initialize model
+model = None
+
+@app.before_first_request
+def initialize():
+    global model
+    model = load_model()
+
+@app.route('/health', methods=['GET'])
+def health_check():
+    return jsonify({"status": "healthy"})
+
+@app.route('/predict', methods=['POST'])
+def predict():
+    if not request.is_json:
+        return jsonify({"error": "Request must be JSON"}), 400
+    
+    data = request.get_json()
+    
+    if 'question' not in data:
+        return jsonify({"error": "Question is required"}), 400
+    
+    if 'context' not in data:
+        return jsonify({"error": "Context is required"}), 400
+    
+    question = data['question']
+    context = data['context']
+    
+    # Format the question with context using the prompt template
+    formatted_question = prompt_template.format(question=question, context=context)
+    
+    # Prepare input for the model
+    # Note: In a real implementation, you'd need to convert text to graph structure
+    # Here we're assuming a simplified interface for demonstration
+    try:
+        # Create a PyTorch Geometric Data object
+        # This is simplified and would need to be adapted to your actual graph structure
+        graph_data = create_graph_from_text(context)
+        
+        # Generate prediction
+        with torch.no_grad():
+            prediction = model.generate(
+                input_question=[formatted_question],
+                input_graph=graph_data
+            )[0]  # Get first prediction since we're processing one sample
+        
+        return jsonify({
+            "question": question,
+            "answer": prediction
+        })
+    
+    except Exception as e:
+        return jsonify({"error": str(e)}), 500
+
+def create_graph_from_text(text):
+    """
+    Convert text to a graph structure for the GNN.
+    This is a placeholder - you'll need to implement the actual conversion
+    based on your specific graph construction approach.
+    """
+    # This would need to be implemented based on how your graphs are constructed
+    # For now, return a dummy graph
+    raise NotImplementedError("Graph creation from text needs to be implemented")
+
+if __name__ == '__main__':
+    port = int(os.environ.get('PORT', 5000))
+    app.run(host='0.0.0.0', port=port) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gnn_model/client_example.py b/nvidia/txt2kg/assets/deploy/services/gnn_model/client_example.py
new file mode 100644
index 0000000..18659a7
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gnn_model/client_example.py
@@ -0,0 +1,83 @@
+#!/usr/bin/env python3
+
+import requests
+import json
+import argparse
+
+def parse_args():
+    parser = argparse.ArgumentParser(description='Client for GNN Model Service')
+    parser.add_argument('--url', type=str, default='http://localhost:5000', 
+                        help='URL of the GNN model service')
+    parser.add_argument('--question', type=str, required=True,
+                        help='Question to ask')
+    parser.add_argument('--context', type=str, required=True,
+                        help='Context information to provide')
+    
+    return parser.parse_args()
+
+def query_gnn_model(url, question, context):
+    """
+    Query the GNN model service with a question and context
+    """
+    endpoint = f"{url}/predict"
+    
+    payload = {
+        "question": question,
+        "context": context
+    }
+    
+    headers = {
+        "Content-Type": "application/json"
+    }
+    
+    try:
+        response = requests.post(endpoint, json=payload, headers=headers)
+        if response.status_code == 200:
+            return response.json()
+        else:
+            print(f"Error: {response.status_code}")
+            print(response.text)
+            return None
+    except Exception as e:
+        print(f"Error connecting to GNN service: {e}")
+        return None
+
+def query_rag_model(question, context):
+    """
+    Simple Pure RAG approach for comparison
+    This is a placeholder - in a real implementation, you would have a separate RAG service
+    or use a local LLM with context insertion
+    """
+    # This would typically call an API or use a local LLM
+    print("Note: This is a placeholder for a Pure RAG implementation")
+    return {
+        "question": question,
+        "answer": "Placeholder RAG answer. Implement real RAG for comparison."
+    }
+
+def compare_approaches(gnn_result, rag_result):
+    """
+    Compare the results from GNN and Pure RAG approaches
+    """
+    print("\n----- COMPARISON -----")
+    print(f"Question: {gnn_result['question']}")
+    print(f"GNN Answer: {gnn_result['answer']}")
+    print(f"RAG Answer: {rag_result['answer']}")
+    print("----------------------\n")
+
+if __name__ == "__main__":
+    args = parse_args()
+    
+    print(f"Querying GNN model at {args.url}...")
+    gnn_result = query_gnn_model(args.url, args.question, args.context)
+    
+    if gnn_result:
+        print("GNN Query successful!")
+        
+        # Get RAG result for comparison
+        rag_result = query_rag_model(args.question, args.context)
+        
+        # Compare the approaches
+        compare_approaches(gnn_result, rag_result)
+    else:
+        print("Failed to get response from GNN model service.") 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gnn_model/train_export.py b/nvidia/txt2kg/assets/deploy/services/gnn_model/train_export.py
new file mode 100644
index 0000000..fb76c49
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gnn_model/train_export.py
@@ -0,0 +1,164 @@
+#!/usr/bin/env python3
+
+import os
+import argparse
+import torch
+from torch_geometric import seed_everything
+from torch_geometric.loader import DataLoader
+from torch_geometric.nn import GAT, LLM, GRetriever
+
+def parse_args():
+    parser = argparse.ArgumentParser(description='Train and export GNN model for service')
+    parser.add_argument('--dataset_file', type=str, default='tech_qa.pt', help='Path to load dataset')
+    parser.add_argument('--output_dir', type=str, default='models', help='Directory to save model')
+    parser.add_argument('--model_save_path', type=str, default='tech-qa-model.pt', help='Model file name')
+    parser.add_argument('--gnn_hidden_channels', type=int, default=1024, help='Hidden channels for GNN')
+    parser.add_argument('--num_gnn_layers', type=int, default=4, help='Number of GNN layers')
+    parser.add_argument('--llm_generator_name', type=str, default='meta-llama/Meta-Llama-3.1-8B-Instruct', 
+                        help='LLM to use for generation')
+    parser.add_argument('--epochs', type=int, default=2, help='Number of training epochs')
+    parser.add_argument('--batch_size', type=int, default=1, help='Training batch size')
+    parser.add_argument('--eval_batch_size', type=int, default=2, help='Evaluation batch size')
+    parser.add_argument('--lr', type=float, default=1e-5, help='Learning rate')
+    
+    return parser.parse_args()
+
+def load_dataset(dataset_path):
+    """
+    Load preprocessed dataset from file
+    """
+    if not os.path.exists(dataset_path):
+        raise FileNotFoundError(f"Dataset file not found at {dataset_path}. Please run preprocess_data.py first.")
+    
+    print(f"Loading dataset from {dataset_path}...")
+    data_lists = torch.load(dataset_path, weights_only=False)
+    print("Dataset loaded successfully!")
+    print(f"Train set size: {len(data_lists['train'])}")
+    print(f"Validation set size: {len(data_lists['validation'])}")
+    print(f"Test set size: {len(data_lists['test'])}")
+    
+    return data_lists
+
+def train_model(args, data_lists):
+    """
+    Train the GNN model
+    """
+    batch_size = args.batch_size
+    eval_batch_size = args.eval_batch_size
+    hidden_channels = args.gnn_hidden_channels
+    num_gnn_layers = args.num_gnn_layers
+    
+    train_loader = DataLoader(data_lists["train"], batch_size=batch_size,
+                             drop_last=True, pin_memory=True, shuffle=True)
+    val_loader = DataLoader(data_lists["validation"], batch_size=eval_batch_size,
+                           drop_last=False, pin_memory=True, shuffle=False)
+    
+    # Create GNN model
+    gnn = GAT(in_channels=768, hidden_channels=hidden_channels,
+             out_channels=1024, num_layers=num_gnn_layers, heads=4)
+    
+    # Create LLM model
+    llm = LLM(model_name=args.llm_generator_name)
+    
+    # Create the combined GRetriever model
+    model = GRetriever(llm=llm, gnn=gnn)
+    
+    # Training setup
+    params = [p for _, p in model.named_parameters() if p.requires_grad]
+    optimizer = torch.optim.AdamW([{
+        'params': params, 'lr': args.lr, 'weight_decay': 0.05
+    }], betas=(0.9, 0.95))
+    
+    # Prompt template for questions
+    prompt_template = """Answer this question based on retrieved contexts. Just give the answer without explanation.
+[QUESTION] {question} [END_QUESTION]
+[RETRIEVED_CONTEXTS] {context} [END_RETRIEVED_CONTEXTS]
+Answer: """
+    
+    # Training loop
+    for epoch in range(args.epochs):
+        model.train()
+        epoch_loss = 0
+        print(f'Epoch: {epoch + 1}/{args.epochs}')
+        
+        for batch in train_loader:
+            new_qs = []
+            for i, q in enumerate(batch["question"]):
+                # insert context
+                new_qs.append(
+                    prompt_template.format(question=q, context=batch.text_context[i]))
+            batch.question = new_qs
+            
+            optimizer.zero_grad()
+            loss = model(
+                input_question=batch.question,
+                input_graph=batch,
+                output_labels=batch.label
+            )
+            loss.backward()
+            torch.nn.utils.clip_grad_norm_(optimizer.param_groups[0]['params'], 0.1)
+            optimizer.step()
+            
+            epoch_loss += float(loss)
+        
+        avg_train_loss = epoch_loss / len(train_loader)
+        print(f'Train Loss: {avg_train_loss:.4f}')
+        
+        # Validation
+        model.eval()
+        val_loss = 0
+        with torch.no_grad():
+            for batch in val_loader:
+                new_qs = []
+                for i, q in enumerate(batch["question"]):
+                    # insert context
+                    new_qs.append(
+                        prompt_template.format(question=q, context=batch.text_context[i]))
+                batch.question = new_qs
+                
+                loss = model(
+                    input_question=batch.question,
+                    input_graph=batch,
+                    output_labels=batch.label
+                )
+                val_loss += float(loss)
+        
+        avg_val_loss = val_loss / len(val_loader)
+        print(f'Validation Loss: {avg_val_loss:.4f}')
+    
+    return model
+
+def save_model(model, save_path):
+    """
+    Save the trained model
+    """
+    directory = os.path.dirname(save_path)
+    if not os.path.exists(directory):
+        os.makedirs(directory)
+    
+    print(f"Saving model to {save_path}")
+    torch.save(model.state_dict(), save_path)
+    print("Model saved successfully!")
+
+if __name__ == '__main__':
+    import math
+    
+    # Set seed for reproducibility
+    seed_everything(50)
+    
+    # Parse arguments
+    args = parse_args()
+    
+    # Load dataset
+    dataset_path = os.path.join(args.output_dir, args.dataset_file)
+    data_lists = load_dataset(dataset_path)
+    
+    # Train model
+    model = train_model(args, data_lists)
+    
+    # Save model
+    model_path = os.path.join(args.output_dir, args.model_save_path)
+    save_model(model, model_path)
+    
+    print(f"Model has been trained and saved to {model_path}")
+    print("This model can now be used by the GNN service.") 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/Dockerfile b/nvidia/txt2kg/assets/deploy/services/gpu-viz/Dockerfile
new file mode 100644
index 0000000..5f2ac1c
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/Dockerfile
@@ -0,0 +1,40 @@
+# Use latest NVIDIA PyG container which includes cuGraph and graph-related packages
+FROM nvcr.io/nvidia/pyg:25.08-py3
+
+# Ensure we're running as root for system package installation
+USER root
+
+# Set working directory
+WORKDIR /app
+
+# Install system dependencies
+RUN apt-get update && apt-get install -y \
+    curl \
+    wget \
+    git \
+    build-essential \
+    && rm -rf /var/lib/apt/lists/*
+
+# Copy requirements first to leverage Docker cache
+COPY requirements.txt .
+
+# Install Python dependencies
+RUN pip install --no-cache-dir -r requirements.txt
+
+# Copy the service code
+COPY unified_gpu_service.py .
+COPY pygraphistry_service.py .
+
+# Create a non-root user for security (using a different UID to avoid conflicts)
+RUN useradd -m -u 1001 appuser && chown -R appuser:appuser /app
+USER appuser
+
+# Expose unified service port
+EXPOSE 8080
+
+# Health check for unified service
+HEALTHCHECK --interval=30s --timeout=10s --start-period=5s --retries=3 \
+    CMD curl -f http://localhost:8080/api/health || exit 1
+
+# Start unified service
+CMD ["python", "unified_gpu_service.py"] 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/GPU_Rendering_Library_Options.md b/nvidia/txt2kg/assets/deploy/services/gpu-viz/GPU_Rendering_Library_Options.md
new file mode 100644
index 0000000..204265c
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/GPU_Rendering_Library_Options.md
@@ -0,0 +1,305 @@
+# GPU Rendering Library Options for Remote Visualization
+
+## 🎯 **Yes! Three.js is Perfect for Adding GPU Rendering**
+
+Your existing **Three.js v0.176.0** stack is ideal for adding true GPU-accelerated WebGL rendering to the remote service. Here's a comprehensive comparison of options:
+
+## 🚀 **Option 1: Three.js (Recommended)**
+
+### **Why Three.js is Perfect**
+- ✅ **Already in your stack** - Three.js v0.176.0 in package.json
+- ✅ **Mature WebGL abstraction** - Handles GPU complexity
+- ✅ **InstancedMesh for performance** - Single draw call for millions of nodes
+- ✅ **Built-in optimizations** - Frustum culling, LOD, memory management
+- ✅ **Easy development** - High-level API, good documentation
+
+### **Three.js GPU Features for Graph Rendering**
+
+#### **1. InstancedMesh for Mass Node Rendering**
+```javascript
+// Single GPU draw call for 100k+ nodes
+const geometry = new THREE.CircleGeometry(1, 8);
+const material = new THREE.MeshBasicMaterial({ vertexColors: true });
+const instancedMesh = new THREE.InstancedMesh(geometry, material, nodeCount);
+
+// Set position, scale, color for each instance
+const matrix = new THREE.Matrix4();
+const color = new THREE.Color();
+
+nodes.forEach((node, i) => {
+    matrix.makeScale(node.size, node.size, 1);
+    matrix.setPosition(node.x, node.y, 0);
+    instancedMesh.setMatrixAt(i, matrix);
+    
+    color.setHex(node.clusterColor);
+    instancedMesh.setColorAt(i, color);
+});
+
+// GPU renders all nodes in one call
+scene.add(instancedMesh);
+```
+
+#### **2. BufferGeometry for Edge Performance**
+```javascript
+// GPU-optimized edge rendering
+const positions = new Float32Array(edgeCount * 6);
+const colors = new Float32Array(edgeCount * 6);
+
+edges.forEach((edge, i) => {
+    const idx = i * 6;
+    // Source vertex
+    positions[idx] = edge.source.x;
+    positions[idx + 1] = edge.source.y;
+    // Target vertex  
+    positions[idx + 3] = edge.target.x;
+    positions[idx + 4] = edge.target.y;
+});
+
+const geometry = new THREE.BufferGeometry();
+geometry.setAttribute('position', new THREE.BufferAttribute(positions, 3));
+geometry.setAttribute('color', new THREE.BufferAttribute(colors, 3));
+
+const lineSegments = new THREE.LineSegments(geometry, material);
+```
+
+#### **3. Built-in Performance Optimizations**
+```javascript
+// Three.js GPU optimizations
+renderer.sortObjects = false;           // Disable expensive sorting
+renderer.setPixelRatio(Math.min(devicePixelRatio, 2)); // Limit pixel density
+
+// Frustum culling (automatic)
+// Level-of-detail (LOD) support
+// Automatic geometry merging
+// GPU texture atlasing
+```
+
+### **Performance Comparison**
+
+| Approach | 10k Nodes | 100k Nodes | 1M Nodes | FPS |
+|----------|-----------|------------|----------|-----|
+| **D3.js SVG** | ✅ Good | ❌ Slow | ❌ Unusable | 15fps |
+| **Three.js Standard** | ✅ Excellent | ✅ Good | ❌ Slow | 45fps |
+| **Three.js Instanced** | ✅ Excellent | ✅ Excellent | ✅ Good | 60fps |
+
+## 🔧 **Option 2: deck.gl (For Data-Heavy Visualizations)**
+
+### **Pros**
+- ✅ **Built for large datasets** - Optimized for millions of points
+- ✅ **WebGL2 compute shaders** - True GPU computation
+- ✅ **Built-in graph layouts** - Force-directed on GPU
+- ✅ **Excellent performance** - 1M+ nodes at 60fps
+
+### **Cons**
+- ❌ **Large bundle size** - Adds ~500KB
+- ❌ **Complex API** - Steeper learning curve
+- ❌ **React-focused** - Less suitable for iframe embedding
+
+```javascript
+// deck.gl GPU-accelerated approach
+import { ScatterplotLayer, LineLayer } from '@deck.gl/layers';
+
+const nodeLayer = new ScatterplotLayer({
+    data: nodes,
+    getPosition: d => [d.x, d.y],
+    getRadius: d => d.size,
+    getFillColor: d => d.color,
+    radiusUnits: 'pixels',
+    // GPU instancing automatically enabled
+});
+
+const edgeLayer = new LineLayer({
+    data: edges,
+    getSourcePosition: d => [d.source.x, d.source.y],
+    getTargetPosition: d => [d.target.x, d.target.y],
+    getColor: [100, 100, 100],
+    getWidth: 1
+});
+```
+
+## ⚡ **Option 3: regl (Raw WebGL Performance)**
+
+### **Pros**
+- ✅ **Maximum performance** - Direct WebGL access
+- ✅ **Small bundle** - ~50KB
+- ✅ **Full control** - Custom shaders, compute pipelines
+- ✅ **Functional API** - Clean, predictable
+
+### **Cons**
+- ❌ **Low-level complexity** - Manual memory management
+- ❌ **Shader development** - GLSL programming required
+- ❌ **More development time** - Everything custom
+
+```javascript
+// regl direct WebGL approach
+const drawNodes = regl({
+    vert: `
+        attribute vec2 position;
+        attribute float size;
+        attribute vec3 color;
+        varying vec3 vColor;
+        
+        void main() {
+            gl_Position = vec4(position, 0, 1);
+            gl_PointSize = size;
+            vColor = color;
+        }
+    `,
+    
+    frag: `
+        precision mediump float;
+        varying vec3 vColor;
+        
+        void main() {
+            gl_FragColor = vec4(vColor, 1);
+        }
+    `,
+    
+    attributes: {
+        position: nodePositions,
+        size: nodeSizes,
+        color: nodeColors
+    },
+    
+    count: nodeCount,
+    primitive: 'points'
+});
+```
+
+## 🎮 **Option 4: WebGPU (Future-Proof)**
+
+### **Pros**
+- ✅ **Next-generation API** - Successor to WebGL
+- ✅ **Compute shaders** - True parallel processing
+- ✅ **Better performance** - Lower overhead
+- ✅ **Multi-threading** - Parallel command buffers
+
+### **Cons**
+- ❌ **Limited browser support** - Chrome/Edge only (2024)
+- ❌ **New API** - Rapidly changing specification
+- ❌ **Complex setup** - More verbose than WebGL
+
+```javascript
+// WebGPU approach (future)
+const adapter = await navigator.gpu.requestAdapter();
+const device = await adapter.requestDevice();
+
+const computePipeline = device.createComputePipeline({
+    compute: {
+        module: device.createShaderModule({
+            code: `
+                @compute @workgroup_size(64)
+                fn main(@builtin(global_invocation_id) global_id : vec3<u32>) {
+                    let index = global_id.x;
+                    if (index >= arrayLength(&positions)) { return; }
+                    
+                    // GPU-parallel force calculation
+                    var force = vec2<f32>(0.0, 0.0);
+                    for (var i = 0u; i < arrayLength(&positions); i++) {
+                        if (i != index) {
+                            let diff = positions[index] - positions[i];
+                            let dist = length(diff);
+                            force += normalize(diff) * (1.0 / (dist * dist));
+                        }
+                    }
+                    
+                    velocities[index] += force * 0.01;
+                    positions[index] += velocities[index] * 0.1;
+                }
+            `
+        }),
+        entryPoint: 'main'
+    }
+});
+```
+
+## 🏆 **Recommendation: Three.js Integration**
+
+### **For Your Use Case, Three.js is Optimal Because:**
+
+1. **Already Available** - No new dependencies
+2. **Proven Performance** - Handles 100k+ nodes smoothly  
+3. **Easy Integration** - Replace D3.js rendering with Three.js
+4. **Maintenance** - Well-documented, stable API
+5. **Development Speed** - Rapid implementation
+
+### **Implementation Strategy**
+
+#### **Phase 1: Basic Three.js WebGL (Week 1)**
+```python
+# Enhanced remote service with Three.js
+def _generate_threejs_html(self, session_data, config):
+    return f"""
+    <script src="https://cdnjs.cloudflare.com/ajax/libs/three.js/0.176.0/three.min.js"></script>
+    <script>
+        // Basic Three.js WebGL rendering
+        const renderer = new THREE.WebGLRenderer({{ 
+            powerPreference: "high-performance" 
+        }});
+        const scene = new THREE.Scene();
+        const camera = new THREE.PerspectiveCamera(75, width/height, 0.1, 1000);
+        
+        // Render nodes and edges with GPU
+        createNodeVisualization();
+        createEdgeVisualization();
+    </script>
+    """
+```
+
+#### **Phase 2: GPU Optimization (Week 2)**
+- Add InstancedMesh for node rendering
+- Implement BufferGeometry for edges  
+- Enable frustum culling and LOD
+
+#### **Phase 3: Advanced Features (Week 3)**
+- GPU-based interaction (raycasting)
+- Smooth camera controls
+- Real-time layout animation
+
+### **Expected Performance Improvements**
+
+| Feature | D3.js SVG | Three.js WebGL | Improvement |
+|---------|-----------|----------------|-------------|
+| **50k nodes** | 5 FPS | 60 FPS | **12x faster** |
+| **Animation** | Choppy | Smooth | **Fluid motion** |
+| **Memory usage** | 200MB DOM | 50MB GPU | **4x less memory** |
+| **Interaction** | Laggy | Responsive | **Real-time** |
+
+## 💡 **Implementation Roadmap**
+
+### **Step 1: Replace HTML Template**
+```python
+# In remote_gpu_rendering_service.py
+def _generate_interactive_html(self, session_data, config):
+    if config.get('use_webgl', True):
+        return self._generate_threejs_webgl_html(session_data, config)
+    else:
+        return self._generate_d3_svg_html(session_data, config)  # Fallback
+```
+
+### **Step 2: Add WebGL Configuration**
+```typescript
+// In RemoteGPUViewer component
+const processWithWebGLOptimization = async () => {
+    const config = {
+        use_webgl: nodeCount > 5000,
+        instanced_rendering: nodeCount > 10000,
+        lod_enabled: nodeCount > 25000,
+        render_quality: 'high'
+    };
+    // Process with enhanced GPU service
+};
+```
+
+### **Step 3: Performance Monitoring**
+```javascript
+// Built-in Three.js performance monitoring
+console.log('Render Info:', {
+    triangles: renderer.info.render.triangles,
+    calls: renderer.info.render.calls,
+    geometries: renderer.info.memory.geometries,
+    textures: renderer.info.memory.textures
+});
+```
+
+**Result**: Your remote GPU service will provide **true GPU-accelerated rendering** with minimal development effort by leveraging your existing Three.js stack. 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/JavaScript_Library_Integration.md b/nvidia/txt2kg/assets/deploy/services/gpu-viz/JavaScript_Library_Integration.md
new file mode 100644
index 0000000..fc074aa
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/JavaScript_Library_Integration.md
@@ -0,0 +1,264 @@
+# JavaScript Library Stack Integration with Remote GPU Rendering
+
+## 🚀 **Library Architecture Overview**
+
+Your project leverages a sophisticated JavaScript stack optimized for graph visualization performance:
+
+### **Core Visualization Libraries**
+```json
+{
+  "3d-force-graph": "^1.77.0",    // WebGL 3D graph rendering
+  "three": "^0.176.0",             // WebGL/WebGPU 3D engine  
+  "d3": "^7.9.0",                  // Data binding & force simulation
+  "@types/d3": "^7.4.3",           // TypeScript definitions
+  "@types/three": "^0.175.0"       // Three.js TypeScript support
+}
+```
+
+### **Frontend Framework**
+```json
+{
+  "next": "15.1.0",                // React framework with SSR
+  "react": "^19",                  // Component architecture
+  "tailwindcss": "^3.4.17"        // Utility-first CSS
+}
+```
+
+## 🎯 **Performance Optimization Strategies**
+
+### **1. Dynamic Import Strategy**
+
+**Problem:** Large visualization libraries increase initial bundle size
+**Solution:** Conditional loading based on graph complexity
+
+```typescript
+// ForceGraphWrapper.tsx - Dynamic loading pattern
+const ForceGraph3D = (await import('3d-force-graph')).default;
+
+// Benefits:
+// - Reduces initial bundle by ~2MB
+// - Enables GPU capability detection
+// - Prevents SSR WebGL conflicts
+```
+
+### **2. GPU Capability Detection**
+
+**Enhanced detection based on your library capabilities:**
+
+```typescript
+const shouldUseRemoteRendering = (nodeCount: number) => {
+  const maxWebGLNodes = window.WebGL2RenderingContext ? 50000 : 10000;
+  const maxWebGPUNodes = 'gpu' in navigator ? 100000 : 25000;
+  
+  // Three.js geometry memory limits
+  const estimatedMemoryMB = (nodeCount * 64) / (1024 * 1024);
+  const maxClientMemory = hasWebGPU ? 512 : 256; // MB
+  
+  return nodeCount > maxWebGLNodes || estimatedMemoryMB > maxClientMemory;
+};
+```
+
+### **3. Library-Specific Optimizations**
+
+#### **Three.js Renderer Settings**
+```typescript
+const optimizeForThreeJS = (nodeCount: number) => ({
+  // Instanced rendering for large graphs
+  instance_rendering: nodeCount > 10000,
+  
+  // Texture optimization
+  texture_atlasing: nodeCount > 5000,
+  max_texture_size: nodeCount > 25000 ? 2048 : 1024,
+  
+  // Performance culling
+  frustum_culling: nodeCount > 15000,
+  occlusion_culling: nodeCount > 25000,
+  
+  // Level-of-detail for distant nodes
+  enable_lod: nodeCount > 25000
+});
+```
+
+#### **D3.js Force Simulation Tuning**
+```typescript
+const optimizeForD3 = (nodeCount: number) => ({
+  // Reduced iterations for large graphs
+  physics_iterations: nodeCount > 50000 ? 100 : 300,
+  
+  // Faster convergence
+  alpha_decay: nodeCount > 50000 ? 0.05 : 0.02,
+  
+  // More damping for stability
+  velocity_decay: nodeCount > 50000 ? 0.6 : 0.4
+});
+```
+
+## 🔧 **Remote GPU Service Integration**
+
+### **Enhanced HTML Template Generation**
+
+The remote GPU service now generates HTML compatible with your frontend:
+
+```python
+def _generate_interactive_html(self, session_data: dict, config: dict) -> str:
+    html_template = f"""
+    <!-- Using D3.js v7.9.0 consistent with frontend -->
+    <script src="https://d3js.org/d3.v7.min.js"></script>
+    
+    <script>
+        // Configuration matching your library versions
+        const config = {{
+            d3_version: "7.9.0",           // Match package.json
+            threejs_version: "0.176.0",    // Match package.json
+            force_graph_version: "1.77.0", // Match package.json
+            
+            // Performance settings based on render quality
+            maxParticles: {settings['particles']},
+            lineWidth: {settings['line_width']},
+            nodeDetail: {settings['node_detail']}
+        }};
+        
+        // D3 force simulation with GPU-optimized parameters
+        this.simulation = d3.forceSimulation()
+            .force("link", d3.forceLink().id(d => d.id).distance(60))
+            .force("charge", d3.forceManyBody().strength(-120))
+            .force("center", d3.forceCenter(this.width / 2, this.height / 2))
+            .alphaDecay(0.02)
+            .velocityDecay(0.4);
+    </script>
+    """
+```
+
+### **Frontend Component Integration**
+
+```typescript
+// RemoteGPUViewer.tsx - Library-aware processing
+const processGraphWithLibraryOptimization = async () => {
+  const optimizedConfig = {
+    // Frontend library compatibility
+    d3_version: "7.9.0",
+    threejs_version: "0.176.0", 
+    force_graph_version: "1.77.0",
+    
+    // WebGL optimization features
+    webgl_features: {
+      instance_rendering: nodeCount > 10000,
+      texture_atlasing: nodeCount > 5000,
+      frustum_culling: nodeCount > 15000
+    },
+    
+    // Performance tuning
+    progressive_loading: nodeCount > 25000,
+    gpu_memory_management: true
+  };
+  
+  const response = await fetch('/api/render', {
+    method: 'POST',
+    body: JSON.stringify({ graph_data, config: optimizedConfig })
+  });
+};
+```
+
+## 📊 **Performance Benchmarks by Library Stack**
+
+### **Client-Side Rendering Limits**
+
+| Library Stack | Max Nodes | Memory Usage | Performance |
+|---------------|-----------|--------------|-------------|
+| **D3.js + SVG** | 5,000 | ~50MB | Good interaction |
+| **Three.js + WebGL** | 50,000 | ~256MB | Smooth 60fps |
+| **Three.js + WebGPU** | 100,000 | ~512MB | GPU-accelerated |
+| **Remote GPU** | 1M+ | ~100KB transfer | Server-rendered |
+
+### **Rendering Strategy Decision Tree**
+
+```typescript
+const selectRenderingStrategy = (nodeCount: number) => {
+  if (nodeCount < 5000) {
+    return "local_svg";        // D3.js + SVG DOM
+  } else if (nodeCount < 25000) {
+    return "local_webgl";      // Three.js + WebGL
+  } else if (nodeCount < 100000 && hasWebGPU) {
+    return "local_webgpu";     // Three.js + WebGPU
+  } else {
+    return "remote_gpu";       // Remote cuGraph + GPU
+  }
+};
+```
+
+## 🚀 **Advanced Integration Features**
+
+### **1. Progressive Loading**
+```typescript
+// For graphs >25k nodes, enable progressive loading
+if (nodeCount > 25000) {
+  config.progressive_loading = true;
+  config.initial_load_size = 10000;  // Load first 10k nodes
+  config.batch_size = 5000;          // Load 5k at a time
+}
+```
+
+### **2. WebSocket Real-time Updates**
+```typescript
+// Real-time parameter updates via WebSocket
+const updateLayoutAlgorithm = (algorithm: string) => {
+  if (wsRef.current?.readyState === WebSocket.OPEN) {
+    wsRef.current.send(JSON.stringify({
+      type: "update_params",
+      layout_algorithm: algorithm
+    }));
+  }
+};
+```
+
+### **3. Memory-Aware Quality Settings**
+```typescript
+const adjustQuality = (availableMemory: number, nodeCount: number) => {
+  if (availableMemory < 256) return "low";      // Mobile devices
+  if (availableMemory < 512) return "medium";   // Standard devices  
+  if (nodeCount > 100000) return "high";       // Large graphs
+  return "ultra";                               // High-end systems
+};
+```
+
+## 💡 **Best Practices for Your Stack**
+
+### **1. Bundle Optimization**
+- Use dynamic imports for 3D libraries
+- Lazy load based on graph size detection
+- Implement service worker caching for repeated visualizations
+
+### **2. Memory Management**
+```typescript
+// Cleanup Three.js resources
+const cleanup = () => {
+  if (graphRef.current) {
+    graphRef.current.scene?.traverse((object) => {
+      if (object.geometry) object.geometry.dispose();
+      if (object.material) object.material.dispose();
+    });
+    graphRef.current.renderer?.dispose();
+  }
+};
+```
+
+### **3. Responsive Rendering**
+```typescript
+// Adjust complexity based on device capabilities
+const getDeviceCapabilities = () => ({
+  memory: (navigator as any).deviceMemory || 4, // GB
+  cores: navigator.hardwareConcurrency || 4,
+  gpu: 'gpu' in navigator ? 'webgpu' : 'webgl'
+});
+```
+
+## 🎯 **Integration Results**
+
+✅ **Seamless fallback** between local and remote rendering
+✅ **Library version consistency** across client and server
+✅ **Memory-aware quality adjustment** based on device capabilities  
+✅ **Progressive enhancement** from SVG → WebGL → WebGPU → Remote GPU
+✅ **Real-time parameter updates** via WebSocket
+✅ **Zero-config optimization** based on graph complexity
+
+This integration provides the best of both worlds: the interactivity of your existing Three.js/D3.js stack for smaller graphs, and the scalability of remote GPU processing for large-scale visualizations. 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/README.md b/nvidia/txt2kg/assets/deploy/services/gpu-viz/README.md
new file mode 100644
index 0000000..24c480a
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/README.md
@@ -0,0 +1,221 @@
+# Unified GPU Graph Visualization Service
+
+## 🚀 Overview
+
+The unified service combines **PyGraphistry Cloud** and **Local GPU (cuGraph)** processing into a single FastAPI service, giving you maximum flexibility for graph visualization.
+
+## ⚡ Processing Modes
+
+| Mode | Description | Requirements |
+|------|-------------|--------------|
+| **PyGraphistry Cloud** | Interactive GPU embeds in browser | API credentials |
+| **Local GPU (cuGraph)** | Full GPU processing on your hardware | NVIDIA GPU + cuGraph |
+| **Local CPU** | NetworkX fallback processing | None |
+
+## 🛠️ Quick Setup
+
+### 1. Set Environment Variables (Optional)
+```bash
+# For PyGraphistry Cloud features
+export GRAPHISTRY_PERSONAL_KEY="your_personal_key"
+export GRAPHISTRY_SECRET_KEY="your_secret_key"
+```
+
+### 2. Run the Service
+
+#### Option A: Direct Python
+```bash
+cd services
+python unified_gpu_service.py
+```
+
+#### Option B: Using Startup Script
+```bash
+cd services
+./start_gpu_services.sh
+```
+
+#### Option C: Docker (NVIDIA PyG Container)
+```bash
+cd services
+docker build -t unified-gpu-viz .
+docker run --gpus all -p 8080:8080 \
+  -e GRAPHISTRY_PERSONAL_KEY="your_key" \
+  -e GRAPHISTRY_SECRET_KEY="your_secret" \
+  unified-gpu-viz
+```
+
+## 📡 API Usage
+
+### Process Graph with Mode Selection
+
+```bash
+curl -X POST http://localhost:8080/api/visualize \
+  -H "Content-Type: application/json" \
+  -d '{
+    "graph_data": {
+      "nodes": [{"id": "1", "name": "Node 1"}, {"id": "2", "name": "Node 2"}],
+      "links": [{"source": "1", "target": "2", "name": "edge_1_2"}]
+    },
+    "processing_mode": "local_gpu",
+    "layout_algorithm": "force_atlas2",
+    "clustering_algorithm": "leiden",
+    "compute_centrality": true
+  }'
+```
+
+### Check Available Capabilities
+
+```bash
+curl http://localhost:8080/api/capabilities
+```
+
+Response:
+```json
+{
+  "processing_modes": {
+    "pygraphistry_cloud": {"available": true, "description": "..."},
+    "local_gpu": {"available": true, "description": "..."},
+    "local_cpu": {"available": true, "description": "..."}
+  },
+  "has_rapids": true,
+  "gpu_available": true
+}
+```
+
+## 🎯 Frontend Integration
+
+### React Component Usage
+
+```tsx
+import { UnifiedGPUViewer } from '@/components/unified-gpu-viewer'
+
+function MyApp() {
+  const graphData = {
+    nodes: [...],
+    links: [...]
+  }
+
+  return (
+    <UnifiedGPUViewer 
+      graphData={graphData}
+      onError={(error) => console.error(error)}
+    />
+  )
+}
+```
+
+### Mode-Specific Processing
+
+```javascript
+// PyGraphistry Cloud mode
+const response = await fetch('/api/unified-gpu/visualize', {
+  method: 'POST',
+  headers: { 'Content-Type': 'application/json' },
+  body: JSON.stringify({
+    graph_data: { nodes, links },
+    processing_mode: 'pygraphistry_cloud',
+    layout_type: 'force',
+    clustering: true,
+    gpu_acceleration: true
+  })
+})
+
+// Local GPU mode  
+const response = await fetch('/api/unified-gpu/visualize', {
+  method: 'POST',
+  headers: { 'Content-Type': 'application/json' },
+  body: JSON.stringify({
+    graph_data: { nodes, links },
+    processing_mode: 'local_gpu',
+    layout_algorithm: 'force_atlas2',
+    clustering_algorithm: 'leiden',
+    compute_centrality: true
+  })
+})
+```
+
+## 🔧 Configuration Options
+
+### PyGraphistry Cloud Mode
+- `layout_type`: "force", "circular", "hierarchical"
+- `gpu_acceleration`: true/false
+- `clustering`: true/false
+
+### Local GPU Mode  
+- `layout_algorithm`: "force_atlas2", "spectral", "fruchterman_reingold"
+- `clustering_algorithm`: "leiden", "louvain", "spectral"
+- `compute_centrality`: true/false
+
+### Local CPU Mode
+- Basic processing with NetworkX fallback
+- No additional configuration needed
+
+## 📊 Response Format
+
+```json
+{
+  "processed_nodes": [...],
+  "processed_edges": [...],
+  "processing_mode": "local_gpu",
+  "embed_url": "https://hub.graphistry.com/...", // Only for cloud mode
+  "layout_positions": {...}, // Only for local GPU mode
+  "clusters": {...},
+  "centrality": {...},
+  "stats": {
+    "node_count": 1000,
+    "edge_count": 5000,
+    "gpu_accelerated": true,
+    "layout_computed": true,
+    "clusters_computed": true
+  },
+  "timestamp": "2024-01-01T12:00:00Z"
+}
+```
+
+## 🚀 Benefits of Unified Approach
+
+### ✅ Advantages
+- **Single service** - One port, one deployment
+- **Mode switching** - Choose best processing per graph
+- **Fallback handling** - Graceful degradation if GPU unavailable  
+- **Consistent API** - Same interface for all modes
+- **Better testing** - Easy comparison between modes
+
+### 🎯 Use Cases
+- **PyGraphistry Cloud**: Sharing visualizations, demos, production embeds
+- **Local GPU**: Private data, large-scale processing, custom algorithms
+- **Local CPU**: Development, testing, small graphs
+
+## 🐛 Troubleshooting
+
+### GPU Not Detected
+```bash
+# Check GPU availability
+nvidia-smi
+
+# Check RAPIDS installation
+python -c "import cudf, cugraph; print('RAPIDS OK')"
+```
+
+### PyGraphistry Credentials
+```bash
+# Verify credentials are set
+echo $GRAPHISTRY_PERSONAL_KEY
+echo $GRAPHISTRY_SECRET_KEY
+
+# Test connection
+python -c "import graphistry; graphistry.register(personal_key_id='$GRAPHISTRY_PERSONAL_KEY', personal_key_secret='$GRAPHISTRY_SECRET_KEY'); print('PyGraphistry OK')"
+```
+
+### Service Health
+```bash
+curl http://localhost:8080/api/health
+```
+
+## 📈 Performance Tips
+
+1. **Large graphs (>100k nodes)**: Use `local_gpu` mode
+2. **Sharing/demos**: Use `pygraphistry_cloud` mode  
+3. **Development**: Use `local_cpu` mode for speed
+4. **Mixed workloads**: Switch modes dynamically based on graph size 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/local_gpu_viz_service.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/local_gpu_viz_service.py
new file mode 100644
index 0000000..34197fe
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/local_gpu_viz_service.py
@@ -0,0 +1,443 @@
+import os
+import json
+import numpy as np
+import pandas as pd
+from typing import Dict, List, Any, Optional, Tuple
+import asyncio
+import logging
+from datetime import datetime
+from fastapi import FastAPI, HTTPException, WebSocket, WebSocketDisconnect
+from fastapi.staticfiles import StaticFiles
+from fastapi.responses import HTMLResponse
+from pydantic import BaseModel
+import uvicorn
+
+# GPU-accelerated imports (available in NVIDIA PyG container)
+try:
+    import cudf
+    import cugraph
+    import cupy as cp
+    from cuml import UMAP
+    HAS_RAPIDS = True
+    print("✓ RAPIDS cuGraph/cuDF/cuML available")
+except ImportError:
+    HAS_RAPIDS = False
+    print("⚠ RAPIDS not available, falling back to CPU")
+    import networkx as nx
+
+try:
+    import torch
+    import torch_geometric
+    HAS_TORCH_GEOMETRIC = True
+    print("✓ PyTorch Geometric available")
+except ImportError:
+    HAS_TORCH_GEOMETRIC = False
+    print("⚠ PyTorch Geometric not available")
+
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+
+class GraphData(BaseModel):
+    nodes: List[Dict[str, Any]]
+    links: List[Dict[str, Any]]
+
+class VisualizationRequest(BaseModel):
+    graph_data: GraphData
+    layout_algorithm: str = "force_atlas2"  # force_atlas2, fruchterman_reingold, spectral
+    clustering_algorithm: str = "leiden"     # leiden, louvain, spectral
+    gpu_acceleration: bool = True
+    compute_centrality: bool = True
+    
+class GPUGraphProcessor:
+    """GPU-accelerated graph processing using cuGraph"""
+    
+    def __init__(self):
+        self.use_gpu = HAS_RAPIDS
+        logger.info(f"GPU Graph Processor initialized (GPU: {self.use_gpu})")
+    
+    def create_cugraph_from_data(self, nodes: List[Dict], edges: List[Dict]) -> 'cugraph.Graph':
+        """Create cuGraph from node/edge data"""
+        if not self.use_gpu:
+            raise RuntimeError("GPU libraries not available")
+            
+        # Create edge dataframe
+        edge_data = []
+        for edge in edges:
+            edge_data.append({
+                'src': edge['source'],
+                'dst': edge['target'],
+                'weight': edge.get('weight', 1.0)
+            })
+        
+        # Convert to cuDF
+        edges_df = cudf.DataFrame(edge_data)
+        
+        # Create cuGraph
+        G = cugraph.Graph()
+        G.from_cudf_edgelist(edges_df, source='src', destination='dst', edge_attr='weight')
+        
+        return G, edges_df
+    
+    def compute_gpu_layout(self, G, algorithm: str = "force_atlas2") -> Dict[str, Tuple[float, float]]:
+        """Compute GPU-accelerated graph layout"""
+        try:
+            if algorithm == "force_atlas2":
+                layout_df = cugraph.force_atlas2(G)
+            elif algorithm == "fruchterman_reingold":
+                # Use spectral as fallback since FR might not be available
+                layout_df = cugraph.spectral_layout(G, dim=2)
+            else:  # spectral
+                layout_df = cugraph.spectral_layout(G, dim=2)
+            
+            # Convert to dictionary
+            positions = {}
+            for _, row in layout_df.iterrows():
+                node_id = str(row['vertex'])
+                positions[node_id] = (float(row['x']), float(row['y']))
+            
+            logger.info(f"Computed {algorithm} layout for {len(positions)} nodes on GPU")
+            return positions
+            
+        except Exception as e:
+            logger.error(f"GPU layout computation failed: {e}")
+            return {}
+    
+    def compute_gpu_clustering(self, G, algorithm: str = "leiden") -> Dict[str, int]:
+        """Compute GPU-accelerated community detection"""
+        try:
+            if algorithm == "leiden":
+                clusters_df, modularity = cugraph.leiden(G)
+            elif algorithm == "louvain":
+                clusters_df, modularity = cugraph.louvain(G)
+            else:  # spectral clustering
+                clusters_df = cugraph.spectral_clustering(G, n_clusters=10)
+                modularity = 0.0
+            
+            # Convert to dictionary
+            clusters = {}
+            for _, row in clusters_df.iterrows():
+                node_id = str(row['vertex'])
+                clusters[node_id] = int(row['partition'])
+            
+            logger.info(f"Computed {algorithm} clustering on GPU (modularity: {modularity:.3f})")
+            return clusters
+            
+        except Exception as e:
+            logger.error(f"GPU clustering failed: {e}")
+            return {}
+    
+    def compute_gpu_centrality(self, G) -> Dict[str, Dict[str, float]]:
+        """Compute GPU-accelerated centrality measures"""
+        centrality_data = {}
+        
+        try:
+            # PageRank
+            pagerank_df = cugraph.pagerank(G)
+            pagerank = {}
+            for _, row in pagerank_df.iterrows():
+                pagerank[str(row['vertex'])] = float(row['pagerank'])
+            centrality_data['pagerank'] = pagerank
+            
+            # Betweenness centrality (for smaller graphs)
+            if G.number_of_vertices() < 5000:
+                betweenness_df = cugraph.betweenness_centrality(G)
+                betweenness = {}
+                for _, row in betweenness_df.iterrows():
+                    betweenness[str(row['vertex'])] = float(row['betweenness_centrality'])
+                centrality_data['betweenness'] = betweenness
+            
+            logger.info(f"Computed centrality measures on GPU")
+            return centrality_data
+            
+        except Exception as e:
+            logger.error(f"GPU centrality computation failed: {e}")
+            return {}
+
+class LocalGPUVisualizationService:
+    """Local GPU-powered interactive graph visualization service"""
+    
+    def __init__(self):
+        self.gpu_processor = GPUGraphProcessor()
+        self.active_connections: List[WebSocket] = []
+        
+    async def process_graph(self, request: VisualizationRequest) -> Dict[str, Any]:
+        """Process graph with GPU acceleration"""
+        try:
+            nodes = request.graph_data.nodes
+            edges = request.graph_data.links
+            
+            result = {
+                "nodes": nodes.copy(),
+                "edges": edges.copy(),
+                "gpu_processed": False,
+                "layout_positions": {},
+                "clusters": {},
+                "centrality": {},
+                "stats": {},
+                "timestamp": datetime.now().isoformat()
+            }
+            
+            if request.gpu_acceleration and self.gpu_processor.use_gpu:
+                logger.info("=== GPU PROCESSING START ===")
+                
+                # Create cuGraph
+                G, edges_df = self.gpu_processor.create_cugraph_from_data(nodes, edges)
+                
+                # Compute layout on GPU
+                positions = self.gpu_processor.compute_gpu_layout(G, request.layout_algorithm)
+                if positions:
+                    result["layout_positions"] = positions
+                    # Add positions to nodes
+                    for node in result["nodes"]:
+                        node_id = str(node["id"])
+                        if node_id in positions:
+                            node["x"], node["y"] = positions[node_id]
+                
+                # Compute clustering on GPU
+                clusters = self.gpu_processor.compute_gpu_clustering(G, request.clustering_algorithm)
+                if clusters:
+                    result["clusters"] = clusters
+                    # Add cluster info to nodes
+                    for node in result["nodes"]:
+                        node_id = str(node["id"])
+                        if node_id in clusters:
+                            node["cluster"] = clusters[node_id]
+                
+                # Compute centrality on GPU
+                if request.compute_centrality:
+                    centrality = self.gpu_processor.compute_gpu_centrality(G)
+                    result["centrality"] = centrality
+                    # Add centrality to nodes
+                    for node in result["nodes"]:
+                        node_id = str(node["id"])
+                        for metric, values in centrality.items():
+                            if node_id in values:
+                                node[metric] = values[node_id]
+                
+                result["gpu_processed"] = True
+                result["stats"] = {
+                    "node_count": len(nodes),
+                    "edge_count": len(edges),
+                    "gpu_accelerated": True,
+                    "layout_computed": len(positions) > 0,
+                    "clusters_computed": len(clusters) > 0,
+                    "centrality_computed": len(centrality) > 0
+                }
+                
+                logger.info("=== GPU PROCESSING COMPLETE ===")
+            
+            return result
+            
+        except Exception as e:
+            logger.error(f"Graph processing failed: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    async def broadcast_update(self, data: Dict[str, Any]):
+        """Broadcast updates to all connected WebSocket clients"""
+        if self.active_connections:
+            message = json.dumps(data)
+            for connection in self.active_connections.copy():
+                try:
+                    await connection.send_text(message)
+                except WebSocketDisconnect:
+                    self.active_connections.remove(connection)
+
+# FastAPI app
+app = FastAPI(title="Local GPU Graph Visualization", version="1.0.0")
+service = LocalGPUVisualizationService()
+
+@app.post("/api/process")
+async def process_graph(request: VisualizationRequest):
+    """Process graph with local GPU acceleration"""
+    result = await service.process_graph(request)
+    
+    # Broadcast to connected WebSocket clients
+    await service.broadcast_update({
+        "type": "graph_processed",
+        "data": result
+    })
+    
+    return result
+
+@app.websocket("/ws")
+async def websocket_endpoint(websocket: WebSocket):
+    """WebSocket endpoint for real-time updates"""
+    await websocket.accept()
+    service.active_connections.append(websocket)
+    
+    try:
+        while True:
+            # Keep connection alive
+            await websocket.receive_text()
+    except WebSocketDisconnect:
+        service.active_connections.remove(websocket)
+
+@app.get("/api/capabilities")
+async def get_capabilities():
+    """Get GPU capabilities"""
+    return {
+        "has_rapids": HAS_RAPIDS,
+        "has_torch_geometric": HAS_TORCH_GEOMETRIC,
+        "gpu_available": HAS_RAPIDS,
+        "supported_layouts": ["force_atlas2", "spectral", "fruchterman_reingold"],
+        "supported_clustering": ["leiden", "louvain", "spectral"],
+        "gpu_memory": "N/A"  # Could add GPU memory info here
+    }
+
+@app.get("/", response_class=HTMLResponse)
+async def get_visualization_page():
+    """Serve the interactive visualization page"""
+    return """
+    <!DOCTYPE html>
+    <html>
+    <head>
+        <title>Local GPU Graph Visualization</title>
+        <script src="https://d3js.org/d3.v7.min.js"></script>
+        <style>
+            body { margin: 0; font-family: Arial, sans-serif; background: #1a1a1a; color: white; }
+            #controls { position: absolute; top: 10px; left: 10px; z-index: 100; background: rgba(0,0,0,0.8); padding: 10px; border-radius: 5px; }
+            #graph { width: 100vw; height: 100vh; }
+            .node { cursor: pointer; }
+            .link { stroke: #999; stroke-opacity: 0.6; }
+            button { margin: 5px; padding: 5px 10px; }
+        </style>
+    </head>
+    <body>
+        <div id="controls">
+            <h3>🚀 Local GPU Visualization</h3>
+            <button onclick="loadSampleGraph()">Load Sample Graph</button>
+            <div id="status">Ready</div>
+        </div>
+        <div id="graph"></div>
+        
+        <script>
+            const width = window.innerWidth;
+            const height = window.innerHeight;
+            
+            const svg = d3.select("#graph")
+                .append("svg")
+                .attr("width", width)
+                .attr("height", height);
+            
+            const g = svg.append("g");
+            
+            // Add zoom behavior
+            const zoom = d3.zoom()
+                .scaleExtent([0.1, 10])
+                .on("zoom", (event) => {
+                    g.attr("transform", event.transform);
+                });
+            
+            svg.call(zoom);
+            
+            // WebSocket connection for real-time updates
+            const ws = new WebSocket(`ws://localhost:8081/ws`);
+            
+            ws.onmessage = function(event) {
+                const message = JSON.parse(event.data);
+                if (message.type === 'graph_processed') {
+                    renderGraph(message.data);
+                }
+            };
+            
+            function renderGraph(data) {
+                console.log("Rendering graph with", data.nodes.length, "nodes");
+                
+                // Clear previous graph
+                g.selectAll("*").remove();
+                
+                // Create links
+                const links = g.selectAll(".link")
+                    .data(data.edges)
+                    .enter().append("line")
+                    .attr("class", "link")
+                    .attr("stroke-width", 1);
+                
+                // Create nodes
+                const nodes = g.selectAll(".node")
+                    .data(data.nodes)
+                    .enter().append("circle")
+                    .attr("class", "node")
+                    .attr("r", d => Math.sqrt((d.pagerank || 0.001) * 1000) + 2)
+                    .attr("fill", d => d3.schemeCategory10[d.cluster % 10] || "#69b3a2")
+                    .attr("stroke", "#fff")
+                    .attr("stroke-width", 1.5);
+                
+                // Add node labels for important nodes
+                const labels = g.selectAll(".label")
+                    .data(data.nodes.filter(d => (d.pagerank || 0) > 0.01))
+                    .enter().append("text")
+                    .attr("class", "label")
+                    .attr("dy", -3)
+                    .attr("text-anchor", "middle")
+                    .style("font-size", "10px")
+                    .style("fill", "white")
+                    .text(d => d.id);
+                
+                // Position nodes using GPU-computed coordinates
+                if (data.layout_positions && Object.keys(data.layout_positions).length > 0) {
+                    nodes.attr("cx", d => (data.layout_positions[d.id] && data.layout_positions[d.id][0]) || width/2)
+                          .attr("cy", d => (data.layout_positions[d.id] && data.layout_positions[d.id][1]) || height/2);
+                    
+                    labels.attr("x", d => (data.layout_positions[d.id] && data.layout_positions[d.id][0]) || width/2)
+                           .attr("y", d => (data.layout_positions[d.id] && data.layout_positions[d.id][1]) || height/2);
+                    
+                    links.attr("x1", d => (data.layout_positions[d.source] && data.layout_positions[d.source][0]) || width/2)
+                         .attr("y1", d => (data.layout_positions[d.source] && data.layout_positions[d.source][1]) || height/2)
+                         .attr("x2", d => (data.layout_positions[d.target] && data.layout_positions[d.target][0]) || width/2)
+                         .attr("y2", d => (data.layout_positions[d.target] && data.layout_positions[d.target][1]) || height/2);
+                } else {
+                    // Fallback to force simulation
+                    const simulation = d3.forceSimulation(data.nodes)
+                        .force("link", d3.forceLink(data.edges).id(d => d.id))
+                        .force("charge", d3.forceManyBody().strength(-30))
+                        .force("center", d3.forceCenter(width / 2, height / 2));
+                    
+                    simulation.on("tick", () => {
+                        links.attr("x1", d => d.source.x)
+                             .attr("y1", d => d.source.y)
+                             .attr("x2", d => d.target.x)
+                             .attr("y2", d => d.target.y);
+                        
+                        nodes.attr("cx", d => d.x)
+                             .attr("cy", d => d.y);
+                        
+                        labels.attr("x", d => d.x)
+                              .attr("y", d => d.y);
+                    });
+                }
+                
+                // Add tooltips
+                nodes.append("title")
+                     .text(d => `Node: ${d.id}\\nCluster: ${d.cluster || 'N/A'}\\nPageRank: ${(d.pagerank || 0).toFixed(4)}`);
+                
+                document.getElementById("status").innerHTML = 
+                    `Rendered ${data.nodes.length} nodes, ${data.edges.length} edges (GPU: ${data.gpu_processed})`;
+            }
+            
+            async function loadSampleGraph() {
+                // This would load your graph data and send it for processing
+                document.getElementById("status").innerHTML = "Loading sample graph...";
+                
+                // You can integrate this with your existing graph generation
+                // For now, this is a placeholder
+                alert("Connect this to your graph generation service!");
+            }
+        </script>
+    </body>
+    </html>
+    """
+
+@app.get("/api/health")
+async def health_check():
+    """Health check endpoint"""
+    return {
+        "status": "healthy",
+        "gpu_available": HAS_RAPIDS,
+        "torch_geometric": HAS_TORCH_GEOMETRIC,
+        "timestamp": datetime.now().isoformat()
+    }
+
+if __name__ == "__main__":
+    uvicorn.run(app, host="0.0.0.0", port=8081) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/pygraphistry_service.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/pygraphistry_service.py
new file mode 100644
index 0000000..9be0fa7
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/pygraphistry_service.py
@@ -0,0 +1,712 @@
+import graphistry
+import pandas as pd
+import numpy as np
+from typing import Dict, List, Any, Optional
+import asyncio
+import json
+from datetime import datetime
+import logging
+from fastapi import FastAPI, HTTPException, BackgroundTasks
+from pydantic import BaseModel
+import uvicorn
+import os
+import time
+from concurrent.futures import ThreadPoolExecutor
+import networkx as nx
+from enum import Enum
+
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+
+# Initialize PyGraphistry
+def init_graphistry():
+    """Initialize PyGraphistry with GPU acceleration"""
+    try:
+        # Set up authentication - check for different credential types
+        api_key = os.getenv('GRAPHISTRY_API_KEY')
+        personal_key = os.getenv('GRAPHISTRY_PERSONAL_KEY')
+        secret_key = os.getenv('GRAPHISTRY_SECRET_KEY')
+        username = os.getenv('GRAPHISTRY_USERNAME')
+        password = os.getenv('GRAPHISTRY_PASSWORD')
+        
+        if personal_key and secret_key:
+            # Configure for cloud API with personal key and secret
+            graphistry.register(
+                api=3, 
+                protocol="https", 
+                server="hub.graphistry.com", 
+                personal_key_id=personal_key,
+                personal_key_secret=secret_key
+            )
+            logger.info("PyGraphistry initialized with personal key/secret for cloud GPU acceleration")
+            return True
+        elif api_key:
+            # Configure for cloud API with API key
+            graphistry.register(api=3, protocol="https", server="hub.graphistry.com", api_key=api_key)
+            logger.info("PyGraphistry initialized with API key for cloud GPU acceleration")
+            return True
+        elif username and password:
+            # Configure for cloud API with username/password
+            graphistry.register(api=3, protocol="https", server="hub.graphistry.com", 
+                              username=username, password=password)
+            logger.info("PyGraphistry initialized with username/password for cloud GPU acceleration")
+            return True
+        else:
+            # Configure for local mode
+            graphistry.register(api=3)
+            logger.info("PyGraphistry initialized in local CPU mode")
+            return True
+            
+    except Exception as e:
+        logger.error(f"Failed to initialize PyGraphistry: {e}")
+        return False
+
+class GraphPattern(str, Enum):
+    RANDOM = "random"
+    SCALE_FREE = "scale-free"
+    SMALL_WORLD = "small-world"
+    CLUSTERED = "clustered"
+    HIERARCHICAL = "hierarchical"
+    GRID = "grid"
+
+class GraphData(BaseModel):
+    nodes: List[Dict[str, Any]]
+    links: List[Dict[str, Any]]
+
+class GraphGenerationRequest(BaseModel):
+    num_nodes: int
+    pattern: GraphPattern = GraphPattern.SCALE_FREE
+    avg_degree: Optional[int] = 5
+    num_clusters: Optional[int] = 100
+    small_world_k: Optional[int] = 6
+    small_world_p: Optional[float] = 0.1
+    grid_dimensions: Optional[List[int]] = [100, 100]
+    seed: Optional[int] = None
+    
+class VisualizationRequest(BaseModel):
+    graph_data: GraphData
+    layout_type: Optional[str] = "force"
+    gpu_acceleration: Optional[bool] = True
+    clustering: Optional[bool] = False
+    node_size_attribute: Optional[str] = None
+    node_color_attribute: Optional[str] = None
+    edge_weight_attribute: Optional[str] = None
+
+class GraphGenerationStatus(BaseModel):
+    task_id: str
+    status: str  # "running", "completed", "failed"
+    progress: float
+    message: str
+    result: Optional[Dict[str, Any]] = None
+    error: Optional[str] = None
+
+class LargeGraphGenerator:
+    """Optimized graph generation using NetworkX and NumPy for performance"""
+    
+    @staticmethod
+    def generate_random_graph(num_nodes: int, avg_degree: int = 5, seed: Optional[int] = None) -> GraphData:
+        """Generate random graph using Erdős–Rényi model"""
+        if seed:
+            np.random.seed(seed)
+            
+        # Calculate edge probability for desired average degree
+        p = avg_degree / (num_nodes - 1)
+        
+        # Use NetworkX for efficient generation
+        G = nx.erdos_renyi_graph(num_nodes, p, seed=seed)
+        
+        return LargeGraphGenerator._networkx_to_graphdata(G)
+    
+    @staticmethod
+    def generate_scale_free_graph(num_nodes: int, m: int = 3, seed: Optional[int] = None) -> GraphData:
+        """Generate scale-free graph using Barabási–Albert model"""
+        G = nx.barabasi_albert_graph(num_nodes, m, seed=seed)
+        return LargeGraphGenerator._networkx_to_graphdata(G)
+    
+    @staticmethod
+    def generate_small_world_graph(num_nodes: int, k: int = 6, p: float = 0.1, seed: Optional[int] = None) -> GraphData:
+        """Generate small-world graph using Watts-Strogatz model"""
+        G = nx.watts_strogatz_graph(num_nodes, k, p, seed=seed)
+        return LargeGraphGenerator._networkx_to_graphdata(G)
+    
+    @staticmethod
+    def generate_clustered_graph(num_nodes: int, num_clusters: int = 100, seed: Optional[int] = None) -> GraphData:
+        """Generate clustered graph with intra and inter-cluster connections"""
+        if seed:
+            np.random.seed(seed)
+            
+        cluster_size = num_nodes // num_clusters
+        G = nx.Graph()
+        
+        # Add nodes with cluster information
+        for i in range(num_nodes):
+            cluster_id = i // cluster_size
+            G.add_node(i, cluster=cluster_id)
+        
+        # Generate intra-cluster edges
+        intra_prob = 0.1
+        for cluster in range(num_clusters):
+            cluster_start = cluster * cluster_size
+            cluster_end = min(cluster_start + cluster_size, num_nodes)
+            cluster_nodes = list(range(cluster_start, cluster_end))
+            
+            # Create subgraph for cluster
+            cluster_subgraph = nx.erdos_renyi_graph(len(cluster_nodes), intra_prob)
+            
+            # Add edges to main graph with proper node mapping
+            for edge in cluster_subgraph.edges():
+                G.add_edge(cluster_nodes[edge[0]], cluster_nodes[edge[1]])
+        
+        # Generate inter-cluster edges
+        inter_prob = 0.001
+        for i in range(num_nodes):
+            for j in range(i + 1, num_nodes):
+                if G.nodes[i].get('cluster') != G.nodes[j].get('cluster'):
+                    if np.random.random() < inter_prob:
+                        G.add_edge(i, j)
+        
+        return LargeGraphGenerator._networkx_to_graphdata(G)
+    
+    @staticmethod
+    def generate_hierarchical_graph(num_nodes: int, branching_factor: int = 3, seed: Optional[int] = None) -> GraphData:
+        """Generate hierarchical (tree-like) graph"""
+        G = nx.random_tree(num_nodes, seed=seed)
+        
+        # Add some cross-links to make it more interesting
+        if seed:
+            np.random.seed(seed)
+        
+        # Add 10% additional edges for cross-connections
+        num_additional_edges = max(1, num_nodes // 10)
+        nodes = list(G.nodes())
+        
+        for _ in range(num_additional_edges):
+            u, v = np.random.choice(nodes, 2, replace=False)
+            if not G.has_edge(u, v):
+                G.add_edge(u, v)
+        
+        return LargeGraphGenerator._networkx_to_graphdata(G)
+    
+    @staticmethod
+    def generate_grid_graph(dimensions: List[int], seed: Optional[int] = None) -> GraphData:
+        """Generate 2D or 3D grid graph"""
+        if len(dimensions) == 2:
+            G = nx.grid_2d_graph(dimensions[0], dimensions[1])
+        elif len(dimensions) == 3:
+            G = nx.grid_graph(dimensions)
+        else:
+            raise ValueError("Grid dimensions must be 2D or 3D")
+        
+        # Convert coordinate tuples to integer node IDs
+        mapping = {node: i for i, node in enumerate(G.nodes())}
+        G = nx.relabel_nodes(G, mapping)
+        
+        return LargeGraphGenerator._networkx_to_graphdata(G)
+    
+    @staticmethod
+    def _networkx_to_graphdata(G: nx.Graph) -> GraphData:
+        """Convert NetworkX graph to GraphData format"""
+        nodes = []
+        links = []
+        
+        # Convert nodes
+        for node_id in G.nodes():
+            node_data = G.nodes[node_id]
+            node = {
+                "id": f"n{node_id}",
+                "name": f"Node {node_id}",
+                "val": np.random.randint(1, 11),
+                "degree": G.degree(node_id)
+            }
+            
+            # Add cluster information if available
+            if 'cluster' in node_data:
+                node['group'] = f"cluster_{node_data['cluster']}"
+            else:
+                node['group'] = f"group_{node_id % 10}"
+                
+            nodes.append(node)
+        
+        # Convert edges
+        for edge in G.edges():
+            link = {
+                "source": f"n{edge[0]}",
+                "target": f"n{edge[1]}",
+                "name": f"link_{edge[0]}_{edge[1]}",
+                "weight": np.random.uniform(0.1, 5.0)
+            }
+            links.append(link)
+        
+        return GraphData(nodes=nodes, links=links)
+
+class PyGraphistryService:
+    def __init__(self):
+        self.initialized = init_graphistry()
+        self.generation_tasks = {}  # Store background tasks
+        self.executor = ThreadPoolExecutor(max_workers=4)
+        
+    async def generate_graph_async(self, request: GraphGenerationRequest, task_id: str):
+        """Generate graph asynchronously"""
+        try:
+            self.generation_tasks[task_id] = GraphGenerationStatus(
+                task_id=task_id,
+                status="running",
+                progress=0.0,
+                message="Starting graph generation..."
+            )
+            
+            start_time = time.time()
+            
+            # Update progress
+            self.generation_tasks[task_id].progress = 10.0
+            self.generation_tasks[task_id].message = f"Generating {request.pattern.value} graph with {request.num_nodes} nodes..."
+            
+            # Generate graph based on pattern
+            if request.pattern == GraphPattern.RANDOM:
+                graph_data = LargeGraphGenerator.generate_random_graph(
+                    request.num_nodes, request.avg_degree, request.seed
+                )
+            elif request.pattern == GraphPattern.SCALE_FREE:
+                m = min(request.avg_degree, request.num_nodes - 1) if request.avg_degree else 3
+                graph_data = LargeGraphGenerator.generate_scale_free_graph(
+                    request.num_nodes, m, request.seed
+                )
+            elif request.pattern == GraphPattern.SMALL_WORLD:
+                graph_data = LargeGraphGenerator.generate_small_world_graph(
+                    request.num_nodes, 
+                    request.small_world_k or 6, 
+                    request.small_world_p or 0.1, 
+                    request.seed
+                )
+            elif request.pattern == GraphPattern.CLUSTERED:
+                graph_data = LargeGraphGenerator.generate_clustered_graph(
+                    request.num_nodes, request.num_clusters or 100, request.seed
+                )
+            elif request.pattern == GraphPattern.HIERARCHICAL:
+                graph_data = LargeGraphGenerator.generate_hierarchical_graph(
+                    request.num_nodes, seed=request.seed
+                )
+            elif request.pattern == GraphPattern.GRID:
+                # Calculate grid dimensions for given number of nodes
+                if request.grid_dimensions:
+                    dimensions = request.grid_dimensions
+                else:
+                    side_length = int(np.sqrt(request.num_nodes))
+                    dimensions = [side_length, side_length]
+                graph_data = LargeGraphGenerator.generate_grid_graph(dimensions, request.seed)
+            else:
+                raise ValueError(f"Unknown graph pattern: {request.pattern}")
+            
+            # Update progress
+            self.generation_tasks[task_id].progress = 80.0
+            self.generation_tasks[task_id].message = "Computing graph statistics..."
+            
+            # Calculate statistics
+            generation_time = time.time() - start_time
+            stats = {
+                "node_count": len(graph_data.nodes),
+                "edge_count": len(graph_data.links),
+                "generation_time": generation_time,
+                "density": len(graph_data.links) / (len(graph_data.nodes) * (len(graph_data.nodes) - 1) / 2) if len(graph_data.nodes) > 1 else 0,
+                "avg_degree": 2 * len(graph_data.links) / len(graph_data.nodes) if len(graph_data.nodes) > 0 else 0,
+                "pattern": request.pattern.value,
+                "parameters": request.model_dump()
+            }
+            
+            # Complete task
+            self.generation_tasks[task_id].status = "completed"
+            self.generation_tasks[task_id].progress = 100.0
+            self.generation_tasks[task_id].message = f"Generated {stats['node_count']} nodes and {stats['edge_count']} edges in {generation_time:.2f}s"
+            self.generation_tasks[task_id].result = {
+                "graph_data": graph_data.model_dump(),
+                "stats": stats
+            }
+            
+            logger.info(f"Graph generation completed for task {task_id}: {stats}")
+            
+        except Exception as e:
+            logger.error(f"Graph generation failed for task {task_id}: {e}")
+            self.generation_tasks[task_id].status = "failed"
+            self.generation_tasks[task_id].error = str(e)
+            self.generation_tasks[task_id].message = f"Generation failed: {e}"
+    
+    async def start_graph_generation(self, request: GraphGenerationRequest) -> str:
+        """Start graph generation as background task"""
+        task_id = f"gen_{int(time.time() * 1000)}"
+        
+        # Run generation in thread pool to avoid blocking
+        loop = asyncio.get_event_loop()
+        loop.run_in_executor(
+            self.executor,
+            lambda: asyncio.run(self.generate_graph_async(request, task_id))
+        )
+        
+        return task_id
+    
+    def get_generation_status(self, task_id: str) -> Optional[GraphGenerationStatus]:
+        """Get status of graph generation task"""
+        return self.generation_tasks.get(task_id)
+        
+    async def process_graph_data(self, request: VisualizationRequest) -> Dict[str, Any]:
+        """Process graph data with PyGraphistry GPU acceleration"""
+        try:
+            if not self.initialized:
+                raise HTTPException(status_code=500, detail="PyGraphistry not initialized")
+            
+            # Convert to pandas DataFrames for PyGraphistry
+            nodes_df = pd.DataFrame(request.graph_data.nodes)
+            edges_df = pd.DataFrame(request.graph_data.links)
+            
+            # Ensure required columns exist
+            if 'id' not in nodes_df.columns:
+                nodes_df['id'] = nodes_df.index
+            if 'source' not in edges_df.columns or 'target' not in edges_df.columns:
+                raise HTTPException(status_code=400, detail="Links must have source and target columns")
+                
+            logger.info(f"Processing graph with {len(nodes_df)} nodes and {len(edges_df)} edges")
+            
+            # Create PyGraphistry graph object
+            try:
+                g = graphistry.edges(edges_df, 'source', 'target').nodes(nodes_df, 'id')
+                logger.info(f"Created PyGraphistry graph object")
+            except Exception as e:
+                logger.error(f"Failed to create PyGraphistry graph: {e}")
+                raise HTTPException(status_code=500, detail=f"Graph creation failed: {e}")
+            
+            # Apply GPU-accelerated processing
+            if request.gpu_acceleration:
+                g = await self._apply_gpu_acceleration(g, request)
+            
+            # Apply clustering if requested
+            if request.clustering:
+                g = await self._apply_clustering(g)
+            
+            # Generate layout
+            g = await self._generate_layout(g, request.layout_type)
+            
+            # Extract processed data
+            try:
+                processed_nodes = g._nodes.to_dict('records') if g._nodes is not None else nodes_df.to_dict('records')
+                processed_edges = g._edges.to_dict('records') if g._edges is not None else edges_df.to_dict('records')
+                logger.info(f"Extracted {len(processed_nodes)} nodes and {len(processed_edges)} edges")
+            except Exception as e:
+                logger.warning(f"Data extraction failed, using original data: {e}")
+                processed_nodes = nodes_df.to_dict('records')
+                processed_edges = edges_df.to_dict('records')
+            
+            # Generate embedding URL for interactive visualization
+            embed_url = None
+            local_viz_data = None
+            
+            try:
+                embed_url = g.plot(render=False)
+                logger.info(f"Generated PyGraphistry embed URL: {embed_url}")
+            except Exception as e:
+                logger.warning(f"Could not generate embed URL (likely running in local mode): {e}")
+                
+                # Create local visualization data as fallback
+                try:
+                    local_viz_data = self._create_local_viz_data(g, processed_nodes, processed_edges)
+                    logger.info("Generated local visualization data as fallback")
+                except Exception as viz_e:
+                    logger.warning(f"Could not generate local visualization data: {viz_e}")
+            
+            return {
+                "processed_nodes": processed_nodes,
+                "processed_edges": processed_edges,
+                "embed_url": embed_url,
+                "local_viz_data": local_viz_data,
+                "stats": {
+                    "node_count": len(processed_nodes),
+                    "edge_count": len(processed_edges),
+                    "gpu_accelerated": request.gpu_acceleration,
+                    "clustered": request.clustering,
+                    "layout_type": request.layout_type,
+                    "has_embed_url": embed_url is not None,
+                    "has_local_viz": local_viz_data is not None
+                },
+                "timestamp": datetime.now().isoformat()
+            }
+            
+        except Exception as e:
+            logger.error(f"Error processing graph data: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    async def _apply_gpu_acceleration(self, g, request: VisualizationRequest):
+        """Apply GPU acceleration using PyGraphistry's vector processing"""
+        try:
+            if not request.gpu_acceleration:
+                logger.info("GPU acceleration disabled by request")
+                return g
+                
+            logger.info("=== GPU ACCELERATION ATTEMPT ===")
+            logger.info(f"PyGraphistry object type: {type(g)}")
+            logger.info(f"Available methods: {[method for method in dir(g) if not method.startswith('_')]}")
+            
+            # Check what GPU methods are actually available
+            has_compute_igraph = hasattr(g, 'compute_igraph')
+            has_umap = hasattr(g, 'umap')
+            logger.info(f"Has compute_igraph: {has_compute_igraph}")
+            logger.info(f"Has UMAP: {has_umap}")
+            
+            gpu_operations_successful = 0
+            total_gpu_operations = 0
+            
+            # Compute centrality measures if available
+            total_gpu_operations += 1
+            try:
+                if has_compute_igraph and len(g._nodes) < 50000:  # Limit for performance
+                    logger.info("Attempting PageRank computation...")
+                    g = g.compute_igraph('pagerank', out_col='pagerank')
+                    logger.info("✓ SUCCESS: Computed PageRank centrality with GPU")
+                    gpu_operations_successful += 1
+                else:
+                    reason = "too many nodes" if len(g._nodes) >= 50000 else "compute_igraph not available"
+                    logger.warning(f"✗ SKIPPED: PageRank computation ({reason})")
+            except Exception as e:
+                logger.warning(f"✗ FAILED: PageRank computation failed: {e}")
+                
+            # Apply UMAP for node positioning if available and beneficial
+            total_gpu_operations += 1
+            try:
+                if has_umap and len(g._nodes) > 100 and len(g._nodes) < 10000:
+                    logger.info("Attempting UMAP for node positioning...")
+                    g = g.umap()
+                    logger.info("✓ SUCCESS: Applied UMAP for node positioning")
+                    gpu_operations_successful += 1
+                else:
+                    reason = ("UMAP not available" if not has_umap else 
+                             "too few nodes" if len(g._nodes) <= 100 else "too many nodes")
+                    logger.warning(f"✗ SKIPPED: UMAP processing ({reason})")
+            except Exception as e:
+                logger.warning(f"✗ FAILED: UMAP processing failed: {e}")
+                
+            logger.info(f"=== GPU ACCELERATION SUMMARY ===")
+            logger.info(f"GPU operations successful: {gpu_operations_successful}/{total_gpu_operations}")
+            logger.info(f"GPU utilization: {(gpu_operations_successful/total_gpu_operations)*100:.1f}%")
+            
+            return g
+        except Exception as e:
+            logger.warning(f"GPU acceleration failed completely, falling back to CPU: {e}")
+            return g
+    
+    async def _apply_clustering(self, g):
+        """Apply GPU-accelerated clustering"""
+        try:
+            logger.info("=== CLUSTERING ATTEMPT ===")
+            
+            # Use PyGraphistry's built-in clustering if available
+            if hasattr(g, 'compute_igraph') and len(g._nodes) < 20000:  # Limit for performance
+                logger.info("Attempting Leiden community detection...")
+                try:
+                    g = g.compute_igraph('community_leiden', out_col='cluster')
+                    logger.info("✓ SUCCESS: Applied Leiden community detection")
+                    return g
+                except Exception as e:
+                    logger.warning(f"✗ FAILED: Leiden clustering failed: {e}")
+                    logger.info("Attempting Louvain community detection as fallback...")
+                    try:
+                        g = g.compute_igraph('community_louvain', out_col='cluster') 
+                        logger.info("✓ SUCCESS: Applied Louvain community detection")
+                        return g
+                    except Exception as e2:
+                        logger.warning(f"✗ FAILED: Louvain clustering also failed: {e2}")
+            else:
+                reason = "too many nodes" if len(g._nodes) >= 20000 else "compute_igraph not available"
+                logger.warning(f"✗ SKIPPED: Clustering ({reason})")
+            
+            logger.info("=== CLUSTERING SUMMARY: No clustering applied ===")
+            return g
+        except Exception as e:
+            logger.warning(f"Clustering failed completely: {e}")
+            return g
+    
+    async def _generate_layout(self, g, layout_type: str = "force"):
+        """Generate layout using PyGraphistry's algorithms"""
+        try:
+            logger.info(f"Generating {layout_type} layout")
+            
+            # Only apply layout computation for reasonable graph sizes
+            if len(g._nodes) > 50000:
+                logger.info("Skipping layout computation for very large graph")
+                return g
+            
+            if hasattr(g, 'compute_igraph'):
+                try:
+                    if layout_type == "force":
+                        g = g.compute_igraph('layout_fruchterman_reingold', out_cols=['x', 'y'])
+                        logger.info("Applied Fruchterman-Reingold force layout")
+                    elif layout_type == "circular":
+                        g = g.compute_igraph('layout_circle', out_cols=['x', 'y'])
+                        logger.info("Applied circular layout")
+                    elif layout_type == "hierarchical":
+                        g = g.compute_igraph('layout_sugiyama', out_cols=['x', 'y'])
+                        logger.info("Applied hierarchical layout")
+                    else:
+                        # Default to force-directed
+                        g = g.compute_igraph('layout_fruchterman_reingold', out_cols=['x', 'y'])
+                        logger.info("Applied default force layout")
+                except Exception as e:
+                    logger.warning(f"Layout computation failed: {e}")
+            else:
+                logger.info("Layout computation not available, using default positioning")
+                
+            return g
+        except Exception as e:
+            logger.warning(f"Layout generation failed: {e}")
+            return g
+    
+    def _create_local_viz_data(self, g, processed_nodes: List[Dict], processed_edges: List[Dict]) -> Dict[str, Any]:
+        """Create local visualization data when embed URL cannot be generated"""
+        try:
+            # Extract layout positions if available
+            positions = {}
+            if g._nodes is not None and 'x' in g._nodes.columns and 'y' in g._nodes.columns:
+                for _, row in g._nodes.iterrows():
+                    node_id = row.get('id', row.name)
+                    positions[str(node_id)] = {
+                        'x': float(row['x']) if pd.notna(row['x']) else 0,
+                        'y': float(row['y']) if pd.notna(row['y']) else 0
+                    }
+            
+            # Add cluster information if available
+            clusters = {}
+            if g._nodes is not None and 'cluster' in g._nodes.columns:
+                for _, row in g._nodes.iterrows():
+                    node_id = row.get('id', row.name)
+                    if pd.notna(row['cluster']):
+                        clusters[str(node_id)] = int(row['cluster'])
+            
+            # Create enhanced nodes with layout and cluster info
+            enhanced_nodes = []
+            for node in processed_nodes:
+                enhanced_node = node.copy()
+                node_id = str(node.get('id', ''))
+                
+                if node_id in positions:
+                    enhanced_node.update(positions[node_id])
+                    
+                if node_id in clusters:
+                    enhanced_node['cluster'] = clusters[node_id]
+                    
+                enhanced_nodes.append(enhanced_node)
+            
+            return {
+                "nodes": enhanced_nodes,
+                "edges": processed_edges,
+                "positions": positions,
+                "clusters": clusters,
+                "layout_computed": len(positions) > 0,
+                "clusters_computed": len(clusters) > 0
+            }
+            
+        except Exception as e:
+            logger.error(f"Failed to create local visualization data: {e}")
+            return {
+                "nodes": processed_nodes,
+                "edges": processed_edges,
+                "positions": {},
+                "clusters": {},
+                "layout_computed": False,
+                "clusters_computed": False
+            }
+    
+    async def get_graph_stats(self, graph_data: GraphData) -> Dict[str, Any]:
+        """Get GPU-accelerated graph statistics"""
+        try:
+            nodes_df = pd.DataFrame(graph_data.nodes)
+            edges_df = pd.DataFrame(graph_data.links)
+            
+            g = graphistry.edges(edges_df, 'source', 'target').nodes(nodes_df, 'id')
+            
+            # Compute various graph metrics using GPU acceleration
+            stats = {
+                "node_count": len(nodes_df),
+                "edge_count": len(edges_df),
+                "density": len(edges_df) / (len(nodes_df) * (len(nodes_df) - 1)) if len(nodes_df) > 1 else 0,
+                "timestamp": datetime.now().isoformat()
+            }
+            
+            # Add centrality measures if possible
+            try:
+                if len(nodes_df) < 10000 and hasattr(g, 'compute_igraph'):  # Only for reasonably sized graphs
+                    g_with_metrics = g.compute_igraph('pagerank', out_col='pagerank')
+                    
+                    if g_with_metrics._nodes is not None and 'pagerank' in g_with_metrics._nodes.columns:
+                        pagerank_data = g_with_metrics._nodes['pagerank'].to_list()
+                        stats.update({
+                            "avg_pagerank": float(np.mean(pagerank_data)),
+                            "max_pagerank": float(np.max(pagerank_data))
+                        })
+                        logger.info("Computed PageRank statistics")
+            except Exception as e:
+                logger.warning(f"Could not compute centrality measures: {e}")
+            
+            return stats
+            
+        except Exception as e:
+            logger.error(f"Error computing graph stats: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+
+# FastAPI app
+app = FastAPI(title="PyGraphistry GPU Visualization Service", version="1.0.0")
+service = PyGraphistryService()
+
+@app.post("/api/generate")
+async def generate_graph(request: GraphGenerationRequest):
+    """Start graph generation as background task"""
+    if request.num_nodes > 1000000:
+        raise HTTPException(status_code=400, detail="Maximum 1 million nodes allowed")
+        
+    task_id = await service.start_graph_generation(request)
+    return {"task_id": task_id, "status": "started"}
+
+@app.get("/api/generate/{task_id}")
+async def get_generation_status(task_id: str):
+    """Get status of graph generation task"""
+    status = service.get_generation_status(task_id)
+    if not status:
+        raise HTTPException(status_code=404, detail="Task not found")
+    return status
+
+@app.post("/api/visualize")
+async def visualize_graph(request: VisualizationRequest):
+    """Process graph data with PyGraphistry GPU acceleration"""
+    return await service.process_graph_data(request)
+
+@app.post("/api/stats")
+async def get_graph_statistics(graph_data: GraphData):
+    """Get GPU-accelerated graph statistics"""
+    return await service.get_graph_stats(graph_data)
+
+@app.get("/api/health")
+async def health_check():
+    """Health check endpoint"""
+    return {
+        "status": "healthy",
+        "pygraphistry_initialized": service.initialized,
+        "timestamp": datetime.now().isoformat()
+    }
+
+@app.get("/api/patterns")
+async def get_available_patterns():
+    """Get list of available graph generation patterns"""
+    return {
+        "patterns": [
+            {
+                "name": pattern.value,
+                "description": {
+                    GraphPattern.RANDOM: "Random graph using Erdős–Rényi model",
+                    GraphPattern.SCALE_FREE: "Scale-free graph using Barabási–Albert model",
+                    GraphPattern.SMALL_WORLD: "Small-world graph using Watts-Strogatz model",
+                    GraphPattern.CLUSTERED: "Clustered graph with community structure",
+                    GraphPattern.HIERARCHICAL: "Hierarchical tree-like graph with cross-links",
+                    GraphPattern.GRID: "2D or 3D grid graph"
+                }[pattern]
+            } for pattern in GraphPattern
+        ]
+    }
+
+if __name__ == "__main__":
+    uvicorn.run(app, host="0.0.0.0", port=8080) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_gpu_rendering_service.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_gpu_rendering_service.py
new file mode 100644
index 0000000..7b820df
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_gpu_rendering_service.py
@@ -0,0 +1,1579 @@
+#!/usr/bin/env python3
+"""
+Remote GPU Rendering Service
+
+A standalone service that receives graph data, processes it with GPU acceleration,
+renders interactive visualizations, and serves them via iframe embeds.
+This provides an alternative to PyGraphistry cloud for large-scale visualization.
+"""
+
+import os
+import json
+import uuid
+import asyncio
+import logging
+from datetime import datetime, timedelta
+from typing import Dict, List, Any, Optional, Tuple
+from fastapi import FastAPI, HTTPException, WebSocket, WebSocketDisconnect, BackgroundTasks, Request
+from fastapi.staticfiles import StaticFiles
+from fastapi.responses import HTMLResponse, FileResponse
+from fastapi.middleware.cors import CORSMiddleware
+from pydantic import BaseModel
+import uvicorn
+import redis
+from pathlib import Path
+
+# GPU-accelerated imports
+try:
+    import cudf
+    import cugraph
+    import cupy as cp
+    from cuml import UMAP
+    HAS_RAPIDS = True
+    print("✓ RAPIDS cuGraph/cuDF/cuML available for remote rendering")
+except ImportError:
+    HAS_RAPIDS = False
+    print("⚠ RAPIDS not available, falling back to CPU for remote rendering")
+    import networkx as nx
+
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+
+class GraphData(BaseModel):
+    nodes: List[Dict[str, Any]]
+    links: List[Dict[str, Any]]
+
+class RemoteRenderingRequest(BaseModel):
+    graph_data: GraphData
+    layout_algorithm: str = "force_atlas2"
+    clustering_algorithm: str = "leiden"
+    compute_centrality: bool = True
+    render_quality: str = "high"  # low, medium, high, ultra
+    interactive_mode: bool = True
+    session_id: Optional[str] = None
+    
+    # Enhanced UX parameters inspired by Graphistry
+    animation_duration: int = 5000  # Layout animation time in ms
+    show_splash: bool = False       # Show loading splash screen
+    auto_zoom: bool = True          # Auto-fit graph to view
+    show_labels: bool = True        # Show node labels
+    edge_bundling: bool = False     # Enable edge bundling for dense graphs
+    background_color: str = "#0a0a0a"  # Background color
+    quality_preset: str = "balanced"   # fast, balanced, quality
+
+class RemoteGPUProcessor:
+    """GPU processing for remote rendering"""
+    
+    def __init__(self):
+        self.use_gpu = HAS_RAPIDS
+        logger.info(f"Remote GPU processor initialized (GPU: {self.use_gpu})")
+    
+    def create_cugraph_from_data(self, nodes: List[Dict], edges: List[Dict]) -> Tuple[Any, Any]:
+        """Create cuGraph from node and edge data"""
+        try:
+            if not self.use_gpu:
+                return None, None
+                
+            # Create edge dataframe
+            edge_data = []
+            for edge in edges:
+                edge_data.append({
+                    'src': str(edge.get('source', edge.get('src', ''))),
+                    'dst': str(edge.get('target', edge.get('dst', ''))),
+                    'weight': float(edge.get('weight', 1.0))
+                })
+            
+            edges_df = cudf.DataFrame(edge_data)
+            
+            # Create graph
+            G = cugraph.Graph()
+            G.from_cudf_edgelist(edges_df, source='src', destination='dst', edge_attr='weight')
+            
+            logger.info(f"Created cuGraph with {G.number_of_nodes()} nodes and {G.number_of_edges()} edges")
+            return G, edges_df
+            
+        except Exception as e:
+            logger.error(f"Error creating cuGraph: {e}")
+            return None, None
+    
+    def compute_gpu_layout(self, G, algorithm: str = "force_atlas2") -> Dict[str, Tuple[float, float]]:
+        """Compute GPU-accelerated graph layout"""
+        try:
+            if not self.use_gpu or G is None:
+                return {}
+                
+            if algorithm == "force_atlas2":
+                layout_df = cugraph.force_atlas2(G)
+            elif algorithm == "spectral":
+                layout_df = cugraph.spectral_layout(G, dim=2)
+            else:
+                layout_df = cugraph.spectral_layout(G, dim=2)
+            
+            # Convert to dictionary with normalized coordinates
+            positions = {}
+            if len(layout_df) > 0:
+                # Normalize coordinates to [0, 1000] range for consistent rendering
+                x_min, x_max = layout_df['x'].min(), layout_df['x'].max()
+                y_min, y_max = layout_df['y'].min(), layout_df['y'].max()
+                
+                x_range = x_max - x_min if x_max != x_min else 1
+                y_range = y_max - y_min if y_max != y_min else 1
+                
+                for _, row in layout_df.iterrows():
+                    node_id = str(row['vertex'])
+                    x_norm = ((row['x'] - x_min) / x_range) * 1000
+                    y_norm = ((row['y'] - y_min) / y_range) * 1000
+                    positions[node_id] = (float(x_norm), float(y_norm))
+            
+            logger.info(f"Computed {algorithm} layout for {len(positions)} nodes on GPU")
+            return positions
+            
+        except Exception as e:
+            logger.error(f"GPU layout computation failed: {e}")
+            return {}
+    
+    def compute_gpu_clustering(self, G, algorithm: str = "leiden") -> Dict[str, int]:
+        """Compute GPU-accelerated graph clustering"""
+        try:
+            if not self.use_gpu or G is None:
+                return {}
+                
+            if algorithm == "leiden":
+                clustering_df = cugraph.leiden(G)
+            elif algorithm == "louvain":
+                clustering_df = cugraph.louvain(G)
+            else:
+                clustering_df = cugraph.louvain(G)
+            
+            clusters = {}
+            for _, row in clustering_df.iterrows():
+                node_id = str(row['vertex'])
+                clusters[node_id] = int(row['partition'])
+            
+            logger.info(f"Computed {algorithm} clustering for {len(clusters)} nodes")
+            return clusters
+            
+        except Exception as e:
+            logger.error(f"GPU clustering computation failed: {e}")
+            return {}
+    
+    def compute_gpu_centrality(self, G) -> Dict[str, Dict[str, float]]:
+        """Compute various centrality metrics on GPU"""
+        try:
+            if not self.use_gpu or G is None:
+                return {}
+                
+            centrality = {}
+            
+            # PageRank
+            try:
+                pagerank_df = cugraph.pagerank(G)
+                centrality["pagerank"] = {}
+                for _, row in pagerank_df.iterrows():
+                    node_id = str(row['vertex'])
+                    centrality["pagerank"][node_id] = float(row['pagerank'])
+            except Exception as e:
+                logger.warning(f"PageRank computation failed: {e}")
+                
+            # Betweenness centrality (for smaller graphs)
+            if G.number_of_nodes() < 10000:  # Limit for performance
+                try:
+                    betweenness_df = cugraph.betweenness_centrality(G)
+                    centrality["betweenness"] = {}
+                    for _, row in betweenness_df.iterrows():
+                        node_id = str(row['vertex'])
+                        centrality["betweenness"][node_id] = float(row['betweenness_centrality'])
+                except Exception as e:
+                    logger.warning(f"Betweenness centrality computation failed: {e}")
+            
+            logger.info(f"Computed centrality metrics: {list(centrality.keys())}")
+            return centrality
+            
+        except Exception as e:
+            logger.error(f"GPU centrality computation failed: {e}")
+            return {}
+
+class RemoteRenderingService:
+    """Remote GPU-powered graph rendering service with iframe embedding"""
+    
+    def __init__(self):
+        self.gpu_processor = RemoteGPUProcessor()
+        self.active_connections: Dict[str, WebSocket] = {}
+        self.redis_client = None
+        self.sessions = {}  # In-memory session storage (use Redis in production)
+        self.datasets = {}  # Cached preprocessed datasets
+        self.session_ttl = timedelta(hours=24)
+        self.dataset_ttl = timedelta(days=7)  # Datasets live longer
+        
+        # Initialize Redis for session storage (optional)
+        try:
+            self.redis_client = redis.Redis(
+                host=os.getenv('REDIS_HOST', 'localhost'),
+                port=int(os.getenv('REDIS_PORT', 6379)),
+                decode_responses=True
+            )
+            self.redis_client.ping()
+            logger.info("Redis connected for session storage")
+        except Exception as e:
+            logger.warning(f"Redis not available, using in-memory storage: {e}")
+    
+    async def process_and_store_graph(self, request: RemoteRenderingRequest) -> Dict[str, Any]:
+        """Process graph with GPU acceleration and store for rendering"""
+        session_id = request.session_id or str(uuid.uuid4())
+        
+        try:
+            nodes = request.graph_data.nodes
+            edges = request.graph_data.links
+            
+            # Enhanced result structure for remote rendering
+            result = {
+                "session_id": session_id,
+                "nodes": nodes.copy(),
+                "edges": edges.copy(),
+                "gpu_processed": False,
+                "layout_positions": {},
+                "clusters": {},
+                "centrality": {},
+                "render_config": {
+                    "quality": request.render_quality,
+                    "interactive": request.interactive_mode,
+                    "layout_algorithm": request.layout_algorithm,
+                    "clustering_algorithm": request.clustering_algorithm
+                },
+                "stats": {
+                    "node_count": len(nodes),
+                    "edge_count": len(edges),
+                    "processing_time": 0
+                },
+                "timestamp": datetime.now().isoformat(),
+                "embed_url": f"/embed/{session_id}"
+            }
+            
+            start_time = datetime.now()
+            
+            if self.gpu_processor.use_gpu:
+                logger.info("=== REMOTE GPU PROCESSING START ===")
+                
+                # Create cuGraph
+                G, edges_df = self.gpu_processor.create_cugraph_from_data(nodes, edges)
+                
+                if G is not None:
+                    # Compute layout on GPU
+                    positions = self.gpu_processor.compute_gpu_layout(G, request.layout_algorithm)
+                    if positions:
+                        result["layout_positions"] = positions
+                        # Add positions to nodes
+                        for node in result["nodes"]:
+                            node_id = str(node["id"])
+                            if node_id in positions:
+                                node["x"], node["y"] = positions[node_id]
+                    
+                    # Compute clustering on GPU
+                    clusters = self.gpu_processor.compute_gpu_clustering(G, request.clustering_algorithm)
+                    if clusters:
+                        result["clusters"] = clusters
+                        # Add cluster info to nodes
+                        for node in result["nodes"]:
+                            node_id = str(node["id"])
+                            if node_id in clusters:
+                                node["cluster"] = clusters[node_id]
+                    
+                    # Compute centrality on GPU
+                    if request.compute_centrality:
+                        centrality = self.gpu_processor.compute_gpu_centrality(G)
+                        result["centrality"] = centrality
+                        # Add centrality to nodes
+                        for node in result["nodes"]:
+                            node_id = str(node["id"])
+                            for metric, values in centrality.items():
+                                if node_id in values:
+                                    node[metric] = values[node_id]
+                    
+                    result["gpu_processed"] = True
+                    
+                    # Update stats
+                    result["stats"].update({
+                        "gpu_accelerated": True,
+                        "layout_computed": len(positions) > 0,
+                        "clusters_computed": len(clusters) > 0,
+                        "centrality_computed": len(centrality) > 0
+                    })
+                
+                logger.info("=== REMOTE GPU PROCESSING COMPLETE ===")
+            
+            # Calculate processing time
+            processing_time = (datetime.now() - start_time).total_seconds()
+            result["stats"]["processing_time"] = processing_time
+            
+            # Store session data
+            await self._store_session(session_id, result)
+            
+            return result
+            
+        except Exception as e:
+            logger.error(f"Remote graph processing failed: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    async def _store_session(self, session_id: str, data: Dict[str, Any]):
+        """Store session data for iframe rendering"""
+        expiry = datetime.now() + self.session_ttl
+        
+        if self.redis_client:
+            try:
+                # Store in Redis with TTL
+                self.redis_client.setex(
+                    f"session:{session_id}",
+                    int(self.session_ttl.total_seconds()),
+                    json.dumps(data)
+                )
+            except Exception as e:
+                logger.warning(f"Redis storage failed, using memory: {e}")
+                self.sessions[session_id] = {"data": data, "expiry": expiry}
+        else:
+            # In-memory storage
+            self.sessions[session_id] = {"data": data, "expiry": expiry}
+        
+        logger.info(f"Stored session {session_id} for iframe rendering")
+    
+    async def get_session_data(self, session_id: str) -> Optional[Dict[str, Any]]:
+        """Retrieve session data for iframe rendering"""
+        if self.redis_client:
+            try:
+                data = self.redis_client.get(f"session:{session_id}")
+                if data:
+                    return json.loads(data)
+            except Exception as e:
+                logger.warning(f"Redis retrieval failed: {e}")
+        
+        # Check in-memory storage
+        if session_id in self.sessions:
+            session = self.sessions[session_id]
+            if datetime.now() < session["expiry"]:
+                return session["data"]
+            else:
+                # Clean up expired session
+                del self.sessions[session_id]
+        
+        return None
+    
+    async def cleanup_expired_sessions(self):
+        """Clean up expired sessions (background task)"""
+        current_time = datetime.now()
+        expired_sessions = []
+        
+        for session_id, session in self.sessions.items():
+            if current_time >= session["expiry"]:
+                expired_sessions.append(session_id)
+        
+        for session_id in expired_sessions:
+            del self.sessions[session_id]
+            logger.info(f"Cleaned up expired session: {session_id}")
+
+    async def preprocess_and_cache_dataset(self, graph_data: GraphData, dataset_id: Optional[str] = None) -> str:
+        """Preprocess graph data and cache for reuse (like Graphistry datasets)"""
+        if not dataset_id:
+            dataset_id = str(uuid.uuid4())
+        
+        try:
+            nodes = graph_data.nodes
+            edges = graph_data.links
+            
+            # GPU preprocessing
+            if self.gpu_processor.use_gpu:
+                G, edges_df = self.gpu_processor.create_cugraph_from_data(nodes, edges)
+                
+                if G is not None:
+                    # Pre-compute multiple layouts for fast switching
+                    layouts = {}
+                    for algorithm in ["force_atlas2", "spectral"]:
+                        positions = self.gpu_processor.compute_gpu_layout(G, algorithm)
+                        if positions:
+                            layouts[algorithm] = positions
+                    
+                    # Pre-compute clustering
+                    clusters = {}
+                    for algorithm in ["leiden", "louvain"]:
+                        cluster_result = self.gpu_processor.compute_gpu_clustering(G, algorithm)
+                        if cluster_result:
+                            clusters[algorithm] = cluster_result
+                    
+                    # Pre-compute centrality
+                    centrality = self.gpu_processor.compute_gpu_centrality(G)
+                    
+                    # Cache the preprocessed data
+                    dataset_cache = {
+                        "dataset_id": dataset_id,
+                        "nodes": nodes,
+                        "edges": edges,
+                        "layouts": layouts,
+                        "clusters": clusters,
+                        "centrality": centrality,
+                        "stats": {
+                            "node_count": len(nodes),
+                            "edge_count": len(edges),
+                            "preprocessing_time": (datetime.now()).isoformat()
+                        },
+                        "created_at": datetime.now().isoformat()
+                    }
+                    
+                    # Store in cache with longer TTL
+                    await self._store_dataset(dataset_id, dataset_cache)
+                    
+                    logger.info(f"Preprocessed and cached dataset {dataset_id} with {len(nodes)} nodes")
+                    return dataset_id
+            
+            return dataset_id
+            
+        except Exception as e:
+            logger.error(f"Dataset preprocessing failed: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    async def _store_dataset(self, dataset_id: str, data: Dict[str, Any]):
+        """Store preprocessed dataset with longer TTL"""
+        expiry = datetime.now() + self.dataset_ttl
+        
+        if self.redis_client:
+            try:
+                self.redis_client.setex(
+                    f"dataset:{dataset_id}",
+                    int(self.dataset_ttl.total_seconds()),
+                    json.dumps(data)
+                )
+            except Exception as e:
+                logger.warning(f"Redis dataset storage failed: {e}")
+                self.datasets[dataset_id] = {"data": data, "expiry": expiry}
+        else:
+            self.datasets[dataset_id] = {"data": data, "expiry": expiry}
+    
+    async def get_dataset(self, dataset_id: str) -> Optional[Dict[str, Any]]:
+        """Retrieve cached dataset"""
+        if self.redis_client:
+            try:
+                data = self.redis_client.get(f"dataset:{dataset_id}")
+                if data:
+                    return json.loads(data)
+            except Exception as e:
+                logger.warning(f"Redis dataset retrieval failed: {e}")
+        
+        if dataset_id in self.datasets:
+            dataset = self.datasets[dataset_id]
+            if datetime.now() < dataset["expiry"]:
+                return dataset["data"]
+            else:
+                del self.datasets[dataset_id]
+        
+        return None
+
+    def _generate_interactive_html(self, session_data: dict, config: dict) -> str:
+        """Generate interactive HTML visualization using libraries consistent with frontend"""
+        
+        # Check if WebGL rendering is requested
+        use_webgl = config.get('use_webgl', len(session_data['processed_nodes']) > 5000)
+        
+        if use_webgl:
+            return self._generate_threejs_webgl_html(session_data, config)
+        else:
+            return self._generate_d3_svg_html(session_data, config)
+        
+        # Extract data
+        nodes = session_data['processed_nodes']
+        edges = session_data['processed_edges']
+        layout_positions = session_data.get('layout_positions', {})
+        clusters = session_data.get('clusters', {})
+        centrality = session_data.get('centrality', {})
+        
+        # Animation and UI settings matching frontend patterns
+        animation_duration = config.get('animation_duration', 3000)
+        show_splash = config.get('show_splash', True)
+        auto_zoom = config.get('auto_zoom', True)
+        show_labels = config.get('show_labels', True)
+        background_color = config.get('background_color', '#0a0a0a')
+        render_quality = config.get('render_quality', 'high')
+        
+        # Performance settings based on render quality
+        quality_settings = {
+            'low': {'particles': 1000, 'line_width': 1, 'node_detail': 8},
+            'medium': {'particles': 5000, 'line_width': 2, 'node_detail': 16}, 
+            'high': {'particles': 20000, 'line_width': 3, 'node_detail': 32},
+            'ultra': {'particles': 100000, 'line_width': 4, 'node_detail': 64}
+        }
+        settings = quality_settings.get(render_quality, quality_settings['high'])
+        
+        html_template = f"""
+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <title>GPU-Accelerated Graph Visualization</title>
+    <style>
+        body {{
+            margin: 0;
+            padding: 0;
+            background-color: {background_color};
+            font-family: 'Inter', -apple-system, BlinkMacSystemFont, sans-serif;  
+            overflow: hidden;
+            color: #ffffff;
+        }}
+        #graph-container {{
+            width: 100vw;
+            height: 100vh;
+            position: relative;
+        }}
+        .splash-screen {{
+            position: absolute;
+            top: 0;
+            left: 0;
+            width: 100%;
+            height: 100%;
+            background: linear-gradient(135deg, #0a0a0a 0%, #1a1a1a 100%);
+            display: flex;
+            flex-direction: column;
+            align-items: center;
+            justify-content: center;
+            z-index: 1000;
+            transition: opacity 0.8s ease;
+        }}
+        .splash-logo {{
+            font-size: 2.5rem;
+            font-weight: 700;
+            color: #76B900;
+            margin-bottom: 1rem;
+            text-align: center;
+        }}
+        .splash-stats {{
+            color: #888;
+            font-size: 1rem;
+            margin-bottom: 2rem;
+            text-align: center;
+        }}
+        .loading-bar {{
+            width: 300px;
+            height: 4px;
+            background: #333;
+            border-radius: 2px;
+            overflow: hidden;
+            margin-bottom: 1rem;
+        }}
+        .loading-progress {{
+            height: 100%;
+            background: linear-gradient(90deg, #76B900, #a8d45a);
+            width: 0%;
+            transition: width 0.3s ease;
+        }}
+        .controls {{
+            position: absolute;
+            top: 20px;
+            left: 20px;
+            z-index: 100;
+            display: flex;
+            gap: 10px;
+            flex-wrap: wrap;
+        }}
+        .control-btn {{
+            background: rgba(0, 0, 0, 0.7);
+            color: #76B900;
+            border: 1px solid #76B900;
+            padding: 8px 16px;
+            border-radius: 6px;
+            cursor: pointer;
+            font-size: 12px;
+            transition: all 0.2s ease;
+        }}
+        .control-btn:hover {{
+            background: rgba(118, 185, 0, 0.2);
+        }}
+        .info-panel {{
+            position: absolute;
+            bottom: 20px;
+            right: 20px;
+            background: rgba(0, 0, 0, 0.8);
+            padding: 15px;
+            border-radius: 8px;
+            font-size: 12px;
+            color: #ccc;
+            border: 1px solid #333;
+            max-width: 250px;
+        }}
+        .tooltip {{
+            position: absolute;
+            background: rgba(0, 0, 0, 0.9);
+            color: #fff;
+            padding: 8px 12px;
+            border-radius: 4px;
+            font-size: 12px;
+            pointer-events: none;
+            z-index: 200;
+            border: 1px solid #76B900;
+            opacity: 0;
+            transition: opacity 0.2s ease;
+        }}
+    </style>
+</head>
+<body>
+    <div id="graph-container"></div>
+    
+    <!-- Splash Screen -->
+    {"" if not show_splash else f'''
+    <div id="splash-screen" class="splash-screen">
+        <div class="splash-logo">GPU Graph Visualization</div>
+        <div class="splash-stats">
+            {len(nodes):,} nodes • {len(edges):,} edges<br>
+            GPU-accelerated rendering • {render_quality.title()} quality
+        </div>
+        <div class="loading-bar">
+            <div id="loading-progress" class="loading-progress"></div>
+        </div>
+        <div id="loading-text" style="color: #888; font-size: 14px;">Initializing GPU compute...</div>
+    </div>
+    '''}
+    
+    <!-- Controls -->
+    <div class="controls">
+        <button class="control-btn" onclick="togglePhysics()">⏸️ Physics</button>
+        <button class="control-btn" onclick="resetView()">🎯 Reset View</button>
+        <button class="control-btn" onclick="toggleLabels()">{{"🏷️ Labels" if show_labels else "🏷️ Show Labels"}}</button>
+        <button class="control-btn" onclick="exportGraph()">💾 Export</button>
+    </div>
+    
+    <!-- Info Panel -->
+    <div class="info-panel">
+        <div><strong>Render Quality:</strong> {render_quality.title()}</div>
+        <div><strong>GPU Acceleration:</strong> Active</div>
+        <div><strong>Layout:</strong> <span id="current-layout">Force Atlas 2</span></div>
+        <div><strong>FPS:</strong> <span id="fps-counter">--</span></div>
+    </div>
+    
+    <!-- Tooltip -->
+    <div id="tooltip" class="tooltip"></div>
+
+    <!-- Using D3.js v7.9.0 consistent with frontend -->
+    <script src="https://d3js.org/d3.v7.min.js"></script>
+    <script>
+        // Graph data from GPU processing
+        const graphData = {{
+            nodes: {json.dumps(nodes)},
+            links: {json.dumps(edges)},
+            layoutPositions: {json.dumps(layout_positions)},
+            clusters: {json.dumps(clusters)},
+            centrality: {json.dumps(centrality)}
+        }};
+        
+        // Configuration
+        const config = {{
+            animationDuration: {animation_duration},
+            autoZoom: {str(auto_zoom).lower()},
+            showLabels: {str(show_labels).lower()},
+            renderQuality: "{render_quality}",
+            maxParticles: {settings['particles']},
+            lineWidth: {settings['line_width']},
+            nodeDetail: {settings['node_detail']}
+        }};
+        
+        // Performance monitoring  
+        let frameCount = 0;
+        let lastTime = performance.now();
+        
+        // Initialize visualization
+        class GPUGraphVisualization {{
+            constructor() {{
+                this.container = d3.select("#graph-container");
+                this.width = window.innerWidth;
+                this.height = window.innerHeight;
+                this.physicsRunning = true;
+                this.labelsVisible = config.showLabels;
+                
+                this.initializeSVG();
+                this.initializeForces();
+                this.loadData();
+                
+                {"this.hideSplash();" if not show_splash else "this.showLoadingProgress();"}
+            }}
+            
+            initializeSVG() {{
+                // Create SVG with GPU-optimized settings
+                this.svg = this.container
+                    .append("svg")
+                    .attr("width", this.width)
+                    .attr("height", this.height)
+                    .style("background-color", "{background_color}");
+                
+                // Add zoom behavior
+                this.zoom = d3.zoom()
+                    .scaleExtent([0.1, 10])
+                    .on("zoom", (event) => {{
+                        this.g.attr("transform", event.transform);
+                    }});
+                
+                this.svg.call(this.zoom);
+                
+                // Main group for graph elements
+                this.g = this.svg.append("g");
+                
+                // Add definitions for gradients and patterns
+                const defs = this.svg.append("defs");
+                
+                // Node gradient
+                const gradient = defs.append("radialGradient")
+                    .attr("id", "nodeGradient")
+                    .attr("cx", "30%")
+                    .attr("cy", "30%");
+                
+                gradient.append("stop")
+                    .attr("offset", "0%")
+                    .attr("stop-color", "#76B900")
+                    .attr("stop-opacity", 1);
+                
+                gradient.append("stop")
+                    .attr("offset", "100%")
+                    .attr("stop-color", "#4a7600")
+                    .attr("stop-opacity", 0.8);
+            }}
+            
+            initializeForces() {{
+                // D3 force simulation with GPU-optimized parameters
+                this.simulation = d3.forceSimulation()
+                    .force("link", d3.forceLink().id(d => d.id).distance(60).strength(0.5))
+                    .force("charge", d3.forceManyBody().strength(-120).distanceMax(300))
+                    .force("center", d3.forceCenter(this.width / 2, this.height / 2))
+                    .force("collision", d3.forceCollide().radius(15))
+                    .alphaDecay(0.02)
+                    .velocityDecay(0.4);
+            }}
+            
+            showLoadingProgress() {{
+                const progressBar = document.getElementById("loading-progress");
+                const loadingText = document.getElementById("loading-text");
+                
+                const stages = [
+                    {{ progress: 20, text: "Loading graph data..." }},
+                    {{ progress: 40, text: "Applying GPU layout..." }},
+                    {{ progress: 60, text: "Computing clusters..." }},
+                    {{ progress: 80, text: "Calculating centrality..." }},
+                    {{ progress: 100, text: "Rendering visualization..." }}
+                ];
+                
+                let currentStage = 0;
+                const updateProgress = () => {{
+                    if (currentStage < stages.length) {{
+                        const stage = stages[currentStage];
+                        progressBar.style.width = stage.progress + "%";
+                        loadingText.textContent = stage.text;
+                        currentStage++;
+                        setTimeout(updateProgress, 600);
+                    }} else {{
+                        setTimeout(() => this.hideSplash(), 500);
+                    }}
+                }};
+                
+                updateProgress();
+            }}
+            
+            hideSplash() {{
+                const splash = document.getElementById("splash-screen");
+                if (splash) {{
+                    splash.style.opacity = "0";
+                    setTimeout(() => splash.remove(), 800);
+                }}
+            }}
+            
+            loadData() {{
+                // Process nodes with GPU-computed positions
+                this.nodes = graphData.nodes.map(node => ({{
+                    ...node,
+                    x: graphData.layoutPositions[node.id]?.x || Math.random() * this.width,
+                    y: graphData.layoutPositions[node.id]?.y || Math.random() * this.height,
+                    cluster: graphData.clusters[node.id] || 0,
+                    centrality: graphData.centrality[node.id] || 0
+                }}));
+                
+                this.links = graphData.links.map(link => ({{
+                    ...link,
+                    source: link.source,
+                    target: link.target
+                }}));
+                
+                this.renderGraph();
+            }}
+            
+            renderGraph() {{
+                // Render links with GPU-optimized styling
+                this.link = this.g.append("g")
+                    .attr("class", "links")
+                    .selectAll("line")
+                    .data(this.links)
+                    .enter().append("line")
+                    .attr("stroke", "#444")
+                    .attr("stroke-width", d => Math.max(1, {settings['line_width']} * (d.weight || 1)))
+                    .attr("stroke-opacity", 0.6);
+                
+                // Render nodes with cluster coloring
+                this.node = this.g.append("g")
+                    .attr("class", "nodes")
+                    .selectAll("circle")
+                    .data(this.nodes)
+                    .enter().append("circle")
+                    .attr("r", d => Math.max(5, 5 + (d.centrality * 20)))
+                    .attr("fill", d => this.getClusterColor(d.cluster))
+                    .attr("stroke", "#76B900")
+                    .attr("stroke-width", 1.5)
+                    .style("cursor", "pointer")
+                    .call(d3.drag()
+                        .on("start", (event, d) => this.dragStarted(event, d))
+                        .on("drag", (event, d) => this.dragged(event, d))
+                        .on("end", (event, d) => this.dragEnded(event, d)));
+                
+                // Add node labels (conditional)
+                if (this.labelsVisible) {{
+                    this.label = this.g.append("g")
+                        .attr("class", "labels")
+                        .selectAll("text")
+                        .data(this.nodes)
+                        .enter().append("text")
+                        .text(d => d.name)
+                        .attr("font-size", "10px")
+                        .attr("fill", "#ccc")
+                        .attr("text-anchor", "middle")
+                        .attr("dy", "0.35em")
+                        .style("pointer-events", "none");
+                }}
+                
+                // Add tooltips
+                this.addInteractions();
+                
+                // Start simulation
+                this.simulation
+                    .nodes(this.nodes)
+                    .on("tick", () => this.ticked());
+                
+                this.simulation.force("link")
+                    .links(this.links);
+                
+                // Auto-zoom to fit
+                if (config.autoZoom) {{
+                    setTimeout(() => this.zoomToFit(), 1000);
+                }}
+                
+                // Start FPS monitoring
+                this.startFPSMonitoring();
+            }}
+            
+            getClusterColor(cluster) {{
+                const colors = [
+                    "#76B900", "#00D4F0", "#FF6B35", "#A855F7", 
+                    "#EF4444", "#F59E0B", "#10B981", "#8B5CF6"
+                ];
+                return colors[cluster % colors.length];
+            }}
+            
+            addInteractions() {{
+                const tooltip = d3.select("#tooltip");
+                
+                this.node
+                    .on("mouseover", (event, d) => {{
+                        tooltip
+                            .style("opacity", 1)
+                            .style("left", (event.pageX + 10) + "px")
+                            .style("top", (event.pageY - 10) + "px")
+                            .html(`
+                                <strong>${{d.name}}</strong><br>
+                                Cluster: ${{d.cluster}}<br>
+                                Centrality: ${{d.centrality.toFixed(3)}}<br>
+                                Connections: ${{this.links.filter(l => l.source.id === d.id || l.target.id === d.id).length}}
+                            `);
+                    }})
+                    .on("mouseout", () => {{
+                        tooltip.style("opacity", 0);
+                    }})
+                    .on("click", (event, d) => {{
+                        console.log("Node clicked:", d);
+                        // Highlight connected nodes
+                        this.highlightConnections(d);
+                    }});
+            }}
+            
+            highlightConnections(node) {{
+                const connectedNodes = new Set();
+                const connectedLinks = new Set();
+                
+                this.links.forEach(link => {{
+                    if (link.source.id === node.id || link.target.id === node.id) {{
+                        connectedLinks.add(link);
+                        connectedNodes.add(link.source.id);
+                        connectedNodes.add(link.target.id);
+                    }}
+                }});
+                
+                // Fade non-connected elements
+                this.node
+                    .style("opacity", d => connectedNodes.has(d.id) ? 1 : 0.2);
+                    
+                this.link
+                    .style("opacity", d => connectedLinks.has(d) ? 0.8 : 0.1);
+                
+                // Reset after 3 seconds
+                setTimeout(() => {{
+                    this.node.style("opacity", 1);
+                    this.link.style("opacity", 0.6);
+                }}, 3000);
+            }}
+            
+            ticked() {{
+                this.link
+                    .attr("x1", d => d.source.x)
+                    .attr("y1", d => d.source.y)
+                    .attr("x2", d => d.target.x)
+                    .attr("y2", d => d.target.y);
+                
+                this.node
+                    .attr("cx", d => d.x)
+                    .attr("cy", d => d.y);
+                
+                if (this.label) {{
+                    this.label
+                        .attr("x", d => d.x)
+                        .attr("y", d => d.y);
+                }}
+            }}
+            
+            dragStarted(event, d) {{
+                if (!event.active) this.simulation.alphaTarget(0.3).restart();
+                d.fx = d.x;
+                d.fy = d.y;
+            }}
+            
+            dragged(event, d) {{
+                d.fx = event.x;
+                d.fy = event.y;
+            }}
+            
+            dragEnded(event, d) {{
+                if (!event.active) this.simulation.alphaTarget(0);
+                d.fx = null;
+                d.fy = null;
+            }}
+            
+            zoomToFit() {{
+                const bounds = this.g.node().getBBox();
+                const fullWidth = this.width;
+                const fullHeight = this.height;
+                const width = bounds.width;
+                const height = bounds.height;
+                const midX = bounds.x + width / 2;
+                const midY = bounds.y + height / 2;
+                
+                if (width === 0 || height === 0) return;
+                
+                const scale = 0.8 / Math.max(width / fullWidth, height / fullHeight);
+                const translate = [fullWidth / 2 - scale * midX, fullHeight / 2 - scale * midY];
+                
+                this.svg.transition()
+                    .duration(750)
+                    .call(this.zoom.transform, d3.zoomIdentity.translate(translate[0], translate[1]).scale(scale));
+            }}
+            
+            startFPSMonitoring() {{
+                const fpsCounter = document.getElementById("fps-counter");
+                
+                const updateFPS = () => {{
+                    frameCount++;
+                    const now = performance.now();
+                    
+                    if (now - lastTime >= 1000) {{
+                        const fps = Math.round((frameCount * 1000) / (now - lastTime));
+                        fpsCounter.textContent = fps;
+                        frameCount = 0;
+                        lastTime = now;
+                    }}
+                    
+                    requestAnimationFrame(updateFPS);
+                }};
+                
+                updateFPS();
+            }}
+        }}
+        
+        // Control functions
+        window.togglePhysics = () => {{
+            const graph = window.graphInstance;
+            if (graph.physicsRunning) {{
+                graph.simulation.stop();
+                graph.physicsRunning = false;
+            }} else {{
+                graph.simulation.restart();
+                graph.physicsRunning = true;
+            }}
+        }};
+        
+        window.resetView = () => {{
+            window.graphInstance.zoomToFit();
+        }};
+        
+        window.toggleLabels = () => {{
+            const graph = window.graphInstance;
+            if (graph.label) {{
+                graph.label.style("opacity", graph.labelsVisible ? 0 : 1);
+                graph.labelsVisible = !graph.labelsVisible;
+            }}
+        }};
+        
+        window.exportGraph = () => {{
+            const svgData = new XMLSerializer().serializeToString(document.querySelector("svg"));
+            const blob = new Blob([svgData], {{type: "image/svg+xml"}});
+            const url = URL.createObjectURL(blob);
+            const link = document.createElement("a");
+            link.href = url;
+            link.download = "gpu-graph-visualization.svg";
+            link.click();
+            URL.revokeObjectURL(url);
+        }};
+        
+        // Handle window resize
+        window.addEventListener("resize", () => {{
+            const graph = window.graphInstance;
+            if (graph) {{
+                graph.width = window.innerWidth;
+                graph.height = window.innerHeight;
+                graph.svg
+                    .attr("width", graph.width)
+                    .attr("height", graph.height);
+                graph.simulation.force("center", d3.forceCenter(graph.width / 2, graph.height / 2));
+            }}
+        }});
+        
+        // Initialize when DOM is ready
+        document.addEventListener("DOMContentLoaded", () => {{
+            window.graphInstance = new GPUGraphVisualization();
+        }});
+    </script>
+</body>
+</html>
+        """
+        
+        return html_template
+    
+    def _generate_threejs_webgl_html(self, session_data: dict, config: dict) -> str:
+        """Generate Three.js WebGL visualization with GPU acceleration"""
+        
+        # Extract data
+        nodes = session_data['processed_nodes']
+        edges = session_data['processed_edges']
+        layout_positions = session_data.get('layout_positions', {})
+        clusters = session_data.get('clusters', {})
+        centrality = session_data.get('centrality', {})
+        
+        # GPU rendering settings
+        node_count = len(nodes)
+        use_instanced = node_count > 1000
+        enable_lod = node_count > 25000
+        render_quality = config.get('render_quality', 'high')
+        
+        html_template = f"""
+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <title>GPU WebGL Graph Visualization</title>
+    <style>
+        body {{ margin: 0; padding: 0; background: #0a0a0a; overflow: hidden; }}
+        #container {{ width: 100vw; height: 100vh; position: relative; }}
+        .perf-monitor {{ 
+            position: absolute; top: 10px; left: 10px; 
+            background: rgba(0,0,0,0.8); padding: 10px; border-radius: 5px;
+            color: #76B900; font-size: 12px; z-index: 100;
+        }}
+        .controls {{ 
+            position: absolute; top: 10px; right: 10px; 
+            display: flex; gap: 5px; z-index: 100; 
+        }}
+        .btn {{ 
+            background: rgba(0,0,0,0.8); color: #76B900; 
+            border: 1px solid #76B900; padding: 5px 10px; 
+            border-radius: 3px; cursor: pointer; font-size: 11px;
+        }}
+    </style>
+</head>
+<body>
+    <div id="container">
+        <canvas id="canvas"></canvas>
+        
+        <div class="perf-monitor">
+            <div>🚀 WebGL GPU Rendering</div>
+            <div>Nodes: {node_count:,}</div>
+            <div>FPS: <span id="fps">--</span></div>
+            <div>Triangles: <span id="triangles">--</span></div>
+            <div>Memory: <span id="memory">--</span>MB</div>
+        </div>
+        
+        <div class="controls">
+            <button class="btn" onclick="resetCamera()">🎯 Reset</button>
+            <button class="btn" onclick="toggleAnimation()">⏸️ Pause</button>
+            <button class="btn" onclick="exportImage()">📷 Export</button>
+        </div>
+    </div>
+
+    <script src="https://cdnjs.cloudflare.com/ajax/libs/three.js/0.176.0/three.min.js"></script>
+    
+    <script>
+        const graphData = {{
+            nodes: {json.dumps(nodes)},
+            edges: {json.dumps(edges)},
+            positions: {json.dumps(layout_positions)},
+            clusters: {json.dumps(clusters)},
+            centrality: {json.dumps(centrality)}
+        }};
+        
+        const config = {{
+            nodeCount: {node_count},
+            useInstanced: {str(use_instanced).lower()},
+            enableLOD: {str(enable_lod).lower()},
+            quality: '{render_quality}'
+        }};
+        
+        class WebGLGraphRenderer {{
+            constructor() {{
+                this.canvas = document.getElementById('canvas');
+                this.container = document.getElementById('container');
+                this.frameCount = 0;
+                this.lastTime = performance.now();
+                this.isAnimating = true;
+                
+                this.init();
+                this.loadData();
+                this.animate();
+            }}
+            
+            init() {{
+                // Three.js WebGL renderer with GPU optimizations
+                this.renderer = new THREE.WebGLRenderer({{
+                    canvas: this.canvas,
+                    antialias: true,
+                    powerPreference: "high-performance"
+                }});
+                
+                this.renderer.setSize(window.innerWidth, window.innerHeight);
+                this.renderer.setPixelRatio(Math.min(window.devicePixelRatio, 2));
+                this.renderer.setClearColor(0x0a0a0a, 1);
+                this.renderer.sortObjects = false; // GPU optimization
+                
+                this.scene = new THREE.Scene();
+                this.camera = new THREE.PerspectiveCamera(75, window.innerWidth / window.innerHeight, 0.1, 10000);
+                this.camera.position.z = 800;
+                
+                this.setupControls();
+            }}
+            
+            setupControls() {{
+                this.controls = {{ mouseDown: false, targetX: 0, targetY: 0, zoom: 1 }};
+                
+                this.canvas.addEventListener('mousedown', (e) => {{
+                    this.controls.mouseDown = true;
+                    this.controls.mouseX = e.clientX;
+                    this.controls.mouseY = e.clientY;
+                }});
+                
+                this.canvas.addEventListener('mousemove', (e) => {{
+                    if (this.controls.mouseDown) {{
+                        this.controls.targetX += (e.clientX - this.controls.mouseX) * 2;
+                        this.controls.targetY -= (e.clientY - this.controls.mouseY) * 2;
+                        this.controls.mouseX = e.clientX;
+                        this.controls.mouseY = e.clientY;
+                    }}
+                }});
+                
+                this.canvas.addEventListener('mouseup', () => this.controls.mouseDown = false);
+                this.canvas.addEventListener('wheel', (e) => {{
+                    e.preventDefault();
+                    this.controls.zoom *= (1 - e.deltaY * 0.001);
+                    this.controls.zoom = Math.max(0.1, Math.min(10, this.controls.zoom));
+                }});
+            }}
+            
+            loadData() {{
+                console.log('Loading graph data with WebGL GPU acceleration...');
+                
+                if (config.useInstanced) {{
+                    this.createInstancedNodes();
+                }} else {{
+                    this.createStandardNodes();
+                }}
+                
+                this.createEdges();
+                console.log('Graph loaded successfully');
+            }}
+            
+            createInstancedNodes() {{
+                console.log('Using GPU InstancedMesh for', config.nodeCount, 'nodes');
+                
+                const geometry = new THREE.CircleGeometry(1, 8);
+                const material = new THREE.MeshBasicMaterial({{ vertexColors: true, transparent: true, opacity: 0.8 }});
+                this.nodesMesh = new THREE.InstancedMesh(geometry, material, config.nodeCount);
+                
+                const matrix = new THREE.Matrix4();
+                const color = new THREE.Color();
+                
+                graphData.nodes.forEach((node, i) => {{
+                    const pos = graphData.positions[node.id] || [0, 0];
+                    const x = pos[0] - 500;
+                    const y = pos[1] - 500;
+                    const size = Math.max(2, 5 + (node.pagerank || 0) * 50);
+                    const cluster = node.cluster || 0;
+                    
+                    matrix.makeScale(size, size, 1);
+                    matrix.setPosition(x, y, 0);
+                    this.nodesMesh.setMatrixAt(i, matrix);
+                    
+                    color.setHex(this.getClusterColor(cluster));
+                    this.nodesMesh.setColorAt(i, color);
+                }});
+                
+                this.nodesMesh.instanceMatrix.needsUpdate = true;
+                this.nodesMesh.instanceColor.needsUpdate = true;
+                this.scene.add(this.nodesMesh);
+            }}
+            
+            createStandardNodes() {{
+                console.log('Using standard mesh rendering for', config.nodeCount, 'nodes');
+                this.nodesGroup = new THREE.Group();
+                
+                graphData.nodes.forEach((node, i) => {{
+                    const pos = graphData.positions[node.id] || [0, 0];
+                    const size = Math.max(2, 5 + (node.pagerank || 0) * 50);
+                    const cluster = node.cluster || 0;
+                    
+                    const geometry = new THREE.CircleGeometry(size, 8);
+                    const material = new THREE.MeshBasicMaterial({{ 
+                        color: this.getClusterColor(cluster), transparent: true, opacity: 0.8 
+                    }});
+                    
+                    const mesh = new THREE.Mesh(geometry, material);
+                    mesh.position.set(pos[0] - 500, pos[1] - 500, 0);
+                    this.nodesGroup.add(mesh);
+                }});
+                
+                this.scene.add(this.nodesGroup);
+            }}
+            
+            createEdges() {{
+                const positions = new Float32Array(graphData.edges.length * 6);
+                
+                graphData.edges.forEach((edge, i) => {{
+                    const sourcePos = graphData.positions[edge.source] || [0, 0];
+                    const targetPos = graphData.positions[edge.target] || [0, 0];
+                    const idx = i * 6;
+                    
+                    positions[idx] = sourcePos[0] - 500;
+                    positions[idx + 1] = sourcePos[1] - 500;
+                    positions[idx + 2] = 0;
+                    positions[idx + 3] = targetPos[0] - 500;
+                    positions[idx + 4] = targetPos[1] - 500;
+                    positions[idx + 5] = 0;
+                }});
+                
+                const geometry = new THREE.BufferGeometry();
+                geometry.setAttribute('position', new THREE.BufferAttribute(positions, 3));
+                
+                const material = new THREE.LineBasicMaterial({{ color: 0x444444, transparent: true, opacity: 0.4 }});
+                this.edgesMesh = new THREE.LineSegments(geometry, material);
+                this.scene.add(this.edgesMesh);
+            }}
+            
+            getClusterColor(cluster) {{
+                const colors = [0x76B900, 0x00D4F0, 0xFF6B35, 0xA855F7, 0xEF4444, 0xF59E0B, 0x10B981, 0x8B5CF6];
+                return colors[cluster % colors.length];
+            }}
+            
+            animate() {{
+                requestAnimationFrame(() => this.animate());
+                
+                if (this.isAnimating) {{
+                    // Smooth camera movement
+                    this.camera.position.x += (this.controls.targetX - this.camera.position.x) * 0.05;
+                    this.camera.position.y += (this.controls.targetY - this.camera.position.y) * 0.05;
+                    this.camera.zoom += (this.controls.zoom - this.camera.zoom) * 0.05;
+                    this.camera.updateProjectionMatrix();
+                }}
+                
+                // GPU render
+                this.renderer.render(this.scene, this.camera);
+                
+                // Performance monitoring
+                this.frameCount++;
+                const now = performance.now();
+                if (now - this.lastTime >= 1000) {{
+                    const fps = Math.round(this.frameCount * 1000 / (now - this.lastTime));
+                    document.getElementById('fps').textContent = fps;
+                    document.getElementById('triangles').textContent = this.renderer.info.render.triangles.toLocaleString();
+                    document.getElementById('memory').textContent = Math.round(this.renderer.info.memory.geometries + this.renderer.info.memory.textures);
+                    this.frameCount = 0;
+                    this.lastTime = now;
+                }}
+            }}
+            
+            resetCamera() {{
+                this.controls.targetX = 0;
+                this.controls.targetY = 0;
+                this.controls.zoom = 1;
+            }}
+            
+            toggleAnimation() {{
+                this.isAnimating = !this.isAnimating;
+            }}
+            
+            exportImage() {{
+                const link = document.createElement('a');
+                link.download = 'webgl-graph.png';
+                link.href = this.renderer.domElement.toDataURL();
+                link.click();
+            }}
+        }}
+        
+        // Global functions
+        window.resetCamera = () => window.graphRenderer.resetCamera();
+        window.toggleAnimation = () => window.graphRenderer.toggleAnimation();
+        window.exportImage = () => window.graphRenderer.exportImage();
+        
+        // Handle resize
+        window.addEventListener('resize', () => {{
+            const renderer = window.graphRenderer;
+            if (renderer) {{
+                renderer.camera.aspect = window.innerWidth / window.innerHeight;
+                renderer.camera.updateProjectionMatrix();
+                renderer.renderer.setSize(window.innerWidth, window.innerHeight);
+            }}
+        }});
+        
+        // Initialize
+        document.addEventListener('DOMContentLoaded', () => {{
+            window.graphRenderer = new WebGLGraphRenderer();
+        }});
+    </script>
+</body>
+</html>
+        """
+        
+        return html_template
+    
+    def _generate_d3_svg_html(self, session_data: dict, config: dict) -> str:
+        """Generate D3.js SVG visualization (original approach)"""
+        # Return the original D3.js SVG implementation
+        nodes = session_data['processed_nodes']
+        edges = session_data['processed_edges']
+        
+        return f"""
+<!DOCTYPE html>
+<html><head><title>D3.js SVG Fallback</title></head>
+<body>
+<div>D3.js SVG rendering for {len(nodes)} nodes (fallback mode)</div>
+<script src="https://d3js.org/d3.v7.min.js"></script>
+<!-- Original D3.js implementation would go here -->
+</body></html>
+        """
+
+# FastAPI app for remote rendering
+app = FastAPI(
+    title="Remote GPU Graph Rendering Service", 
+    version="1.0.0",
+    description="GPU-accelerated graph processing and iframe-embeddable visualization service"
+)
+
+# Add CORS middleware for iframe embedding
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],  # Configure appropriately for production
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+
+# Initialize service
+rendering_service = RemoteRenderingService()
+
+@app.post("/api/render")
+async def render_graph(request: RemoteRenderingRequest, background_tasks: BackgroundTasks):
+    """Process and store graph for remote rendering"""
+    result = await rendering_service.process_and_store_graph(request)
+    
+    # Schedule cleanup task
+    background_tasks.add_task(rendering_service.cleanup_expired_sessions)
+    
+    return result
+
+@app.get("/embed/{session_id}", response_class=HTMLResponse)
+async def get_iframe_visualization(session_id: str):
+    """Serve iframe-embeddable visualization"""
+    session_data = await rendering_service.get_session_data(session_id)
+    
+    if not session_data:
+        raise HTTPException(status_code=404, detail="Session not found or expired")
+    
+    # Generate interactive HTML for iframe
+    html_content = await rendering_service._generate_interactive_html(session_data, session_data['render_config'])
+    return HTMLResponse(content=html_content)
+
+@app.get("/api/session/{session_id}")
+async def get_session_status(session_id: str):
+    """Get session status and metadata"""
+    session_data = await rendering_service.get_session_data(session_id)
+    
+    if not session_data:
+        raise HTTPException(status_code=404, detail="Session not found or expired")
+    
+    return {
+        "session_id": session_id,
+        "status": "ready",
+        "stats": session_data.get("stats", {}),
+        "render_config": session_data.get("render_config", {}),
+        "timestamp": session_data.get("timestamp")
+    }
+
+@app.websocket("/ws/{session_id}")
+async def websocket_session_endpoint(websocket: WebSocket, session_id: str):
+    """WebSocket endpoint for real-time iframe communication"""
+    await websocket.accept()
+    rendering_service.active_connections[session_id] = websocket
+    
+    try:
+        while True:
+            # Handle real-time updates, parameter changes, etc.
+            message = await websocket.receive_text()
+            data = json.loads(message)
+            
+            # Handle different message types
+            if data.get("type") == "update_params":
+                # Handle parameter updates (layout changes, filtering, etc.)
+                await handle_parameter_update(session_id, data)
+                
+    except WebSocketDisconnect:
+        if session_id in rendering_service.active_connections:
+            del rendering_service.active_connections[session_id]
+
+async def handle_parameter_update(session_id: str, data: Dict[str, Any]):
+    """Handle real-time parameter updates"""
+    # Implementation for real-time parameter changes
+    # (layout algorithm changes, filtering, etc.)
+    pass
+
+@app.get("/api/health")
+async def health_check():
+    """Health check endpoint"""
+    return {
+        "status": "healthy",
+        "gpu_available": HAS_RAPIDS,
+        "active_sessions": len(rendering_service.sessions),
+        "active_connections": len(rendering_service.active_connections)
+    }
+
+@app.post("/api/datasets")
+async def preprocess_dataset(graph_data: GraphData, background_tasks: BackgroundTasks):
+    """Preprocess and cache dataset for fast visualization (like Graphistry dataset upload)"""
+    dataset_id = await rendering_service.preprocess_and_cache_dataset(graph_data)
+    
+    background_tasks.add_task(rendering_service.cleanup_expired_sessions)
+    
+    return {
+        "dataset_id": dataset_id,
+        "status": "preprocessed",
+        "visualization_url": f"/visualize/{dataset_id}",
+        "embed_url": f"/embed-dataset/{dataset_id}",
+        "message": "Dataset preprocessed and cached. Use the visualization_url for direct access."
+    }
+
+@app.get("/visualize/{dataset_id}", response_class=HTMLResponse)
+async def visualize_cached_dataset(
+    dataset_id: str,
+    layout: str = "force_atlas2",
+    clustering: str = "leiden", 
+    play: int = 5000,
+    splash: bool = False,
+    auto_zoom: bool = True,
+    show_labels: bool = True
+):
+    """Visualize a cached dataset with URL parameters (like Graphistry)"""
+    dataset = await rendering_service.get_dataset(dataset_id)
+    
+    if not dataset:
+        raise HTTPException(status_code=404, detail="Dataset not found or expired")
+    
+    # Create session with cached data and URL parameters
+    session_data = {
+        "session_id": f"dataset-{dataset_id}",
+        "dataset_id": dataset_id,
+        "nodes": dataset["nodes"],
+        "edges": dataset["edges"],
+        "layout_positions": dataset["layouts"].get(layout, {}),
+        "clusters": dataset["clusters"].get(clustering, {}),
+        "centrality": dataset["centrality"],
+        "gpu_processed": True,
+        "render_config": {
+            "layout_algorithm": layout,
+            "clustering_algorithm": clustering,
+            "animation_duration": play,
+            "show_splash": splash,
+            "auto_zoom": auto_zoom,
+            "show_labels": show_labels,
+            "quality": "high",
+            "interactive": True
+        },
+        "stats": dataset["stats"],
+        "timestamp": datetime.now().isoformat()
+    }
+    
+    # Generate enhanced HTML with URL parameters
+    html_content = await rendering_service._generate_interactive_html(session_data, session_data['render_config'])
+    return HTMLResponse(content=html_content)
+
+@app.get("/embed-dataset/{dataset_id}", response_class=HTMLResponse) 
+async def embed_cached_dataset(
+    dataset_id: str,
+    layout: str = "force_atlas2",
+    clustering: str = "leiden",
+    play: int = 5000,
+    splash: bool = False
+):
+    """Embeddable iframe version of cached dataset visualization"""
+    return await visualize_cached_dataset(dataset_id, layout, clustering, play, splash)
+
+@app.get("/api/datasets/{dataset_id}")
+async def get_dataset_info(dataset_id: str):
+    """Get information about a cached dataset"""
+    dataset = await rendering_service.get_dataset(dataset_id)
+    
+    if not dataset:
+        raise HTTPException(status_code=404, detail="Dataset not found or expired") 
+    
+    return {
+        "dataset_id": dataset_id,
+        "stats": dataset["stats"],
+        "available_layouts": list(dataset["layouts"].keys()),
+        "available_clustering": list(dataset["clusters"].keys()),
+        "centrality_metrics": list(dataset["centrality"].keys()),
+        "created_at": dataset["created_at"],
+        "visualization_url": f"/visualize/{dataset_id}",
+        "embed_url": f"/embed-dataset/{dataset_id}"
+    }
+
+if __name__ == "__main__":
+    logger.info("🚀 Starting Remote GPU Rendering Service")
+    logger.info("📊 Features:")
+    logger.info("  - GPU-accelerated graph processing with cuGraph")
+    logger.info("  - Interactive iframe-embeddable visualizations")
+    logger.info("  - Real-time WebSocket communication")
+    logger.info("  - Session-based rendering with TTL")
+    logger.info("  - Scalable up to million-node graphs")
+    logger.info("")
+    logger.info("🎯 Service endpoints:")
+    logger.info("  - Process graph:        POST /api/render")
+    logger.info("  - Iframe visualization: GET  /embed/{session_id}")
+    logger.info("  - Session status:       GET  /api/session/{session_id}")
+    logger.info("  - Real-time updates:    WS   /ws/{session_id}")
+    logger.info("  - Health check:         GET  /api/health")
+    logger.info("  - Preprocess dataset:   POST /api/datasets")
+    logger.info("  - Visualize dataset:    GET  /visualize/{dataset_id}")
+    logger.info("  - Embed dataset:        GET  /embed-dataset/{dataset_id}")
+    logger.info("  - Get dataset info:     GET  /api/datasets/{dataset_id}")
+    logger.info("")
+    
+    uvicorn.run(app, host="0.0.0.0", port=8082)
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_webgpu_clustering_service.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_webgpu_clustering_service.py
new file mode 100644
index 0000000..ec16754
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_webgpu_clustering_service.py
@@ -0,0 +1,800 @@
+#!/usr/bin/env python3
+"""
+Remote WebGPU Clustering Service - CuPy Version with Semantic Clustering
+
+Provides GPU-accelerated graph clustering using CuPy instead of cuDF to avoid segfaults.
+Uses stable CuPy operations for GPU clustering while maintaining the same API.
+Enhanced with semantic clustering based on node names and content similarity.
+"""
+
+import os
+import json
+import uuid
+import asyncio
+import logging
+import numpy as np
+from datetime import datetime, timedelta
+from typing import Dict, List, Any, Optional, Tuple, Union
+from fastapi import FastAPI, HTTPException, WebSocket, WebSocketDisconnect, BackgroundTasks
+from fastapi.responses import HTMLResponse, StreamingResponse
+from fastapi.middleware.cors import CORSMiddleware
+from pydantic import BaseModel
+import uvicorn
+import time
+import threading
+from concurrent.futures import ThreadPoolExecutor
+import base64
+from io import BytesIO
+
+# Import semantic clustering
+from semantic_clustering_service import SemanticClusteringEngine, cluster_nodes_by_similarity
+
+# GPU-accelerated imports
+try:
+    import cupy as cp
+    HAS_CUPY = True
+    print("✓ CuPy available for stable GPU clustering")
+except ImportError:
+    HAS_CUPY = False
+    print("⚠ CuPy not available, falling back to CPU")
+
+# Optional cuGraph for force simulation (avoid cuDF operations)
+try:
+    import cugraph
+    import cudf
+    HAS_CUGRAPH = True
+    print("✓ cuGraph available for force simulation")
+except ImportError:
+    HAS_CUGRAPH = False
+    print("⚠ cuGraph not available")
+    import networkx as nx
+
+# WebRTC streaming imports
+try:
+    import cv2
+    import PIL.Image as PILImage
+    HAS_OPENCV = True
+    print("✓ OpenCV available for WebRTC streaming")
+except ImportError:
+    HAS_OPENCV = False
+    print("⚠ OpenCV not available, WebRTC streaming disabled")
+
+# WebGL rendering imports
+try:
+    import matplotlib.pyplot as plt
+    import matplotlib
+    matplotlib.use('Agg')
+    import plotly.graph_objects as go
+    import plotly.io as pio
+    pio.renderers.default = "json"
+    HAS_PLOTTING = True
+    print("✓ Plotting libraries available for server-side rendering")
+except ImportError:
+    HAS_PLOTTING = False
+    print("⚠ Plotting libraries not available")
+
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+
+class GraphData(BaseModel):
+    nodes: List[Dict[str, Any]]
+    links: List[Dict[str, Any]]
+
+class ClusteringMode(str):
+    HYBRID = "hybrid"
+    WEBRTC_STREAM = "webrtc_stream"
+
+class RemoteClusteringRequest(BaseModel):
+    graph_data: GraphData
+    mode: str = ClusteringMode.HYBRID
+    cluster_dimensions: List[int] = [32, 18, 24]
+    force_simulation: bool = True
+    max_iterations: int = 100
+    webrtc_options: Optional[Dict[str, Any]] = None
+    
+    # Semantic clustering options
+    clustering_method: str = "hybrid"  # "spatial", "semantic", "hybrid"
+    semantic_algorithm: str = "hierarchical"  # "hierarchical", "kmeans", "dbscan"
+    n_clusters: Optional[int] = None
+    similarity_threshold: float = 0.7
+    
+    # Hybrid clustering weights
+    name_weight: float = 0.6
+    content_weight: float = 0.3
+    spatial_weight: float = 0.1
+
+class ClusteringResult(BaseModel):
+    clustered_nodes: List[Dict[str, Any]]
+    cluster_info: Dict[str, Any]
+    processing_time: float
+    mode: str
+    session_id: Optional[str] = None
+
+class WebRTCSession(BaseModel):
+    session_id: str
+    client_id: str
+    created_at: datetime
+    last_frame_time: datetime
+    is_active: bool = True
+
+class CuPyClusteringEngine:
+    """
+    Stable GPU clustering using CuPy arrays instead of cuDF to avoid segfaults
+    """
+    
+    def __init__(self, cluster_dimensions: Tuple[int, int, int] = (32, 18, 24)):
+        self.cluster_dimensions = cluster_dimensions
+        self.cluster_count = cluster_dimensions[0] * cluster_dimensions[1] * cluster_dimensions[2]
+        self.has_gpu = HAS_CUPY
+        logger.info(f"CuPy clustering engine initialized with {self.cluster_count} clusters")
+        
+    def cluster_nodes_gpu(self, nodes: List[Dict[str, Any]]) -> Tuple[List[Dict[str, Any]], Dict[str, Any]]:
+        """
+        Perform advanced GPU-accelerated clustering using RAPIDS cuML algorithms
+        """
+        if not self.has_gpu:
+            return self._cluster_nodes_cpu(nodes)
+            
+        try:
+            from cuml.cluster import KMeans, DBSCAN, HDBSCAN
+            import cupy as cp
+            
+            start_time = time.time()
+            
+            # Extract coordinates and prepare feature matrix
+            coordinates = []
+            for node in nodes:
+                x = float(node.get('x', 0))
+                y = float(node.get('y', 0)) 
+                z = float(node.get('z', 0))
+                coordinates.append([x, y, z])
+            
+            # Create GPU feature matrix
+            X = cp.array(coordinates, dtype=cp.float32)
+            n_samples = X.shape[0]
+            
+            print(f"🚀 GPU clustering {n_samples} nodes with RAPIDS cuML...")
+            
+            # Choose clustering algorithm optimized for performance
+            # KMeans is fastest and works well for most graph clustering scenarios
+            if n_samples < 5000:
+                # Small datasets: moderate cluster count
+                n_clusters = min(max(int(np.sqrt(n_samples / 2)), 3), 25)
+                clusterer = KMeans(n_clusters=n_clusters, random_state=42, max_iter=100)
+                algorithm_name = f"KMeans(k={n_clusters})"
+                
+            elif n_samples < 25000:
+                # Medium datasets: higher cluster count for better granularity
+                n_clusters = min(max(int(np.sqrt(n_samples / 1.5)), 10), 50)
+                clusterer = KMeans(n_clusters=n_clusters, random_state=42, max_iter=150)
+                algorithm_name = f"KMeans(k={n_clusters})"
+                
+            else:
+                # Large datasets: many clusters but capped for performance
+                n_clusters = min(max(int(np.sqrt(n_samples)), 20), 100)
+                clusterer = KMeans(n_clusters=n_clusters, random_state=42, max_iter=200)
+                algorithm_name = f"KMeans(k={n_clusters})"
+            
+            # Perform GPU clustering
+            cluster_labels = clusterer.fit_predict(X)
+            
+            # Convert results back to CPU
+            if hasattr(cluster_labels, 'get'):
+                cluster_result = cluster_labels.get()
+            else:
+                cluster_result = cp.asarray(cluster_labels).get()
+            
+            # Update nodes with clustering results
+            clustered_nodes = []
+            for i, node in enumerate(nodes):
+                cluster_id = int(cluster_result[i])
+                
+                clustered_node = {
+                    **node,
+                    'cluster_index': cluster_id,
+                    'node_index': i
+                }
+                clustered_nodes.append(clustered_node)
+            
+            # Generate cluster statistics
+            unique_clusters = len(np.unique(cluster_result))
+            noise_points = 0  # KMeans doesn't produce noise points
+            processing_time = time.time() - start_time
+            
+            print(f"✅ {algorithm_name} completed: {unique_clusters} clusters, {noise_points} noise points in {processing_time:.4f}s")
+            
+            # Apply intelligent subsampling for large datasets
+            if len(nodes) > 10000:
+                print(f"🎯 Large dataset detected ({len(nodes)} nodes), applying cluster-based subsampling...")
+                clustered_nodes = self._apply_cluster_subsampling(clustered_nodes, cluster_result, target_nodes=5000)
+                print(f"✅ Subsampled to {len(clustered_nodes)} representative nodes")
+            
+            cluster_info = {
+                'total_clusters': self.cluster_count,
+                'used_clusters': unique_clusters,
+                'cluster_dimensions': self.cluster_dimensions,
+                'processing_time': processing_time,
+                'gpu_accelerated': True,
+                'engine': 'RAPIDS cuML',
+                'algorithm': algorithm_name,
+                'noise_points': int(noise_points),
+                'original_node_count': len(nodes),
+                'rendered_node_count': len(clustered_nodes),
+                'subsampled': len(nodes) > 10000
+            }
+            
+            logger.info(f"CuPy GPU clustering completed in {processing_time:.3f}s for {len(nodes)} nodes -> {unique_clusters} clusters")
+            return clustered_nodes, cluster_info
+            
+        except Exception as e:
+            logger.error(f"RAPIDS cuML GPU clustering failed: {e}")
+            import traceback
+            logger.error(f"Full traceback: {traceback.format_exc()}")
+            print(f"❌ GPU clustering error: {e}")
+            print(f"Traceback: {traceback.format_exc()}")
+            return self._cluster_nodes_cpu(nodes)
+    
+    def _apply_cluster_subsampling(self, clustered_nodes: List[Dict[str, Any]], cluster_labels: np.ndarray, target_nodes: int = 5000) -> List[Dict[str, Any]]:
+        """
+        Apply intelligent cluster-based subsampling to reduce rendering load while preserving cluster structure.
+        
+        Strategy:
+        1. Keep cluster centroids (most representative nodes)
+        2. Keep boundary nodes (cluster edges for visual separation)  
+        3. Sample remaining nodes proportionally from each cluster
+        4. Always keep noise points (outliers are important)
+        """
+        import cupy as cp
+        
+        # Separate nodes by cluster
+        cluster_groups = {}
+        noise_nodes = []
+        
+        for i, node in enumerate(clustered_nodes):
+            cluster_id = cluster_labels[i]
+            if cluster_id == -1:  # Noise points
+                noise_nodes.append(node)
+            else:
+                if cluster_id not in cluster_groups:
+                    cluster_groups[cluster_id] = []
+                cluster_groups[cluster_id].append((i, node))
+        
+        # Calculate sampling allocation
+        total_clusters = len(cluster_groups)
+        noise_count = len(noise_nodes)
+        
+        # Reserve space for noise points and ensure minimum representation
+        available_nodes = max(target_nodes - noise_count, total_clusters * 3)  # At least 3 nodes per cluster
+        
+        selected_nodes = []
+        
+        # Include noise points if they exist (DBSCAN/HDBSCAN only)
+        if noise_nodes:
+            selected_nodes.extend(noise_nodes)
+            print(f"   📍 Kept {len(noise_nodes)} noise points")
+        else:
+            print(f"   📍 No noise points (KMeans clustering)")
+        
+        # Process each cluster
+        for cluster_id, cluster_nodes in cluster_groups.items():
+            cluster_size = len(cluster_nodes)
+            
+            if cluster_size == 0:
+                continue
+                
+            # Calculate how many nodes to keep from this cluster
+            # Larger clusters get more representation, but with diminishing returns
+            cluster_weight = min(cluster_size / len(clustered_nodes), 0.1)  # Cap at 10% weight
+            target_from_cluster = max(3, int(available_nodes * cluster_weight))  # Minimum 3 per cluster
+            target_from_cluster = min(target_from_cluster, cluster_size)  # Don't exceed cluster size
+            
+            if target_from_cluster >= cluster_size:
+                # Keep all nodes from small clusters
+                selected_nodes.extend([node for _, node in cluster_nodes])
+            else:
+                # Intelligent sampling for large clusters
+                cluster_coords = np.array([[float(node.get('x', 0)), float(node.get('y', 0)), float(node.get('z', 0))] for _, node in cluster_nodes])
+                
+                # Find cluster centroid
+                centroid = np.mean(cluster_coords, axis=0)
+                
+                # Calculate distances from centroid
+                distances = np.linalg.norm(cluster_coords - centroid, axis=1)
+                
+                # Select representative nodes:
+                # 1. Centroid node (closest to center)
+                centroid_idx = np.argmin(distances)
+                selected_indices = {centroid_idx}
+                
+                # 2. Boundary nodes (furthest from center for cluster separation)
+                if target_from_cluster > 1:
+                    boundary_count = min(2, target_from_cluster - 1)
+                    boundary_indices = np.argsort(distances)[-boundary_count:]
+                    selected_indices.update(boundary_indices)
+                
+                # 3. Random sampling for remaining slots
+                remaining_slots = target_from_cluster - len(selected_indices)
+                if remaining_slots > 0:
+                    available_indices = set(range(len(cluster_nodes))) - selected_indices
+                    if available_indices:
+                        random_indices = np.random.choice(list(available_indices), 
+                                                        size=min(remaining_slots, len(available_indices)), 
+                                                        replace=False)
+                        selected_indices.update(random_indices)
+                
+                # Add selected nodes
+                for idx in selected_indices:
+                    selected_nodes.append(cluster_nodes[idx][1])
+        
+        print(f"   🎨 Cluster sampling: {len(cluster_groups)} clusters, {len(selected_nodes)} total nodes")
+        return selected_nodes
+    
+    def _cluster_nodes_cpu(self, nodes: List[Dict[str, Any]]) -> Tuple[List[Dict[str, Any]], Dict[str, Any]]:
+        """CPU fallback clustering implementation"""
+        start_time = time.time()
+        
+        clustered_nodes = []
+        for i, node in enumerate(nodes):
+            # Apply same clustering logic as GPU version
+            x = float(node.get('x', 0))
+            y = float(node.get('y', 0))
+            z = float(node.get('z', 0))
+            
+            # Normalize positions
+            norm_x = max(0.0, min(0.999, x / 100.0 + 0.5))
+            norm_y = max(0.0, min(0.999, y / 100.0 + 0.5))
+            norm_z = max(0.001, min(0.999, z / 100.0 + 0.5))
+            
+            # Apply logarithmic scaling to Z
+            log_z = max(0.0, min(0.999, np.log(norm_z) / np.log(0.999)))
+            
+            # Calculate cluster indices
+            cluster_x = min(self.cluster_dimensions[0] - 1, int(norm_x * self.cluster_dimensions[0]))
+            cluster_y = min(self.cluster_dimensions[1] - 1, int(norm_y * self.cluster_dimensions[1]))
+            cluster_z = min(self.cluster_dimensions[2] - 1, int(log_z * self.cluster_dimensions[2]))
+            
+            cluster_index = (cluster_x + 
+                           cluster_y * self.cluster_dimensions[0] + 
+                           cluster_z * self.cluster_dimensions[0] * self.cluster_dimensions[1])
+            
+            clustered_node = {
+                **node,
+                'cluster_index': cluster_index,
+                'node_index': i
+            }
+            clustered_nodes.append(clustered_node)
+        
+        processing_time = time.time() - start_time
+        
+        cluster_info = {
+            'total_clusters': self.cluster_count,
+            'cluster_dimensions': self.cluster_dimensions,
+            'processing_time': processing_time,
+            'gpu_accelerated': False,
+            'engine': 'CPU'
+        }
+        
+        logger.info(f"CPU clustering completed in {processing_time:.3f}s for {len(nodes)} nodes")
+        return clustered_nodes, cluster_info
+
+class ForceSimulationEngine:
+    """
+    GPU-accelerated force simulation for graph layout
+    """
+    
+    def __init__(self):
+        self.has_gpu = HAS_CUGRAPH
+        
+    def simulate_forces(self, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]], max_iterations: int = 100) -> List[Dict[str, Any]]:
+        """Run force-directed layout simulation"""
+        
+        if not self.has_gpu or not links:
+            return self._simulate_forces_cpu(nodes, links, max_iterations)
+            
+        try:
+            return self._simulate_forces_gpu(nodes, links, max_iterations)
+        except Exception as e:
+            logger.error(f"GPU force simulation failed: {e}")
+            return self._simulate_forces_cpu(nodes, links, max_iterations)
+    
+    def _simulate_forces_gpu(self, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]], max_iterations: int) -> List[Dict[str, Any]]:
+        """GPU-accelerated force simulation using cuGraph (avoid cuDF operations)"""
+        
+        # Create simple edge list for cuGraph
+        edge_list = []
+        for link in links:
+            source_id = str(link.get('source', ''))
+            target_id = str(link.get('target', ''))
+            edge_list.append([source_id, target_id])
+        
+        if not edge_list:
+            return nodes
+            
+        try:
+            # Use NetworkX for safer force simulation
+            return self._simulate_forces_cpu(nodes, links, max_iterations)
+        except Exception as e:
+            logger.warning(f"Force simulation failed: {e}")
+            return nodes
+    
+    def _simulate_forces_cpu(self, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]], max_iterations: int) -> List[Dict[str, Any]]:
+        """CPU fallback force simulation using NetworkX"""
+        
+        import networkx as nx
+        
+        G = nx.Graph()
+        
+        # Add nodes
+        for node in nodes:
+            G.add_node(str(node.get('id', '')), **node)
+            
+        # Add edges
+        for link in links:
+            source = str(link.get('source', ''))
+            target = str(link.get('target', ''))
+            G.add_edge(source, target)
+        
+        # Compute spring layout
+        pos = nx.spring_layout(G, iterations=max_iterations, k=1.0)
+        
+        # Update node positions
+        updated_nodes = []
+        for node in nodes:
+            node_id = str(node.get('id', ''))
+            if node_id in pos:
+                x, y = pos[node_id]
+                updated_node = {**node, 'x': float(x * 100), 'y': float(y * 100)}
+            else:
+                updated_node = node
+            updated_nodes.append(updated_node)
+            
+        return updated_nodes
+
+class WebRTCStreamingEngine:
+    """WebRTC streaming engine for real-time graph visualization streaming"""
+    
+    def __init__(self):
+        self.has_rendering = HAS_PLOTTING and HAS_OPENCV
+        self.active_sessions: Dict[str, WebRTCSession] = {}
+        self.frame_buffer: Dict[str, bytes] = {}
+        
+    def create_session(self, client_id: str) -> str:
+        """Create new WebRTC streaming session"""
+        session_id = str(uuid.uuid4())
+        session = WebRTCSession(
+            session_id=session_id,
+            client_id=client_id,
+            created_at=datetime.now(),
+            last_frame_time=datetime.now()
+        )
+        self.active_sessions[session_id] = session
+        logger.info(f"Created WebRTC session {session_id} for client {client_id}")
+        return session_id
+    
+    def render_graph_frame(self, session_id: str, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]]) -> bool:
+        """Render graph to frame buffer for streaming"""
+        
+        if not self.has_rendering:
+            return False
+            
+        if session_id not in self.active_sessions:
+            return False
+            
+        try:
+            # Create 3D plotly visualization
+            node_x = [node.get('x', 0) for node in nodes]
+            node_y = [node.get('y', 0) for node in nodes] 
+            node_z = [node.get('z', 0) for node in nodes]
+            node_text = [node.get('name', f"Node {i}") for i, node in enumerate(nodes)]
+            node_colors = [node.get('cluster_index', 0) for node in nodes]
+            
+            # Create node trace
+            node_trace = go.Scatter3d(
+                x=node_x, y=node_y, z=node_z,
+                mode='markers',
+                marker=dict(size=8, color=node_colors, colorscale='Viridis', showscale=True),
+                text=node_text,
+                hovertemplate='%{text}<br>(%{x:.1f}, %{y:.1f}, %{z:.1f})<extra></extra>',
+                name='Nodes'
+            )
+            
+            # Create edge traces
+            edge_traces = []
+            for link in links:
+                source_idx = None
+                target_idx = None
+                
+                for i, node in enumerate(nodes):
+                    if str(node.get('id', '')) == str(link.get('source', '')):
+                        source_idx = i
+                    if str(node.get('id', '')) == str(link.get('target', '')):
+                        target_idx = i
+                        
+                if source_idx is not None and target_idx is not None:
+                    edge_trace = go.Scatter3d(
+                        x=[node_x[source_idx], node_x[target_idx], None],
+                        y=[node_y[source_idx], node_y[target_idx], None],
+                        z=[node_z[source_idx], node_z[target_idx], None],
+                        mode='lines',
+                        line=dict(color='gray', width=2),
+                        showlegend=False,
+                        hoverinfo='none'
+                    )
+                    edge_traces.append(edge_trace)
+            
+            # Create figure
+            fig = go.Figure(data=[node_trace] + edge_traces)
+            fig.update_layout(
+                title='GPU-Clustered Knowledge Graph (CuPy)',
+                scene=dict(xaxis_title='X', yaxis_title='Y', zaxis_title='Z', bgcolor='rgb(10, 10, 10)'),
+                showlegend=False,
+                paper_bgcolor='rgb(10, 10, 10)',
+                plot_bgcolor='rgb(10, 10, 10)',
+                font=dict(color='white')
+            )
+            
+            # Convert to image
+            img_bytes = pio.to_image(fig, format='png', width=1200, height=800, engine='kaleido')
+            
+            # Store frame in buffer
+            self.frame_buffer[session_id] = img_bytes
+            self.active_sessions[session_id].last_frame_time = datetime.now()
+            
+            return True
+            
+        except Exception as e:
+            logger.error(f"Frame rendering failed for session {session_id}: {e}")
+            return False
+    
+    def get_frame(self, session_id: str) -> Optional[bytes]:
+        return self.frame_buffer.get(session_id)
+    
+    def cleanup_session(self, session_id: str):
+        if session_id in self.active_sessions:
+            del self.active_sessions[session_id]
+        if session_id in self.frame_buffer:
+            del self.frame_buffer[session_id]
+
+class RemoteWebGPUService:
+    """Main service class with stable CuPy clustering"""
+    
+    def __init__(self):
+        self.clustering_engine = CuPyClusteringEngine()
+        self.force_engine = ForceSimulationEngine()
+        self.webrtc_engine = WebRTCStreamingEngine()
+        self.active_connections: List[WebSocket] = []
+        self.executor = ThreadPoolExecutor(max_workers=4)
+        
+    async def process_clustering_request(self, request: RemoteClusteringRequest) -> ClusteringResult:
+        """Process remote clustering request with semantic clustering support"""
+        
+        start_time = time.time()
+        
+        try:
+            nodes = request.graph_data.nodes
+            links = request.graph_data.links
+            
+            # Apply force simulation if requested
+            if request.force_simulation:
+                logger.info("Running force simulation...")
+                nodes = self.force_engine.simulate_forces(nodes, links, request.max_iterations)
+            
+            # Choose clustering method based on request
+            if request.clustering_method == "spatial":
+                # Use traditional spatial clustering
+                logger.info(f"Spatial clustering {len(nodes)} nodes in {request.mode} mode...")
+                clustered_nodes, cluster_info = self.clustering_engine.cluster_nodes_gpu(nodes)
+                
+            elif request.clustering_method == "semantic":
+                # Use semantic clustering based on node names/content
+                logger.info(f"Semantic clustering {len(nodes)} nodes using {request.semantic_algorithm}...")
+                semantic_result = await cluster_nodes_by_similarity(
+                    nodes,
+                    method="name" if request.semantic_algorithm != "content" else "content",
+                    algorithm=request.semantic_algorithm,
+                    n_clusters=request.n_clusters,
+                    similarity_threshold=request.similarity_threshold
+                )
+                clustered_nodes = semantic_result.clustered_nodes
+                cluster_info = semantic_result.cluster_info
+                
+            elif request.clustering_method == "hybrid":
+                # Use hybrid clustering (semantic + spatial)
+                logger.info(f"Hybrid clustering {len(nodes)} nodes...")
+                semantic_result = await cluster_nodes_by_similarity(
+                    nodes,
+                    method="hybrid",
+                    algorithm=request.semantic_algorithm,
+                    n_clusters=request.n_clusters,
+                    name_weight=request.name_weight,
+                    content_weight=request.content_weight,
+                    spatial_weight=request.spatial_weight
+                )
+                clustered_nodes = semantic_result.clustered_nodes
+                cluster_info = semantic_result.cluster_info
+                
+            else:
+                # Fallback to spatial clustering
+                logger.warning(f"Unknown clustering method '{request.clustering_method}', using spatial")
+                clustered_nodes, cluster_info = self.clustering_engine.cluster_nodes_gpu(nodes)
+            
+            processing_time = time.time() - start_time
+            
+            # Add clustering method info to result
+            cluster_info['clustering_method'] = request.clustering_method
+            cluster_info['total_processing_time'] = processing_time
+            
+            result = ClusteringResult(
+                clustered_nodes=clustered_nodes,
+                cluster_info=cluster_info,
+                processing_time=processing_time,
+                mode=request.mode
+            )
+            
+            # Handle WebRTC streaming mode
+            if request.mode == ClusteringMode.WEBRTC_STREAM:
+                session_id = self.webrtc_engine.create_session("remote_client")
+                success = self.webrtc_engine.render_graph_frame(session_id, clustered_nodes, links)
+                if success:
+                    result.session_id = session_id
+                    
+            return result
+            
+        except Exception as e:
+            logger.error(f"Clustering request failed: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    async def broadcast_update(self, data: Dict[str, Any]):
+        """Broadcast updates to connected WebSocket clients"""
+        if not self.active_connections:
+            return
+            
+        disconnected = []
+        for connection in self.active_connections:
+            try:
+                await connection.send_json(data)
+            except Exception:
+                disconnected.append(connection)
+                
+        for connection in disconnected:
+            self.active_connections.remove(connection)
+
+# FastAPI app setup
+app = FastAPI(
+    title="Remote WebGPU Clustering Service (CuPy)",
+    description="Stable GPU-accelerated graph clustering using CuPy",
+    version="1.1.0"
+)
+
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+
+service = RemoteWebGPUService()
+
+@app.post("/api/cluster", response_model=ClusteringResult)
+async def cluster_graph(request: RemoteClusteringRequest):
+    """Process graph clustering request"""
+    result = await service.process_clustering_request(request)
+    
+    await service.broadcast_update({
+        "type": "clustering_complete",
+        "data": result.dict()
+    })
+    
+    return result
+
+@app.get("/api/capabilities")
+async def get_capabilities():
+    """Get service capabilities"""
+    return {
+        "modes": {
+            "hybrid": {
+                "available": True,
+                "description": "GPU clustering on server, CPU rendering on client"
+            },
+            "webrtc_stream": {
+                "available": service.webrtc_engine.has_rendering,
+                "description": "Full GPU rendering streamed to client browser"
+            }
+        },
+        "clustering_methods": {
+            "spatial": {
+                "available": True,
+                "description": "Traditional spatial/coordinate-based clustering"
+            },
+            "semantic": {
+                "available": True,
+                "description": "Semantic clustering based on node names and content similarity"
+            },
+            "hybrid": {
+                "available": True,
+                "description": "Combined semantic and spatial clustering with configurable weights"
+            }
+        },
+        "clustering_algorithms": {
+            "hierarchical": {
+                "available": True,
+                "description": "Hierarchical agglomerative clustering"
+            },
+            "kmeans": {
+                "available": True,
+                "description": "K-means clustering (GPU accelerated when available)"
+            },
+            "dbscan": {
+                "available": True,
+                "description": "Density-based spatial clustering"
+            }
+        },
+        "gpu_acceleration": {
+            "cupy_available": HAS_CUPY,
+            "cugraph_available": HAS_CUGRAPH,
+            "opencv_available": HAS_OPENCV,
+            "plotting_available": HAS_PLOTTING,
+            "semantic_gpu": HAS_CUPY
+        },
+        "cluster_dimensions": service.clustering_engine.cluster_dimensions,
+        "max_cluster_count": service.clustering_engine.cluster_count
+    }
+
+@app.get("/api/stream/{session_id}")
+async def stream_frame(session_id: str):
+    """Stream rendered frame for WebRTC session"""
+    frame_data = service.webrtc_engine.get_frame(session_id)
+    if not frame_data:
+        raise HTTPException(status_code=404, detail="Frame not found")
+        
+    return StreamingResponse(
+        BytesIO(frame_data),
+        media_type="image/png",
+        headers={"Cache-Control": "no-cache"}
+    )
+
+@app.delete("/api/stream/{session_id}")
+async def cleanup_stream(session_id: str):
+    """Clean up WebRTC streaming session"""
+    service.webrtc_engine.cleanup_session(session_id)
+    return {"status": "cleaned up"}
+
+@app.websocket("/ws")
+async def websocket_endpoint(websocket: WebSocket):
+    """WebSocket endpoint for real-time updates"""
+    await websocket.accept()
+    service.active_connections.append(websocket)
+    
+    try:
+        while True:
+            await websocket.receive_text()
+    except WebSocketDisconnect:
+        service.active_connections.remove(websocket)
+
+@app.get("/health")
+async def health_check():
+    """Health check endpoint"""
+    return {
+        "status": "healthy",
+        "gpu_available": HAS_CUPY,
+        "webrtc_available": service.webrtc_engine.has_rendering,
+        "active_sessions": len(service.webrtc_engine.active_sessions),
+        "active_connections": len(service.active_connections),
+        "engine": "RAPIDS cuML"
+    }
+
+if __name__ == "__main__":
+    port = int(os.environ.get("PORT", 8083))
+    logger.info(f"Starting Remote WebGPU Clustering Service (RAPIDS cuML) on port {port}")
+    logger.info(f"CuPy GPU acceleration: {'✓' if HAS_CUPY else '✗'}")
+    logger.info(f"WebRTC streaming: {'✓' if service.webrtc_engine.has_rendering else '✗'}")
+    
+    uvicorn.run(
+        "remote_webgpu_clustering_service:app",
+        host="0.0.0.0",
+        port=port,
+        log_level="info",
+        reload=False
+    )
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_webgpu_clustering_service_cupy.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_webgpu_clustering_service_cupy.py
new file mode 100644
index 0000000..a1a5cd5
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/remote_webgpu_clustering_service_cupy.py
@@ -0,0 +1,582 @@
+#!/usr/bin/env python3
+"""
+Remote WebGPU Clustering Service - CuPy Version
+
+Provides GPU-accelerated graph clustering using CuPy instead of cuDF to avoid segfaults.
+Uses stable CuPy operations for GPU clustering while maintaining the same API.
+"""
+
+import os
+import json
+import uuid
+import asyncio
+import logging
+import numpy as np
+from datetime import datetime, timedelta
+from typing import Dict, List, Any, Optional, Tuple, Union
+from fastapi import FastAPI, HTTPException, WebSocket, WebSocketDisconnect, BackgroundTasks
+from fastapi.responses import HTMLResponse, StreamingResponse
+from fastapi.middleware.cors import CORSMiddleware
+from pydantic import BaseModel
+import uvicorn
+import time
+import threading
+from concurrent.futures import ThreadPoolExecutor
+import base64
+from io import BytesIO
+
+# GPU-accelerated imports
+try:
+    import cupy as cp
+    HAS_CUPY = True
+    print("✓ CuPy available for stable GPU clustering")
+except ImportError:
+    HAS_CUPY = False
+    print("⚠ CuPy not available, falling back to CPU")
+
+# Optional cuGraph for force simulation (avoid cuDF operations)
+try:
+    import cugraph
+    import cudf
+    HAS_CUGRAPH = True
+    print("✓ cuGraph available for force simulation")
+except ImportError:
+    HAS_CUGRAPH = False
+    print("⚠ cuGraph not available")
+    import networkx as nx
+
+# WebRTC streaming imports
+try:
+    import cv2
+    import PIL.Image as PILImage
+    HAS_OPENCV = True
+    print("✓ OpenCV available for WebRTC streaming")
+except ImportError:
+    HAS_OPENCV = False
+    print("⚠ OpenCV not available, WebRTC streaming disabled")
+
+# WebGL rendering imports
+try:
+    import matplotlib.pyplot as plt
+    import matplotlib
+    matplotlib.use('Agg')
+    import plotly.graph_objects as go
+    import plotly.io as pio
+    pio.renderers.default = "json"
+    HAS_PLOTTING = True
+    print("✓ Plotting libraries available for server-side rendering")
+except ImportError:
+    HAS_PLOTTING = False
+    print("⚠ Plotting libraries not available")
+
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+
+class GraphData(BaseModel):
+    nodes: List[Dict[str, Any]]
+    links: List[Dict[str, Any]]
+
+class ClusteringMode(str):
+    HYBRID = "hybrid"
+    WEBRTC_STREAM = "webrtc_stream"
+
+class RemoteClusteringRequest(BaseModel):
+    graph_data: GraphData
+    mode: str = ClusteringMode.HYBRID
+    cluster_dimensions: List[int] = [32, 18, 24]
+    force_simulation: bool = True
+    max_iterations: int = 100
+    webrtc_options: Optional[Dict[str, Any]] = None
+
+class ClusteringResult(BaseModel):
+    clustered_nodes: List[Dict[str, Any]]
+    cluster_info: Dict[str, Any]
+    processing_time: float
+    mode: str
+    session_id: Optional[str] = None
+
+class WebRTCSession(BaseModel):
+    session_id: str
+    client_id: str
+    created_at: datetime
+    last_frame_time: datetime
+    is_active: bool = True
+
+class CuPyClusteringEngine:
+    """
+    Stable GPU clustering using CuPy arrays instead of cuDF to avoid segfaults
+    """
+    
+    def __init__(self, cluster_dimensions: Tuple[int, int, int] = (32, 18, 24)):
+        self.cluster_dimensions = cluster_dimensions
+        self.cluster_count = cluster_dimensions[0] * cluster_dimensions[1] * cluster_dimensions[2]
+        self.has_gpu = HAS_CUPY
+        logger.info(f"CuPy clustering engine initialized with {self.cluster_count} clusters")
+        
+    def cluster_nodes_gpu(self, nodes: List[Dict[str, Any]]) -> Tuple[List[Dict[str, Any]], Dict[str, Any]]:
+        """
+        Perform stable GPU-accelerated clustering using CuPy
+        """
+        if not self.has_gpu:
+            return self._cluster_nodes_cpu(nodes)
+            
+        try:
+            start_time = time.time()
+            
+            # Extract coordinates using CuPy arrays (stable)
+            x_vals = cp.array([float(node.get('x', 0)) for node in nodes])
+            y_vals = cp.array([float(node.get('y', 0)) for node in nodes])
+            z_vals = cp.array([float(node.get('z', 0)) for node in nodes])
+            
+            # Apply clustering algorithm (same as WebGPU shader)
+            norm_x = cp.clip((x_vals / 100.0 + 0.5), 0.0, 0.999)
+            norm_y = cp.clip((y_vals / 100.0 + 0.5), 0.0, 0.999)
+            norm_z = cp.clip((z_vals / 100.0 + 0.5), 0.001, 0.999)
+            
+            # Apply logarithmic scaling to Z dimension
+            log_z = cp.clip(cp.log(norm_z) / cp.log(0.999), 0.0, 0.999)
+            
+            # Calculate cluster indices
+            cluster_x = cp.clip((norm_x * self.cluster_dimensions[0]).astype(cp.int32), 0, self.cluster_dimensions[0] - 1)
+            cluster_y = cp.clip((norm_y * self.cluster_dimensions[1]).astype(cp.int32), 0, self.cluster_dimensions[1] - 1)
+            cluster_z = cp.clip((log_z * self.cluster_dimensions[2]).astype(cp.int32), 0, self.cluster_dimensions[2] - 1)
+            
+            # Calculate final cluster index
+            cluster_indices = (cluster_x + 
+                             cluster_y * self.cluster_dimensions[0] + 
+                             cluster_z * self.cluster_dimensions[0] * self.cluster_dimensions[1])
+            
+            # Convert back to CPU for results
+            cluster_result = cluster_indices.get()
+            
+            # Update nodes with clustering results
+            clustered_nodes = []
+            for i, node in enumerate(nodes):
+                clustered_node = {
+                    **node,
+                    'cluster_index': int(cluster_result[i]),
+                    'node_index': i
+                }
+                clustered_nodes.append(clustered_node)
+            
+            # Generate cluster statistics
+            unique_clusters = len(np.unique(cluster_result))
+            processing_time = time.time() - start_time
+            
+            cluster_info = {
+                'total_clusters': self.cluster_count,
+                'used_clusters': unique_clusters,
+                'cluster_dimensions': self.cluster_dimensions,
+                'processing_time': processing_time,
+                'gpu_accelerated': True,
+                'engine': 'CuPy'
+            }
+            
+            logger.info(f"CuPy GPU clustering completed in {processing_time:.3f}s for {len(nodes)} nodes -> {unique_clusters} clusters")
+            return clustered_nodes, cluster_info
+            
+        except Exception as e:
+            logger.error(f"CuPy GPU clustering failed: {e}")
+            return self._cluster_nodes_cpu(nodes)
+    
+    def _cluster_nodes_cpu(self, nodes: List[Dict[str, Any]]) -> Tuple[List[Dict[str, Any]], Dict[str, Any]]:
+        """CPU fallback clustering implementation"""
+        start_time = time.time()
+        
+        clustered_nodes = []
+        for i, node in enumerate(nodes):
+            # Apply same clustering logic as GPU version
+            x = float(node.get('x', 0))
+            y = float(node.get('y', 0))
+            z = float(node.get('z', 0))
+            
+            # Normalize positions
+            norm_x = max(0.0, min(0.999, x / 100.0 + 0.5))
+            norm_y = max(0.0, min(0.999, y / 100.0 + 0.5))
+            norm_z = max(0.001, min(0.999, z / 100.0 + 0.5))
+            
+            # Apply logarithmic scaling to Z
+            log_z = max(0.0, min(0.999, np.log(norm_z) / np.log(0.999)))
+            
+            # Calculate cluster indices
+            cluster_x = min(self.cluster_dimensions[0] - 1, int(norm_x * self.cluster_dimensions[0]))
+            cluster_y = min(self.cluster_dimensions[1] - 1, int(norm_y * self.cluster_dimensions[1]))
+            cluster_z = min(self.cluster_dimensions[2] - 1, int(log_z * self.cluster_dimensions[2]))
+            
+            cluster_index = (cluster_x + 
+                           cluster_y * self.cluster_dimensions[0] + 
+                           cluster_z * self.cluster_dimensions[0] * self.cluster_dimensions[1])
+            
+            clustered_node = {
+                **node,
+                'cluster_index': cluster_index,
+                'node_index': i
+            }
+            clustered_nodes.append(clustered_node)
+        
+        processing_time = time.time() - start_time
+        
+        cluster_info = {
+            'total_clusters': self.cluster_count,
+            'cluster_dimensions': self.cluster_dimensions,
+            'processing_time': processing_time,
+            'gpu_accelerated': False,
+            'engine': 'CPU'
+        }
+        
+        logger.info(f"CPU clustering completed in {processing_time:.3f}s for {len(nodes)} nodes")
+        return clustered_nodes, cluster_info
+
+class ForceSimulationEngine:
+    """
+    GPU-accelerated force simulation for graph layout
+    """
+    
+    def __init__(self):
+        self.has_gpu = HAS_CUGRAPH
+        
+    def simulate_forces(self, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]], max_iterations: int = 100) -> List[Dict[str, Any]]:
+        """Run force-directed layout simulation"""
+        
+        if not self.has_gpu or not links:
+            return self._simulate_forces_cpu(nodes, links, max_iterations)
+            
+        try:
+            return self._simulate_forces_gpu(nodes, links, max_iterations)
+        except Exception as e:
+            logger.error(f"GPU force simulation failed: {e}")
+            return self._simulate_forces_cpu(nodes, links, max_iterations)
+    
+    def _simulate_forces_gpu(self, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]], max_iterations: int) -> List[Dict[str, Any]]:
+        """GPU-accelerated force simulation using cuGraph (avoid cuDF operations)"""
+        
+        # Create simple edge list for cuGraph
+        edge_list = []
+        for link in links:
+            source_id = str(link.get('source', ''))
+            target_id = str(link.get('target', ''))
+            edge_list.append([source_id, target_id])
+        
+        if not edge_list:
+            return nodes
+            
+        try:
+            # Use NetworkX for safer force simulation
+            return self._simulate_forces_cpu(nodes, links, max_iterations)
+        except Exception as e:
+            logger.warning(f"Force simulation failed: {e}")
+            return nodes
+    
+    def _simulate_forces_cpu(self, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]], max_iterations: int) -> List[Dict[str, Any]]:
+        """CPU fallback force simulation using NetworkX"""
+        
+        import networkx as nx
+        
+        G = nx.Graph()
+        
+        # Add nodes
+        for node in nodes:
+            G.add_node(str(node.get('id', '')), **node)
+            
+        # Add edges
+        for link in links:
+            source = str(link.get('source', ''))
+            target = str(link.get('target', ''))
+            G.add_edge(source, target)
+        
+        # Compute spring layout
+        pos = nx.spring_layout(G, iterations=max_iterations, k=1.0)
+        
+        # Update node positions
+        updated_nodes = []
+        for node in nodes:
+            node_id = str(node.get('id', ''))
+            if node_id in pos:
+                x, y = pos[node_id]
+                updated_node = {**node, 'x': float(x * 100), 'y': float(y * 100)}
+            else:
+                updated_node = node
+            updated_nodes.append(updated_node)
+            
+        return updated_nodes
+
+class WebRTCStreamingEngine:
+    """WebRTC streaming engine for real-time graph visualization streaming"""
+    
+    def __init__(self):
+        self.has_rendering = HAS_PLOTTING and HAS_OPENCV
+        self.active_sessions: Dict[str, WebRTCSession] = {}
+        self.frame_buffer: Dict[str, bytes] = {}
+        
+    def create_session(self, client_id: str) -> str:
+        """Create new WebRTC streaming session"""
+        session_id = str(uuid.uuid4())
+        session = WebRTCSession(
+            session_id=session_id,
+            client_id=client_id,
+            created_at=datetime.now(),
+            last_frame_time=datetime.now()
+        )
+        self.active_sessions[session_id] = session
+        logger.info(f"Created WebRTC session {session_id} for client {client_id}")
+        return session_id
+    
+    def render_graph_frame(self, session_id: str, nodes: List[Dict[str, Any]], links: List[Dict[str, Any]]) -> bool:
+        """Render graph to frame buffer for streaming"""
+        
+        if not self.has_rendering:
+            return False
+            
+        if session_id not in self.active_sessions:
+            return False
+            
+        try:
+            # Create 3D plotly visualization
+            node_x = [node.get('x', 0) for node in nodes]
+            node_y = [node.get('y', 0) for node in nodes] 
+            node_z = [node.get('z', 0) for node in nodes]
+            node_text = [node.get('name', f"Node {i}") for i, node in enumerate(nodes)]
+            node_colors = [node.get('cluster_index', 0) for node in nodes]
+            
+            # Create node trace
+            node_trace = go.Scatter3d(
+                x=node_x, y=node_y, z=node_z,
+                mode='markers',
+                marker=dict(size=8, color=node_colors, colorscale='Viridis', showscale=True),
+                text=node_text,
+                hovertemplate='%{text}<br>(%{x:.1f}, %{y:.1f}, %{z:.1f})<extra></extra>',
+                name='Nodes'
+            )
+            
+            # Create edge traces
+            edge_traces = []
+            for link in links:
+                source_idx = None
+                target_idx = None
+                
+                for i, node in enumerate(nodes):
+                    if str(node.get('id', '')) == str(link.get('source', '')):
+                        source_idx = i
+                    if str(node.get('id', '')) == str(link.get('target', '')):
+                        target_idx = i
+                        
+                if source_idx is not None and target_idx is not None:
+                    edge_trace = go.Scatter3d(
+                        x=[node_x[source_idx], node_x[target_idx], None],
+                        y=[node_y[source_idx], node_y[target_idx], None],
+                        z=[node_z[source_idx], node_z[target_idx], None],
+                        mode='lines',
+                        line=dict(color='gray', width=2),
+                        showlegend=False,
+                        hoverinfo='none'
+                    )
+                    edge_traces.append(edge_trace)
+            
+            # Create figure
+            fig = go.Figure(data=[node_trace] + edge_traces)
+            fig.update_layout(
+                title='GPU-Clustered Knowledge Graph (CuPy)',
+                scene=dict(xaxis_title='X', yaxis_title='Y', zaxis_title='Z', bgcolor='rgb(10, 10, 10)'),
+                showlegend=False,
+                paper_bgcolor='rgb(10, 10, 10)',
+                plot_bgcolor='rgb(10, 10, 10)',
+                font=dict(color='white')
+            )
+            
+            # Convert to image
+            img_bytes = pio.to_image(fig, format='png', width=1200, height=800, engine='kaleido')
+            
+            # Store frame in buffer
+            self.frame_buffer[session_id] = img_bytes
+            self.active_sessions[session_id].last_frame_time = datetime.now()
+            
+            return True
+            
+        except Exception as e:
+            logger.error(f"Frame rendering failed for session {session_id}: {e}")
+            return False
+    
+    def get_frame(self, session_id: str) -> Optional[bytes]:
+        return self.frame_buffer.get(session_id)
+    
+    def cleanup_session(self, session_id: str):
+        if session_id in self.active_sessions:
+            del self.active_sessions[session_id]
+        if session_id in self.frame_buffer:
+            del self.frame_buffer[session_id]
+
+class RemoteWebGPUService:
+    """Main service class with stable CuPy clustering"""
+    
+    def __init__(self):
+        self.clustering_engine = CuPyClusteringEngine()
+        self.force_engine = ForceSimulationEngine()
+        self.webrtc_engine = WebRTCStreamingEngine()
+        self.active_connections: List[WebSocket] = []
+        self.executor = ThreadPoolExecutor(max_workers=4)
+        
+    async def process_clustering_request(self, request: RemoteClusteringRequest) -> ClusteringResult:
+        """Process remote clustering request"""
+        
+        start_time = time.time()
+        
+        try:
+            nodes = request.graph_data.nodes
+            links = request.graph_data.links
+            
+            # Apply force simulation if requested
+            if request.force_simulation:
+                logger.info("Running force simulation...")
+                nodes = self.force_engine.simulate_forces(nodes, links, request.max_iterations)
+            
+            # Perform clustering
+            logger.info(f"Clustering {len(nodes)} nodes in {request.mode} mode...")
+            clustered_nodes, cluster_info = self.clustering_engine.cluster_nodes_gpu(nodes)
+            
+            processing_time = time.time() - start_time
+            
+            result = ClusteringResult(
+                clustered_nodes=clustered_nodes,
+                cluster_info=cluster_info,
+                processing_time=processing_time,
+                mode=request.mode
+            )
+            
+            # Handle WebRTC streaming mode
+            if request.mode == ClusteringMode.WEBRTC_STREAM:
+                session_id = self.webrtc_engine.create_session("remote_client")
+                success = self.webrtc_engine.render_graph_frame(session_id, clustered_nodes, links)
+                if success:
+                    result.session_id = session_id
+                    
+            return result
+            
+        except Exception as e:
+            logger.error(f"Clustering request failed: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    async def broadcast_update(self, data: Dict[str, Any]):
+        """Broadcast updates to connected WebSocket clients"""
+        if not self.active_connections:
+            return
+            
+        disconnected = []
+        for connection in self.active_connections:
+            try:
+                await connection.send_json(data)
+            except Exception:
+                disconnected.append(connection)
+                
+        for connection in disconnected:
+            self.active_connections.remove(connection)
+
+# FastAPI app setup
+app = FastAPI(
+    title="Remote WebGPU Clustering Service (CuPy)",
+    description="Stable GPU-accelerated graph clustering using CuPy",
+    version="1.1.0"
+)
+
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+
+service = RemoteWebGPUService()
+
+@app.post("/api/cluster", response_model=ClusteringResult)
+async def cluster_graph(request: RemoteClusteringRequest):
+    """Process graph clustering request"""
+    result = await service.process_clustering_request(request)
+    
+    await service.broadcast_update({
+        "type": "clustering_complete",
+        "data": result.dict()
+    })
+    
+    return result
+
+@app.get("/api/capabilities")
+async def get_capabilities():
+    """Get service capabilities"""
+    return {
+        "modes": {
+            "hybrid": {
+                "available": True,
+                "description": "GPU clustering on server, CPU rendering on client"
+            },
+            "webrtc_stream": {
+                "available": service.webrtc_engine.has_rendering,
+                "description": "Full GPU rendering streamed to client browser"
+            }
+        },
+        "gpu_acceleration": {
+            "cupy_available": HAS_CUPY,
+            "cugraph_available": HAS_CUGRAPH,
+            "opencv_available": HAS_OPENCV,
+            "plotting_available": HAS_PLOTTING
+        },
+        "cluster_dimensions": service.clustering_engine.cluster_dimensions,
+        "max_cluster_count": service.clustering_engine.cluster_count
+    }
+
+@app.get("/api/stream/{session_id}")
+async def stream_frame(session_id: str):
+    """Stream rendered frame for WebRTC session"""
+    frame_data = service.webrtc_engine.get_frame(session_id)
+    if not frame_data:
+        raise HTTPException(status_code=404, detail="Frame not found")
+        
+    return StreamingResponse(
+        BytesIO(frame_data),
+        media_type="image/png",
+        headers={"Cache-Control": "no-cache"}
+    )
+
+@app.delete("/api/stream/{session_id}")
+async def cleanup_stream(session_id: str):
+    """Clean up WebRTC streaming session"""
+    service.webrtc_engine.cleanup_session(session_id)
+    return {"status": "cleaned up"}
+
+@app.websocket("/ws")
+async def websocket_endpoint(websocket: WebSocket):
+    """WebSocket endpoint for real-time updates"""
+    await websocket.accept()
+    service.active_connections.append(websocket)
+    
+    try:
+        while True:
+            await websocket.receive_text()
+    except WebSocketDisconnect:
+        service.active_connections.remove(websocket)
+
+@app.get("/health")
+async def health_check():
+    """Health check endpoint"""
+    return {
+        "status": "healthy",
+        "gpu_available": HAS_CUPY,
+        "webrtc_available": service.webrtc_engine.has_rendering,
+        "active_sessions": len(service.webrtc_engine.active_sessions),
+        "active_connections": len(service.active_connections),
+        "engine": "CuPy"
+    }
+
+if __name__ == "__main__":
+    port = int(os.environ.get("PORT", 8083))
+    logger.info(f"Starting Remote WebGPU Clustering Service (CuPy) on port {port}")
+    logger.info(f"CuPy GPU acceleration: {'✓' if HAS_CUPY else '✗'}")
+    logger.info(f"WebRTC streaming: {'✓' if service.webrtc_engine.has_rendering else '✗'}")
+    
+    uvicorn.run(
+        "remote_webgpu_clustering_service_cupy:app",
+        host="0.0.0.0",
+        port=port,
+        log_level="info",
+        reload=False
+    )
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/requirements-remote-webgpu.txt b/nvidia/txt2kg/assets/deploy/services/gpu-viz/requirements-remote-webgpu.txt
new file mode 100644
index 0000000..79bfc42
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/requirements-remote-webgpu.txt
@@ -0,0 +1,30 @@
+# Remote WebGPU Clustering Service Dependencies
+# For GPU-accelerated graph clustering and WebRTC streaming
+
+# Core FastAPI and web service dependencies
+fastapi==0.104.1
+uvicorn==0.24.0
+websockets==12.0
+python-multipart==0.0.6
+
+# RAPIDS dependencies (GPU-accelerated data processing)
+# These are included in the NVIDIA PyTorch container
+# cudf, cugraph, cuml, cupy are pre-installed
+
+# Data processing and scientific computing
+numpy>=1.24.0
+pandas>=2.0.0
+networkx>=3.0
+
+# WebRTC streaming and visualization dependencies
+opencv-python-headless==4.8.1.78
+plotly>=5.17.0
+kaleido>=0.2.1
+Pillow>=10.0.0
+
+# Redis for session management (optional)
+redis>=5.0.0
+
+# Additional utilities
+pydantic>=2.0.0
+python-dotenv>=1.0.0
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/requirements.txt b/nvidia/txt2kg/assets/deploy/services/gpu-viz/requirements.txt
new file mode 100644
index 0000000..ef16d7a
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/requirements.txt
@@ -0,0 +1,14 @@
+graphistry>=0.32.0
+pandas>=2.0.0
+numpy>=1.24.0
+fastapi>=0.104.0
+uvicorn[standard]>=0.24.0
+pydantic>=2.0.0
+networkx>=3.0  # For efficient graph generation algorithms
+# cudf, cuml, cugraph are already included in PyG container
+# cupy>=12.0.0  # Already included in PyG container
+igraph>=0.10.0  # For additional graph algorithms
+scikit-learn>=1.3.0  # For additional ML features
+requests>=2.31.0
+aiofiles>=23.0.0
+python-multipart>=0.0.6 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/semantic_clustering_service.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/semantic_clustering_service.py
new file mode 100644
index 0000000..ea1733c
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/semantic_clustering_service.py
@@ -0,0 +1,600 @@
+"""
+Semantic Clustering Service for Knowledge Graphs
+Groups nodes by semantic similarity of names, types, and content rather than just spatial coordinates
+"""
+
+import asyncio
+import logging
+import time
+from typing import Dict, List, Any, Tuple, Set, Optional
+from dataclasses import dataclass
+from collections import defaultdict
+import numpy as np
+import re
+from difflib import SequenceMatcher
+from sklearn.feature_extraction.text import TfidfVectorizer
+from sklearn.cluster import KMeans, DBSCAN, AgglomerativeClustering
+from sklearn.metrics.pairwise import cosine_similarity
+import networkx as nx
+
+# Try to import GPU libraries
+try:
+    import cupy as cp
+    import cuml
+    from cuml.cluster import KMeans as cuKMeans, DBSCAN as cuDBSCAN
+    HAS_GPU = True
+    print("✅ GPU libraries (CuPy, cuML) available for semantic clustering")
+except ImportError:
+    HAS_GPU = False
+    print("⚠️  GPU libraries not available, using CPU for semantic clustering")
+
+logger = logging.getLogger(__name__)
+
+@dataclass
+class SemanticClusterResult:
+    """Result of semantic clustering operation"""
+    clustered_nodes: List[Dict[str, Any]]
+    cluster_info: Dict[str, Any]
+    similarity_matrix: Optional[np.ndarray] = None
+    cluster_labels: Optional[np.ndarray] = None
+
+class SemanticSimilarityCalculator:
+    """Calculate semantic similarity between node names and content"""
+    
+    def __init__(self):
+        self.tfidf_vectorizer = TfidfVectorizer(
+            max_features=1000,
+            stop_words='english',
+            ngram_range=(1, 2),
+            lowercase=True
+        )
+        self.fitted = False
+    
+    def calculate_name_similarity(self, name1: str, name2: str) -> float:
+        """Calculate similarity between two node names using multiple methods"""
+        if not name1 or not name2:
+            return 0.0
+        
+        name1_clean = self._clean_name(name1)
+        name2_clean = self._clean_name(name2)
+        
+        # Method 1: Exact match
+        if name1_clean == name2_clean:
+            return 1.0
+        
+        # Method 2: Substring match
+        if name1_clean in name2_clean or name2_clean in name1_clean:
+            return 0.8
+        
+        # Method 3: Sequence similarity (Levenshtein-based)
+        seq_similarity = SequenceMatcher(None, name1_clean, name2_clean).ratio()
+        
+        # Method 4: Word overlap (Jaccard similarity)
+        words1 = set(name1_clean.split())
+        words2 = set(name2_clean.split())
+        if words1 and words2:
+            jaccard_sim = len(words1.intersection(words2)) / len(words1.union(words2))
+        else:
+            jaccard_sim = 0.0
+        
+        # Method 5: Common prefix/suffix
+        prefix_sim = self._prefix_similarity(name1_clean, name2_clean)
+        suffix_sim = self._suffix_similarity(name1_clean, name2_clean)
+        
+        # Combine similarities with weights
+        combined_similarity = (
+            seq_similarity * 0.3 +
+            jaccard_sim * 0.4 +
+            prefix_sim * 0.15 +
+            suffix_sim * 0.15
+        )
+        
+        return min(combined_similarity, 1.0)
+    
+    def calculate_content_similarity(self, nodes: List[Dict[str, Any]]) -> np.ndarray:
+        """Calculate content similarity matrix using TF-IDF"""
+        # Extract text content from nodes
+        texts = []
+        for node in nodes:
+            text_parts = []
+            
+            # Add node name
+            if node.get('name'):
+                text_parts.append(str(node['name']))
+            
+            # Add node type/group
+            if node.get('group') or node.get('type'):
+                text_parts.append(str(node.get('group', node.get('type', ''))))
+            
+            # Add any description or content
+            for key in ['description', 'content', 'label', 'properties']:
+                if node.get(key):
+                    text_parts.append(str(node[key]))
+            
+            # Combine all text
+            combined_text = ' '.join(text_parts)
+            texts.append(combined_text if combined_text.strip() else node.get('name', 'unnamed'))
+        
+        # Calculate TF-IDF similarity
+        if not self.fitted and texts:
+            tfidf_matrix = self.tfidf_vectorizer.fit_transform(texts)
+            self.fitted = True
+        else:
+            tfidf_matrix = self.tfidf_vectorizer.transform(texts)
+        
+        # Calculate cosine similarity matrix
+        similarity_matrix = cosine_similarity(tfidf_matrix)
+        return similarity_matrix
+    
+    def _clean_name(self, name: str) -> str:
+        """Clean and normalize node name"""
+        if not name:
+            return ""
+        
+        # Convert to lowercase
+        cleaned = name.lower().strip()
+        
+        # Remove special characters but keep spaces and alphanumeric
+        cleaned = re.sub(r'[^\w\s-]', ' ', cleaned)
+        
+        # Normalize whitespace
+        cleaned = re.sub(r'\s+', ' ', cleaned)
+        
+        return cleaned.strip()
+    
+    def _prefix_similarity(self, name1: str, name2: str) -> float:
+        """Calculate similarity based on common prefix"""
+        min_len = min(len(name1), len(name2))
+        if min_len == 0:
+            return 0.0
+        
+        common_prefix = 0
+        for i in range(min_len):
+            if name1[i] == name2[i]:
+                common_prefix += 1
+            else:
+                break
+        
+        return common_prefix / min_len
+    
+    def _suffix_similarity(self, name1: str, name2: str) -> float:
+        """Calculate similarity based on common suffix"""
+        min_len = min(len(name1), len(name2))
+        if min_len == 0:
+            return 0.0
+        
+        common_suffix = 0
+        for i in range(1, min_len + 1):
+            if name1[-i] == name2[-i]:
+                common_suffix += 1
+            else:
+                break
+        
+        return common_suffix / min_len
+
+class SemanticClusteringEngine:
+    """Main semantic clustering engine"""
+    
+    def __init__(self, use_gpu: bool = None):
+        self.use_gpu = use_gpu if use_gpu is not None else HAS_GPU
+        self.similarity_calc = SemanticSimilarityCalculator()
+        logger.info(f"Semantic clustering engine initialized (GPU: {self.use_gpu})")
+    
+    def cluster_by_name_similarity(
+        self, 
+        nodes: List[Dict[str, Any]], 
+        algorithm: str = "hierarchical",
+        n_clusters: Optional[int] = None,
+        similarity_threshold: float = 0.7
+    ) -> SemanticClusterResult:
+        """
+        Cluster nodes based on name similarity
+        
+        Args:
+            nodes: List of node dictionaries
+            algorithm: 'hierarchical', 'kmeans', 'dbscan'
+            n_clusters: Number of clusters (for kmeans/hierarchical)
+            similarity_threshold: Minimum similarity for clustering (for dbscan)
+        """
+        start_time = time.time()
+        n_nodes = len(nodes)
+        
+        logger.info(f"🧠 Starting semantic clustering of {n_nodes} nodes using {algorithm}")
+        
+        if n_nodes < 2:
+            return self._create_single_cluster_result(nodes, start_time)
+        
+        # Calculate name similarity matrix
+        similarity_matrix = self._calculate_name_similarity_matrix(nodes)
+        
+        # Convert similarity to distance matrix
+        distance_matrix = 1.0 - similarity_matrix
+        
+        # Apply clustering algorithm
+        if algorithm == "hierarchical":
+            cluster_labels = self._hierarchical_clustering(
+                distance_matrix, n_clusters or min(10, n_nodes // 2)
+            )
+        elif algorithm == "kmeans":
+            cluster_labels = self._kmeans_clustering(
+                similarity_matrix, n_clusters or min(10, n_nodes // 2)
+            )
+        elif algorithm == "dbscan":
+            cluster_labels = self._dbscan_clustering(
+                distance_matrix, similarity_threshold
+            )
+        else:
+            raise ValueError(f"Unknown clustering algorithm: {algorithm}")
+        
+        # Create clustered nodes
+        clustered_nodes = []
+        for i, node in enumerate(nodes):
+            clustered_node = {
+                **node,
+                'cluster_id': int(cluster_labels[i]),
+                'node_index': i
+            }
+            clustered_nodes.append(clustered_node)
+        
+        processing_time = time.time() - start_time
+        
+        # Calculate cluster statistics
+        unique_clusters = len(set(cluster_labels))
+        cluster_sizes = defaultdict(int)
+        for label in cluster_labels:
+            cluster_sizes[label] += 1
+        
+        cluster_info = {
+            'algorithm': f'semantic_{algorithm}',
+            'total_clusters': unique_clusters,
+            'processing_time': processing_time,
+            'gpu_accelerated': self.use_gpu,
+            'cluster_sizes': dict(cluster_sizes),
+            'average_cluster_size': n_nodes / unique_clusters if unique_clusters > 0 else 0,
+            'similarity_threshold': similarity_threshold if algorithm == 'dbscan' else None
+        }
+        
+        logger.info(f"✅ Semantic clustering completed: {unique_clusters} clusters in {processing_time:.3f}s")
+        
+        return SemanticClusterResult(
+            clustered_nodes=clustered_nodes,
+            cluster_info=cluster_info,
+            similarity_matrix=similarity_matrix,
+            cluster_labels=cluster_labels
+        )
+    
+    def cluster_by_content_similarity(
+        self,
+        nodes: List[Dict[str, Any]],
+        algorithm: str = "kmeans",
+        n_clusters: Optional[int] = None
+    ) -> SemanticClusterResult:
+        """Cluster nodes based on content similarity using TF-IDF"""
+        start_time = time.time()
+        n_nodes = len(nodes)
+        
+        logger.info(f"📄 Starting content-based clustering of {n_nodes} nodes")
+        
+        if n_nodes < 2:
+            return self._create_single_cluster_result(nodes, start_time)
+        
+        # Calculate content similarity
+        similarity_matrix = self.similarity_calc.calculate_content_similarity(nodes)
+        
+        # Apply clustering
+        if algorithm == "kmeans":
+            n_clusters = n_clusters or min(10, n_nodes // 2)
+            if self.use_gpu and HAS_GPU:
+                cluster_labels = self._gpu_kmeans_clustering(similarity_matrix, n_clusters)
+            else:
+                cluster_labels = self._kmeans_clustering(similarity_matrix, n_clusters)
+        else:
+            distance_matrix = 1.0 - similarity_matrix
+            cluster_labels = self._hierarchical_clustering(
+                distance_matrix, n_clusters or min(10, n_nodes // 2)
+            )
+        
+        # Create result
+        clustered_nodes = []
+        for i, node in enumerate(nodes):
+            clustered_node = {
+                **node,
+                'cluster_id': int(cluster_labels[i]),
+                'node_index': i
+            }
+            clustered_nodes.append(clustered_node)
+        
+        processing_time = time.time() - start_time
+        unique_clusters = len(set(cluster_labels))
+        
+        cluster_info = {
+            'algorithm': f'content_{algorithm}',
+            'total_clusters': unique_clusters,
+            'processing_time': processing_time,
+            'gpu_accelerated': self.use_gpu and algorithm == 'kmeans',
+            'average_cluster_size': n_nodes / unique_clusters if unique_clusters > 0 else 0
+        }
+        
+        logger.info(f"✅ Content clustering completed: {unique_clusters} clusters in {processing_time:.3f}s")
+        
+        return SemanticClusterResult(
+            clustered_nodes=clustered_nodes,
+            cluster_info=cluster_info,
+            similarity_matrix=similarity_matrix,
+            cluster_labels=cluster_labels
+        )
+    
+    def hybrid_clustering(
+        self,
+        nodes: List[Dict[str, Any]],
+        name_weight: float = 0.6,
+        content_weight: float = 0.3,
+        spatial_weight: float = 0.1,
+        algorithm: str = "hierarchical",
+        n_clusters: Optional[int] = None
+    ) -> SemanticClusterResult:
+        """
+        Hybrid clustering combining name, content, and spatial similarities
+        
+        Args:
+            name_weight: Weight for name similarity (0.0-1.0)
+            content_weight: Weight for content similarity (0.0-1.0) 
+            spatial_weight: Weight for spatial similarity (0.0-1.0)
+        """
+        start_time = time.time()
+        n_nodes = len(nodes)
+        
+        logger.info(f"🔄 Starting hybrid clustering of {n_nodes} nodes")
+        logger.info(f"   Weights: name={name_weight}, content={content_weight}, spatial={spatial_weight}")
+        
+        if n_nodes < 2:
+            return self._create_single_cluster_result(nodes, start_time)
+        
+        # Normalize weights
+        total_weight = name_weight + content_weight + spatial_weight
+        if total_weight > 0:
+            name_weight /= total_weight
+            content_weight /= total_weight
+            spatial_weight /= total_weight
+        
+        # Calculate different similarity matrices
+        similarities = []
+        weights = []
+        
+        if name_weight > 0:
+            name_similarity = self._calculate_name_similarity_matrix(nodes)
+            similarities.append(name_similarity)
+            weights.append(name_weight)
+        
+        if content_weight > 0:
+            content_similarity = self.similarity_calc.calculate_content_similarity(nodes)
+            similarities.append(content_similarity)
+            weights.append(content_weight)
+        
+        if spatial_weight > 0:
+            spatial_similarity = self._calculate_spatial_similarity_matrix(nodes)
+            similarities.append(spatial_similarity)
+            weights.append(spatial_weight)
+        
+        # Combine similarities
+        if not similarities:
+            return self._create_single_cluster_result(nodes, start_time)
+        
+        combined_similarity = np.zeros((n_nodes, n_nodes))
+        for similarity, weight in zip(similarities, weights):
+            combined_similarity += similarity * weight
+        
+        # Apply clustering
+        distance_matrix = 1.0 - combined_similarity
+        
+        if algorithm == "hierarchical":
+            cluster_labels = self._hierarchical_clustering(
+                distance_matrix, n_clusters or min(10, n_nodes // 2)
+            )
+        elif algorithm == "kmeans":
+            cluster_labels = self._kmeans_clustering(
+                combined_similarity, n_clusters or min(10, n_nodes // 2)
+            )
+        else:
+            cluster_labels = self._dbscan_clustering(distance_matrix, 0.3)
+        
+        # Create result
+        clustered_nodes = []
+        for i, node in enumerate(nodes):
+            clustered_node = {
+                **node,
+                'cluster_id': int(cluster_labels[i]),
+                'node_index': i
+            }
+            clustered_nodes.append(clustered_node)
+        
+        processing_time = time.time() - start_time
+        unique_clusters = len(set(cluster_labels))
+        
+        cluster_info = {
+            'algorithm': f'hybrid_{algorithm}',
+            'total_clusters': unique_clusters,
+            'processing_time': processing_time,
+            'gpu_accelerated': self.use_gpu,
+            'weights': {
+                'name': name_weight,
+                'content': content_weight,
+                'spatial': spatial_weight
+            },
+            'average_cluster_size': n_nodes / unique_clusters if unique_clusters > 0 else 0
+        }
+        
+        logger.info(f"✅ Hybrid clustering completed: {unique_clusters} clusters in {processing_time:.3f}s")
+        
+        return SemanticClusterResult(
+            clustered_nodes=clustered_nodes,
+            cluster_info=cluster_info,
+            similarity_matrix=combined_similarity,
+            cluster_labels=cluster_labels
+        )
+    
+    def _calculate_name_similarity_matrix(self, nodes: List[Dict[str, Any]]) -> np.ndarray:
+        """Calculate pairwise name similarity matrix"""
+        n_nodes = len(nodes)
+        similarity_matrix = np.zeros((n_nodes, n_nodes))
+        
+        for i in range(n_nodes):
+            for j in range(i, n_nodes):
+                if i == j:
+                    similarity_matrix[i, j] = 1.0
+                else:
+                    name1 = nodes[i].get('name', '')
+                    name2 = nodes[j].get('name', '')
+                    similarity = self.similarity_calc.calculate_name_similarity(name1, name2)
+                    similarity_matrix[i, j] = similarity
+                    similarity_matrix[j, i] = similarity  # Symmetric
+        
+        return similarity_matrix
+    
+    def _calculate_spatial_similarity_matrix(self, nodes: List[Dict[str, Any]]) -> np.ndarray:
+        """Calculate spatial similarity based on node positions"""
+        n_nodes = len(nodes)
+        similarity_matrix = np.zeros((n_nodes, n_nodes))
+        
+        # Extract coordinates
+        coords = []
+        for node in nodes:
+            x = float(node.get('x', 0))
+            y = float(node.get('y', 0))
+            z = float(node.get('z', 0))
+            coords.append([x, y, z])
+        
+        coords = np.array(coords)
+        
+        # Calculate pairwise distances
+        for i in range(n_nodes):
+            for j in range(i, n_nodes):
+                if i == j:
+                    similarity_matrix[i, j] = 1.0
+                else:
+                    # Euclidean distance
+                    dist = np.linalg.norm(coords[i] - coords[j])
+                    # Convert distance to similarity (closer = more similar)
+                    # Use exponential decay: similarity = exp(-distance/scale)
+                    scale = 50.0  # Adjust based on your coordinate system
+                    similarity = np.exp(-dist / scale)
+                    similarity_matrix[i, j] = similarity
+                    similarity_matrix[j, i] = similarity
+        
+        return similarity_matrix
+    
+    def _hierarchical_clustering(self, distance_matrix: np.ndarray, n_clusters: int) -> np.ndarray:
+        """Apply hierarchical clustering"""
+        clusterer = AgglomerativeClustering(
+            n_clusters=n_clusters,
+            metric='precomputed',
+            linkage='average'
+        )
+        return clusterer.fit_predict(distance_matrix)
+    
+    def _kmeans_clustering(self, similarity_matrix: np.ndarray, n_clusters: int) -> np.ndarray:
+        """Apply K-means clustering"""
+        clusterer = KMeans(n_clusters=n_clusters, random_state=42, n_init=10)
+        return clusterer.fit_predict(similarity_matrix)
+    
+    def _gpu_kmeans_clustering(self, similarity_matrix: np.ndarray, n_clusters: int) -> np.ndarray:
+        """Apply GPU-accelerated K-means clustering"""
+        try:
+            gpu_matrix = cp.array(similarity_matrix, dtype=cp.float32)
+            clusterer = cuKMeans(n_clusters=n_clusters, random_state=42)
+            labels = clusterer.fit_predict(gpu_matrix)
+            return cp.asnumpy(labels)
+        except Exception as e:
+            logger.warning(f"GPU K-means failed, falling back to CPU: {e}")
+            return self._kmeans_clustering(similarity_matrix, n_clusters)
+    
+    def _dbscan_clustering(self, distance_matrix: np.ndarray, eps: float) -> np.ndarray:
+        """Apply DBSCAN clustering"""
+        clusterer = DBSCAN(eps=eps, metric='precomputed', min_samples=2)
+        labels = clusterer.fit_predict(distance_matrix)
+        
+        # DBSCAN uses -1 for noise points, convert to positive integers
+        unique_labels = set(labels)
+        if -1 in unique_labels:
+            # Assign noise points to individual clusters
+            max_label = max(labels) if len(unique_labels) > 1 else -1
+            noise_cluster = max_label + 1
+            labels = np.array([noise_cluster if label == -1 else label for label in labels])
+        
+        return labels
+    
+    def _create_single_cluster_result(self, nodes: List[Dict[str, Any]], start_time: float) -> SemanticClusterResult:
+        """Create result for single cluster (when too few nodes)"""
+        clustered_nodes = []
+        for i, node in enumerate(nodes):
+            clustered_node = {
+                **node,
+                'cluster_id': 0,
+                'node_index': i
+            }
+            clustered_nodes.append(clustered_node)
+        
+        processing_time = time.time() - start_time
+        
+        cluster_info = {
+            'algorithm': 'single_cluster',
+            'total_clusters': 1,
+            'processing_time': processing_time,
+            'gpu_accelerated': False,
+            'average_cluster_size': len(nodes)
+        }
+        
+        return SemanticClusterResult(
+            clustered_nodes=clustered_nodes,
+            cluster_info=cluster_info,
+            similarity_matrix=None,
+            cluster_labels=np.zeros(len(nodes), dtype=int)
+        )
+
+# Convenience functions for easy integration
+async def cluster_nodes_by_similarity(
+    nodes: List[Dict[str, Any]],
+    method: str = "hybrid",
+    algorithm: str = "hierarchical",
+    n_clusters: Optional[int] = None,
+    **kwargs
+) -> SemanticClusterResult:
+    """
+    Main entry point for semantic clustering
+    
+    Args:
+        nodes: List of node dictionaries
+        method: 'name', 'content', 'hybrid'
+        algorithm: 'hierarchical', 'kmeans', 'dbscan'
+        n_clusters: Number of clusters (if applicable)
+        **kwargs: Additional parameters for specific methods
+    """
+    engine = SemanticClusteringEngine()
+    
+    if method == "name":
+        return engine.cluster_by_name_similarity(nodes, algorithm, n_clusters, **kwargs)
+    elif method == "content":
+        return engine.cluster_by_content_similarity(nodes, algorithm, n_clusters, **kwargs)
+    elif method == "hybrid":
+        return engine.hybrid_clustering(nodes, algorithm=algorithm, n_clusters=n_clusters, **kwargs)
+    else:
+        raise ValueError(f"Unknown clustering method: {method}")
+
+if __name__ == "__main__":
+    # Example usage
+    test_nodes = [
+        {"name": "Machine Learning", "x": 0, "y": 0, "z": 0, "group": "AI"},
+        {"name": "Deep Learning", "x": 10, "y": 5, "z": 2, "group": "AI"},
+        {"name": "Neural Networks", "x": 15, "y": 8, "z": 3, "group": "AI"},
+        {"name": "Data Science", "x": 20, "y": 10, "z": 5, "group": "Data"},
+        {"name": "Statistics", "x": 25, "y": 15, "z": 8, "group": "Math"},
+        {"name": "Linear Algebra", "x": 30, "y": 20, "z": 10, "group": "Math"},
+    ]
+    
+    async def test():
+        result = await cluster_nodes_by_similarity(test_nodes, method="hybrid")
+        print("Cluster Result:", result.cluster_info)
+        for node in result.clustered_nodes:
+            print(f"  {node['name']} -> Cluster {node['cluster_id']}")
+    
+    asyncio.run(test())
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/simple_webgpu_test.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/simple_webgpu_test.py
new file mode 100644
index 0000000..6da720f
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/simple_webgpu_test.py
@@ -0,0 +1,120 @@
+#!/usr/bin/env python3
+"""
+Simple WebGPU clustering test service
+Minimal implementation to test basic functionality
+"""
+
+from fastapi import FastAPI, HTTPException
+from fastapi.middleware.cors import CORSMiddleware
+from pydantic import BaseModel
+import uvicorn
+from typing import Dict, List, Any, Optional
+import time
+
+# Simple data models
+class GraphData(BaseModel):
+    nodes: List[Dict[str, Any]]
+    links: List[Dict[str, Any]]
+
+class SimpleClusteringRequest(BaseModel):
+    graph_data: GraphData
+    mode: str = "hybrid"
+
+class SimpleClusteringResult(BaseModel):
+    clustered_nodes: List[Dict[str, Any]]
+    processing_time: float
+    mode: str
+    session_id: Optional[str] = None
+
+# FastAPI app
+app = FastAPI(title="Simple WebGPU Test Service", version="1.0.0")
+
+# Enable CORS
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+
+@app.get("/health")
+async def health_check():
+    return {
+        "status": "healthy",
+        "gpu_available": True,
+        "webrtc_available": True,
+        "active_sessions": 0,
+        "active_connections": 0
+    }
+
+@app.get("/api/capabilities")
+async def get_capabilities():
+    return {
+        "modes": {
+            "hybrid": {
+                "available": True,
+                "description": "GPU clustering on server, CPU rendering on client"
+            },
+            "webrtc_stream": {
+                "available": True,
+                "description": "Full GPU rendering streamed to client browser"
+            }
+        },
+        "gpu_acceleration": {
+            "rapids_available": True,
+            "opencv_available": True,
+            "plotting_available": True
+        },
+        "cluster_dimensions": [32, 18, 24],
+        "max_cluster_count": 13824
+    }
+
+@app.post("/api/cluster", response_model=SimpleClusteringResult)
+async def cluster_graph(request: SimpleClusteringRequest):
+    """Simple clustering implementation for testing"""
+    try:
+        start_time = time.time()
+        
+        # Simple clustering - just add cluster_index to each node
+        clustered_nodes = []
+        for i, node in enumerate(request.graph_data.nodes):
+            clustered_node = {**node, "cluster_index": i % 10, "node_index": i}
+            clustered_nodes.append(clustered_node)
+        
+        processing_time = time.time() - start_time
+        
+        result = SimpleClusteringResult(
+            clustered_nodes=clustered_nodes,
+            processing_time=processing_time,
+            mode=request.mode,
+            session_id="test-session-123" if request.mode == "webrtc_stream" else None
+        )
+        
+        return result
+        
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=str(e))
+
+@app.get("/api/stream/{session_id}")
+async def stream_frame(session_id: str):
+    """Simple streaming endpoint - returns a placeholder"""
+    # Return a simple 1x1 PNG pixel as placeholder
+    png_data = b'\x89PNG\r\n\x1a\n\x00\x00\x00\rIHDR\x00\x00\x00\x01\x00\x00\x00\x01\x08\x06\x00\x00\x00\x1f\x15\xc4\x89\x00\x00\x00\nIDATx\x9cc\x00\x01\x00\x00\x05\x00\x01\r\n-\xdb\x00\x00\x00\x00IEND\xaeB`\x82'
+    
+    from fastapi.responses import Response
+    return Response(
+        content=png_data,
+        media_type="image/png",
+        headers={"Cache-Control": "no-cache"}
+    )
+
+if __name__ == "__main__":
+    print("Starting Simple WebGPU Test Service...")
+    uvicorn.run(
+        "simple_webgpu_test:app",
+        host="0.0.0.0",
+        port=8083,
+        log_level="info",
+        reload=False
+    )
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/start_remote_gpu_services.sh b/nvidia/txt2kg/assets/deploy/services/gpu-viz/start_remote_gpu_services.sh
new file mode 100755
index 0000000..cafd438
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/start_remote_gpu_services.sh
@@ -0,0 +1,166 @@
+#!/bin/bash
+
+# Start Remote GPU Rendering Services
+# This script starts the custom remote GPU rendering service as an alternative to PyGraphistry cloud
+
+echo "🚀 Starting Remote GPU Rendering Services"
+echo "========================================="
+
+# Check if we're in a RAPIDS/cuGraph environment
+if python -c "import cudf, cugraph" 2>/dev/null; then
+    echo "✓ RAPIDS/cuGraph environment detected"
+    GPU_AVAILABLE=true
+else
+    echo "⚠ RAPIDS/cuGraph not available - will use CPU fallback"
+    GPU_AVAILABLE=false
+fi
+
+# Check if Redis is available (optional for session storage)
+if command -v redis-server >/dev/null 2>&1; then
+    echo "✓ Redis available for session storage"
+    
+    # Start Redis if not running
+    if ! pgrep -x "redis-server" > /dev/null; then
+        echo "Starting Redis server..."
+        redis-server --daemonize yes --port 6379
+        sleep 2
+    fi
+else
+    echo "⚠ Redis not available - using in-memory session storage"
+fi
+
+# Set environment variables
+export REDIS_HOST=${REDIS_HOST:-localhost}
+export REDIS_PORT=${REDIS_PORT:-6379}
+
+# Create log directory
+mkdir -p logs
+
+echo ""
+echo "🎯 Service Configuration:"
+echo "  GPU Processing: $GPU_AVAILABLE"
+echo "  Session Storage: ${REDIS_HOST:-memory}:${REDIS_PORT:-N/A}"
+echo "  Service Port: 8082"
+echo ""
+
+# Function to start service with proper error handling
+start_service() {
+    local service_name=$1
+    local script_path=$2
+    local port=$3
+    local log_file=$4
+    
+    echo "Starting $service_name on port $port..."
+    
+    # Kill existing process if running
+    if lsof -Pi :$port -sTCP:LISTEN -t >/dev/null; then
+        echo "  Killing existing process on port $port"
+        kill $(lsof -t -i:$port) 2>/dev/null || true
+        sleep 2
+    fi
+    
+    # Start the service
+    python $script_path > logs/$log_file 2>&1 &
+    local pid=$!
+    
+    # Wait a moment and check if it started successfully
+    sleep 3
+    if kill -0 $pid 2>/dev/null; then
+        echo "  ✓ $service_name started successfully (PID: $pid)"
+        echo $pid > logs/${service_name,,}_pid.txt
+        return 0
+    else
+        echo "  ✗ Failed to start $service_name"
+        echo "  Check logs/$log_file for details"
+        return 1
+    fi
+}
+
+# Start Remote GPU Rendering Service
+echo "📊 Starting Remote GPU Rendering Service..."
+start_service "RemoteGPURenderer" "remote_gpu_rendering_service.py" 8082 "remote_gpu_rendering.log"
+
+if [ $? -eq 0 ]; then
+    echo ""
+    echo "✅ Remote GPU Rendering Service is ready!"
+    echo ""
+    echo "🎯 Available endpoints:"
+    echo "  Process graph:        POST http://localhost:8082/api/render"
+    echo "  Iframe visualization: GET  http://localhost:8082/embed/{session_id}"
+    echo "  Session status:       GET  http://localhost:8082/api/session/{session_id}"
+    echo "  Real-time updates:    WS   ws://localhost:8082/ws/{session_id}"
+    echo "  Health check:         GET  http://localhost:8082/api/health"
+    echo ""
+    echo "📋 Usage examples:"
+    echo ""
+    echo "  # Test health check"
+    echo "  curl http://localhost:8082/api/health"
+    echo ""
+    echo "  # Process a sample graph"
+    echo "  curl -X POST http://localhost:8082/api/render \\"
+    echo "    -H 'Content-Type: application/json' \\"
+    echo "    -d '{"
+    echo "      \"graph_data\": {"
+    echo "        \"nodes\": [{\"id\": \"1\", \"name\": \"Node 1\"}, {\"id\": \"2\", \"name\": \"Node 2\"}],"
+    echo "        \"links\": [{\"source\": \"1\", \"target\": \"2\", \"name\": \"edge_1_2\"}]"
+    echo "      },"
+    echo "      \"layout_algorithm\": \"force_atlas2\","
+    echo "      \"clustering_algorithm\": \"leiden\","
+    echo "      \"compute_centrality\": true,"
+    echo "      \"render_quality\": \"high\","
+    echo "      \"interactive_mode\": true"
+    echo "    }'"
+    echo ""
+    echo "📁 Logs are available in:"
+    echo "  Remote GPU Rendering: logs/remote_gpu_rendering.log"
+    echo ""
+    echo "🛠️ Integration with frontend:"
+    echo "  import { RemoteGPUViewer } from '@/components/remote-gpu-viewer'"
+    echo "  <RemoteGPUViewer"
+    echo "    graphData={graphData}"
+    echo "    remoteServiceUrl=\"http://localhost:8082\""
+    echo "    onError={(err) => console.error(err)}"
+    echo "  />"
+    echo ""
+    echo "⚡ Performance tips:"
+    echo "  - Use 'ultra' quality for 1M+ node graphs"
+    echo "  - Enable Redis for production session storage"
+    echo "  - Run on GPU server for maximum performance"
+    echo "  - Use iframe embedding to isolate visualization"
+    echo ""
+    
+    # Start a simple monitoring script
+    echo "🔍 Starting service monitor..."
+    monitor_services() {
+        while true; do
+            sleep 30
+            
+            # Check if services are still running
+            if [ -f logs/remotegpurenderer_pid.txt ]; then
+                pid=$(cat logs/remotegpurenderer_pid.txt)
+                if ! kill -0 $pid 2>/dev/null; then
+                    echo "$(date): Remote GPU Rendering Service died, restarting..."
+                    start_service "RemoteGPURenderer" "remote_gpu_rendering_service.py" 8082 "remote_gpu_rendering.log"
+                fi
+            fi
+        done
+    }
+    
+    # Run monitor in background
+    monitor_services &
+    echo $! > logs/monitor_pid.txt
+    
+    echo "✅ All services started and monitoring enabled!"
+    echo ""
+    echo "To stop all services, run: ./stop_remote_gpu_services.sh"
+    echo "To view logs in real-time: tail -f logs/remote_gpu_rendering.log"
+    
+else
+    echo ""
+    echo "❌ Failed to start Remote GPU Rendering Service"
+    echo "Check the logs for details and ensure dependencies are installed:"
+    echo "  - FastAPI: pip install fastapi uvicorn"
+    echo "  - RAPIDS (optional): conda install -c rapidsai cudf cugraph"
+    echo "  - Redis (optional): sudo apt-get install redis-server"
+    exit 1
+fi 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/stop_remote_gpu_services.sh b/nvidia/txt2kg/assets/deploy/services/gpu-viz/stop_remote_gpu_services.sh
new file mode 100755
index 0000000..8901987
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/stop_remote_gpu_services.sh
@@ -0,0 +1,64 @@
+#!/bin/bash
+
+# Stop Remote GPU Rendering Services
+
+echo "🛑 Stopping Remote GPU Rendering Services"
+echo "========================================="
+
+# Function to stop service by PID file
+stop_service() {
+    local service_name=$1
+    local pid_file=$2
+    
+    if [ -f "$pid_file" ]; then
+        local pid=$(cat "$pid_file")
+        if kill -0 "$pid" 2>/dev/null; then
+            echo "Stopping $service_name (PID: $pid)..."
+            kill "$pid"
+            
+            # Wait for graceful shutdown
+            local count=0
+            while kill -0 "$pid" 2>/dev/null && [ $count -lt 10 ]; do
+                sleep 1
+                count=$((count + 1))
+            done
+            
+            # Force kill if still running
+            if kill -0 "$pid" 2>/dev/null; then
+                echo "  Force killing $service_name..."
+                kill -9 "$pid"
+            fi
+            
+            echo "  ✓ $service_name stopped"
+        else
+            echo "  $service_name was not running"
+        fi
+        rm -f "$pid_file"
+    else
+        echo "  No PID file found for $service_name"
+    fi
+}
+
+# Stop services
+stop_service "Remote GPU Renderer" "logs/remotegpurenderer_pid.txt"
+stop_service "Service Monitor" "logs/monitor_pid.txt"
+
+# Stop any remaining processes on the service ports
+echo ""
+echo "🔍 Checking for remaining processes on service ports..."
+
+ports=(8082)
+for port in "${ports[@]}"; do
+    if lsof -Pi :$port -sTCP:LISTEN -t >/dev/null 2>&1; then
+        echo "Killing process on port $port..."
+        kill $(lsof -t -i:$port) 2>/dev/null || true
+    fi
+done
+
+echo ""
+echo "✅ All Remote GPU Rendering Services stopped"
+echo ""
+echo "📁 Log files are preserved in logs/ directory:"
+echo "  - logs/remote_gpu_rendering.log"
+echo ""
+echo "To restart services, run: ./start_remote_gpu_services.sh" 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/true_gpu_rendering_comparison.md b/nvidia/txt2kg/assets/deploy/services/gpu-viz/true_gpu_rendering_comparison.md
new file mode 100644
index 0000000..8f01d43
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/true_gpu_rendering_comparison.md
@@ -0,0 +1,243 @@
+# True GPU Rendering vs Current Approach
+
+## 🎯 **Current Remote GPU Service**
+
+### **What Uses GPU (✅)**
+- **Graph Layout**: cuGraph Force Atlas 2, Spectral Layout
+- **Clustering**: cuGraph Leiden, Louvain algorithms  
+- **Centrality**: cuGraph PageRank, Betweenness Centrality
+- **Data Processing**: Node positioning, edge bundling
+
+### **What Uses CPU (❌)**
+- **Visual Rendering**: D3.js SVG/Canvas drawing
+- **Animation**: D3.js transitions and transforms
+- **Interaction**: DOM event handling, hover, zoom
+- **Text Rendering**: Node labels, tooltips
+
+## 🔥 **True GPU Rendering (Like PyGraphistry)**
+
+### **What Would Need GPU Acceleration**
+
+#### **1. WebGL Compute Shaders**
+```glsl
+// Vertex shader for node positioning
+attribute vec2 position;
+attribute float size;
+attribute vec3 color;
+
+uniform mat4 projectionMatrix;
+uniform float time;
+
+void main() {
+    // GPU-accelerated node positioning
+    vec2 pos = position + computeForceLayout(time);
+    gl_Position = projectionMatrix * vec4(pos, 0.0, 1.0);
+    gl_PointSize = size;
+}
+```
+
+#### **2. GPU Particle Systems**
+```javascript
+// WebGL-based node rendering
+class GPUNodeRenderer {
+    constructor(gl, nodeCount) {
+        this.nodeCount = nodeCount;
+        
+        // Create vertex buffers for GPU processing
+        this.positionBuffer = gl.createBuffer();
+        this.colorBuffer = gl.createBuffer();
+        this.sizeBuffer = gl.createBuffer();
+        
+        // Compile GPU shaders
+        this.program = this.createShaderProgram(gl);
+    }
+    
+    render(nodes) {
+        // Update GPU buffers - no CPU iteration
+        gl.bindBuffer(gl.ARRAY_BUFFER, this.positionBuffer);
+        gl.bufferData(gl.ARRAY_BUFFER, new Float32Array(positions), gl.DYNAMIC_DRAW);
+        
+        // GPU draws all nodes in single call
+        gl.drawArrays(gl.POINTS, 0, this.nodeCount);
+    }
+}
+```
+
+#### **3. GPU-Based Interaction**
+```javascript
+// GPU picking for node selection
+class GPUPicker {
+    constructor(gl, nodeCount) {
+        // Render nodes to off-screen framebuffer with unique colors
+        this.pickingFramebuffer = gl.createFramebuffer();
+        this.pickingTexture = gl.createTexture();
+    }
+    
+    getNodeAtPosition(x, y) {
+        // Read single pixel from GPU framebuffer
+        const pixel = new Uint8Array(4);
+        gl.readPixels(x, y, 1, 1, gl.RGBA, gl.UNSIGNED_BYTE, pixel);
+        
+        // Decode node ID from color
+        return this.colorToNodeId(pixel);
+    }
+}
+```
+
+## 📊 **Performance Comparison**
+
+### **Current D3.js CPU Rendering**
+```javascript
+// CPU-bound operations
+nodes.forEach(node => {
+    // For each node, update DOM element
+    d3.select(`#node-${node.id}`)
+        .attr("cx", node.x)
+        .attr("cy", node.y)
+        .attr("r", node.size);
+});
+
+// Performance: O(n) DOM operations
+// 10k nodes = 10k DOM updates per frame
+// Maximum ~60fps with heavy optimization
+```
+
+### **GPU WebGL Rendering**
+```javascript
+// GPU-accelerated operations
+class GPURenderer {
+    updateNodes(nodeData) {
+        // Single buffer update for all nodes
+        gl.bufferSubData(gl.ARRAY_BUFFER, 0, nodeData);
+        
+        // Single draw call for all nodes
+        gl.drawArraysInstanced(gl.TRIANGLES, 0, 6, nodeCount);
+    }
+}
+
+// Performance: O(1) GPU operations
+// 1M nodes = 1 GPU draw call
+// Can maintain 60fps with millions of nodes
+```
+
+## 🛠️ **Implementation Options**
+
+### **Option 1: WebGL2 + Compute Shaders**
+```html
+<!-- Enhanced HTML template with WebGL -->
+<canvas id="gpu-canvas" width="800" height="600"></canvas>
+<script>
+    const canvas = document.getElementById('gpu-canvas');
+    const gl = canvas.getContext('webgl2');
+    
+    // Load compute shaders for layout animation
+    const computeShader = gl.createShader(gl.COMPUTE_SHADER);
+    gl.shaderSource(computeShader, computeShaderSource);
+    
+    // Render loop using GPU
+    function animate() {
+        // Update node positions on GPU
+        gl.useProgram(computeProgram);
+        gl.dispatchCompute(Math.ceil(nodeCount / 64), 1, 1);
+        
+        // Render nodes on GPU
+        gl.useProgram(renderProgram);
+        gl.drawArraysInstanced(gl.POINTS, 0, 1, nodeCount);
+        
+        requestAnimationFrame(animate);
+    }
+</script>
+```
+
+### **Option 2: WebGPU (Future)**
+```javascript
+// Next-generation WebGPU API
+const adapter = await navigator.gpu.requestAdapter();
+const device = await adapter.requestDevice();
+
+// GPU compute pipeline for layout
+const computePipeline = device.createComputePipeline({
+    compute: {
+        module: device.createShaderModule({ code: layoutComputeShader }),
+        entryPoint: 'main'
+    }
+});
+
+// GPU render pipeline
+const renderPipeline = device.createRenderPipeline({
+    vertex: { module: vertexShaderModule, entryPoint: 'main' },
+    fragment: { module: fragmentShaderModule, entryPoint: 'main' },
+    primitive: { topology: 'point-list' }
+});
+```
+
+### **Option 3: Three.js GPU Optimization**
+```javascript
+// Use Three.js InstancedMesh for GPU instancing
+import * as THREE from 'three';
+
+class GPUGraphRenderer {
+    constructor(nodeCount) {
+        // Single geometry instanced for all nodes
+        const geometry = new THREE.CircleGeometry(1, 8);
+        const material = new THREE.MeshBasicMaterial();
+        
+        // GPU-instanced mesh for all nodes
+        this.instancedMesh = new THREE.InstancedMesh(
+            geometry, material, nodeCount
+        );
+        
+        // Position matrix for each instance
+        this.matrix = new THREE.Matrix4();
+    }
+    
+    updateNode(index, x, y, scale, color) {
+        // Update single instance matrix
+        this.matrix.makeScale(scale, scale, 1);
+        this.matrix.setPosition(x, y, 0);
+        this.instancedMesh.setMatrixAt(index, this.matrix);
+        this.instancedMesh.setColorAt(index, color);
+    }
+    
+    render() {
+        // Single GPU draw call for all nodes
+        this.instancedMesh.instanceMatrix.needsUpdate = true;
+        this.instancedMesh.instanceColor.needsUpdate = true;
+    }
+}
+```
+
+## 🎯 **Recommendation**
+
+### **Current Approach is Good For:**
+- ✅ **Rapid development** - Standard D3.js patterns
+- ✅ **Small-medium graphs** (<50k nodes) 
+- ✅ **Interactive features** - Easy DOM manipulation
+- ✅ **Debugging** - Standard web dev tools
+- ✅ **Compatibility** - Works in all browsers
+
+### **True GPU Rendering Needed For:**
+- 🚀 **Million+ node graphs** with smooth 60fps
+- 🚀 **Real-time layout animation** 
+- 🚀 **Complex visual effects** (particles, trails)
+- 🚀 **VR/AR graph visualization**
+- 🚀 **Multi-touch interaction** on large displays
+
+## 💡 **Hybrid Solution**
+
+The optimal approach combines both:
+
+```javascript
+// Intelligent renderer selection
+const selectRenderer = (nodeCount) => {
+    if (nodeCount < 10000) {
+        return new D3SVGRenderer();     // CPU DOM rendering
+    } else if (nodeCount < 100000) {
+        return new ThreeJSRenderer();   // WebGL with Three.js
+    } else {
+        return new WebGLRenderer();     // Custom GPU shaders
+    }
+};
+```
+
+**Current Status:** Your remote service provides **GPU-accelerated data processing** with **CPU-based rendering** - which is perfect for most use cases and much easier to develop/maintain than full GPU rendering. 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/unified_gpu_service.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/unified_gpu_service.py
new file mode 100644
index 0000000..91f6682
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/unified_gpu_service.py
@@ -0,0 +1,773 @@
+#!/usr/bin/env python3
+"""
+Unified GPU Graph Visualization Service
+
+Combines PyGraphistry cloud processing and local GPU processing with cuGraph
+into a single FastAPI service for maximum flexibility.
+"""
+
+import os
+import json
+import numpy as np
+import pandas as pd
+from typing import Dict, List, Any, Optional, Tuple
+import asyncio
+import logging
+from datetime import datetime
+from fastapi import FastAPI, HTTPException, WebSocket, WebSocketDisconnect, BackgroundTasks
+from fastapi.responses import HTMLResponse
+from pydantic import BaseModel
+import uvicorn
+import time
+from concurrent.futures import ThreadPoolExecutor
+import networkx as nx
+from enum import Enum
+
+# PyGraphistry imports
+import graphistry
+
+# GPU-accelerated imports (available in NVIDIA PyG container)
+try:
+    import cudf
+    import cugraph
+    import cupy as cp
+    from cuml import UMAP
+    HAS_RAPIDS = True
+    print("✓ RAPIDS cuGraph/cuDF/cuML available")
+except ImportError:
+    HAS_RAPIDS = False
+    print("⚠ RAPIDS not available, falling back to CPU")
+
+try:
+    import torch
+    import torch_geometric
+    HAS_TORCH_GEOMETRIC = True
+    print("✓ PyTorch Geometric available")
+except ImportError:
+    HAS_TORCH_GEOMETRIC = False
+    print("⚠ PyTorch Geometric not available")
+
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+
+class ProcessingMode(str, Enum):
+    PYGRAPHISTRY_CLOUD = "pygraphistry_cloud"
+    LOCAL_GPU = "local_gpu" 
+    LOCAL_CPU = "local_cpu"
+
+class GraphPattern(str, Enum):
+    RANDOM = "random"
+    SCALE_FREE = "scale-free"
+    SMALL_WORLD = "small-world"
+    CLUSTERED = "clustered"
+    HIERARCHICAL = "hierarchical"
+    GRID = "grid"
+
+class GraphData(BaseModel):
+    nodes: List[Dict[str, Any]]
+    links: List[Dict[str, Any]]
+
+class GraphGenerationRequest(BaseModel):
+    num_nodes: int
+    pattern: GraphPattern = GraphPattern.SCALE_FREE
+    avg_degree: Optional[int] = 5
+    num_clusters: Optional[int] = 100
+    small_world_k: Optional[int] = 6
+    small_world_p: Optional[float] = 0.1
+    grid_dimensions: Optional[List[int]] = [100, 100]
+    seed: Optional[int] = None
+
+class UnifiedVisualizationRequest(BaseModel):
+    graph_data: GraphData
+    processing_mode: ProcessingMode = ProcessingMode.PYGRAPHISTRY_CLOUD
+    
+    # PyGraphistry Cloud options
+    layout_type: Optional[str] = "force"
+    gpu_acceleration: Optional[bool] = True  
+    clustering: Optional[bool] = False
+    
+    # Local GPU options
+    layout_algorithm: Optional[str] = "force_atlas2"
+    clustering_algorithm: Optional[str] = "leiden"
+    compute_centrality: Optional[bool] = True
+
+class GraphGenerationStatus(BaseModel):
+    task_id: str
+    status: str  # "running", "completed", "failed"
+    progress: float
+    message: str
+    result: Optional[Dict[str, Any]] = None
+    error: Optional[str] = None
+
+# Import graph generation classes (keeping existing code)
+from pygraphistry_service import LargeGraphGenerator, init_graphistry
+
+class LocalGPUProcessor:
+    """GPU-accelerated graph processing using cuGraph"""
+    
+    def __init__(self):
+        self.use_gpu = HAS_RAPIDS
+        logger.info(f"Local GPU Processor initialized (GPU: {self.use_gpu})")
+    
+    def create_cugraph_from_data(self, nodes: List[Dict], edges: List[Dict]) -> Tuple['cugraph.Graph', 'cudf.DataFrame']:
+        """Create cuGraph from node/edge data"""
+        if not self.use_gpu:
+            raise RuntimeError("GPU libraries not available")
+            
+        # Create edge dataframe
+        edge_data = []
+        for edge in edges:
+            edge_data.append({
+                'src': edge['source'],
+                'dst': edge['target'],
+                'weight': edge.get('weight', 1.0)
+            })
+        
+        # Convert to cuDF
+        edges_df = cudf.DataFrame(edge_data)
+        
+        # Create cuGraph
+        G = cugraph.Graph()
+        G.from_cudf_edgelist(edges_df, source='src', destination='dst', edge_attr='weight')
+        
+        return G, edges_df
+    
+    def compute_gpu_layout(self, G, algorithm: str = "force_atlas2") -> Dict[str, Tuple[float, float]]:
+        """Compute GPU-accelerated graph layout"""
+        try:
+            if algorithm == "force_atlas2":
+                layout_df = cugraph.force_atlas2(G)
+            elif algorithm == "fruchterman_reingold":
+                layout_df = cugraph.spectral_layout(G, dim=2)
+            else:  # spectral
+                layout_df = cugraph.spectral_layout(G, dim=2)
+            
+            # Convert to dictionary
+            positions = {}
+            for _, row in layout_df.iterrows():
+                node_id = str(row['vertex'])
+                positions[node_id] = (float(row['x']), float(row['y']))
+            
+            logger.info(f"Computed {algorithm} layout for {len(positions)} nodes on GPU")
+            return positions
+            
+        except Exception as e:
+            logger.error(f"GPU layout computation failed: {e}")
+            return {}
+    
+    def compute_gpu_clustering(self, G, algorithm: str = "leiden") -> Dict[str, int]:
+        """Compute GPU-accelerated community detection"""
+        try:
+            if algorithm == "leiden":
+                clusters_df, modularity = cugraph.leiden(G)
+            elif algorithm == "louvain":
+                clusters_df, modularity = cugraph.louvain(G)
+            else:  # spectral clustering
+                clusters_df = cugraph.spectral_clustering(G, n_clusters=10)
+                modularity = 0.0
+            
+            # Convert to dictionary
+            clusters = {}
+            for _, row in clusters_df.iterrows():
+                node_id = str(row['vertex'])
+                clusters[node_id] = int(row['partition'])
+            
+            logger.info(f"Computed {algorithm} clustering on GPU (modularity: {modularity:.3f})")
+            return clusters
+            
+        except Exception as e:
+            logger.error(f"GPU clustering failed: {e}")
+            return {}
+    
+    def compute_gpu_centrality(self, G) -> Dict[str, Dict[str, float]]:
+        """Compute GPU-accelerated centrality measures"""
+        centrality_data = {}
+        
+        try:
+            # PageRank
+            pagerank_df = cugraph.pagerank(G)
+            pagerank = {}
+            for _, row in pagerank_df.iterrows():
+                pagerank[str(row['vertex'])] = float(row['pagerank'])
+            centrality_data['pagerank'] = pagerank
+            
+            # Betweenness centrality (for smaller graphs)
+            if G.number_of_vertices() < 5000:
+                betweenness_df = cugraph.betweenness_centrality(G)
+                betweenness = {}
+                for _, row in betweenness_df.iterrows():
+                    betweenness[str(row['vertex'])] = float(row['betweenness_centrality'])
+                centrality_data['betweenness'] = betweenness
+            
+            logger.info(f"Computed centrality measures on GPU")
+            return centrality_data
+            
+        except Exception as e:
+            logger.error(f"GPU centrality computation failed: {e}")
+            return {}
+
+class PyGraphistryProcessor:
+    """PyGraphistry cloud processing (existing functionality)"""
+    
+    def __init__(self):
+        self.initialized = init_graphistry()
+    
+    async def process_graph_data(self, request: UnifiedVisualizationRequest) -> Dict[str, Any]:
+        """Process graph data with PyGraphistry GPU acceleration"""
+        try:
+            if not self.initialized:
+                raise HTTPException(status_code=500, detail="PyGraphistry not initialized")
+            
+            # Convert to pandas DataFrames for PyGraphistry
+            nodes_df = pd.DataFrame(request.graph_data.nodes)
+            edges_df = pd.DataFrame(request.graph_data.links)
+            
+            # Ensure required columns exist
+            if 'id' not in nodes_df.columns:
+                nodes_df['id'] = nodes_df.index
+            if 'source' not in edges_df.columns or 'target' not in edges_df.columns:
+                raise HTTPException(status_code=400, detail="Links must have source and target columns")
+                
+            logger.info(f"Processing graph with {len(nodes_df)} nodes and {len(edges_df)} edges")
+            
+            # Create PyGraphistry graph object
+            g = graphistry.edges(edges_df, 'source', 'target').nodes(nodes_df, 'id')
+            
+            # Apply GPU-accelerated processing
+            if request.gpu_acceleration:
+                g = await self._apply_gpu_acceleration(g, request)
+            
+            # Apply clustering if requested
+            if request.clustering:
+                g = await self._apply_clustering(g)
+            
+            # Generate layout
+            g = await self._generate_layout(g, request.layout_type)
+            
+            # Extract processed data
+            processed_nodes = g._nodes.to_dict('records') if g._nodes is not None else nodes_df.to_dict('records')
+            processed_edges = g._edges.to_dict('records') if g._edges is not None else edges_df.to_dict('records')
+            
+            # Generate embedding URL for interactive visualization
+            embed_url = None
+            local_viz_data = None
+            
+            try:
+                embed_url = g.plot(render=False)
+                logger.info(f"Generated PyGraphistry embed URL: {embed_url}")
+            except Exception as e:
+                logger.warning(f"Could not generate embed URL (likely running in local mode): {e}")
+                
+                # Create local visualization data as fallback
+                try:
+                    local_viz_data = self._create_local_viz_data(g, processed_nodes, processed_edges)
+                    logger.info("Generated local visualization data as fallback")
+                except Exception as viz_e:
+                    logger.warning(f"Could not generate local visualization data: {viz_e}")
+            
+            return {
+                "processed_nodes": processed_nodes,
+                "processed_edges": processed_edges,
+                "embed_url": embed_url,
+                "local_viz_data": local_viz_data,
+                "processing_mode": ProcessingMode.PYGRAPHISTRY_CLOUD,
+                "stats": {
+                    "node_count": len(processed_nodes),
+                    "edge_count": len(processed_edges),
+                    "gpu_accelerated": request.gpu_acceleration,
+                    "clustered": request.clustering,
+                    "layout_type": request.layout_type,
+                    "has_embed_url": embed_url is not None,
+                    "has_local_viz": local_viz_data is not None
+                },
+                "timestamp": datetime.now().isoformat()
+            }
+            
+        except Exception as e:
+            logger.error(f"Error processing graph data: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    # ... (include other PyGraphistry methods from original service)
+    async def _apply_gpu_acceleration(self, g, request):
+        # Implementation from original service
+        pass
+    
+    async def _apply_clustering(self, g):
+        # Implementation from original service  
+        pass
+    
+    async def _generate_layout(self, g, layout_type):
+        # Implementation from original service
+        pass
+    
+    def _create_local_viz_data(self, g, processed_nodes, processed_edges):
+        # Implementation from original service
+        pass
+
+class UnifiedGPUService:
+    """Unified service offering both PyGraphistry cloud and local GPU processing"""
+    
+    def __init__(self):
+        self.pygraphistry_processor = PyGraphistryProcessor()
+        self.local_gpu_processor = LocalGPUProcessor()
+        self.generation_tasks = {}
+        self.executor = ThreadPoolExecutor(max_workers=4)
+        self.active_connections: List[WebSocket] = []
+        
+    async def process_graph(self, request: UnifiedVisualizationRequest) -> Dict[str, Any]:
+        """Process graph with selected processing mode"""
+        
+        if request.processing_mode == ProcessingMode.PYGRAPHISTRY_CLOUD:
+            return await self.pygraphistry_processor.process_graph_data(request)
+            
+        elif request.processing_mode == ProcessingMode.LOCAL_GPU:
+            return await self._process_with_local_gpu(request)
+            
+        else:  # LOCAL_CPU
+            return await self._process_with_local_cpu(request)
+    
+    async def _process_with_local_gpu(self, request: UnifiedVisualizationRequest) -> Dict[str, Any]:
+        """Process graph with local GPU acceleration"""
+        try:
+            nodes = request.graph_data.nodes
+            edges = request.graph_data.links
+            
+            result = {
+                "processed_nodes": nodes.copy(),
+                "processed_edges": edges.copy(),
+                "processing_mode": ProcessingMode.LOCAL_GPU,
+                "gpu_processed": False,
+                "layout_positions": {},
+                "clusters": {},
+                "centrality": {},
+                "stats": {},
+                "timestamp": datetime.now().isoformat()
+            }
+            
+            if self.local_gpu_processor.use_gpu:
+                logger.info("=== LOCAL GPU PROCESSING START ===")
+                
+                # Create cuGraph
+                G, edges_df = self.local_gpu_processor.create_cugraph_from_data(nodes, edges)
+                
+                # Compute layout on GPU
+                positions = self.local_gpu_processor.compute_gpu_layout(G, request.layout_algorithm)
+                if positions:
+                    result["layout_positions"] = positions
+                    # Add positions to nodes
+                    for node in result["processed_nodes"]:
+                        node_id = str(node["id"])
+                        if node_id in positions:
+                            node["x"], node["y"] = positions[node_id]
+                
+                # Compute clustering on GPU
+                clusters = self.local_gpu_processor.compute_gpu_clustering(G, request.clustering_algorithm)
+                if clusters:
+                    result["clusters"] = clusters
+                    # Add cluster info to nodes
+                    for node in result["processed_nodes"]:
+                        node_id = str(node["id"])
+                        if node_id in clusters:
+                            node["cluster"] = clusters[node_id]
+                
+                # Compute centrality on GPU
+                if request.compute_centrality:
+                    centrality = self.local_gpu_processor.compute_gpu_centrality(G)
+                    result["centrality"] = centrality
+                    # Add centrality to nodes
+                    for node in result["processed_nodes"]:
+                        node_id = str(node["id"])
+                        for metric, values in centrality.items():
+                            if node_id in values:
+                                node[metric] = values[node_id]
+                
+                result["gpu_processed"] = True
+                result["stats"] = {
+                    "node_count": len(nodes),
+                    "edge_count": len(edges),
+                    "gpu_accelerated": True,
+                    "layout_computed": len(positions) > 0,
+                    "clusters_computed": len(clusters) > 0,
+                    "centrality_computed": len(centrality) > 0
+                }
+                
+                logger.info("=== LOCAL GPU PROCESSING COMPLETE ===")
+            
+            return result
+            
+        except Exception as e:
+            logger.error(f"Local GPU processing failed: {e}")
+            raise HTTPException(status_code=500, detail=str(e))
+    
+    async def _process_with_local_cpu(self, request: UnifiedVisualizationRequest) -> Dict[str, Any]:
+        """Process graph with local CPU (NetworkX fallback)"""
+        # Simple CPU fallback using NetworkX
+        nodes = request.graph_data.nodes
+        edges = request.graph_data.links
+        
+        return {
+            "processed_nodes": nodes,
+            "processed_edges": edges,
+            "processing_mode": ProcessingMode.LOCAL_CPU,
+            "gpu_processed": False,
+            "stats": {
+                "node_count": len(nodes),
+                "edge_count": len(edges),
+                "gpu_accelerated": False
+            },
+            "timestamp": datetime.now().isoformat()
+        }
+    
+    async def broadcast_update(self, data: Dict[str, Any]):
+        """Broadcast updates to all connected WebSocket clients"""
+        if self.active_connections:
+            message = json.dumps(data)
+            for connection in self.active_connections.copy():
+                try:
+                    await connection.send_text(message)
+                except WebSocketDisconnect:
+                    self.active_connections.remove(connection)
+
+# FastAPI app
+app = FastAPI(title="Unified GPU Graph Visualization Service", version="2.0.0")
+service = UnifiedGPUService()
+
+@app.post("/api/visualize")
+async def visualize_graph(request: UnifiedVisualizationRequest):
+    """Process graph with unified service (supports all processing modes)"""
+    result = await service.process_graph(request)
+    
+    # Broadcast to connected WebSocket clients
+    await service.broadcast_update({
+        "type": "graph_processed",
+        "data": result
+    })
+    
+    return result
+
+@app.post("/api/generate")
+async def generate_graph(request: GraphGenerationRequest):
+    """Start graph generation as background task"""
+    if request.num_nodes > 1000000:
+        raise HTTPException(status_code=400, detail="Maximum 1 million nodes allowed")
+        
+    # Use existing graph generation logic
+    task_id = f"gen_{int(time.time() * 1000)}"
+    # Implementation would go here...
+    return {"task_id": task_id, "status": "started"}
+
+@app.get("/api/capabilities")
+async def get_capabilities():
+    """Get GPU capabilities and available processing modes"""
+    return {
+        "processing_modes": {
+            "pygraphistry_cloud": {
+                "available": service.pygraphistry_processor.initialized,
+                "description": "PyGraphistry cloud GPU processing with interactive embeds"
+            },
+            "local_gpu": {
+                "available": HAS_RAPIDS,
+                "description": "Local GPU processing with cuGraph/RAPIDS"
+            },
+            "local_cpu": {
+                "available": True,
+                "description": "Local CPU fallback processing"
+            }
+        },
+        "has_rapids": HAS_RAPIDS,
+        "has_torch_geometric": HAS_TORCH_GEOMETRIC,
+        "gpu_available": HAS_RAPIDS,
+        "supported_layouts": ["force_atlas2", "spectral", "fruchterman_reingold"],
+        "supported_clustering": ["leiden", "louvain", "spectral"]
+    }
+
+@app.websocket("/ws")
+async def websocket_endpoint(websocket: WebSocket):
+    """WebSocket endpoint for real-time updates"""
+    await websocket.accept()
+    service.active_connections.append(websocket)
+    
+    try:
+        while True:
+            await websocket.receive_text()
+    except WebSocketDisconnect:
+        service.active_connections.remove(websocket)
+
+@app.get("/api/sample-graph")
+async def get_sample_graph():
+    """Get a sample graph for testing"""
+    return {
+        "nodes": [
+            {"id": "1", "name": "Central Hub", "group": "core"},
+            {"id": "2", "name": "Data Source A", "group": "input"},
+            {"id": "3", "name": "Data Source B", "group": "input"},
+            {"id": "4", "name": "Processing Unit", "group": "compute"},
+            {"id": "5", "name": "Output A", "group": "output"},
+            {"id": "6", "name": "Output B", "group": "output"},
+            {"id": "7", "name": "Analytics", "group": "analysis"},
+            {"id": "8", "name": "Storage", "group": "storage"}
+        ],
+        "links": [
+            {"source": "2", "target": "1", "name": "data_feed"},
+            {"source": "3", "target": "1", "name": "data_feed"},
+            {"source": "1", "target": "4", "name": "process"},
+            {"source": "4", "target": "5", "name": "output"},
+            {"source": "4", "target": "6", "name": "output"},
+            {"source": "1", "target": "7", "name": "analyze"},
+            {"source": "1", "target": "8", "name": "store"}
+        ]
+    }
+
+@app.get("/api/health")
+async def health_check():
+    """Health check endpoint"""
+    return {
+        "status": "healthy",
+        "pygraphistry_initialized": service.pygraphistry_processor.initialized,
+        "local_gpu_available": HAS_RAPIDS,
+        "torch_geometric": HAS_TORCH_GEOMETRIC,
+        "timestamp": datetime.now().isoformat()
+    }
+
+@app.get("/", response_class=HTMLResponse)
+async def get_visualization_page():
+    """Serve the interactive visualization page"""
+    return """
+    <!DOCTYPE html>
+    <html>
+    <head>
+        <title>Unified GPU Graph Visualization</title>
+        <script src="https://d3js.org/d3.v7.min.js"></script>
+        <style>
+            body { margin: 0; font-family: Arial, sans-serif; background: #1a1a1a; color: white; }
+            #controls { position: absolute; top: 10px; left: 10px; z-index: 100; background: rgba(0,0,0,0.8); padding: 10px; border-radius: 5px; }
+            #graph { width: 100vw; height: 100vh; }
+            .node { cursor: pointer; }
+            .link { stroke: #999; stroke-opacity: 0.6; }
+            button { margin: 5px; padding: 5px 10px; }
+            select { margin: 5px; padding: 5px; }
+        </style>
+    </head>
+    <body>
+        <div id="controls">
+            <h3>⚡ Unified GPU Visualization</h3>
+            <div>
+                <label>Processing Mode:</label>
+                <select id="processingMode">
+                    <option value="pygraphistry_cloud">PyGraphistry Cloud</option>
+                    <option value="local_gpu">Local GPU (cuGraph)</option>
+                    <option value="local_cpu">Local CPU</option>
+                </select>
+            </div>
+            <button onclick="loadSampleGraph()">Load Sample Graph</button>
+            <div id="status">Ready - Select processing mode and load graph</div>
+        </div>
+        <div id="graph"></div>
+        
+        <script>
+            let currentGraph = null;
+            let simulation = null;
+            
+            async function loadSampleGraph() {
+                const mode = document.getElementById('processingMode').value;
+                document.getElementById("status").innerHTML = `Loading sample graph...`;
+                
+                try {
+                    // Get sample graph data
+                    const graphResponse = await fetch('/api/sample-graph');
+                    const graphData = await graphResponse.json();
+                    
+                    document.getElementById("status").innerHTML = `Processing with ${mode}...`;
+                    
+                    // Process the graph with selected mode
+                    const processResponse = await fetch('/api/visualize', {
+                        method: 'POST',
+                        headers: {
+                            'Content-Type': 'application/json',
+                        },
+                        body: JSON.stringify({
+                            graph_data: graphData,
+                            processing_mode: mode,
+                            layout_algorithm: 'force_atlas2',
+                            clustering_algorithm: 'leiden',
+                            compute_centrality: true
+                        })
+                    });
+                    
+                    const result = await processResponse.json();
+                    
+                    if (result.processed_nodes && result.processed_edges) {
+                        document.getElementById("status").innerHTML = 
+                            `✅ Processed ${result.processed_nodes.length} nodes, ${result.processed_edges.length} edges (GPU: ${result.gpu_processed})`;
+                        
+                        // Visualize with D3.js
+                        visualizeGraph({
+                            nodes: result.processed_nodes,
+                            links: result.processed_edges
+                        });
+                    } else {
+                        document.getElementById("status").innerHTML = `❌ Error: ${result.detail || 'Unknown error'}`;
+                    }
+                    
+                } catch (error) {
+                    document.getElementById("status").innerHTML = `❌ Error: ${error.message}`;
+                    console.error('Error:', error);
+                }
+            }
+            
+            function visualizeGraph(graph) {
+                // Clear previous visualization
+                d3.select("#graph").selectAll("*").remove();
+                
+                const width = window.innerWidth;
+                const height = window.innerHeight;
+                
+                const svg = d3.select("#graph")
+                    .append("svg")
+                    .attr("width", width)
+                    .attr("height", height);
+                
+                // Create force simulation
+                simulation = d3.forceSimulation(graph.nodes)
+                    .force("link", d3.forceLink(graph.links).id(d => d.id).distance(100))
+                    .force("charge", d3.forceManyBody().strength(-300))
+                    .force("center", d3.forceCenter(width / 2, height / 2));
+                
+                // Create links
+                const link = svg.append("g")
+                    .selectAll("line")
+                    .data(graph.links)
+                    .enter().append("line")
+                    .attr("class", "link")
+                    .attr("stroke-width", 2);
+                
+                // Create nodes
+                const node = svg.append("g")
+                    .selectAll("circle")
+                    .data(graph.nodes)
+                    .enter().append("circle")
+                    .attr("class", "node")
+                    .attr("r", 8)
+                    .attr("fill", d => d.group === 'core' ? '#ff6b6b' : 
+                                    d.group === 'input' ? '#4ecdc4' : 
+                                    d.group === 'output' ? '#45b7d1' : 
+                                    d.group === 'compute' ? '#f9ca24' : '#6c5ce7')
+                    .call(d3.drag()
+                        .on("start", dragstarted)
+                        .on("drag", dragged)
+                        .on("end", dragended));
+                
+                // Add labels
+                const label = svg.append("g")
+                    .selectAll("text")
+                    .data(graph.nodes)
+                    .enter().append("text")
+                    .text(d => d.name)
+                    .attr("font-size", 12)
+                    .attr("fill", "white")
+                    .attr("text-anchor", "middle")
+                    .attr("dy", ".35em");
+                
+                // Update positions on tick
+                simulation.on("tick", () => {
+                    link
+                        .attr("x1", d => d.source.x)
+                        .attr("y1", d => d.source.y)
+                        .attr("x2", d => d.target.x)
+                        .attr("y2", d => d.target.y);
+                    
+                    node
+                        .attr("cx", d => d.x)
+                        .attr("cy", d => d.y);
+                    
+                    label
+                        .attr("x", d => d.x)
+                        .attr("y", d => d.y + 20);
+                });
+            }
+            
+            function dragstarted(event, d) {
+                if (!event.active) simulation.alphaTarget(0.3).restart();
+                d.fx = d.x;
+                d.fy = d.y;
+            }
+            
+            function dragged(event, d) {
+                d.fx = event.x;
+                d.fy = event.y;
+            }
+            
+            function dragended(event, d) {
+                if (!event.active) simulation.alphaTarget(0);
+                d.fx = null;
+                d.fy = null;
+            }
+        </script>
+    </body>
+    </html>
+    """
+
+def startup_diagnostics():
+    """Run startup diagnostics and display system info"""
+    print("🚀 Starting Unified GPU-accelerated graph visualization service...")
+    print("Container: NVIDIA PyG with cuGraph/RAPIDS support")
+    
+    # Check GPU availability
+    try:
+        import subprocess
+        result = subprocess.run(['nvidia-smi', '--query-gpu=gpu_name,memory.total,memory.used', '--format=csv,noheader,nounits'], 
+                              capture_output=True, text=True, timeout=5)
+        if result.returncode == 0:
+            print("✓ GPU detected:")
+            for line in result.stdout.strip().split('\n'):
+                if line.strip():
+                    print(f"  {line.strip()}")
+        else:
+            print("⚠ No GPU detected, will use CPU fallback")
+    except (subprocess.TimeoutExpired, FileNotFoundError, Exception):
+        print("⚠ No GPU detected, will use CPU fallback")
+    
+    # Check RAPIDS availability
+    if HAS_RAPIDS:
+        print("✓ RAPIDS cuGraph/cuDF/cuML available")
+    else:
+        print("⚠ RAPIDS not available")
+    
+    # Check PyTorch Geometric
+    if HAS_TORCH_GEOMETRIC:
+        print("✓ PyTorch Geometric available")
+    else:
+        print("⚠ PyTorch Geometric not available")
+    
+    # Check PyGraphistry credentials
+    print("Checking PyGraphistry credentials...")
+    personal_key = os.getenv('GRAPHISTRY_PERSONAL_KEY')
+    secret_key = os.getenv('GRAPHISTRY_SECRET_KEY')
+    api_key = os.getenv('GRAPHISTRY_API_KEY')
+    
+    if personal_key and secret_key:
+        print("✓ PyGraphistry personal key/secret found")
+    elif api_key:
+        print("✓ PyGraphistry API key found")
+    else:
+        print("⚠ No PyGraphistry credentials found - cloud mode will be limited")
+        print("  Set GRAPHISTRY_PERSONAL_KEY + GRAPHISTRY_SECRET_KEY for full cloud features")
+    
+    print("")
+    print("🎯 Available Processing Modes:")
+    print("  ☁️  PyGraphistry Cloud - Interactive GPU embeds (requires credentials)")
+    print("  🚀 Local GPU (cuGraph) - Full local GPU processing")
+    print("  💻 Local CPU          - NetworkX fallback")
+    print("")
+    print("📊 Service starting on: http://0.0.0.0:8080")
+    print("🎯 API Endpoints:")
+    print("  - Unified processing:   POST /api/visualize")
+    print("  - Processing modes:     GET  /api/capabilities")
+    print("  - Health check:         GET  /api/health")
+    print("  - WebSocket updates:    WS   /ws")
+    print("")
+
+if __name__ == "__main__":
+    startup_diagnostics()
+    uvicorn.run(app, host="0.0.0.0", port=8080) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/gpu-viz/webgl_rendering_enhancement.py b/nvidia/txt2kg/assets/deploy/services/gpu-viz/webgl_rendering_enhancement.py
new file mode 100644
index 0000000..0b49a9c
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/gpu-viz/webgl_rendering_enhancement.py
@@ -0,0 +1,661 @@
+# WebGL-Enhanced Remote GPU Rendering Service
+# Using Three.js for GPU-accelerated visualization
+
+import json
+from typing import Dict, Any, List
+
+class WebGLGPUVisualizationService:
+    """Enhanced remote GPU service with Three.js WebGL rendering"""
+    
+    def _generate_threejs_webgl_html(self, session_data: dict, config: dict) -> str:
+        """Generate Three.js WebGL visualization with GPU-accelerated rendering"""
+        
+        # Extract data
+        nodes = session_data['processed_nodes']
+        edges = session_data['processed_edges']
+        layout_positions = session_data.get('layout_positions', {})
+        clusters = session_data.get('clusters', {})
+        centrality = session_data.get('centrality', {})
+        
+        # Configuration
+        animation_duration = config.get('animation_duration', 3000)
+        show_splash = config.get('show_splash', True)
+        auto_zoom = config.get('auto_zoom', True)
+        show_labels = config.get('show_labels', True)
+        background_color = config.get('background_color', '#0a0a0a')
+        render_quality = config.get('render_quality', 'high')
+        
+        # GPU rendering settings
+        gpu_settings = self._get_gpu_rendering_settings(len(nodes), render_quality)
+        
+        html_template = f"""
+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <title>GPU-Accelerated WebGL Graph Visualization</title>
+    <style>
+        body {{
+            margin: 0;
+            padding: 0;
+            background-color: {background_color};
+            font-family: 'Inter', -apple-system, BlinkMacSystemFont, sans-serif;
+            overflow: hidden;
+            color: #ffffff;
+        }}
+        #container {{
+            width: 100vw;
+            height: 100vh;
+            position: relative;
+        }}
+        #webgl-canvas {{
+            display: block;
+            width: 100%;
+            height: 100%;
+        }}
+        .performance-monitor {{
+            position: absolute;
+            top: 10px;
+            left: 10px;
+            background: rgba(0, 0, 0, 0.8);
+            padding: 10px;
+            border-radius: 5px;
+            font-size: 12px;
+            color: #76B900;
+            z-index: 100;
+        }}
+        .controls {{
+            position: absolute;
+            top: 10px;
+            right: 10px;
+            display: flex;
+            gap: 5px;
+            z-index: 100;
+        }}
+        .control-btn {{
+            background: rgba(0, 0, 0, 0.8);
+            color: #76B900;
+            border: 1px solid #76B900;
+            padding: 5px 10px;
+            border-radius: 3px;
+            cursor: pointer;
+            font-size: 11px;
+        }}
+        .control-btn:hover {{
+            background: rgba(118, 185, 0, 0.2);
+        }}
+        .tooltip {{
+            position: absolute;
+            background: rgba(0, 0, 0, 0.9);
+            color: #fff;
+            padding: 8px;
+            border-radius: 4px;
+            font-size: 12px;
+            pointer-events: none;
+            z-index: 200;
+            border: 1px solid #76B900;
+            opacity: 0;
+            transition: opacity 0.2s ease;
+        }}
+        {"" if not show_splash else '''
+        .splash-screen {
+            position: absolute;
+            top: 0;
+            left: 0;
+            width: 100%;
+            height: 100%;
+            background: linear-gradient(135deg, #0a0a0a 0%, #1a1a1a 100%);
+            display: flex;
+            flex-direction: column;
+            align-items: center;
+            justify-content: center;
+            z-index: 1000;
+            transition: opacity 0.8s ease;
+        }
+        .splash-logo {
+            font-size: 2rem;
+            font-weight: 700;
+            color: #76B900;
+            margin-bottom: 1rem;
+        }
+        .loading-progress {
+            width: 300px;
+            height: 4px;
+            background: #333;
+            border-radius: 2px;
+            overflow: hidden;
+            margin-bottom: 1rem;
+        }
+        .loading-bar {
+            height: 100%;
+            background: linear-gradient(90deg, #76B900, #a8d45a);
+            width: 0%;
+            transition: width 0.3s ease;
+        }
+        '''}
+    </style>
+</head>
+<body>
+    <div id="container">
+        <canvas id="webgl-canvas"></canvas>
+        
+        <!-- Performance Monitor -->
+        <div class="performance-monitor">
+            <div>🚀 WebGL GPU Rendering</div>
+            <div>FPS: <span id="fps">--</span></div>
+            <div>Nodes: {len(nodes):,}</div>
+            <div>Triangles: <span id="triangles">--</span></div>
+            <div>Memory: <span id="memory">--</span>MB</div>
+        </div>
+        
+        <!-- Controls -->
+        <div class="controls">
+            <button class="control-btn" onclick="toggleAnimation()">⏸️ Animation</button>
+            <button class="control-btn" onclick="resetCamera()">🎯 Reset</button>
+            <button class="control-btn" onclick="toggleLabels()">🏷️ Labels</button>
+            <button class="control-btn" onclick="toggleClusters()">🎨 Clusters</button>
+            <button class="control-btn" onclick="exportImage()">📷 Export</button>
+        </div>
+        
+        <!-- Tooltip -->
+        <div id="tooltip" class="tooltip"></div>
+        
+        {"" if not show_splash else '''
+        <div id="splash-screen" class="splash-screen">
+            <div class="splash-logo">GPU WebGL Visualization</div>
+            <div style="color: #888; margin-bottom: 2rem; text-align: center;">
+                Loading {len(nodes):,} nodes with GPU acceleration<br>
+                Quality: {render_quality.title()} • WebGL 2.0
+            </div>
+            <div class="loading-progress">
+                <div id="loading-bar" class="loading-bar"></div>
+            </div>
+            <div id="loading-text" style="color: #888; font-size: 14px;">Initializing WebGL...</div>
+        </div>
+        '''}
+    </div>
+
+    <!-- Three.js Library (matching your package.json version) -->
+    <script src="https://cdnjs.cloudflare.com/ajax/libs/three.js/0.176.0/three.min.js"></script>
+    
+    <script>
+        // Graph data from GPU processing
+        const graphData = {{
+            nodes: {json.dumps(nodes)},
+            edges: {json.dumps(edges)},
+            layoutPositions: {json.dumps(layout_positions)},
+            clusters: {json.dumps(clusters)},
+            centrality: {json.dumps(centrality)}
+        }};
+        
+        // GPU rendering configuration
+        const gpuConfig = {{
+            nodeCount: {len(nodes)},
+            edgeCount: {len(edges)},
+            maxInstancedNodes: {gpu_settings['max_instanced_nodes']},
+            useInstancedMesh: {str(gpu_settings['use_instanced_mesh']).lower()},
+            enableLOD: {str(gpu_settings['enable_lod']).lower()},
+            frustumCulling: {str(gpu_settings['frustum_culling']).lower()},
+            textureAtlasSize: {gpu_settings['texture_atlas_size']},
+            animationDuration: {animation_duration},
+            showLabels: {str(show_labels).lower()},
+            autoZoom: {str(auto_zoom).lower()}
+        }};
+        
+        class WebGLGraphVisualization {{
+            constructor() {{
+                this.container = document.getElementById('container');
+                this.canvas = document.getElementById('webgl-canvas');
+                
+                // Performance monitoring
+                this.frameCount = 0;
+                this.lastTime = performance.now();
+                this.isAnimating = true;
+                this.labelsVisible = gpuConfig.showLabels;
+                this.clustersVisible = true;
+                
+                this.init();
+                {"this.hideSplash();" if not show_splash else "this.showLoadingProgress();"}
+            }}
+            
+            init() {{
+                // Initialize Three.js WebGL renderer with GPU optimizations
+                this.renderer = new THREE.WebGLRenderer({{
+                    canvas: this.canvas,
+                    antialias: true,
+                    alpha: true,
+                    powerPreference: "high-performance"
+                }});
+                
+                this.renderer.setSize(window.innerWidth, window.innerHeight);
+                this.renderer.setPixelRatio(Math.min(window.devicePixelRatio, 2));
+                this.renderer.setClearColor(0x0a0a0a, 1);
+                
+                // Enable GPU optimizations
+                this.renderer.sortObjects = false; // Disable sorting for performance
+                
+                // Scene setup
+                this.scene = new THREE.Scene();
+                
+                // Camera setup with controls
+                this.camera = new THREE.PerspectiveCamera(
+                    75, window.innerWidth / window.innerHeight, 0.1, 10000
+                );
+                this.camera.position.z = 1000;
+                
+                // Add basic controls
+                this.setupControls();
+                
+                // Load and process graph data
+                this.loadGraphData();
+                
+                // Start render loop
+                this.animate();
+                
+                // Setup interaction
+                this.setupInteraction();
+                
+                // Start performance monitoring
+                this.startPerformanceMonitoring();
+                
+                console.log('WebGL Graph Visualization initialized');
+            }}
+            
+            setupControls() {{
+                // Simple camera controls
+                this.controls = {{
+                    mouseDown: false,
+                    mouseX: 0,
+                    mouseY: 0,
+                    targetX: 0,
+                    targetY: 0,
+                    zoom: 1
+                }};
+                
+                this.canvas.addEventListener('mousedown', (e) => {{
+                    this.controls.mouseDown = true;
+                    this.controls.mouseX = e.clientX;
+                    this.controls.mouseY = e.clientY;
+                }});
+                
+                this.canvas.addEventListener('mousemove', (e) => {{
+                    if (this.controls.mouseDown) {{
+                        const deltaX = e.clientX - this.controls.mouseX;
+                        const deltaY = e.clientY - this.controls.mouseY;
+                        
+                        this.controls.targetX += deltaX * 2;
+                        this.controls.targetY -= deltaY * 2;
+                        
+                        this.controls.mouseX = e.clientX;
+                        this.controls.mouseY = e.clientY;
+                    }}
+                }});
+                
+                this.canvas.addEventListener('mouseup', () => {{
+                    this.controls.mouseDown = false;
+                }});
+                
+                this.canvas.addEventListener('wheel', (e) => {{
+                    e.preventDefault();
+                    this.controls.zoom *= (1 - e.deltaY * 0.001);
+                    this.controls.zoom = Math.max(0.1, Math.min(10, this.controls.zoom));
+                }});
+            }}
+            
+            loadGraphData() {{
+                console.log('Loading graph data with WebGL...');
+                
+                // Create node geometries and materials
+                this.createNodeVisualization();
+                this.createEdgeVisualization();
+                
+                if (this.labelsVisible) {{
+                    this.createLabelVisualization();
+                }}
+                
+                console.log('Graph data loaded successfully');
+            }}
+            
+            createNodeVisualization() {{
+                const nodeCount = graphData.nodes.length;
+                
+                if (gpuConfig.useInstancedMesh && nodeCount > 1000) {{
+                    // GPU-accelerated instanced rendering for large graphs
+                    console.log('Using GPU instanced mesh for', nodeCount, 'nodes');
+                    
+                    const geometry = new THREE.CircleGeometry(1, 8);
+                    const material = new THREE.MeshBasicMaterial({{ 
+                        vertexColors: true,
+                        transparent: true,
+                        opacity: 0.8
+                    }});
+                    
+                    this.nodesMesh = new THREE.InstancedMesh(geometry, material, nodeCount);
+                    
+                    // Set up instance matrices and colors
+                    const matrix = new THREE.Matrix4();
+                    const color = new THREE.Color();
+                    
+                    graphData.nodes.forEach((node, i) => {{
+                        // Position from GPU-computed layout
+                        const pos = graphData.layoutPositions[node.id] || [0, 0];
+                        const x = pos[0] - 500; // Center
+                        const y = pos[1] - 500;
+                        
+                        // Size based on centrality
+                        const centrality = node.pagerank || 0.001;
+                        const size = Math.max(2, Math.sqrt(centrality * 10000) + 3);
+                        
+                        // Color based on cluster
+                        const cluster = node.cluster || 0;
+                        const clusterColor = this.getClusterColor(cluster);
+                        
+                        // Set instance transform
+                        matrix.makeScale(size, size, 1);
+                        matrix.setPosition(x, y, 0);
+                        this.nodesMesh.setMatrixAt(i, matrix);
+                        
+                        // Set instance color
+                        color.setHex(clusterColor);
+                        this.nodesMesh.setColorAt(i, color);
+                    }});
+                    
+                    this.nodesMesh.instanceMatrix.needsUpdate = true;
+                    this.nodesMesh.instanceColor.needsUpdate = true;
+                    
+                    this.scene.add(this.nodesMesh);
+                    
+                }} else {{
+                    // Standard mesh rendering for smaller graphs
+                    console.log('Using standard mesh rendering for', nodeCount, 'nodes');
+                    
+                    this.nodesGroup = new THREE.Group();
+                    
+                    graphData.nodes.forEach((node, i) => {{
+                        const pos = graphData.layoutPositions[node.id] || [0, 0];
+                        const x = pos[0] - 500;
+                        const y = pos[1] - 500;
+                        
+                        const centrality = node.pagerank || 0.001;
+                        const size = Math.max(2, Math.sqrt(centrality * 10000) + 3);
+                        
+                        const cluster = node.cluster || 0;
+                        const clusterColor = this.getClusterColor(cluster);
+                        
+                        const geometry = new THREE.CircleGeometry(size, 8);
+                        const material = new THREE.MeshBasicMaterial({{ 
+                            color: clusterColor,
+                            transparent: true,
+                            opacity: 0.8
+                        }});
+                        
+                        const nodeMesh = new THREE.Mesh(geometry, material);
+                        nodeMesh.position.set(x, y, 0);
+                        nodeMesh.userData = {{ nodeData: node, nodeIndex: i }};
+                        
+                        this.nodesGroup.add(nodeMesh);
+                    }});
+                    
+                    this.scene.add(this.nodesGroup);
+                }}
+            }}
+            
+            createEdgeVisualization() {{
+                console.log('Creating edge visualization...');
+                
+                const edgeCount = graphData.edges.length;
+                const positions = new Float32Array(edgeCount * 6); // 2 vertices * 3 coordinates
+                const colors = new Float32Array(edgeCount * 6); // 2 vertices * 3 colors
+                
+                graphData.edges.forEach((edge, i) => {{
+                    const sourcePos = graphData.layoutPositions[edge.source] || [0, 0];
+                    const targetPos = graphData.layoutPositions[edge.target] || [0, 0];
+                    
+                    const idx = i * 6;
+                    
+                    // Source vertex
+                    positions[idx] = sourcePos[0] - 500;
+                    positions[idx + 1] = sourcePos[1] - 500;
+                    positions[idx + 2] = 0;
+                    
+                    // Target vertex  
+                    positions[idx + 3] = targetPos[0] - 500;
+                    positions[idx + 4] = targetPos[1] - 500;
+                    positions[idx + 5] = 0;
+                    
+                    // Edge color (gray)
+                    colors[idx] = colors[idx + 3] = 0.3;
+                    colors[idx + 1] = colors[idx + 4] = 0.3;
+                    colors[idx + 2] = colors[idx + 5] = 0.3;
+                }});
+                
+                const edgeGeometry = new THREE.BufferGeometry();
+                edgeGeometry.setAttribute('position', new THREE.BufferAttribute(positions, 3));
+                edgeGeometry.setAttribute('color', new THREE.BufferAttribute(colors, 3));
+                
+                const edgeMaterial = new THREE.LineBasicMaterial({{ 
+                    vertexColors: true,
+                    transparent: true,
+                    opacity: 0.4
+                }});
+                
+                this.edgesMesh = new THREE.LineSegments(edgeGeometry, edgeMaterial);
+                this.scene.add(this.edgesMesh);
+            }}
+            
+            createLabelVisualization() {{
+                // Canvas-based text rendering for labels
+                this.labelCanvases = [];
+                
+                graphData.nodes.forEach((node, i) => {{
+                    if (i > 500) return; // Limit labels for performance
+                    
+                    const canvas = document.createElement('canvas');
+                    const context = canvas.getContext('2d');
+                    canvas.width = 256;
+                    canvas.height = 64;
+                    
+                    context.fillStyle = '#ffffff';
+                    context.font = '16px Arial';
+                    context.textAlign = 'center';
+                    context.fillText(node.name || node.id, 128, 32);
+                    
+                    const texture = new THREE.CanvasTexture(canvas);
+                    const material = new THREE.SpriteMaterial({{ map: texture }});
+                    const sprite = new THREE.Sprite(material);
+                    
+                    const pos = graphData.layoutPositions[node.id] || [0, 0];
+                    sprite.position.set(pos[0] - 500, pos[1] - 480, 1);
+                    sprite.scale.set(50, 12.5, 1);
+                    
+                    this.scene.add(sprite);
+                    this.labelCanvases.push(sprite);
+                }});
+            }}
+            
+            getClusterColor(cluster) {{
+                // Midnight Tokyo inspired color palette - neon colors in hex format for WebGL
+                const colors = [
+                    0xFF0080, // Hot pink neon
+                    0x00FFFF, // Electric cyan
+                    0xFF4081, // Neon pink
+                    0x8A2BE2, // Electric purple
+                    0x00FF41, // Matrix green
+                    0xFF6B35, // Neon orange
+                    0x1E90FF, // Electric blue
+                    0xFF1493, // Deep pink
+                    0x00CED1, // Dark turquoise
+                    0x9932CC, // Dark orchid
+                    0x32CD32, // Lime green
+                    0xFF4500, // Orange red
+                    0x4169E1, // Royal blue
+                    0xDC143C, // Crimson
+                    0x00FA9A, // Medium spring green
+                    0xFF69B4, // Hot pink
+                    0x1E88E5, // Blue
+                    0xE91E63, // Pink
+                    0x00E676, // Green
+                    0xFF5722, // Deep orange
+                    0x673AB7, // Deep purple
+                    0x03DAC6, // Teal
+                    0xBB86FC, // Light purple
+                    0xCF6679  // Light pink
+                ];
+                return colors[cluster % colors.length];
+            }}
+            
+            animate() {{
+                requestAnimationFrame(() => this.animate());
+                
+                if (this.isAnimating) {{
+                    // Smooth camera movement
+                    this.camera.position.x += (this.controls.targetX - this.camera.position.x) * 0.05;
+                    this.camera.position.y += (this.controls.targetY - this.camera.position.y) * 0.05;
+                    this.camera.zoom += (this.controls.zoom - this.camera.zoom) * 0.05;
+                    this.camera.updateProjectionMatrix();
+                }}
+                
+                // Render with GPU
+                this.renderer.render(this.scene, this.camera);
+                
+                // Update performance monitor
+                this.updatePerformanceMonitor();
+            }}
+            
+            setupInteraction() {{
+                const raycaster = new THREE.Raycaster();
+                const mouse = new THREE.Vector2();
+                const tooltip = document.getElementById('tooltip');
+                
+                this.canvas.addEventListener('mousemove', (event) => {{
+                    if (this.controls.mouseDown) return;
+                    
+                    mouse.x = (event.clientX / window.innerWidth) * 2 - 1;
+                    mouse.y = -(event.clientY / window.innerHeight) * 2 + 1;
+                    
+                    raycaster.setFromCamera(mouse, this.camera);
+                    
+                    let intersects = [];
+                    if (this.nodesGroup) {{
+                        intersects = raycaster.intersectObjects(this.nodesGroup.children);
+                    }}
+                    
+                    if (intersects.length > 0) {{
+                        const nodeData = intersects[0].object.userData.nodeData;
+                        tooltip.innerHTML = `
+                            <strong>${{nodeData.name || nodeData.id}}</strong><br>
+                            Cluster: ${{nodeData.cluster || 'N/A'}}<br>
+                            PageRank: ${{(nodeData.pagerank || 0).toFixed(4)}}
+                        `;
+                        tooltip.style.left = (event.clientX + 10) + 'px';
+                        tooltip.style.top = (event.clientY - 10) + 'px';
+                        tooltip.style.opacity = '1';
+                    }} else {{
+                        tooltip.style.opacity = '0';
+                    }}
+                }});
+            }}
+            
+            startPerformanceMonitoring() {{
+                setInterval(() => {{
+                    const now = performance.now();
+                    const fps = Math.round((this.frameCount * 1000) / (now - this.lastTime));
+                    
+                    document.getElementById('fps').textContent = fps;
+                    document.getElementById('triangles').textContent = 
+                        (this.renderer.info.render.triangles || 0).toLocaleString();
+                    document.getElementById('memory').textContent = 
+                        Math.round(this.renderer.info.memory.geometries + this.renderer.info.memory.textures);
+                    
+                    this.frameCount = 0;
+                    this.lastTime = now;
+                }}, 1000);
+            }}
+            
+            updatePerformanceMonitor() {{
+                this.frameCount++;
+            }}
+            
+            {"showLoadingProgress() { /* Loading animation */ }" if show_splash else ""}
+            {"hideSplash() { /* Hide splash */ }" if show_splash else ""}
+            
+            resetCamera() {{
+                this.controls.targetX = 0;
+                this.controls.targetY = 0;
+                this.controls.zoom = 1;
+            }}
+            
+            toggleAnimation() {{
+                this.isAnimating = !this.isAnimating;
+            }}
+            
+            toggleLabels() {{
+                this.labelsVisible = !this.labelsVisible;
+                this.labelCanvases.forEach(sprite => {{
+                    sprite.visible = this.labelsVisible;
+                }});
+            }}
+            
+            toggleClusters() {{
+                this.clustersVisible = !this.clustersVisible;
+                // Toggle cluster coloring
+            }}
+            
+            exportImage() {{
+                const link = document.createElement('a');
+                link.download = 'webgl-graph.png';
+                link.href = this.renderer.domElement.toDataURL();
+                link.click();
+            }}
+        }}
+        
+        // Global control functions
+        window.toggleAnimation = () => window.graphViz.toggleAnimation();
+        window.resetCamera = () => window.graphViz.resetCamera();
+        window.toggleLabels = () => window.graphViz.toggleLabels();
+        window.toggleClusters = () => window.graphViz.toggleClusters();
+        window.exportImage = () => window.graphViz.exportImage();
+        
+        // Handle window resize
+        window.addEventListener('resize', () => {{
+            if (window.graphViz) {{
+                window.graphViz.camera.aspect = window.innerWidth / window.innerHeight;
+                window.graphViz.camera.updateProjectionMatrix();
+                window.graphViz.renderer.setSize(window.innerWidth, window.innerHeight);
+            }}
+        }});
+        
+        // Initialize when DOM is ready
+        document.addEventListener('DOMContentLoaded', () => {{
+            window.graphViz = new WebGLGraphVisualization();
+        }});
+    </script>
+</body>
+</html>
+        """
+        
+        return html_template
+    
+    def _get_gpu_rendering_settings(self, node_count: int, quality: str) -> Dict[str, Any]:
+        """Get GPU rendering settings based on graph size and quality"""
+        
+        base_settings = {
+            'max_instanced_nodes': 100000,
+            'use_instanced_mesh': node_count > 1000,
+            'enable_lod': node_count > 25000,
+            'frustum_culling': node_count > 10000,
+            'texture_atlas_size': 1024
+        }
+        
+        quality_multipliers = {
+            'low': {'texture_atlas_size': 512, 'max_instanced_nodes': 50000},
+            'medium': {'texture_atlas_size': 1024, 'max_instanced_nodes': 75000},
+            'high': {'texture_atlas_size': 2048, 'max_instanced_nodes': 100000},
+            'ultra': {'texture_atlas_size': 4096, 'max_instanced_nodes': 500000}
+        }
+        
+        settings = base_settings.copy()
+        settings.update(quality_multipliers.get(quality, quality_multipliers['high']))
+        
+        return settings 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/Dockerfile b/nvidia/txt2kg/assets/deploy/services/ollama/Dockerfile
new file mode 100644
index 0000000..e3e76d8
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/Dockerfile
@@ -0,0 +1,9 @@
+FROM ollama/ollama:latest
+
+# Copy the entrypoint script
+COPY entrypoint.sh /entrypoint.sh
+RUN chmod +x /entrypoint.sh
+
+# Set the entrypoint
+ENTRYPOINT ["/entrypoint.sh"]
+
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/Dockerfile.monitor b/nvidia/txt2kg/assets/deploy/services/ollama/Dockerfile.monitor
new file mode 100644
index 0000000..8a864ac
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/Dockerfile.monitor
@@ -0,0 +1,26 @@
+FROM ubuntu:22.04
+
+# Install required packages
+RUN apt-get update && apt-get install -y \
+    curl \
+    docker.io \
+    bc \
+    && rm -rf /var/lib/apt/lists/*
+
+# Copy the monitoring script
+COPY gpu_memory_monitor.sh /usr/local/bin/
+RUN chmod +x /usr/local/bin/gpu_memory_monitor.sh
+
+# Create a non-root user
+RUN useradd -m -s /bin/bash monitor
+
+# Set environment variables with defaults
+ENV CHECK_INTERVAL=60
+ENV MIN_AVAILABLE_PERCENT=70
+ENV AUTO_FIX=true
+
+# Run as non-root user
+USER monitor
+WORKDIR /home/monitor
+
+CMD ["/usr/local/bin/gpu_memory_monitor.sh"]
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/NVIDIA_MPS_GUIDE.md b/nvidia/txt2kg/assets/deploy/services/ollama/NVIDIA_MPS_GUIDE.md
new file mode 100644
index 0000000..dc08af0
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/NVIDIA_MPS_GUIDE.md
@@ -0,0 +1,252 @@
+# NVIDIA MPS Guide for Ollama GPU Optimization
+
+## 🚀 Overview
+
+NVIDIA Multi-Process Service (MPS) is a game-changing technology that enables multiple processes to share a single GPU context, eliminating expensive context switching overhead and dramatically improving concurrent workload performance.
+
+This guide documents our discovery: **MPS transforms the DGX Spark from a single-threaded bottleneck into a high-throughput powerhouse**, achieving **3x concurrent performance** with near-perfect scaling.
+
+## 📊 Performance Results Summary
+
+### Triple Extraction Benchmark (llama3.1:8b)
+
+| System | Mode | Individual Performance | Aggregate Throughput | Scaling Efficiency |
+|--------|------|----------------------|---------------------|-------------------|
+| **RTX 5090** | Single | ~300 tok/s | 300 tok/s | 100% (baseline) |
+| **Mac M4 Pro** | Single | ~45 tok/s | 45 tok/s | 100% (baseline) |
+| **DGX Spark** | Single (MPS) | 33.3 tok/s | 33.3 tok/s | 100% (baseline) |
+| **DGX Spark** | 2x Concurrent | ~33.2 tok/s each | **66.4 tok/s** | **97% efficiency** |
+| **DGX Spark** | 3x Concurrent | ~33.1 tok/s each | **99.4 tok/s** | **99% efficiency** |
+
+### 🏆 Key Achievement
+**DGX Spark + MPS delivers 2.2x higher aggregate throughput than RTX 5090 in multi-request scenarios!**
+
+## 🛠️ MPS Setup Instructions
+
+### 1. Start MPS Server
+
+```bash
+# Set MPS directory
+export CUDA_MPS_PIPE_DIRECTORY=/tmp/nvidia-mps
+mkdir -p /tmp/nvidia-mps
+
+# Start MPS control daemon
+sudo env "CUDA_MPS_PIPE_DIRECTORY=/tmp/nvidia-mps" nvidia-cuda-mps-control -d
+```
+
+### 2. Restart Ollama with MPS Support
+
+```bash
+# Stop current Ollama
+cd /path/to/ollama
+docker compose down
+
+# Start Ollama with MPS environment
+sudo env "CUDA_MPS_PIPE_DIRECTORY=/tmp/nvidia-mps" docker compose up -d
+```
+
+### 3. Verify MPS is Working
+
+```bash
+# Check MPS processes
+ps aux | grep mps
+
+# Expected output:
+# root nvidia-cuda-mps-control -d
+# root nvidia-cuda-mps-server -force-tegra
+
+# Check Ollama processes show M+C flag
+nvidia-smi
+# Look for M+C in the Type column for Ollama processes
+```
+
+### 4. Stop MPS (when needed)
+
+```bash
+sudo nvidia-cuda-mps-control quit
+```
+
+## 🔬 Technical Architecture
+
+### CUDA MPS Architecture
+```
+┌─────────────────────────────────────────┐
+│  GPU (Single CUDA Context)              │
+│  ├── MPS Server (Resource Manager)      │
+│  ├── Ollama Process 1 ──┐               │
+│  ├── Ollama Process 2 ──┼── Shared      │
+│  └── Ollama Process 3 ──┘   Context     │
+└─────────────────────────────────────────┘
+```
+
+### Traditional Multi-Process Architecture
+```
+┌─────────────────────────────────────────┐
+│  GPU                                    │
+│  ├── Process 1 (Context 1) ─────────────│
+│  ├── Process 2 (Context 2) ─────────────│
+│  └── Process 3 (Context 3) ─────────────│
+│      ↑ Context Switching Overhead       │
+└─────────────────────────────────────────┘
+```
+
+## ⚖️ MPS vs Multiple API Servers Comparison
+
+### 🚀 CUDA MPS Advantages
+
+**Performance:**
+- ✅ No context switching overhead (single shared context)
+- ✅ Concurrent kernel execution from different processes
+- ✅ Lower latency for small requests
+- ✅ Better GPU utilization (kernels can overlap)
+
+**Memory Efficiency:**
+- ✅ Shared GPU memory management
+- ✅ No duplicate driver overhead per process
+- ✅ More efficient memory allocation
+- ✅ Can fit more models in same memory
+
+**Resource Management:**
+- ✅ Single point of GPU resource control
+- ✅ Automatic load balancing across processes
+- ✅ Better thermal management
+- ✅ Unified monitoring and debugging
+
+### 🏢 Multiple API Servers Advantages
+
+**Isolation & Reliability:**
+- ✅ Process isolation (one crash doesn't affect others)
+- ✅ Independent scaling per service
+- ✅ Different models can have different configurations
+- ✅ Easier to update/restart individual services
+
+**Flexibility:**
+- ✅ Different frameworks (vLLM, TensorRT-LLM, etc.)
+- ✅ Per-service optimization
+- ✅ Independent monitoring and logging
+- ✅ Service-specific resource limits
+
+**Operational:**
+- ✅ Standard container orchestration (K8s, Docker)
+- ✅ Familiar DevOps patterns
+- ✅ Load balancing at HTTP level
+- ✅ Rolling updates and deployments
+
+## 🎯 Decision Framework
+
+### Use CUDA MPS When:
+- 🏆 Maximum GPU utilization is critical
+- ⚡ Low latency is paramount
+- 💰 Cost optimization (more models per GPU)
+- 🔄 Same framework/runtime (e.g., all Ollama)
+- 📊 Predictable, homogeneous workloads
+- 🎮 Single-tenant environments
+
+### Use Multiple API Servers When:
+- 🛡️ High availability/fault tolerance required
+- 🔧 Different models need different optimizations
+- 📈 Independent scaling per service needed
+- 🌐 Multi-tenant production environments
+- 🔄 Frequent model updates/deployments
+- 👥 Different teams managing different models
+
+## 📊 Performance Impact Analysis
+
+| Metric | CUDA MPS | Multiple Servers |
+|--------|----------|------------------|
+| Context Switch Overhead | ~0% | ~5-15% |
+| Memory Efficiency | ~95% | ~80-85% |
+| Latency (small requests) | Lower | Higher |
+| Throughput (concurrent) | Higher | Lower |
+| Fault Isolation | Lower | Higher |
+| Operational Complexity | Lower | Higher |
+
+## 🔍 Memory Capacity Analysis
+
+### Model Memory Requirements
+- **llama3.1:8b (Q4_K_M)**: ~4.9GB per instance
+
+### System Comparison
+| System | Total Memory | Theoretical Max | Practical Max |
+|--------|--------------|----------------|---------------|
+| **RTX 5090** | 24GB VRAM | 4-5 models | 2-3 models |
+| **DGX Spark** | 120GB Unified | 20+ models | 10+ models |
+
+### RTX 5090 Limitations:
+- ❌ Limited to 24GB VRAM (hard ceiling)
+- ❌ Driver overhead reduces available memory
+- ❌ Memory fragmentation issues
+- ❌ Thermal throttling under concurrent load
+- ❌ Context switching still expensive
+
+### DGX Spark Advantages:
+- ✅ 5x more memory capacity (120GB vs 24GB)
+- ✅ Unified memory architecture
+- ✅ Better thermal design for sustained loads
+- ✅ Can scale to 10+ concurrent models
+- ✅ No VRAM bottleneck
+
+## 🧪 Testing Concurrent Performance
+
+### Single Instance Baseline
+```bash
+curl -X POST http://localhost:11434/api/chat \
+  -H "Content-Type: application/json" \
+  -d '{
+    "model": "llama3.1:8b",
+    "messages": [{"role": "user", "content": "Your prompt here"}],
+    "stream": false
+  }'
+```
+
+### Concurrent Testing
+```bash
+# Run multiple requests simultaneously
+curl [request1] & curl [request2] & curl [request3] & wait
+```
+
+### Expected Results with MPS:
+- **1 instance**: 33.3 tok/s
+- **2 concurrent**: ~66.4 tok/s total (97% efficiency)
+- **3 concurrent**: ~99.4 tok/s total (99% efficiency)
+
+## 🎯 Recommendations
+
+### For Triple Extraction Workloads:
+**MPS is the optimal choice because:**
+1. **Homogeneous workload** - same model (llama3.1:8b)
+2. **Performance critical** - maximum throughput needed
+3. **Cost optimization** - more concurrent requests per GPU
+4. **Predictable usage** - biomedical triple extraction
+
+### Hybrid Approach:
+Consider running:
+- **MPS in production** for maximum throughput
+- **Separate dev/test servers** for experimentation
+- **Different models** on separate instances when needed
+
+## 🚨 Important Notes
+
+1. **MPS requires careful setup** - ensure proper environment variables
+2. **Monitor GPU temperature** under heavy concurrent loads
+3. **Test thoroughly** before production deployment
+4. **Have fallback plan** to standard single-process mode
+5. **Consider workload patterns** - MPS excels with consistent concurrent requests
+
+## 🔗 Related Files
+
+- `docker-compose.yml` - Ollama service configuration
+- `ollama_gpu_benchmark.py` - Performance testing script
+- `clear_cache_and_restart.sh` - Memory optimization script
+- `gpu_memory_monitor.sh` - GPU monitoring script
+
+## 📚 Additional Resources
+
+- [NVIDIA MPS Documentation](https://docs.nvidia.com/deploy/mps/index.html)
+- [CUDA Multi-Process Service Guide](https://docs.nvidia.com/cuda/mps/index.html)
+- [Ollama Documentation](https://ollama.ai/docs)
+
+---
+
+**Last Updated**: October 2, 2025
+**Tested On**: DGX Spark with 120GB unified memory, CUDA 13.0, Ollama latest
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/README_GPU_MONITORING.md b/nvidia/txt2kg/assets/deploy/services/ollama/README_GPU_MONITORING.md
new file mode 100644
index 0000000..1ea560b
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/README_GPU_MONITORING.md
@@ -0,0 +1,78 @@
+# Ollama GPU Memory Monitoring
+
+This setup includes automatic monitoring and fixing of GPU memory detection issues that can occur on unified memory systems (like DGX Spark, Jetson, etc.).
+
+## The Problem
+
+On unified memory systems, Ollama sometimes can't detect the full amount of available GPU memory due to buffer cache not being reclaimable. This causes models to fall back to CPU inference, dramatically reducing performance.
+
+**Symptoms:**
+- Ollama logs show low "available" vs "total" GPU memory
+- Models show mixed CPU/GPU processing instead of 100% GPU
+- Performance is much slower than expected
+
+## The Solution
+
+This Docker Compose setup includes an optional GPU memory monitor that:
+
+1. **Monitors** Ollama's GPU memory detection every 60 seconds
+2. **Detects** when available memory drops below 70% of total
+3. **Automatically fixes** the issue by clearing buffer cache and restarting Ollama
+4. **Logs** all actions for debugging
+
+## Usage
+
+### Standard Setup (Most Systems)
+```bash
+docker compose up -d
+```
+
+### Unified Memory Systems (DGX Spark, Jetson, etc.)
+```bash
+docker compose --profile unified-memory up -d
+```
+
+This will start both Ollama and the GPU memory monitor.
+
+## Configuration
+
+The monitor can be configured via environment variables:
+
+- `CHECK_INTERVAL=60` - How often to check (seconds)
+- `MIN_AVAILABLE_PERCENT=70` - Threshold for triggering fixes (percentage)
+- `AUTO_FIX=true` - Whether to automatically fix issues
+
+## Manual Commands
+
+You can still use the manual scripts if needed:
+
+```bash
+# Check current GPU memory status
+./monitor_gpu_memory.sh
+
+# Manually clear cache and restart
+./clear_cache_and_restart.sh
+```
+
+## Monitoring Logs
+
+To see what the monitor is doing:
+
+```bash
+docker logs ollama-gpu-monitor -f
+```
+
+## When to Use
+
+Use the unified memory profile if you experience:
+- Inconsistent Ollama performance
+- Models loading on CPU instead of GPU
+- GPU memory showing as much lower than system RAM
+- You're on a system with unified memory (DGX, Jetson, etc.)
+
+## Performance Impact
+
+The monitor has minimal performance impact:
+- Runs one check every 60 seconds
+- Only takes action when issues are detected
+- Automatic fixes typically resolve issues within 30 seconds
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/clear_cache_and_restart.sh b/nvidia/txt2kg/assets/deploy/services/ollama/clear_cache_and_restart.sh
new file mode 100755
index 0000000..9721e92
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/clear_cache_and_restart.sh
@@ -0,0 +1,34 @@
+#!/bin/bash
+#
+# Clear buffer cache and restart Ollama to fix unified memory detection
+# This script addresses the issue where Ollama can't see full GPU memory
+# due to buffer cache not being reclaimable in unified memory systems
+#
+
+set -e
+
+echo "🧹 Clearing system buffer cache..."
+echo "Current memory status:"
+free -h
+
+echo "Stopping Ollama container..."
+docker compose -f /home/nvidia/txt2kg/txt2kg/deploy/services/ollama/docker-compose.yml down
+
+echo "Clearing buffer cache..."
+sudo sync
+sudo sh -c 'echo 1 > /proc/sys/vm/drop_caches'
+
+echo "Memory status after cache clear:"
+free -h
+
+echo "Restarting Ollama container..."
+docker compose -f /home/nvidia/txt2kg/txt2kg/deploy/services/ollama/docker-compose.yml up -d
+
+echo "Waiting for Ollama to start..."
+sleep 10
+
+echo "Checking GPU memory detection..."
+timeout 30 bash -c 'while ! docker logs ollama-server 2>&1 | grep -q "inference compute"; do sleep 1; done'
+docker logs ollama-server 2>&1 | grep "inference compute" | tail -1
+
+echo "✅ Ollama restarted with cleared cache"
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/docker-compose.yml b/nvidia/txt2kg/assets/deploy/services/ollama/docker-compose.yml
new file mode 100644
index 0000000..3851bb9
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/docker-compose.yml
@@ -0,0 +1,66 @@
+version: '3.8'
+
+services:
+  ollama:
+    build:
+      context: .
+      dockerfile: Dockerfile
+    image: ollama-custom:latest
+    container_name: ollama-server
+    ports:
+      - "11434:11434"
+    volumes:
+      - ollama_models:/root/.ollama
+    environment:
+      - OLLAMA_HOST=0.0.0.0:11434
+      - OLLAMA_FLASH_ATTENTION=1
+      - OLLAMA_KEEP_ALIVE=30m
+      - OLLAMA_CUDA=1
+      # Performance tuning for large models like Llama3 70B
+      - OLLAMA_LLM_LIBRARY=cuda
+      - OLLAMA_NUM_PARALLEL=1         # Favor latency/stability for 70B; increase for smaller models
+      - OLLAMA_MAX_LOADED_MODELS=1    # Avoid VRAM contention
+      - OLLAMA_KV_CACHE_TYPE=q8_0     # Reduce KV cache VRAM with minimal perf impact
+      # Removed restrictive settings for 70B model testing:
+      # - OLLAMA_CONTEXT_LENGTH=8192 (let Ollama auto-detect)
+      # - OLLAMA_NUM_PARALLEL=4 (let Ollama decide)
+      # - OLLAMA_MAX_LOADED=1 (allow multiple models)
+      # - OLLAMA_NUM_THREADS=16 (may force CPU usage)
+    runtime: nvidia
+    deploy:
+      resources:
+        reservations:
+          devices:
+            - driver: nvidia
+              count: all
+              capabilities: [gpu]
+    restart: unless-stopped
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:11434/api/tags"]
+      interval: 30s
+      timeout: 10s
+      retries: 3
+      start_period: 60s
+
+  # GPU Memory Monitor - only for unified memory systems like DGX Spark
+  gpu-monitor:
+    build:
+      context: .
+      dockerfile: Dockerfile.monitor
+    container_name: ollama-gpu-monitor
+    depends_on:
+      - ollama
+    volumes:
+      - /var/run/docker.sock:/var/run/docker.sock:ro
+    environment:
+      - CHECK_INTERVAL=60          # Check every 60 seconds
+      - MIN_AVAILABLE_PERCENT=70   # Alert if less than 70% GPU memory available
+      - AUTO_FIX=true             # Automatically fix buffer cache issues
+    privileged: true  # Required to clear buffer cache and restart containers
+    restart: unless-stopped
+    profiles:
+      - unified-memory  # Only start with --profile unified-memory
+
+volumes:
+  ollama_models:
+    driver: local
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/entrypoint.sh b/nvidia/txt2kg/assets/deploy/services/ollama/entrypoint.sh
new file mode 100644
index 0000000..52aed88
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/entrypoint.sh
@@ -0,0 +1,42 @@
+#!/bin/bash
+set -e
+
+# Start Ollama server in the background
+echo "Starting Ollama server..."
+/bin/ollama serve &
+OLLAMA_PID=$!
+
+# Wait for Ollama to be ready
+echo "Waiting for Ollama to be ready..."
+max_attempts=30
+attempt=0
+while [ $attempt -lt $max_attempts ]; do
+    if curl -s http://localhost:11434/api/tags > /dev/null 2>&1; then
+        echo "Ollama is ready!"
+        break
+    fi
+    attempt=$((attempt + 1))
+    sleep 2
+done
+
+if [ $attempt -eq $max_attempts ]; then
+    echo "ERROR: Ollama failed to start within the timeout period"
+    exit 1
+fi
+
+# Check if any models are present
+echo "Checking for existing models..."
+MODELS=$(curl -s http://localhost:11434/api/tags | grep -o '"models":\s*\[\]' || echo "has_models")
+
+if [[ "$MODELS" == *'"models": []'* ]]; then
+    echo "No models found. Pulling llama3.1:8b..."
+    /bin/ollama pull llama3.1:8b
+    echo "Successfully pulled llama3.1:8b"
+else
+    echo "Models already exist, skipping pull."
+fi
+
+# Keep the container running
+echo "Setup complete. Ollama is running."
+wait $OLLAMA_PID
+
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/gpu_memory_monitor.sh b/nvidia/txt2kg/assets/deploy/services/ollama/gpu_memory_monitor.sh
new file mode 100644
index 0000000..7cbfc6e
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/gpu_memory_monitor.sh
@@ -0,0 +1,108 @@
+#!/bin/bash
+#
+# Ollama GPU Memory Monitor - runs inside a sidecar container
+# Automatically detects and fixes unified memory buffer cache issues
+#
+
+set -e
+
+# Configuration
+CHECK_INTERVAL=${CHECK_INTERVAL:-60}  # Check every 60 seconds
+MIN_AVAILABLE_PERCENT=${MIN_AVAILABLE_PERCENT:-70}  # Alert if less than 70% available
+AUTO_FIX=${AUTO_FIX:-true}  # Automatically fix issues
+
+log() {
+    echo "[$(date '+%Y-%m-%d %H:%M:%S')] $1"
+}
+
+check_ollama_memory() {
+    # Wait for Ollama to be ready
+    if ! curl -s http://ollama:11434/api/tags > /dev/null 2>&1; then
+        log "Ollama not ready, skipping check"
+        return 0
+    fi
+
+    # Get Ollama logs to find inference compute info
+    local compute_log=$(docker logs ollama-server 2>&1 | grep "inference compute" | tail -1)
+    
+    if [ -z "$compute_log" ]; then
+        log "No inference compute logs found"
+        return 0
+    fi
+
+    # Extract memory info
+    local total_mem=$(echo "$compute_log" | grep -o 'total="[^"]*"' | cut -d'"' -f2)
+    local available_mem=$(echo "$compute_log" | grep -o 'available="[^"]*"' | cut -d'"' -f2)
+    
+    if [ -z "$total_mem" ] || [ -z "$available_mem" ]; then
+        log "Could not parse memory information"
+        return 0
+    fi
+
+    # Convert to numeric (assuming GiB)
+    local total_num=$(echo "$total_mem" | sed 's/ GiB//')
+    local available_num=$(echo "$available_mem" | sed 's/ GiB//')
+    
+    # Calculate percentage
+    local available_percent=$(echo "scale=1; $available_num * 100 / $total_num" | bc)
+    
+    log "GPU Memory: $available_mem / $total_mem available (${available_percent}%)"
+    
+    # Check if we need to take action
+    if (( $(echo "$available_percent < $MIN_AVAILABLE_PERCENT" | bc -l) )); then
+        log "WARNING: Low GPU memory availability detected (${available_percent}%)"
+        
+        if [ "$AUTO_FIX" = "true" ]; then
+            log "Attempting to fix by clearing buffer cache..."
+            fix_memory_issue
+        else
+            log "Auto-fix disabled. Manual intervention required."
+        fi
+        
+        return 1
+    else
+        log "GPU memory availability OK (${available_percent}%)"
+        return 0
+    fi
+}
+
+fix_memory_issue() {
+    log "Clearing system buffer cache..."
+    
+    # Clear buffer cache from host (requires privileged container)
+    echo 1 > /proc/sys/vm/drop_caches 2>/dev/null || {
+        log "Cannot clear buffer cache from container. Trying host command..."
+        # Alternative: use nsenter to run on host
+        nsenter -t 1 -m -p sh -c 'sync && echo 1 > /proc/sys/vm/drop_caches' 2>/dev/null || {
+            log "Failed to clear buffer cache. Manual intervention required."
+            return 1
+        }
+    }
+    
+    # Wait a moment
+    sleep 5
+    
+    # Restart Ollama container
+    log "Restarting Ollama container..."
+    docker restart ollama-server
+    
+    # Wait for restart
+    sleep 15
+    
+    log "Fix applied. Ollama should have better memory detection now."
+}
+
+main() {
+    log "Starting Ollama GPU Memory Monitor"
+    log "Check interval: ${CHECK_INTERVAL}s, Min available: ${MIN_AVAILABLE_PERCENT}%, Auto-fix: ${AUTO_FIX}"
+    
+    while true; do
+        check_ollama_memory || true  # Don't exit on check failures
+        sleep "$CHECK_INTERVAL"
+    done
+}
+
+# Handle signals gracefully
+trap 'log "Shutting down monitor..."; exit 0' SIGTERM SIGINT
+
+main
diff --git a/nvidia/txt2kg/assets/deploy/services/ollama/monitor_gpu_memory.sh b/nvidia/txt2kg/assets/deploy/services/ollama/monitor_gpu_memory.sh
new file mode 100755
index 0000000..49737fc
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/ollama/monitor_gpu_memory.sh
@@ -0,0 +1,79 @@
+#!/bin/bash
+#
+# Monitor Ollama GPU memory usage and alert when buffer cache is consuming too much
+# This helps detect when the unified memory issue is occurring
+#
+
+set -e
+
+# Colors for output
+RED='\033[0;31m'
+GREEN='\033[0;32m'
+YELLOW='\033[1;33m'
+NC='\033[0m' # No Color
+
+# Thresholds
+MIN_AVAILABLE_PERCENT=70  # Alert if less than 70% GPU memory available
+
+echo "🔍 Ollama GPU Memory Monitor"
+echo "================================"
+
+# Check if Ollama container is running
+if ! docker ps | grep -q ollama-server; then
+    echo -e "${RED}❌ Ollama container is not running${NC}"
+    exit 1
+fi
+
+# Get the latest inference compute log
+COMPUTE_LOG=$(docker logs ollama-server 2>&1 | grep "inference compute" | tail -1)
+
+if [ -z "$COMPUTE_LOG" ]; then
+    echo -e "${YELLOW}⚠️  No inference compute logs found. Model may not be loaded.${NC}"
+    exit 1
+fi
+
+echo "Latest GPU memory status:"
+echo "$COMPUTE_LOG"
+
+# Extract total and available memory
+TOTAL_MEM=$(echo "$COMPUTE_LOG" | grep -o 'total="[^"]*"' | cut -d'"' -f2)
+AVAILABLE_MEM=$(echo "$COMPUTE_LOG" | grep -o 'available="[^"]*"' | cut -d'"' -f2)
+
+# Convert to numeric values (assuming GiB)
+TOTAL_NUM=$(echo "$TOTAL_MEM" | sed 's/ GiB//')
+AVAILABLE_NUM=$(echo "$AVAILABLE_MEM" | sed 's/ GiB//')
+
+# Calculate percentage
+AVAILABLE_PERCENT=$(echo "scale=1; $AVAILABLE_NUM * 100 / $TOTAL_NUM" | bc)
+
+echo ""
+echo "Memory Analysis:"
+echo "  Total GPU Memory: $TOTAL_MEM"
+echo "  Available Memory: $AVAILABLE_MEM"
+echo "  Available Percentage: ${AVAILABLE_PERCENT}%"
+
+# Check if we need to alert
+if (( $(echo "$AVAILABLE_PERCENT < $MIN_AVAILABLE_PERCENT" | bc -l) )); then
+    echo ""
+    echo -e "${RED}🚨 WARNING: Low GPU memory availability detected!${NC}"
+    echo -e "${RED}   Only ${AVAILABLE_PERCENT}% of GPU memory is available${NC}"
+    echo -e "${YELLOW}   This may cause models to run on CPU instead of GPU${NC}"
+    echo ""
+    echo "💡 Recommended action:"
+    echo "   Run: ./clear_cache_and_restart.sh"
+    echo ""
+    
+    # Show current system memory usage
+    echo "Current system memory usage:"
+    free -h
+    
+    exit 1
+else
+    echo ""
+    echo -e "${GREEN}✅ GPU memory availability looks good (${AVAILABLE_PERCENT}%)${NC}"
+fi
+
+# Show current model status
+echo ""
+echo "Current loaded models:"
+docker exec ollama-server ollama ps
diff --git a/nvidia/txt2kg/assets/deploy/services/sentence-transformers/Dockerfile b/nvidia/txt2kg/assets/deploy/services/sentence-transformers/Dockerfile
new file mode 100644
index 0000000..4c775f0
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/sentence-transformers/Dockerfile
@@ -0,0 +1,23 @@
+FROM python:3.9-slim
+
+WORKDIR /app
+
+# Copy requirements and install dependencies first for better caching
+COPY requirements.txt /app/
+RUN pip install --no-cache-dir -r requirements.txt
+
+# Copy application code
+COPY app.py /app/
+
+# Set default model name
+ENV MODEL_NAME="all-MiniLM-L6-v2"
+ENV TRANSFORMERS_CACHE="/app/.cache"
+
+# Pre-download the model during build for faster startup
+RUN python -c "from sentence_transformers import SentenceTransformer; SentenceTransformer('${MODEL_NAME}')"
+
+# Expose the port
+EXPOSE 80
+
+# Use Gunicorn for better performance
+CMD ["gunicorn", "--bind", "0.0.0.0:80", "--workers", "1", "--threads", "8", "app:app"] 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/sentence-transformers/app.py b/nvidia/txt2kg/assets/deploy/services/sentence-transformers/app.py
new file mode 100644
index 0000000..1f3cbc0
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/sentence-transformers/app.py
@@ -0,0 +1,92 @@
+from flask import Flask, request, jsonify
+from sentence_transformers import SentenceTransformer
+import os
+import time
+import logging
+
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+
+app = Flask(__name__)
+
+# Get model name from environment variable
+model_name = os.environ.get("MODEL_NAME", "all-MiniLM-L6-v2")
+logger.info(f"Loading model: {model_name}")
+
+# Load model during startup
+start_time = time.time()
+try:
+    model = SentenceTransformer(model_name)
+    logger.info(f"Model loaded in {time.time() - start_time:.2f} seconds")
+except Exception as e:
+    logger.error(f"Failed to load model: {e}")
+    raise
+
+@app.route("/health", methods=["GET"])
+def health():
+    return jsonify({"status": "healthy", "model": model_name})
+
+@app.route("/embed", methods=["POST"])
+def embed():
+    try:
+        data = request.json
+        if not data:
+            return jsonify({"error": "No JSON data provided"}), 400
+            
+        texts = data.get("texts", [])
+        if not texts:
+            return jsonify({"error": "No texts provided"}), 400
+            
+        # Process in batches if needed
+        batch_size = data.get("batch_size", 32)
+        
+        start_time = time.time()
+        embeddings = model.encode(texts, batch_size=batch_size).tolist()
+        processing_time = time.time() - start_time
+        
+        logger.info(f"Processed {len(texts)} texts in {processing_time:.2f} seconds")
+        
+        return jsonify({
+            "embeddings": embeddings,
+            "model": model_name,
+            "processing_time": processing_time
+        })
+    except Exception as e:
+        logger.error(f"Error generating embeddings: {e}")
+        return jsonify({"error": str(e)}), 500
+
+# Add compatibility with the /embeddings endpoint for the EmbeddingsService class
+@app.route("/embeddings", methods=["POST"])
+def embeddings():
+    try:
+        data = request.json
+        if not data:
+            return jsonify({"error": "No JSON data provided"}), 400
+            
+        texts = data.get("input", [])
+        if not texts:
+            return jsonify({"error": "No input texts provided"}), 400
+            
+        batch_size = data.get("batch_size", 32)
+        
+        start_time = time.time()
+        embeddings = model.encode(texts, batch_size=batch_size).tolist()
+        processing_time = time.time() - start_time
+        
+        # Format response for compatibility with the EmbeddingsService
+        response_data = {
+            "data": [{"embedding": embedding} for embedding in embeddings],
+            "model": model_name,
+            "processing_time": processing_time
+        }
+        
+        logger.info(f"Processed {len(texts)} texts in {processing_time:.2f} seconds for /embeddings endpoint")
+        
+        return jsonify(response_data)
+    except Exception as e:
+        logger.error(f"Error generating embeddings: {e}")
+        return jsonify({"error": str(e)}), 500
+
+if __name__ == "__main__":
+    app.run(host="0.0.0.0", port=int(os.environ.get("PORT", 80))) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/sentence-transformers/requirements.txt b/nvidia/txt2kg/assets/deploy/services/sentence-transformers/requirements.txt
new file mode 100644
index 0000000..d77c26d
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/sentence-transformers/requirements.txt
@@ -0,0 +1,6 @@
+sentence-transformers==2.3.1
+transformers==4.36.2
+torch==2.1.2
+flask==2.3.3
+gunicorn==21.2.0
+numpy==1.26.2 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/Dockerfile b/nvidia/txt2kg/assets/deploy/services/vllm/Dockerfile
new file mode 100644
index 0000000..a07c298
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/Dockerfile
@@ -0,0 +1,27 @@
+# Use NVIDIA Triton Inference Server with vLLM - optimized for latest NVIDIA hardware
+FROM nvcr.io/nvidia/tritonserver:25.08-vllm-python-py3
+
+# Install curl for health checks
+RUN apt-get update && apt-get install -y curl && rm -rf /var/lib/apt/lists/*
+
+# Set working directory
+WORKDIR /app
+
+# Copy the vLLM startup script
+COPY launch_server.sh .
+
+# Make startup script executable
+RUN chmod +x launch_server.sh
+
+# Create model directory
+RUN mkdir -p /app/models
+
+# Expose the service port
+EXPOSE 8001
+
+# Health check
+HEALTHCHECK --interval=30s --timeout=10s --start-period=60s --retries=3 \
+    CMD curl -f http://localhost:8001/health || exit 1
+
+# Start vLLM's built-in OpenAI API server directly
+CMD ["./launch_server.sh"]
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/Dockerfile.benchmark b/nvidia/txt2kg/assets/deploy/services/vllm/Dockerfile.benchmark
new file mode 100644
index 0000000..540fb76
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/Dockerfile.benchmark
@@ -0,0 +1,21 @@
+FROM python:3.11-slim
+
+WORKDIR /app
+
+# Install required packages
+RUN pip install --no-cache-dir \
+    aiohttp \
+    asyncio \
+    statistics
+
+# Copy benchmark script
+COPY vllm_llama3_benchmark.py /app/
+
+# Create results directory
+RUN mkdir -p /app/results
+
+# Make script executable
+RUN chmod +x /app/vllm_llama3_benchmark.py
+
+# Default command
+CMD ["python", "/app/vllm_llama3_benchmark.py", "--url", "http://vllm-llama3-8b:8001", "--output", "/app/results/benchmark_results.json"]
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/README.md b/nvidia/txt2kg/assets/deploy/services/vllm/README.md
new file mode 100644
index 0000000..ccfdfde
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/README.md
@@ -0,0 +1,92 @@
+# vLLM NVFP4 Deployment
+
+This setup deploys the NVIDIA Llama 4 Scout model with NVFP4 quantization using vLLM, optimized for Blackwell and Hopper GPU architectures.
+
+## Quick Start
+
+1. **Set up your HuggingFace token:**
+   ```bash
+   cp env.example .env
+   # Edit .env and add your HF_TOKEN
+   ```
+
+2. **Build and run:**
+   ```bash
+   docker-compose up --build
+   ```
+
+3. **Test the deployment:**
+   ```bash
+   curl -X POST "http://localhost:8001/v1/chat/completions" \
+     -H "Content-Type: application/json" \
+     -d '{
+       "model": "nvidia/Llama-4-Scout-17B-16E-Instruct-FP4",
+       "messages": [{"role": "user", "content": "Hello! How are you?"}],
+       "max_tokens": 100
+     }'
+   ```
+
+## Model Information
+
+- **Model**: `nvidia/Llama-4-Scout-17B-16E-Instruct-FP4`
+- **Quantization**: NVFP4 (optimized for Blackwell architecture)
+- **Alternative**: `nvidia/Llama-4-Scout-17B-16E-Instruct-FP8` (for Hopper architecture)
+
+## Performance Tuning
+
+The startup script automatically detects your GPU architecture and applies optimal settings:
+
+### Blackwell (Compute Capability 10.0)
+- Enables FlashInfer backend
+- Uses NVFP4 quantization
+- Enables async scheduling
+- Applies fusion optimizations
+
+### Hopper (Compute Capability 9.0)
+- Uses FP8 quantization
+- Disables async scheduling (due to vLLM limitations)
+- Standard optimization settings
+
+### Configuration Options
+
+Adjust these environment variables in your `.env` file:
+
+- `VLLM_TENSOR_PARALLEL_SIZE`: Number of GPUs to use (default: 2)
+- `VLLM_MAX_NUM_SEQS`: Batch size (default: 128)
+- `VLLM_MAX_NUM_BATCHED_TOKENS`: Token batching limit (default: 8192)
+- `VLLM_GPU_MEMORY_UTILIZATION`: GPU memory usage (default: 0.9)
+
+### Performance Scenarios
+
+- **Maximum Throughput**: `VLLM_TENSOR_PARALLEL_SIZE=1`, increase `VLLM_MAX_NUM_SEQS`
+- **Minimum Latency**: `VLLM_TENSOR_PARALLEL_SIZE=4-8`, `VLLM_MAX_NUM_SEQS=8`
+- **Balanced**: `VLLM_TENSOR_PARALLEL_SIZE=2`, `VLLM_MAX_NUM_SEQS=128` (default)
+
+## Benchmarking
+
+To benchmark performance:
+
+```bash
+docker exec -it vllm-nvfp4-server vllm bench serve \
+  --host 0.0.0.0 \
+  --port 8001 \
+  --model nvidia/Llama-4-Scout-17B-16E-Instruct-FP4 \
+  --dataset-name random \
+  --random-input-len 1024 \
+  --random-output-len 1024 \
+  --max-concurrency 128 \
+  --num-prompts 1280
+```
+
+## Requirements
+
+- NVIDIA GPU with Blackwell or Hopper architecture
+- CUDA Driver 575 or above
+- Docker with NVIDIA Container Toolkit
+- HuggingFace token (for model access)
+
+## Troubleshooting
+
+- Check GPU compatibility: `nvidia-smi`
+- View logs: `docker-compose logs -f vllm-nvfp4`
+- Monitor GPU usage: `nvidia-smi -l 1`
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/build_image.sh b/nvidia/txt2kg/assets/deploy/services/vllm/build_image.sh
new file mode 100755
index 0000000..4c902d8
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/build_image.sh
@@ -0,0 +1,23 @@
+#!/bin/bash
+
+# Use latest stable vLLM release for better compute capability 12.1 support
+# Clone the vLLM GitHub repo and use latest stable release.
+git clone https://github.com/vllm-project/vllm.git /tmp/vllm-tutorial
+cd /tmp/vllm-tutorial
+git checkout $(git describe --tags --abbrev=0)
+
+# Build the docker image using official vLLM Dockerfile.
+DOCKER_BUILDKIT=1 docker build . \
+        --file docker/Dockerfile \
+        --target vllm-openai \
+        --build-arg CUDA_VERSION=12.8.1 \
+        --build-arg max_jobs=8 \
+        --build-arg nvcc_threads=2 \
+        --build-arg RUN_WHEEL_CHECK=false \
+        --build-arg torch_cuda_arch_list="10.0+PTX;12.1" \
+        --build-arg vllm_fa_cmake_gpu_arches="100-real;121-real" \
+        -t vllm/vllm-openai:deploy
+
+# Clean up
+cd /
+rm -rf /tmp/vllm-tutorial
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/docker-compose.llama3-8b.yml b/nvidia/txt2kg/assets/deploy/services/vllm/docker-compose.llama3-8b.yml
new file mode 100644
index 0000000..f465158
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/docker-compose.llama3-8b.yml
@@ -0,0 +1,100 @@
+services:
+  vllm-llama3-8b:
+    image: nvcr.io/nvidia/vllm:25.09-py3
+    container_name: vllm-llama3-8b
+    ports:
+      - "8001:8001"
+    environment:
+      # Model configuration - Llama3 8B
+      - MODEL_NAME=meta-llama/Llama-3.1-8B-Instruct
+      - TENSOR_PARALLEL_SIZE=1
+      - MAX_MODEL_LEN=4096
+      - GPU_MEMORY_UTILIZATION=0.9
+      
+      # Performance optimizations
+      - QUANTIZATION=fp8
+      - KV_CACHE_DTYPE=fp8
+      - ENABLE_CHUNKED_PREFILL=true
+      - MAX_NUM_BATCHED_TOKENS=8192
+      - MAX_NUM_SEQS=256
+      
+      # Service configuration
+      - HOST=0.0.0.0
+      - PORT=8001
+      - DISABLE_LOG_STATS=false
+      - DISABLE_LOG_REQUESTS=false
+      
+      # CUDA settings
+      - CUDA_VISIBLE_DEVICES=0
+      - NCCL_DEBUG=INFO
+      
+      # Hugging Face settings
+      - HF_HOME=/app/.cache/huggingface
+      - TRANSFORMERS_CACHE=/app/.cache/huggingface/transformers
+    
+    volumes:
+      # Cache Hugging Face models for faster startup
+      - ~/.cache/huggingface:/app/.cache/huggingface
+      - /tmp:/tmp
+    
+    command: >
+      python -m vllm.entrypoints.openai.api_server
+      --model meta-llama/Llama-3.1-8B-Instruct
+      --host 0.0.0.0
+      --port 8001
+      --tensor-parallel-size 1
+      --max-model-len 4096
+      --gpu-memory-utilization 0.9
+      --quantization fp8
+      --kv-cache-dtype fp8
+      --enable-chunked-prefill
+      --max-num-batched-tokens 8192
+      --max-num-seqs 256
+      --disable-log-stats
+      --trust-remote-code
+    
+    deploy:
+      resources:
+        reservations:
+          devices:
+            - driver: nvidia
+              count: 1
+              capabilities: [gpu]
+    
+    restart: unless-stopped
+    
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:8001/v1/models"]
+      interval: 30s
+      timeout: 10s
+      retries: 5
+      start_period: 300s  # 5 minutes for model loading
+    
+    networks:
+      - vllm-network
+
+  # Benchmark runner service
+  vllm-benchmark:
+    build:
+      context: .
+      dockerfile: Dockerfile.benchmark
+    container_name: vllm-benchmark
+    depends_on:
+      vllm-llama3-8b:
+        condition: service_healthy
+    environment:
+      - VLLM_URL=http://vllm-llama3-8b:8001
+    volumes:
+      - ./benchmark_results:/app/results
+    networks:
+      - vllm-network
+    profiles:
+      - benchmark  # Only start when explicitly requested
+
+networks:
+  vllm-network:
+    driver: bridge
+
+volumes:
+  vllm_cache:
+    driver: local
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/docker-compose.yml b/nvidia/txt2kg/assets/deploy/services/vllm/docker-compose.yml
new file mode 100644
index 0000000..3df17d9
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/docker-compose.yml
@@ -0,0 +1,51 @@
+version: '3.8'
+
+services:
+  vllm-nvfp4:
+    build:
+      context: .
+      dockerfile: Dockerfile
+    container_name: vllm-nvfp4-server
+    ports:
+      - "8001:8001"
+    environment:
+      # HuggingFace configuration
+      - HF_TOKEN=${HF_TOKEN}
+      - HF_HOME=/app/models/.cache
+      
+    volumes:
+      # Cache HuggingFace models locally
+      - ./models:/app/models
+      - huggingface_cache:/app/models/.cache
+      # Mount the launch script
+      - ./launch_server.sh:/app/launch_server.sh
+    
+    # NVIDIA recommended settings for PyTorch
+    ipc: host
+    ulimits:
+      memlock: -1
+      stack: 67108864
+    shm_size: 2gb
+      
+    deploy:
+      resources:
+        reservations:
+          devices:
+            - driver: nvidia
+              count: all
+              capabilities: [gpu]
+    
+    restart: unless-stopped
+    
+    entrypoint: ["/bin/bash", "/app/launch_server.sh"]
+    
+    healthcheck:
+      test: ["CMD", "curl", "-f", "http://localhost:8001/health"]
+      interval: 30s
+      timeout: 10s
+      retries: 3
+      start_period: 120s
+
+volumes:
+  huggingface_cache:
+    driver: local
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/launch_server.sh b/nvidia/txt2kg/assets/deploy/services/vllm/launch_server.sh
new file mode 100755
index 0000000..e89640c
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/launch_server.sh
@@ -0,0 +1,115 @@
+#!/bin/bash
+
+# Launch vLLM with NVIDIA Triton Inference Server optimized build
+# This should have proper support for compute capability 12.1 (DGX Spark)
+
+# Enable unified memory usage for DGX Spark
+export CUDA_MANAGED_FORCE_DEVICE_ALLOC=1
+export PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True
+
+# Enable CUDA unified memory and oversubscription
+export CUDA_VISIBLE_DEVICES=0
+export PYTORCH_NO_CUDA_MEMORY_CACHING=0
+
+# Force vLLM to use CPU offloading for large models
+export VLLM_CPU_OFFLOAD_GB=50
+export VLLM_ALLOW_RUNTIME_LORA_UPDATES_WITH_SGD_LORA=1
+export VLLM_SKIP_WARMUP=0
+
+# Optimized environment for performance
+export VLLM_LOGGING_LEVEL=INFO
+export PYTHONUNBUFFERED=1
+
+# Enable CUDA optimizations
+export VLLM_USE_MODELSCOPE=false
+
+# Enable unified memory in vLLM
+export VLLM_USE_V1=0
+
+# First, test basic CUDA functionality
+echo "=== Testing CUDA functionality ==="
+python3 -c "
+import torch
+print(f'PyTorch version: {torch.__version__}')
+print(f'CUDA available: {torch.cuda.is_available()}')
+if torch.cuda.is_available():
+    print(f'CUDA version: {torch.version.cuda}')
+    print(f'GPU count: {torch.cuda.device_count()}')
+    for i in range(torch.cuda.device_count()):
+        props = torch.cuda.get_device_properties(i)
+        print(f'GPU {i}: {props.name} (compute capability {props.major}.{props.minor})')
+        # Try basic CUDA operation
+        try:
+            x = torch.randn(10, 10).cuda(i)
+            y = torch.matmul(x, x.T)
+            print(f'GPU {i}: Basic CUDA operations work')
+        except Exception as e:
+            print(f'GPU {i}: CUDA operation failed: {e}')
+"
+
+echo "=== Starting optimized vLLM server ==="
+# Optimized configuration for DGX Spark performance with NVFP4 quantization
+# Available quantized models from NVIDIA
+NVFP4_MODEL="nvidia/Llama-3.3-70B-Instruct-FP4"
+NVFP8_MODEL="nvidia/Llama-3.1-8B-Instruct-FP8"
+STANDARD_MODEL="meta-llama/Llama-3.1-70B-Instruct"
+
+# Check GPU compute capability for optimal quantization
+COMPUTE_CAPABILITY=$(nvidia-smi -i 0 --query-gpu=compute_cap --format=csv,noheader,nounits 2>/dev/null || echo "unknown")
+echo "Detected GPU compute capability: $COMPUTE_CAPABILITY"
+
+# Configure quantization based on GPU architecture
+if [[ "$COMPUTE_CAPABILITY" == "12.1" ]] || [[ "$COMPUTE_CAPABILITY" == "10.0" ]]; then
+    # Blackwell/DGX Spark architecture - use standard 70B model with CPU offloading
+    echo "Using standard Llama-3.1-70B model for Blackwell/DGX Spark with CPU offloading"
+    QUANTIZATION_FLAG=""
+    MODEL_TO_USE="$STANDARD_MODEL"  # Use standard 70B model
+    GPU_MEMORY_UTIL="0.7"  # Lower GPU memory to allow unified memory
+    MAX_MODEL_LEN="4096"   # Shorter sequences for memory efficiency
+    MAX_NUM_SEQS="16"      # Lower concurrent sequences for 70B
+    MAX_BATCHED_TOKENS="4096"
+    CPU_OFFLOAD_GB="50"    # Offload 50GB to CPU/unified memory
+elif [[ "$COMPUTE_CAPABILITY" == "9.0" ]]; then
+    # Hopper architecture - use standard model
+    echo "Using standard 70B model for Hopper architecture"
+    QUANTIZATION_FLAG=""
+    MODEL_TO_USE="$STANDARD_MODEL"
+    GPU_MEMORY_UTIL="0.7"
+    MAX_MODEL_LEN="4096"
+    MAX_NUM_SEQS="16"
+    MAX_BATCHED_TOKENS="4096"
+    CPU_OFFLOAD_GB="40"
+else
+    # Other architectures - use standard precision
+    echo "Using standard 70B model for GPU architecture: $COMPUTE_CAPABILITY"
+    QUANTIZATION_FLAG=""
+    MODEL_TO_USE="$STANDARD_MODEL"
+    GPU_MEMORY_UTIL="0.7"
+    MAX_MODEL_LEN="2048"
+    MAX_NUM_SEQS="16"
+    MAX_BATCHED_TOKENS="2048"
+    CPU_OFFLOAD_GB="40"
+fi
+
+echo "Using model: $MODEL_TO_USE"
+echo "Quantization: ${QUANTIZATION_FLAG:-'disabled'}"
+echo "GPU memory utilization: $GPU_MEMORY_UTIL"
+
+echo "CPU Offload: ${CPU_OFFLOAD_GB}GB"
+
+vllm serve "$MODEL_TO_USE" \
+  --host 0.0.0.0 \
+  --port 8001 \
+  --tensor-parallel-size 1 \
+  --max-model-len "$MAX_MODEL_LEN" \
+  --max-num-seqs "$MAX_NUM_SEQS" \
+  --max-num-batched-tokens "$MAX_BATCHED_TOKENS" \
+  --gpu-memory-utilization "$GPU_MEMORY_UTIL" \
+  --cpu-offload-gb "$CPU_OFFLOAD_GB" \
+  --kv-cache-dtype auto \
+  --trust-remote-code \
+  --served-model-name "$MODEL_TO_USE" \
+  --enable-chunked-prefill \
+  --disable-custom-all-reduce \
+  --disable-async-output-proc \
+  $QUANTIZATION_FLAG
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/run_benchmark.sh b/nvidia/txt2kg/assets/deploy/services/vllm/run_benchmark.sh
new file mode 100755
index 0000000..43208e7
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/run_benchmark.sh
@@ -0,0 +1,199 @@
+#!/bin/bash
+
+# vLLM Llama3 8B Benchmark Runner
+# Uses NVIDIA vLLM container for optimal performance
+
+set -e
+
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+VLLM_URL="http://localhost:8001"
+RUNS=3
+MAX_TOKENS=512
+OUTPUT_FILE=""
+
+# Colors for output
+RED='\033[0;31m'
+GREEN='\033[0;32m'
+YELLOW='\033[1;33m'
+BLUE='\033[0;34m'
+NC='\033[0m' # No Color
+
+print_header() {
+    echo -e "${BLUE}========================================${NC}"
+    echo -e "${BLUE}  🚀 vLLM Llama3 8B Benchmark Suite${NC}"
+    echo -e "${BLUE}========================================${NC}"
+}
+
+print_usage() {
+    echo "Usage: $0 [OPTIONS]"
+    echo ""
+    echo "Options:"
+    echo "  -u, --url URL          vLLM service URL (default: http://localhost:8001)"
+    echo "  -r, --runs NUMBER      Number of runs per prompt (default: 3)"
+    echo "  -t, --max-tokens NUM   Maximum tokens to generate (default: 512)"
+    echo "  -o, --output FILE      Output file for detailed results (JSON)"
+    echo "  -d, --docker           Run using Docker Compose"
+    echo "  -s, --start-service    Start vLLM service first"
+    echo "  -h, --health-check     Only run health check"
+    echo "  --help                 Show this help message"
+    echo ""
+    echo "Examples:"
+    echo "  $0                                    # Run basic benchmark"
+    echo "  $0 --docker --start-service          # Start service and run benchmark in Docker"
+    echo "  $0 -r 5 -t 1024 -o results.json     # Custom settings with output file"
+    echo "  $0 --health-check                    # Check if service is running"
+}
+
+check_dependencies() {
+    if ! command -v python3 &> /dev/null; then
+        echo -e "${RED}❌ Python3 is required but not installed${NC}"
+        exit 1
+    fi
+    
+    if ! python3 -c "import aiohttp, asyncio" &> /dev/null; then
+        echo -e "${YELLOW}⚠️  Installing required Python packages...${NC}"
+        pip3 install aiohttp asyncio
+    fi
+}
+
+check_nvidia_docker() {
+    if ! command -v docker &> /dev/null; then
+        echo -e "${RED}❌ Docker is required but not installed${NC}"
+        exit 1
+    fi
+    
+    if ! docker info | grep -q "nvidia"; then
+        echo -e "${YELLOW}⚠️  NVIDIA Docker runtime not detected. Make sure nvidia-container-toolkit is installed${NC}"
+    fi
+}
+
+start_vllm_service() {
+    echo -e "${BLUE}🚀 Starting vLLM Llama3 8B service...${NC}"
+    
+    cd "$SCRIPT_DIR"
+    docker-compose -f docker-compose.llama3-8b.yml up -d vllm-llama3-8b
+    
+    echo -e "${YELLOW}⏳ Waiting for model to load (this may take several minutes)...${NC}"
+    
+    # Wait for service to be healthy
+    local max_attempts=60  # 10 minutes
+    local attempt=1
+    
+    while [ $attempt -le $max_attempts ]; do
+        if curl -sf "$VLLM_URL/v1/models" > /dev/null 2>&1; then
+            echo -e "${GREEN}✅ vLLM service is ready!${NC}"
+            return 0
+        fi
+        
+        echo -e "${YELLOW}⏳ Attempt $attempt/$max_attempts - waiting for service...${NC}"
+        sleep 10
+        ((attempt++))
+    done
+    
+    echo -e "${RED}❌ vLLM service failed to start within timeout${NC}"
+    echo -e "${YELLOW}📋 Checking service logs:${NC}"
+    docker-compose -f docker-compose.llama3-8b.yml logs vllm-llama3-8b
+    exit 1
+}
+
+run_benchmark() {
+    local cmd_args=("--url" "$VLLM_URL" "--runs" "$RUNS" "--max-tokens" "$MAX_TOKENS")
+    
+    if [ -n "$OUTPUT_FILE" ]; then
+        cmd_args+=("--output" "$OUTPUT_FILE")
+    fi
+    
+    if [ "$HEALTH_CHECK_ONLY" = true ]; then
+        cmd_args+=("--health-check-only")
+    fi
+    
+    echo -e "${BLUE}🧪 Running vLLM Llama3 8B benchmark...${NC}"
+    echo -e "${BLUE}URL: $VLLM_URL${NC}"
+    echo -e "${BLUE}Runs per prompt: $RUNS${NC}"
+    echo -e "${BLUE}Max tokens: $MAX_TOKENS${NC}"
+    
+    if [ "$USE_DOCKER" = true ]; then
+        # Run benchmark in Docker
+        cd "$SCRIPT_DIR"
+        docker-compose -f docker-compose.llama3-8b.yml run --rm vllm-benchmark \
+            python /app/vllm_llama3_benchmark.py "${cmd_args[@]}"
+    else
+        # Run benchmark locally
+        python3 "$SCRIPT_DIR/vllm_llama3_benchmark.py" "${cmd_args[@]}"
+    fi
+}
+
+# Parse command line arguments
+USE_DOCKER=false
+START_SERVICE=false
+HEALTH_CHECK_ONLY=false
+
+while [[ $# -gt 0 ]]; do
+    case $1 in
+        -u|--url)
+            VLLM_URL="$2"
+            shift 2
+            ;;
+        -r|--runs)
+            RUNS="$2"
+            shift 2
+            ;;
+        -t|--max-tokens)
+            MAX_TOKENS="$2"
+            shift 2
+            ;;
+        -o|--output)
+            OUTPUT_FILE="$2"
+            shift 2
+            ;;
+        -d|--docker)
+            USE_DOCKER=true
+            shift
+            ;;
+        -s|--start-service)
+            START_SERVICE=true
+            shift
+            ;;
+        -h|--health-check)
+            HEALTH_CHECK_ONLY=true
+            shift
+            ;;
+        --help)
+            print_usage
+            exit 0
+            ;;
+        *)
+            echo -e "${RED}❌ Unknown option: $1${NC}"
+            print_usage
+            exit 1
+            ;;
+    esac
+done
+
+# Main execution
+print_header
+
+if [ "$USE_DOCKER" = true ]; then
+    check_nvidia_docker
+    
+    if [ "$START_SERVICE" = true ]; then
+        start_vllm_service
+    fi
+    
+    run_benchmark
+else
+    check_dependencies
+    
+    if [ "$START_SERVICE" = true ]; then
+        echo -e "${YELLOW}⚠️  --start-service only works with --docker flag${NC}"
+        exit 1
+    fi
+    
+    run_benchmark
+fi
+
+echo -e "${GREEN}✅ Benchmark completed successfully!${NC}"
+
+if [ -n "$OUTPUT_FILE" ] && [ -f "$OUTPUT_FILE" ]; then
+    echo -e "${BLUE}📊 Detailed results saved to: $OUTPUT_FILE${NC}"
+fi
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/run_container.sh b/nvidia/txt2kg/assets/deploy/services/vllm/run_container.sh
new file mode 100755
index 0000000..a56cb18
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/run_container.sh
@@ -0,0 +1,4 @@
+#!/bin/bash
+
+# Follow the official vLLM tutorial run_container.sh exactly
+docker run -e HF_TOKEN="$HF_TOKEN" -e HF_HOME="$HF_HOME" --ipc=host --gpus all --entrypoint "/bin/bash" --rm -it vllm/vllm-openai:deploy
diff --git a/nvidia/txt2kg/assets/deploy/services/vllm/start-vllm.sh b/nvidia/txt2kg/assets/deploy/services/vllm/start-vllm.sh
new file mode 100755
index 0000000..4b27093
--- /dev/null
+++ b/nvidia/txt2kg/assets/deploy/services/vllm/start-vllm.sh
@@ -0,0 +1,96 @@
+#!/bin/bash
+
+# vLLM startup script with NVFP4 quantization support for Llama 4 Scout
+# Optimized for NVIDIA Blackwell and Hopper architectures
+
+set -e
+
+# Default configuration - using supported Llama 3.1 model for testing
+VLLM_MODEL=${VLLM_MODEL:-"meta-llama/Llama-3.1-8B-Instruct"}
+VLLM_PORT=${VLLM_PORT:-8001}
+VLLM_HOST=${VLLM_HOST:-"0.0.0.0"}
+VLLM_TENSOR_PARALLEL_SIZE=${VLLM_TENSOR_PARALLEL_SIZE:-2}
+VLLM_MAX_MODEL_LEN=${VLLM_MAX_MODEL_LEN:-8192}
+VLLM_GPU_MEMORY_UTILIZATION=${VLLM_GPU_MEMORY_UTILIZATION:-0.9}
+VLLM_MAX_NUM_SEQS=${VLLM_MAX_NUM_SEQS:-128}
+VLLM_MAX_NUM_BATCHED_TOKENS=${VLLM_MAX_NUM_BATCHED_TOKENS:-8192}
+VLLM_KV_CACHE_DTYPE=${VLLM_KV_CACHE_DTYPE:-"auto"}
+
+# Detect GPU compute capability and set optimizations
+COMPUTE_CAPABILITY=$(nvidia-smi -i 0 --query-gpu=compute_cap --format=csv,noheader 2>/dev/null || echo "unknown")
+
+echo "Starting vLLM service with the following configuration:"
+echo "Model: $VLLM_MODEL"
+echo "Port: $VLLM_PORT"
+echo "Host: $VLLM_HOST"
+echo "Tensor Parallel Size: $VLLM_TENSOR_PARALLEL_SIZE"
+echo "Max Model Length: $VLLM_MAX_MODEL_LEN"
+echo "Max Num Seqs: $VLLM_MAX_NUM_SEQS"
+echo "Max Batched Tokens: $VLLM_MAX_NUM_BATCHED_TOKENS"
+echo "GPU Memory Utilization: $VLLM_GPU_MEMORY_UTILIZATION"
+echo "KV Cache Dtype: $VLLM_KV_CACHE_DTYPE"
+echo "GPU Compute Capability: $COMPUTE_CAPABILITY"
+
+# Set up environment variables for optimal performance based on GPU architecture
+if [ "$COMPUTE_CAPABILITY" = "10.0" ]; then
+    echo "Detected Blackwell architecture - enabling NVFP4 optimizations"
+    # Use FlashInfer backend for attentions
+    export VLLM_ATTENTION_BACKEND=FLASHINFER
+    # Use FlashInfer trtllm-gen attention kernels
+    export VLLM_USE_TRTLLM_ATTENTION=1
+    # Use FlashInfer FP8/FP4 MoE
+    export VLLM_USE_FLASHINFER_MOE_FP8=1
+    export VLLM_USE_FLASHINFER_MOE_FP4=1
+    # Use FlashInfer trtllm-gen MoE backend
+    export VLLM_FLASHINFER_MOE_BACKEND="latency"
+    # Enable async scheduling
+    ASYNC_SCHEDULING_FLAG="--async-scheduling"
+    # Enable FlashInfer fusions
+    FUSION_FLAG='{"pass_config":{"enable_fi_allreduce_fusion":true,"enable_noop":true},"custom_ops":["+quant_fp8","+rms_norm"],"full_cuda_graph":true}'
+elif [ "$COMPUTE_CAPABILITY" = "9.0" ]; then
+    echo "Detected Hopper architecture - enabling FP8 optimizations"
+    # Disable async scheduling on Hopper architecture due to vLLM limitations
+    ASYNC_SCHEDULING_FLAG=""
+    # Disable FlashInfer fusions since they are not supported on Hopper architecture
+    FUSION_FLAG="{}"
+else
+    echo "GPU architecture not specifically optimized - using default settings"
+    ASYNC_SCHEDULING_FLAG=""
+    FUSION_FLAG="{}"
+fi
+
+# Check GPU availability
+if ! nvidia-smi > /dev/null 2>&1; then
+    echo "Warning: NVIDIA GPU not detected. vLLM may not work properly."
+fi
+
+# Create model cache directory
+mkdir -p /app/models
+
+echo "Starting vLLM's built-in OpenAI API server"
+
+# Build vLLM command with NVFP4 optimizations
+VLLM_CMD="vllm serve $VLLM_MODEL \
+    --host $VLLM_HOST \
+    --port $VLLM_PORT \
+    --tensor-parallel-size $VLLM_TENSOR_PARALLEL_SIZE \
+    --max-model-len $VLLM_MAX_MODEL_LEN \
+    --max-num-seqs $VLLM_MAX_NUM_SEQS \
+    --max-num-batched-tokens $VLLM_MAX_NUM_BATCHED_TOKENS \
+    --gpu-memory-utilization $VLLM_GPU_MEMORY_UTILIZATION \
+    --kv-cache-dtype $VLLM_KV_CACHE_DTYPE \
+    --trust-remote-code \
+    --served-model-name $VLLM_MODEL"
+
+# Add async scheduling if supported
+if [ -n "$ASYNC_SCHEDULING_FLAG" ]; then
+    VLLM_CMD="$VLLM_CMD $ASYNC_SCHEDULING_FLAG"
+fi
+
+# Add fusion optimizations if available
+if [ "$FUSION_FLAG" != "{}" ]; then
+    VLLM_CMD="$VLLM_CMD --compilation-config '$FUSION_FLAG'"
+fi
+
+# Start vLLM server
+exec $VLLM_CMD
diff --git a/nvidia/txt2kg/assets/examples/download_biorxiv_dataset.py b/nvidia/txt2kg/assets/examples/download_biorxiv_dataset.py
new file mode 100644
index 0000000..a7cf2da
--- /dev/null
+++ b/nvidia/txt2kg/assets/examples/download_biorxiv_dataset.py
@@ -0,0 +1,87 @@
+#!/usr/bin/env python3
+"""
+Download and process the MTEB raw_biorxiv dataset for txt2kg demo.
+Filter for genetics/genomics categories and create individual txt files.
+"""
+
+import os
+import re
+from pathlib import Path
+from datasets import load_dataset
+
+def sanitize_filename(text, max_length=100):
+    """Convert text to a safe filename."""
+    # Remove special characters and replace with underscores
+    filename = re.sub(r'[^\w\s-]', '', text)
+    filename = re.sub(r'[-\s]+', '_', filename)
+    filename = filename.strip('_')
+    
+    # Truncate if too long
+    if len(filename) > max_length:
+        filename = filename[:max_length]
+    
+    return filename
+
+def main():
+    print("Loading MTEB raw_biorxiv dataset...")
+    
+    # Load the dataset
+    ds = load_dataset("mteb/raw_biorxiv")
+    
+    # Get the train split
+    train_data = ds['train']
+    
+    print(f"Total dataset size: {len(train_data)} papers")
+    
+    # Filter for genetics or genomics categories
+    genetics_genomics_data = []
+    for item in train_data:
+        category = item['category'].lower()
+        if 'genetic' in category or 'genomic' in category:
+            genetics_genomics_data.append(item)
+    
+    print(f"Found {len(genetics_genomics_data)} papers with genetics/genomics categories")
+    
+    if len(genetics_genomics_data) == 0:
+        # Let's check what categories are available
+        categories = set(item['category'] for item in train_data)
+        print("Available categories:")
+        for cat in sorted(categories):
+            print(f"  - {cat}")
+        return
+    
+    # Create output directory
+    output_dir = Path("biorxiv_genetics_genomics")
+    output_dir.mkdir(exist_ok=True)
+    
+    print(f"Creating txt files in {output_dir}/")
+    
+    # Process each paper
+    for i, item in enumerate(genetics_genomics_data):
+        # Create filename from title and ID
+        title_part = sanitize_filename(item['title'], max_length=50)
+        paper_id = item['id'].replace('/', '_')
+        filename = f"{i+1:03d}_{title_part}_{paper_id}.txt"
+        
+        # Create file content
+        content = f"Title: {item['title']}\n"
+        content += f"ID: {item['id']}\n"
+        content += f"Category: {item['category']}\n"
+        content += f"\nAbstract:\n{item['abstract']}\n"
+        
+        # Write to file
+        file_path = output_dir / filename
+        with open(file_path, 'w', encoding='utf-8') as f:
+            f.write(content)
+    
+    print(f"Successfully created {len(genetics_genomics_data)} txt files in {output_dir}/")
+    
+    # Show some statistics
+    categories_found = set(item['category'] for item in genetics_genomics_data)
+    print(f"\nCategories included:")
+    for cat in sorted(categories_found):
+        count = sum(1 for item in genetics_genomics_data if item['category'] == cat)
+        print(f"  - {cat}: {count} papers")
+
+if __name__ == "__main__":
+    main()
diff --git a/nvidia/txt2kg/assets/examples/download_cc_biorxiv_dataset.py b/nvidia/txt2kg/assets/examples/download_cc_biorxiv_dataset.py
new file mode 100644
index 0000000..bfa76b4
--- /dev/null
+++ b/nvidia/txt2kg/assets/examples/download_cc_biorxiv_dataset.py
@@ -0,0 +1,86 @@
+#!/usr/bin/env python3
+"""
+Download and process the marianna13/biorxiv dataset for txt2kg demo.
+Filter for Creative Commons licensed papers and create individual txt files.
+"""
+
+import os
+import re
+from pathlib import Path
+from datasets import load_dataset
+
+def sanitize_filename(text, max_length=100):
+    """Convert text to a safe filename."""
+    # Remove special characters and replace with underscores
+    filename = re.sub(r'[^\w\s-]', '', text)
+    filename = re.sub(r'[-\s]+', '_', filename)
+    filename = filename.strip('_')
+    
+    # Truncate if too long
+    if len(filename) > max_length:
+        filename = filename[:max_length]
+    
+    return filename
+
+def main():
+    print("Loading marianna13/biorxiv dataset...")
+    
+    # Load the dataset
+    ds = load_dataset("marianna13/biorxiv")
+    
+    # Get the train split
+    train_data = ds['train']
+    
+    print(f"Total dataset size: {len(train_data)} papers")
+    
+    # Filter for Creative Commons licensed papers
+    cc_papers = train_data.filter(lambda x: x['LICENSE'] == 'creative-commons')
+    
+    print(f"Found {len(cc_papers)} Creative Commons licensed papers ({len(cc_papers)/len(train_data)*100:.1f}%)")
+    
+    # Take a sample for the demo (full dataset would be too large)
+    sample_size = min(1000, len(cc_papers))  # Limit to 1000 papers for demo
+    cc_sample = cc_papers.select(range(sample_size))
+    
+    print(f"Using sample of {len(cc_sample)} papers for demo")
+    
+    # Create output directory
+    output_dir = Path("biorxiv_creative_commons")
+    output_dir.mkdir(exist_ok=True)
+    
+    print(f"Creating txt files in {output_dir}/")
+    
+    # Process each paper
+    for i, item in enumerate(cc_sample):
+        # Create filename from title and DOI
+        title_part = sanitize_filename(item['TITLE'], max_length=50)
+        doi_part = item['DOI'].replace('/', '_').replace('.', '_')
+        filename = f"{i+1:03d}_{title_part}_{doi_part}.txt"
+        
+        # Create file content with full text
+        content = f"Title: {item['TITLE']}\n"
+        content += f"DOI: {item['DOI']}\n"
+        content += f"Year: {item['YEAR']}\n"
+        content += f"Authors: {'; '.join(item['AUTHORS']) if item['AUTHORS'] else 'N/A'}\n"
+        content += f"License: {item['LICENSE']}\n"
+        content += f"\nFull Text:\n{item['TEXT']}\n"
+        
+        # Write to file
+        file_path = output_dir / filename
+        with open(file_path, 'w', encoding='utf-8') as f:
+            f.write(content)
+    
+    print(f"Successfully created {len(cc_sample)} txt files in {output_dir}/")
+    
+    # Show some statistics
+    years = [item['YEAR'] for item in cc_sample]
+    year_range = f"{min(years)} - {max(years)}"
+    
+    print(f"\nDataset Statistics:")
+    print(f"  Year range: {year_range}")
+    print(f"  License: Creative Commons (commercial use allowed)")
+    print(f"  Content: Full paper text (not just abstracts)")
+    print(f"  Average text length: {sum(len(item['TEXT']) for item in cc_sample) // len(cc_sample):,} characters")
+
+if __name__ == "__main__":
+    main()
diff --git a/nvidia/txt2kg/assets/frontend/README.md b/nvidia/txt2kg/assets/frontend/README.md
new file mode 100644
index 0000000..23bce5a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/README.md
@@ -0,0 +1,31 @@
+# Frontend Application
+
+This directory contains the Next.js frontend application for the txt2kg project.
+
+## Structure
+
+- **app/**: Next.js app directory with pages and routes
+- **components/**: React components
+- **contexts/**: React context providers
+- **hooks/**: Custom React hooks
+- **lib/**: Utility functions and shared logic
+- **public/**: Static assets
+- **styles/**: CSS and styling files
+- **types/**: TypeScript type definitions
+
+## Development
+
+To start the development server:
+
+```bash
+cd frontend
+npm install
+npm run dev
+```
+
+## Building for Production
+
+```bash
+cd frontend
+npm run build
+``` 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/.static/3d-force-graph-mock.js b/nvidia/txt2kg/assets/frontend/app/.static/3d-force-graph-mock.js
new file mode 100644
index 0000000..218be2b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/.static/3d-force-graph-mock.js
@@ -0,0 +1 @@
+console.log('This is mock data to avoid SSR issues')
diff --git a/nvidia/txt2kg/assets/frontend/app/api/backend/route.ts b/nvidia/txt2kg/assets/frontend/app/api/backend/route.ts
new file mode 100644
index 0000000..178002a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/backend/route.ts
@@ -0,0 +1,90 @@
+import { NextRequest, NextResponse } from 'next/server';
+import remoteBackend from '@/lib/remote-backend';
+import type { Triple } from '@/types/graph';
+import { getGraphDbType } from '../settings/route';
+
+/**
+ * Remote backend API that provides endpoints for creating and querying a knowledge graph
+ * using the selected graph database, Pinecone, and SentenceTransformer
+ */
+
+/**
+ * Create a backend from triples
+ */
+export async function POST(request: NextRequest) {
+  try {
+    const { triples } = await request.json();
+    
+    if (!triples || !Array.isArray(triples) || triples.length === 0) {
+      return NextResponse.json(
+        { error: 'Triples are required and must be a non-empty array' }, 
+        { status: 400 }
+      );
+    }
+    
+    // Initialize backend with the selected graph database type
+    if (!remoteBackend.isInitialized()) {
+      const graphDbType = getGraphDbType();
+      console.log(`Initializing backend with graph DB type: ${graphDbType}`);
+      await remoteBackend.initialize(graphDbType);
+    }
+    
+    // Create backend from triples
+    await remoteBackend.createBackendFromTriples(triples);
+    
+    return NextResponse.json({
+      success: true,
+      message: `Created backend successfully with ${triples.length} triples`,
+      graphDbType: getGraphDbType()
+    });
+  } catch (error) {
+    console.error('Error creating backend from triples:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json({ error: errorMessage }, { status: 500 });
+  }
+}
+
+/**
+ * Query the backend with a given query text
+ */
+export async function GET(request: NextRequest) {
+  try {
+    const url = new URL(request.url);
+    const query = url.searchParams.get('query');
+    
+    if (!query) {
+      return NextResponse.json({ error: 'Query parameter is required' }, { status: 400 });
+    }
+    
+    // Parse optional parameters with fallbacks
+    const kNeighbors = parseInt(url.searchParams.get('kNeighbors') || '4096', 10);
+    const fanout = parseInt(url.searchParams.get('fanout') || '400', 10);
+    const numHops = parseInt(url.searchParams.get('numHops') || '2', 10);
+    
+    // Initialize backend with the selected graph database type
+    if (!remoteBackend.isInitialized()) {
+      const graphDbType = getGraphDbType();
+      console.log(`Initializing backend with graph DB type: ${graphDbType}`);
+      await remoteBackend.initialize(graphDbType);
+    }
+    
+    // Query the backend
+    const relevantTriples = await remoteBackend.query(query, kNeighbors, fanout, numHops);
+    
+    return NextResponse.json({
+      query,
+      triples: relevantTriples,
+      count: relevantTriples.length,
+      parameters: {
+        kNeighbors,
+        fanout,
+        numHops
+      },
+      graphDbType: getGraphDbType()
+    });
+  } catch (error) {
+    console.error('Error querying backend:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json({ error: errorMessage }, { status: 500 });
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/config/route.ts b/nvidia/txt2kg/assets/frontend/app/api/config/route.ts
new file mode 100644
index 0000000..d94e619
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/config/route.ts
@@ -0,0 +1,14 @@
+import { NextResponse } from "next/server";
+
+export async function GET() {
+  // Only return the necessary configuration data
+  return NextResponse.json({
+    nvidiaApiKey: process.env.NVIDIA_API_KEY || null,
+    // xaiApiKey removed - integration has been removed
+    ollamaBaseUrl: process.env.OLLAMA_BASE_URL || 'http://localhost:11434/v1',
+    ollamaModel: process.env.OLLAMA_MODEL || 'qwen3:1.7b',
+    vllmBaseUrl: process.env.VLLM_BASE_URL || 'http://localhost:8001/v1',
+    vllmModel: process.env.VLLM_MODEL || 'meta-llama/Llama-3.2-3B-Instruct',
+    // Add other config values as needed
+  });
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/embeddings/route.ts b/nvidia/txt2kg/assets/frontend/app/api/embeddings/route.ts
new file mode 100644
index 0000000..caf7bb3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/embeddings/route.ts
@@ -0,0 +1,133 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { EmbeddingsService } from '@/lib/embeddings';
+import { PineconeService } from '@/lib/pinecone';
+
+/**
+ * Generate embeddings for text chunks and store them in Pinecone
+ */
+export async function POST(request: NextRequest) {
+  try {
+    const { documentId, content, documentName } = await request.json();
+    
+    if (!content) {
+      return NextResponse.json(
+        { error: 'Document content is required' }, 
+        { status: 400 }
+      );
+    }
+    
+    // Initialize embedding service
+    const embeddingsService = EmbeddingsService.getInstance();
+    
+    // Log which provider we're using
+    console.log(`Using embeddings provider: ${process.env.EMBEDDINGS_PROVIDER || 'local'}`);
+    
+    // Generate chunks from content
+    const chunkSize = 200; // Size of each text chunk
+    const chunks = generateChunks(content, chunkSize);
+    console.log(`Generated ${chunks.length} chunks from document`);
+    
+    // Create unique IDs for each chunk based on document name and chunk index
+    const docPrefix = documentName ? 
+      documentName.replace(/[^a-zA-Z0-9]/g, '_').substring(0, 20) : 
+      documentId ? documentId : 'doc';
+    
+    const chunkIds = chunks.map((_, index) => `${docPrefix}_chunk_${index}`);
+    
+    // Generate embeddings for chunks
+    console.log('Generating embeddings for chunks...');
+    const embeddings = await embeddingsService.encode(chunks);
+    console.log(`Generated ${embeddings.length} embeddings`);
+    
+    // Initialize PineconeService
+    const pineconeService = PineconeService.getInstance();
+    
+    // Check if Pinecone server is running
+    const isPineconeRunning = await pineconeService.isPineconeRunning();
+    if (!isPineconeRunning) {
+      return NextResponse.json(
+        { error: 'Pinecone server is not available. Please make sure it is running.' },
+        { status: 503 }
+      );
+    }
+    
+    if (!pineconeService.isInitialized()) {
+      try {
+        await pineconeService.initialize();
+      } catch (initError) {
+        console.error('Error initializing Pinecone:', initError);
+        return NextResponse.json(
+          { error: `Failed to initialize Pinecone: ${initError instanceof Error ? initError.message : String(initError)}` },
+          { status: 500 }
+        );
+      }
+    }
+    
+    // Create maps for embeddings and text content
+    const entityEmbeddings = new Map<string, number[]>();
+    const textContent = new Map<string, string>();
+    
+    // Populate the maps
+    for (let i = 0; i < chunkIds.length; i++) {
+      entityEmbeddings.set(chunkIds[i], embeddings[i]);
+      textContent.set(chunkIds[i], chunks[i]);
+    }
+    
+    // Store embeddings in PineconeService with retry logic
+    try {
+      await pineconeService.storeEmbeddings(entityEmbeddings, textContent);
+    } catch (storeError) {
+      console.error('Error storing embeddings in Pinecone:', storeError);
+      return NextResponse.json(
+        { error: `Failed to store embeddings in Pinecone: ${storeError instanceof Error ? storeError.message : String(storeError)}` },
+        { status: 500 }
+      );
+    }
+    
+    return NextResponse.json({
+      success: true,
+      documentId: documentId || 'unnamed',
+      chunks: chunks.length,
+      embeddings: embeddings.length
+    });
+    
+  } catch (error) {
+    console.error('Error generating embeddings:', error);
+    return NextResponse.json(
+      { error: `Failed to generate embeddings: ${error instanceof Error ? error.message : String(error)}` },
+      { status: 500 }
+    );
+  }
+}
+
+/**
+ * Generate chunks from text content
+ * @param content Text content
+ * @param chunkSize Size of each chunk
+ * @param overlap Overlap between chunks
+ * @returns Array of text chunks
+ */
+function generateChunks(content: string, chunkSize: number, overlap: number = 50): string[] {
+  const chunks: string[] = [];
+  const sentences = content.split(/(?<=[.!?])\s+/);
+  
+  let currentChunk = '';
+  for (const sentence of sentences) {
+    // If adding this sentence would make the chunk too long, save the current chunk and start a new one
+    if (currentChunk.length + sentence.length > chunkSize && currentChunk.length > 0) {
+      chunks.push(currentChunk.trim());
+      // Take the last part of the current chunk as overlap for the next chunk
+      const words = currentChunk.split(' ');
+      currentChunk = words.slice(Math.max(0, words.length - overlap)).join(' ');
+    }
+    
+    currentChunk += ' ' + sentence;
+  }
+  
+  // Add the last chunk if it's not empty
+  if (currentChunk.trim().length > 0) {
+    chunks.push(currentChunk.trim());
+  }
+  
+  return chunks;
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/enhanced-query/route.ts b/nvidia/txt2kg/assets/frontend/app/api/enhanced-query/route.ts
new file mode 100644
index 0000000..7533849
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/enhanced-query/route.ts
@@ -0,0 +1,92 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { RemoteBackendService } from '@/lib/remote-backend';
+
+/**
+ * API endpoint for enhanced RAG query with LangChain features
+ * POST /api/enhanced-query
+ */
+export async function POST(req: NextRequest) {
+  try {
+    // Parse request body
+    const body = await req.json();
+    const { query, kNeighbors, fanout, numHops, topK, queryMode, useTraditional } = body;
+
+    if (!query || typeof query !== 'string') {
+      return NextResponse.json({ error: 'Query is required' }, { status: 400 });
+    }
+
+    // Initialize the backend service
+    const backend = RemoteBackendService.getInstance();
+    
+    // Prepare parameters with defaults
+    const params = {
+      kNeighbors: kNeighbors || 4096,
+      fanout: fanout || 400,
+      numHops: numHops || 2,
+      topK: topK || 5
+    };
+    
+    console.log(`Enhanced RAG query: "${query}" with params:`, params);
+    console.log(`Query mode: ${queryMode}, useTraditional: ${useTraditional}`);
+    
+    // Determine search method - if traditional is specified, use that
+    const shouldUseTraditional = useTraditional || (queryMode === 'traditional');
+    
+    if (shouldUseTraditional) {
+      console.log('Using traditional search for enhanced query');
+      // Call the regular query method with traditional flag
+      const relevantTriples = await backend.query(
+        query,
+        params.kNeighbors,
+        params.fanout,
+        params.numHops,
+        { 
+          topk: params.topK, 
+          topk_e: params.topK, 
+          cost_e: 0.5, 
+          num_clusters: 2 
+        },
+        true // Use traditional search
+      );
+      
+      // Return the results
+      return NextResponse.json({
+        relevantTriples,
+        count: relevantTriples.length,
+        metadata: {
+          searchType: 'traditional'
+        },
+        success: true
+      });
+    }
+
+    // Use the enhanced query with metadata for vector search
+    const { relevantTriples, queryMetadata } = await backend.enhancedQuery(
+      query,
+      params.kNeighbors,
+      params.fanout,
+      params.numHops,
+      { 
+        topk: params.topK, 
+        topk_e: params.topK, 
+        cost_e: 0.5, 
+        num_clusters: 2 
+      }
+    );
+
+    // Return the results
+    return NextResponse.json({
+      relevantTriples,
+      count: relevantTriples.length,
+      metadata: queryMetadata,
+      success: true
+    });
+  } catch (error) {
+    console.error('Error in enhanced RAG query:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to execute enhanced query: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/extract-triples/route.ts b/nvidia/txt2kg/assets/frontend/app/api/extract-triples/route.ts
new file mode 100644
index 0000000..da1382b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/extract-triples/route.ts
@@ -0,0 +1,207 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { processDocument, TextProcessor } from '@/lib/text-processor';
+import { llmService } from '@/lib/llm-service';
+
+// Configure route for dynamic operations and long-running requests
+export const dynamic = 'force-dynamic';
+export const maxDuration = 1800; // 30 minutes for large model processing
+
+/**
+ * API endpoint for extracting triples from text using the LangChain-based pipeline
+ * POST /api/extract-triples
+ */
+export async function POST(req: NextRequest) {
+  const startTime = Date.now();
+  console.log(`[${new Date().toISOString()}] extract-triples: Request received`);
+  
+  try {
+    // Parse request body
+    const body = await req.json();
+    console.log(`[${new Date().toISOString()}] extract-triples: Body parsed, text length: ${body.text?.length || 0}`);
+    const { 
+      text, 
+      useLangChain = false, 
+      useGraphTransformer = false,
+      systemPrompt,
+      extractionPrompt,
+      graphTransformerPrompt,
+      llmProvider,
+      ollamaModel,
+      ollamaBaseUrl,
+      vllmModel,
+      vllmBaseUrl
+    } = body;
+
+    if (!text || typeof text !== 'string') {
+      return NextResponse.json({ error: 'Text is required' }, { status: 400 });
+    }
+
+    // If Ollama is specified, call llmService directly (avoid internal fetch timeout)
+    if (llmProvider === 'ollama') {
+      console.log(`[${new Date().toISOString()}] extract-triples: Processing with Ollama model: ${ollamaModel || 'llama3.1:8b'}`);
+      const llmStartTime = Date.now();
+      
+      try {
+        const model = ollamaModel || 'llama3.1:8b';
+        const messages = [
+          {
+            role: 'system' as const,
+            content: 'You are a knowledge graph builder. Extract subject-predicate-object triples from text and return them as a JSON array.'
+          },
+          {
+            role: 'user' as const,
+            content: `Extract triples from this text:\n\n${text}`
+          }
+        ];
+
+        console.log(`[${new Date().toISOString()}] extract-triples: Calling llmService.generateOllamaCompletion directly`);
+        const response = await llmService.generateOllamaCompletion(
+          model,
+          messages,
+          { temperature: 0.1, maxTokens: 8192 }
+        );
+
+        const llmDuration = ((Date.now() - llmStartTime) / 1000).toFixed(2);
+        console.log(`[${new Date().toISOString()}] extract-triples: LLM completion received after ${llmDuration}s, response length: ${response?.length || 0}`);
+
+        // Parse the response to extract triples
+        let triples = [];
+        try {
+          const jsonMatch = response.match(/\[[\s\S]*\]/);
+          if (jsonMatch) {
+            triples = JSON.parse(jsonMatch[0]);
+          } else {
+            // Fallback parser
+            triples = parseTriplesFallback(response);
+          }
+        } catch (parseError) {
+          console.warn('Failed to parse JSON response, using fallback parser:', parseError);
+          triples = parseTriplesFallback(response);
+        }
+
+        const totalDuration = ((Date.now() - llmStartTime) / 1000).toFixed(2);
+        console.log(`[${new Date().toISOString()}] extract-triples: Returning ${triples.length} triples, total duration: ${totalDuration}s`);
+
+        return NextResponse.json({
+          triples: triples.map((triple) => ({
+            ...triple,
+            confidence: 0.8,
+            metadata: {
+              entityTypes: [],
+              source: text.substring(0, 100) + '...',
+              context: text.substring(0, 200) + '...',
+              extractionMethod: 'ollama',
+              model: model
+            }
+          })),
+          count: triples.length,
+          success: true,
+          method: 'ollama',
+          model: model
+        });
+      } catch (llmError) {
+        const llmDuration = ((Date.now() - llmStartTime) / 1000).toFixed(2);
+        console.error(`[${new Date().toISOString()}] extract-triples: Ollama processing failed after ${llmDuration}s:`, llmError);
+        throw llmError;
+      }
+    }
+
+    // If vLLM is specified, use the vLLM API endpoint
+    if (llmProvider === 'vllm') {
+      const vllmResponse = await fetch(`${req.nextUrl.origin}/api/vllm`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify({
+          text,
+          model: vllmModel || 'meta-llama/Llama-3.2-3B-Instruct',
+          temperature: 0.1,
+          maxTokens: 8192
+        })
+      });
+
+      if (!vllmResponse.ok) {
+        throw new Error(`vLLM API error: ${vllmResponse.statusText}`);
+      }
+
+      const vllmResult = await vllmResponse.json();
+      return NextResponse.json(vllmResult);
+    }
+
+    // Configure TextProcessor for the specified LLM provider
+    const processor = TextProcessor.getInstance();
+    if (llmProvider && ['ollama', 'nvidia', 'vllm'].includes(llmProvider)) {
+      processor.setLLMProvider(llmProvider as 'ollama' | 'nvidia' | 'vllm', {
+        ollamaModel: ollamaModel,
+        ollamaBaseUrl: ollamaBaseUrl,
+        vllmModel: vllmModel,
+        vllmBaseUrl: vllmBaseUrl
+      });
+    }
+
+    // Process the text to extract triples using either default pipeline or LangChain transformer
+    // When both useLangChain and useGraphTransformer are true, use the GraphTransformer
+    // When only useLangChain is true, use the default LangChain pipeline
+    // Pass custom prompts if provided
+    const options = {
+      systemPrompt,
+      extractionPrompt,
+      graphTransformerPrompt
+    };
+    
+    const triples = await processDocument(text, useLangChain, useGraphTransformer, options);
+
+    // Return the extracted triples
+    return NextResponse.json({
+      triples,
+      count: triples.length,
+      success: true,
+      method: useGraphTransformer 
+        ? 'langchain_graphtransformer' 
+        : useLangChain 
+          ? 'langchain_default' 
+          : 'standard_pipeline',
+      llmProvider: processor.getLLMProvider(),
+      customPromptUsed: !!(systemPrompt || extractionPrompt || graphTransformerPrompt)
+    });
+  } catch (error) {
+    console.error('Error in triple extraction:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to extract triples: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+}
+
+// Helper function to parse triples from text when JSON parsing fails
+function parseTriplesFallback(text: string): Array<{subject: string, predicate: string, object: string}> {
+  const triples = [];
+  const lines = text.split('\n');
+  
+  for (const line of lines) {
+    // Look for patterns like "Subject - Predicate - Object" or similar
+    const tripleMatch = line.match(/^[\s\-\*\d\.]*(.+?)\s*[\-\|]\s*(.+?)\s*[\-\|]\s*(.+)$/);
+    if (tripleMatch) {
+      triples.push({
+        subject: tripleMatch[1].trim(),
+        predicate: tripleMatch[2].trim(),
+        object: tripleMatch[3].trim()
+      });
+    }
+    
+    // Also look for JSON-like objects in the text
+    const jsonObjectMatch = line.match(/\{\s*"subject"\s*:\s*"([^"]+)"\s*,\s*"predicate"\s*:\s*"([^"]+)"\s*,\s*"object"\s*:\s*"([^"]+)"\s*\}/);
+    if (jsonObjectMatch) {
+      triples.push({
+        subject: jsonObjectMatch[1],
+        predicate: jsonObjectMatch[2],
+        object: jsonObjectMatch[3]
+      });
+    }
+  }
+  
+  return triples;
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/app/api/fix-query-logs/route.ts b/nvidia/txt2kg/assets/frontend/app/api/fix-query-logs/route.ts
new file mode 100644
index 0000000..525cc67
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/fix-query-logs/route.ts
@@ -0,0 +1,117 @@
+import { NextRequest, NextResponse } from 'next/server';
+import queryLoggerService, { QueryLogEntry } from '@/lib/query-logger';
+import fs from 'fs';
+import path from 'path';
+import { promises as fsPromises } from 'fs';
+
+interface QueryLogData {
+  query: string;
+  count: number;
+}
+
+interface FixResults {
+  fixed: number;
+  data: QueryLogData[];
+}
+
+/**
+ * API endpoint to check and fix query logs
+ */
+export async function GET(request: NextRequest) {
+  try {
+    console.log('Checking and fixing query logs');
+    
+    // Initialize logger if not already
+    if (!queryLoggerService.isInitialized()) {
+      await queryLoggerService.initialize();
+    }
+    
+    let results: FixResults = { fixed: 0, data: [] };
+    
+    try {
+      // Get the log file path
+      const logFilePath = path.join(process.cwd(), 'data', 'query-logs.json');
+      
+      // Check if log file exists
+      if (!fs.existsSync(logFilePath)) {
+        console.log('Log file does not exist, creating empty file');
+        await fsPromises.mkdir(path.dirname(logFilePath), { recursive: true });
+        await fsPromises.writeFile(logFilePath, JSON.stringify([]));
+        return NextResponse.json({
+          success: true,
+          results,
+          message: 'Created new empty log file'
+        });
+      }
+      
+      // Read existing logs
+      const logsRaw = await fsPromises.readFile(logFilePath, 'utf-8');
+      let logs: QueryLogEntry[] = JSON.parse(logsRaw || '[]');
+      
+      console.log(`Found ${logs.length} query log entries`);
+      
+      // Create a summary of existing logs
+      const querySummary = new Map<string, number>();
+      logs.forEach(log => {
+        const count = querySummary.get(log.query) || 0;
+        querySummary.set(log.query, count + 1);
+      });
+      
+      // Convert to array for response
+      results.data = Array.from(querySummary.entries()).map(([query, count]) => ({
+        query,
+        count
+      }));
+      
+      // If there are no logs, add a default test log
+      if (logs.length === 0) {
+        console.log('No logs found, adding a default test log');
+        
+        const defaultLog: QueryLogEntry = {
+          query: 'Test query for metrics',
+          queryMode: 'traditional',
+          timestamp: new Date().toISOString(),
+          metrics: {
+            executionTimeMs: 0,
+            relevanceScore: 0,
+            precision: 0,
+            recall: 0,
+            resultCount: 0
+          }
+        };
+        
+        logs.push(defaultLog);
+        results.fixed++;
+        
+        // Update results data
+        results.data.push({
+          query: defaultLog.query,
+          count: 1
+        });
+        
+        // Write back to file
+        await fsPromises.writeFile(logFilePath, JSON.stringify(logs, null, 2));
+        console.log('Added default test log');
+      }
+      
+      // Return the fixed results
+      return NextResponse.json({
+        success: true,
+        results,
+        message: `Fixed ${results.fixed} query logs`
+      });
+    } catch (error) {
+      console.error('Error during fix operation:', error);
+      return NextResponse.json({
+        success: false,
+        error: error instanceof Error ? error.message : String(error)
+      }, { status: 500 });
+    }
+  } catch (error) {
+    console.error('Error fixing query logs:', error);
+    return NextResponse.json({
+      success: false,
+      error: error instanceof Error ? error.message : String(error)
+    }, { status: 500 });
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/graph-data/route.ts b/nvidia/txt2kg/assets/frontend/app/api/graph-data/route.ts
new file mode 100644
index 0000000..26c9304
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/graph-data/route.ts
@@ -0,0 +1,139 @@
+import { type NextRequest, NextResponse } from "next/server"
+
+// Utility function to generate UUID with fallback
+const generateUUID = (): string => {
+  // Check if crypto.randomUUID is available
+  if (typeof crypto !== 'undefined' && crypto.randomUUID) {
+    try {
+      return crypto.randomUUID();
+    } catch (error) {
+      console.warn('crypto.randomUUID failed, using fallback:', error);
+    }
+  }
+  
+  // Fallback UUID generation
+  return 'xxxxxxxx-xxxx-4xxx-yxxx-xxxxxxxxxxxx'.replace(/[xy]/g, function(c) {
+    const r = Math.random() * 16 | 0;
+    const v = c == 'x' ? r : (r & 0x3 | 0x8);
+    return v.toString(16);
+  });
+};
+
+// Create a more persistent storage mechanism (still in-memory but more reliable)
+// This will be a global variable that persists between API calls
+// In a production environment, you would use a database instead
+const graphDataStore = new Map<string, { triples: any[]; documentName: string }>()
+
+// Sample graph data for when no ID is provided
+const sampleGraphData = {
+  nodes: [
+    { id: "1", name: "Document 1", group: "document" },
+    { id: "2", name: "Machine Learning", group: "concept" },
+    { id: "3", name: "Neural Networks", group: "concept" },
+    { id: "4", name: "Deep Learning", group: "concept" },
+    { id: "5", name: "Computer Vision", group: "concept" },
+    { id: "6", name: "Natural Language Processing", group: "concept" },
+    { id: "7", name: "Reinforcement Learning", group: "concept" },
+    { id: "8", name: "Supervised Learning", group: "concept" },
+    { id: "9", name: "Unsupervised Learning", group: "concept" },
+    { id: "10", name: "Semi-supervised Learning", group: "concept" },
+    { id: "11", name: "Transfer Learning", group: "concept" },
+    { id: "12", name: "GPT-4", group: "important" },
+    { id: "13", name: "BERT", group: "concept" },
+    { id: "14", name: "Transformers", group: "concept" },
+    { id: "15", name: "CNN", group: "concept" },
+    { id: "16", name: "RNN", group: "concept" },
+    { id: "17", name: "LSTM", group: "concept" },
+    { id: "18", name: "GAN", group: "concept" },
+    { id: "19", name: "Diffusion Models", group: "important" },
+    { id: "20", name: "Document 2", group: "document" },
+  ],
+  links: [
+    { source: "1", target: "2", name: "mentions" },
+    { source: "1", target: "3", name: "discusses" },
+    { source: "1", target: "4", name: "explains" },
+    { source: "2", target: "3", name: "includes" },
+    { source: "2", target: "4", name: "includes" },
+    { source: "2", target: "5", name: "related_to" },
+    { source: "2", target: "6", name: "related_to" },
+    { source: "2", target: "7", name: "includes" },
+    { source: "2", target: "8", name: "includes" },
+    { source: "2", target: "9", name: "includes" },
+    { source: "2", target: "10", name: "includes" },
+    { source: "2", target: "11", name: "includes" },
+    { source: "3", target: "15", name: "includes" },
+    { source: "3", target: "16", name: "includes" },
+    { source: "3", target: "17", name: "includes" },
+    { source: "4", target: "12", name: "uses" },
+    { source: "4", target: "13", name: "uses" },
+    { source: "4", target: "14", name: "uses" },
+    { source: "6", target: "12", name: "uses" },
+    { source: "6", target: "13", name: "uses" },
+    { source: "6", target: "14", name: "uses" },
+    { source: "5", target: "15", name: "uses" },
+    { source: "5", target: "18", name: "uses" },
+    { source: "5", target: "19", name: "uses" },
+    { source: "20", target: "6", name: "mentions" },
+    { source: "20", target: "12", name: "discusses" },
+    { source: "20", target: "19", name: "explains" },
+  ]
+};
+
+export async function POST(request: NextRequest) {
+  try {
+    const { triples, documentName } = await request.json()
+
+    if (!triples || !Array.isArray(triples)) {
+      return NextResponse.json({ error: "Invalid triples data" }, { status: 400 })
+    }
+
+    // Generate a unique ID for this graph data
+    const graphId = generateUUID()
+
+    // Store the data
+    graphDataStore.set(graphId, { triples, documentName: documentName || "Unnamed Document" })
+
+    console.log(`Stored graph data with ID: ${graphId}, triples count: ${triples.length}`)
+
+    // Return the ID
+    return NextResponse.json({ graphId })
+  } catch (error) {
+    console.error("Error storing graph data:", error)
+    return NextResponse.json({ error: "Failed to store graph data" }, { status: 500 })
+  }
+}
+
+export async function GET(request: NextRequest) {
+  try {
+    const url = new URL(request.url)
+    const graphId = url.searchParams.get("id")
+
+    // If no ID provided, return sample graph data
+    if (!graphId) {
+      console.log("No graph ID provided, returning sample data")
+      return NextResponse.json(sampleGraphData)
+    }
+
+    console.log(`Retrieving graph data for ID: ${graphId}`)
+    console.log(`Available graph IDs: ${Array.from(graphDataStore.keys()).join(", ")}`)
+
+    const graphData = graphDataStore.get(graphId)
+
+    if (!graphData) {
+      console.log(`Graph data not found for ID: ${graphId}. Informing client to use localStorage.`)
+      // Instead of a redirect, return a special response that tells the client to use localStorage
+      return NextResponse.json({ 
+        redirect: true, 
+        useLocalStorage: true,
+        error: "Graph data not found or has expired"
+      }, { status: 404 })
+    }
+
+    console.log(`Found graph data with ${graphData.triples.length} triples`)
+    return NextResponse.json(graphData)
+  } catch (error) {
+    console.error("Error retrieving graph data:", error)
+    return NextResponse.json({ error: "Failed to retrieve graph data" }, { status: 500 })
+  }
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/app/api/graph-db/clear/route.ts b/nvidia/txt2kg/assets/frontend/app/api/graph-db/clear/route.ts
new file mode 100644
index 0000000..3434760
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/graph-db/clear/route.ts
@@ -0,0 +1,41 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { getGraphDbService } from '@/lib/graph-db-util';
+import { getGraphDbType } from '../../settings/route';
+import { ArangoDBService } from '@/lib/arangodb';
+import { Neo4jService } from '@/lib/neo4j';
+
+/**
+ * POST handler for clearing all data from the graph database
+ */
+export async function POST(request: NextRequest) {
+  try {
+    // Get the preferred database type from settings
+    const graphDbType = getGraphDbType();
+    console.log(`Using graph database for clearing: ${graphDbType}`);
+    
+    // Get the appropriate service
+    const graphDbService = getGraphDbService(graphDbType);
+    
+    // Clear the database based on type
+    if (graphDbType === 'arangodb') {
+      const arangoService = graphDbService as ArangoDBService;
+      await arangoService.clearDatabase();
+    } else if (graphDbType === 'neo4j') {
+      // TODO: Implement Neo4j clear functionality when needed
+      throw new Error('Clear database functionality not implemented for Neo4j');
+    }
+    
+    // Return success response
+    return NextResponse.json({
+      success: true,
+      message: `Successfully cleared all data from ${graphDbType} database`,
+      databaseType: graphDbType
+    });
+  } catch (error) {
+    console.error(`Error in clear database handler:`, error);
+    return NextResponse.json(
+      { error: `Failed to clear database: ${error instanceof Error ? error.message : String(error)}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/graph-db/disconnect/route.ts b/nvidia/txt2kg/assets/frontend/app/api/graph-db/disconnect/route.ts
new file mode 100644
index 0000000..5eac33b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/graph-db/disconnect/route.ts
@@ -0,0 +1,42 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { getGraphDbService } from '@/lib/graph-db-util';
+import { getGraphDbType } from '../../settings/route';
+
+/**
+ * API endpoint for disconnecting from the selected graph database
+ * POST /api/graph-db/disconnect
+ */
+export async function POST(request: NextRequest) {
+  try {
+    // Get the graph database type from the settings
+    const graphDbType = getGraphDbType();
+    console.log(`Disconnecting from ${graphDbType}...`);
+    
+    // Get the appropriate service
+    const graphDbService = getGraphDbService(graphDbType);
+    
+    if (graphDbService.isInitialized()) {
+      graphDbService.close();
+      return NextResponse.json({
+        success: true,
+        message: `Successfully disconnected from ${graphDbType}`,
+        type: graphDbType
+      });
+    } else {
+      return NextResponse.json({
+        success: false,
+        message: `No active ${graphDbType} connection to disconnect`,
+        type: graphDbType
+      });
+    }
+  } catch (error) {
+    console.error('Error disconnecting from graph database:', error);
+    return NextResponse.json(
+      { 
+        error: `Failed to disconnect from graph database: ${error instanceof Error ? error.message : String(error)}`,
+        type: getGraphDbType()
+      },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/graph-db/route.ts b/nvidia/txt2kg/assets/frontend/app/api/graph-db/route.ts
new file mode 100644
index 0000000..690b919
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/graph-db/route.ts
@@ -0,0 +1,155 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { getGraphDbService } from '@/lib/graph-db-util';
+import { getGraphDbType } from '../settings/route';
+import { GraphDBType } from '@/lib/graph-db-service';
+
+/**
+ * Initialize graph database connection with parameters from request
+ * @param request Optional request containing connection parameters
+ */
+async function ensureConnection(request?: NextRequest): Promise<GraphDBType> {
+  try {
+    // Get the preferred database type from settings or request
+    let graphDbType: GraphDBType;
+    
+    if (request?.nextUrl.searchParams.has('type')) {
+      // Explicitly specified in the request
+      graphDbType = request.nextUrl.searchParams.get('type') as GraphDBType;
+    } else {
+      // Get from settings, with a safe fallback
+      graphDbType = getGraphDbType();
+    }
+    
+    console.log(`Using graph database: ${graphDbType}`);
+    
+    // Get the appropriate service
+    const graphDbService = getGraphDbService(graphDbType);
+    
+    if (graphDbType === 'neo4j') {
+      // Neo4j connection params
+      let uri = process.env.NEO4J_URI;
+      let username = process.env.NEO4J_USER || process.env.NEO4J_USERNAME;
+      let password = process.env.NEO4J_PASSWORD;
+
+      // Override with URL parameters if provided
+      if (request) {
+        const params = request.nextUrl.searchParams;
+        if (params.has('url')) uri = params.get('url') as string;
+        if (params.has('username')) username = params.get('username') as string;
+        if (params.has('password')) password = params.get('password') as string;
+      }
+
+      // Connect to Neo4j instance
+      graphDbService.initialize(uri, username, password);
+    } else if (graphDbType === 'arangodb') {
+      // ArangoDB connection params - environment variables take absolute priority
+      let url = process.env.ARANGODB_URL;
+      let dbName = process.env.ARANGODB_DB;
+      let username = process.env.ARANGODB_USER;
+      let password = process.env.ARANGODB_PASSWORD;
+
+      // Only use URL parameters if environment variables are not set
+      if (request) {
+        const params = request.nextUrl.searchParams;
+        if (!url && params.has('url')) url = params.get('url') as string;
+        if (!dbName && params.has('dbName')) dbName = params.get('dbName') as string;
+        if (!username && params.has('username')) username = params.get('username') as string;
+        if (!password && params.has('password')) password = params.get('password') as string;
+      }
+
+      // Connect to ArangoDB instance
+      await (graphDbService as any).initialize(url, dbName, username, password);
+    }
+    
+    return graphDbType;
+  } catch (error) {
+    console.error(`Failed to initialize graph database connection:`, error);
+    throw error;
+  }
+}
+
+/**
+ * GET handler for retrieving graph data from the selected graph database
+ */
+export async function GET(request: NextRequest) {
+  try {
+    // Initialize with connection parameters
+    const graphDbType = await ensureConnection(request);
+    const graphDbService = getGraphDbService(graphDbType);
+    
+    // Get graph data from the database
+    const graphData = await graphDbService.getGraphData();
+    
+    // Transform to format expected by the frontend
+    const nodes = graphData.nodes.map(node => ({
+      ...node,
+      name: node.name || `Node ${node.id}`,
+      label: node.labels?.[0] || 'Entity',
+      val: 1, // Default size
+      color: node.labels?.includes('Entity') ? '#ff6b6b' : '#4ecdc4'
+    }));
+    
+    const links = graphData.relationships.map(rel => ({
+      ...rel,
+      label: rel.type || 'RELATED_TO'
+    }));
+    
+    // Get the connection URL from request params or env
+    const params = request.nextUrl.searchParams;
+    const connectionUrl = params.get('url') || 
+      (graphDbType === 'neo4j' ? process.env.NEO4J_URI : process.env.ARANGODB_URL) || 
+      'Not specified';
+    
+    // Convert to the format expected by the application
+    return NextResponse.json({ 
+      nodes, 
+      links, 
+      connectionUrl,
+      databaseType: graphDbType
+    });
+  } catch (error) {
+    console.error(`Error in graph database GET handler:`, error);
+    return NextResponse.json(
+      { error: `Failed to fetch graph data: ${error instanceof Error ? error.message : String(error)}` },
+      { status: 500 }
+    );
+  }
+}
+
+/**
+ * POST handler for importing triples into the selected graph database
+ */
+export async function POST(request: NextRequest) {
+  try {
+    // Initialize with connection parameters
+    const graphDbType = await ensureConnection(request);
+    const graphDbService = getGraphDbService(graphDbType);
+    
+    // Parse request body
+    const body = await request.json();
+    
+    // Validate request body
+    if (!body.triples || !Array.isArray(body.triples)) {
+      return NextResponse.json(
+        { error: 'Invalid request: triples array is required' },
+        { status: 400 }
+      );
+    }
+    
+    // Import triples into the graph database
+    await graphDbService.importTriples(body.triples);
+    
+    // Return success response
+    return NextResponse.json({
+      success: true,
+      message: `Successfully imported ${body.triples.length} triples into ${graphDbType}`,
+      databaseType: graphDbType
+    });
+  } catch (error) {
+    console.error(`Error in graph database POST handler:`, error);
+    return NextResponse.json(
+      { error: `Failed to import triples: ${error instanceof Error ? error.message : String(error)}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/graph-db/triples/route.ts b/nvidia/txt2kg/assets/frontend/app/api/graph-db/triples/route.ts
new file mode 100644
index 0000000..f777a4d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/graph-db/triples/route.ts
@@ -0,0 +1,180 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { getGraphDbService } from '@/lib/graph-db-util';
+import { getGraphDbType } from '../../settings/route';
+import type { Triple } from '@/types/graph';
+import { GraphDBType } from '@/lib/graph-db-service';
+
+/**
+ * API endpoint for fetching all triples from the selected graph database
+ * GET /api/graph-db/triples
+ */
+export async function GET(req: NextRequest) {
+  try {
+    // Get the database type from settings or request parameter
+    const graphDbType = req.nextUrl.searchParams.get('type') as GraphDBType || getGraphDbType();
+    console.log(`Using graph database type: ${graphDbType}`);
+    
+    // Get the appropriate graph database service
+    const graphDbService = getGraphDbService(graphDbType);
+    
+    // Initialize the service based on database type
+    if (graphDbType === 'neo4j') {
+      // Neo4j specific initialization
+      const uri = process.env.NEO4J_URI;
+      const username = process.env.NEO4J_USER || process.env.NEO4J_USERNAME;
+      const password = process.env.NEO4J_PASSWORD;
+      graphDbService.initialize(uri, username, password);
+    } else if (graphDbType === 'arangodb') {
+      // ArangoDB specific initialization
+      const url = process.env.ARANGODB_URL;
+      const dbName = process.env.ARANGODB_DB;
+      const username = process.env.ARANGODB_USER;
+      const password = process.env.ARANGODB_PASSWORD;
+      await (graphDbService as any).initialize(url, dbName, username, password);
+    }
+    
+    console.log(`Fetching all triples from ${graphDbType}...`);
+    
+    // Get all triples from the graph database
+    // We'll use the graphDbService to get the graph data and then extract the triples
+    const graphData = await graphDbService.getGraphData();
+    
+    // Extract triples from the graph data
+    const triples: Triple[] = [];
+    
+    // Map of node IDs to names
+    const nodeMap = new Map();
+    for (const node of graphData.nodes) {
+      nodeMap.set(node.id, node.name);
+    }
+    
+    // Convert relationships to triples
+    for (const rel of graphData.relationships) {
+      const subject = nodeMap.get(rel.source);
+      const object = nodeMap.get(rel.target);
+      const predicate = rel.type;
+      
+      if (subject && predicate && object) {
+        triples.push({
+          subject,
+          predicate,
+          object
+        });
+      }
+    }
+    
+    // Deduplicate triples
+    const uniqueTriples = deduplicateTriples(triples);
+    
+    console.log(`Successfully fetched ${uniqueTriples.length} unique triples from ${graphDbType}`);
+    
+    // Return the triples
+    return NextResponse.json({
+      success: true,
+      triples: uniqueTriples,
+      count: uniqueTriples.length,
+      databaseType: graphDbType
+    });
+    
+  } catch (error) {
+    console.error(`Error fetching triples from graph database:`, error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to fetch triples: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+}
+
+/**
+ * Helper function to deduplicate triples
+ */
+function deduplicateTriples(triples: Triple[]): Triple[] {
+  const seen = new Set<string>();
+  return triples.filter(triple => {
+    // Create a string key for this triple
+    const key = `${triple.subject.toLowerCase()}|${triple.predicate.toLowerCase()}|${triple.object.toLowerCase()}`;
+    
+    // Check if we've seen this triple before
+    if (seen.has(key)) {
+      return false;
+    }
+    
+    // Mark this triple as seen
+    seen.add(key);
+    return true;
+  });
+}
+
+/**
+ * API endpoint for storing triples in the selected graph database
+ * POST /api/graph-db/triples
+ */
+export async function POST(req: NextRequest) {
+  try {
+    // Parse request body
+    const body = await req.json();
+    const { triples, documentName } = body;
+
+    if (!triples || !Array.isArray(triples)) {
+      return NextResponse.json({ error: 'Triples are required' }, { status: 400 });
+    }
+
+    // Get the database type from settings or request parameter
+    const graphDbType = req.nextUrl.searchParams.get('type') as GraphDBType || getGraphDbType();
+    console.log(`Using graph database type: ${graphDbType}`);
+    
+    console.log(`Storing ${triples.length} triples in ${graphDbType} from document "${documentName || 'unnamed'}"`);
+
+    // Get the appropriate graph database service
+    const graphDbService = getGraphDbService(graphDbType);
+    
+    // Initialize the service based on database type
+    if (graphDbType === 'neo4j') {
+      // Neo4j specific initialization
+      const uri = process.env.NEO4J_URI;
+      const username = process.env.NEO4J_USER || process.env.NEO4J_USERNAME;
+      const password = process.env.NEO4J_PASSWORD;
+      graphDbService.initialize(uri, username, password);
+    } else if (graphDbType === 'arangodb') {
+      // ArangoDB specific initialization
+      const url = process.env.ARANGODB_URL;
+      const dbName = process.env.ARANGODB_DB;
+      const username = process.env.ARANGODB_USER;
+      const password = process.env.ARANGODB_PASSWORD;
+      await (graphDbService as any).initialize(url, dbName, username, password);
+    }
+
+    // Filter triples to ensure they are valid
+    const validTriples = triples.filter((triple: any) => {
+      return (
+        triple &&
+        typeof triple.subject === 'string' && triple.subject.trim() !== '' &&
+        typeof triple.predicate === 'string' && triple.predicate.trim() !== '' &&
+        typeof triple.object === 'string' && triple.object.trim() !== ''
+      );
+    }) as Triple[];
+
+    console.log(`Found ${validTriples.length} valid triples to store`);
+
+    // Store triples in the graph database
+    await graphDbService.importTriples(validTriples);
+
+    // Return success response
+    return NextResponse.json({
+      success: true,
+      message: `Triples stored successfully in ${graphDbType}`,
+      count: validTriples.length,
+      documentName,
+      databaseType: graphDbType
+    });
+
+  } catch (error) {
+    console.error('Error storing triples in graph database:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to store triples: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/metrics/route.ts b/nvidia/txt2kg/assets/frontend/app/api/metrics/route.ts
new file mode 100644
index 0000000..8f024c2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/metrics/route.ts
@@ -0,0 +1,138 @@
+import { NextRequest, NextResponse } from 'next/server';
+import remoteBackendInstance from '@/lib/remote-backend';
+import { Neo4jService } from '@/lib/neo4j';
+import neo4jService from '@/lib/neo4j';
+import { PineconeService } from '@/lib/pinecone';
+import RAGService from '@/lib/rag';
+import queryLoggerService, { QueryLogSummary } from '@/lib/query-logger';
+
+/**
+ * Metrics API that provides performance statistics about the RAG system
+ */
+export async function GET(request: NextRequest) {
+  try {
+    // Initialize services
+    const neo4j = Neo4jService.getInstance();
+    const pineconeService = PineconeService.getInstance();
+    
+    if (!neo4j.isInitialized()) {
+      neo4j.initialize();
+    }
+    
+    // Get graph stats from Neo4j
+    const graphData = await neo4j.getGraphData();
+    
+    // Get unique entities (nodes)
+    const uniqueEntities = new Set<string>();
+    graphData.nodes.forEach((node: any) => uniqueEntities.add(node.name));
+    
+    // Get total triples (relationships)
+    const totalTriples = graphData.relationships.length;
+    
+    // Get vector stats from Pinecone if available
+    let vectorStats = {
+      totalVectors: 0,
+      avgQueryTime: 0,
+      avgRelevanceScore: 0
+    };
+    
+    try {
+      await pineconeService.initialize();
+      const stats = await pineconeService.getStats();
+      
+      vectorStats = {
+        totalVectors: stats.totalVectorCount || 0,
+        avgQueryTime: stats.averageQueryTime || 0,
+        avgRelevanceScore: stats.averageRelevanceScore || 0
+      };
+    } catch (error) {
+      console.warn('Could not fetch Pinecone stats:', error);
+    }
+
+    // Get real query logs instead of mock data
+    let queryLogs: QueryLogSummary[] = [];
+    let precision = 0; 
+    let recall = 0;
+    let f1Score = 0;
+    let avgQueryTime = vectorStats.avgQueryTime || 0;
+    let avgRelevance = 0;
+
+    // Get query logs from file-based logger instead of Neo4j
+    try {
+      // Initialize query logger if needed
+      if (!queryLoggerService.isInitialized()) {
+        await queryLoggerService.initialize();
+      }
+      
+      // Get the logs
+      console.log('Getting query logs from file');
+      queryLogs = await queryLoggerService.getQueryLogs(25);
+      console.log(`Found ${queryLogs.length} query logs from file-based logger`);
+      
+      // Calculate metrics from the query logs
+      if (queryLogs.length > 0) {
+        // Calculate metrics from logs with actual data
+        const logsWithMetrics = queryLogs.filter(log => 
+          log.metrics.avgPrecision > 0 || 
+          log.metrics.avgRecall > 0 || 
+          log.metrics.avgExecutionTimeMs > 0
+        );
+        
+        const logsWithRelevance = queryLogs.filter(log => log.metrics.avgRelevanceScore > 0);
+        
+        if (logsWithMetrics.length > 0) {
+          precision = logsWithMetrics.reduce((sum, log) => sum + (log.metrics.avgPrecision || 0), 0) / logsWithMetrics.length;
+          recall = logsWithMetrics.reduce((sum, log) => sum + (log.metrics.avgRecall || 0), 0) / logsWithMetrics.length;
+          avgQueryTime = logsWithMetrics.reduce((sum, log) => sum + (log.metrics.avgExecutionTimeMs || 0), 0) / logsWithMetrics.length;
+          f1Score = precision > 0 && recall > 0 ? 2 * (precision * recall) / (precision + recall) : 0;
+        }
+        
+        if (logsWithRelevance.length > 0) {
+          avgRelevance = logsWithRelevance.reduce((sum, log) => sum + (log.metrics.avgRelevanceScore || 0), 0) / logsWithRelevance.length;
+        }
+      }
+    } catch (error) {
+      console.warn('Error getting query logs from file:', error);
+      // Keep values at 0 instead of using defaults
+    }
+    
+    // Get top queries from real logs
+    const topQueries = queryLogs.length > 0 
+      ? queryLogs
+          .sort((a, b) => b.count - a.count)
+          .slice(0, 5)
+          .map(log => ({ 
+            query: log.query, 
+            count: log.count 
+          }))
+      : [];
+    
+    // Aggregate metrics
+    const metrics = {
+      totalTriples,
+      totalEntities: uniqueEntities.size,
+      avgQueryTime,
+      avgRelevance: avgRelevance || vectorStats.avgRelevanceScore || 0, // Use query log relevance score, fallback to vector stats
+      precision,
+      recall,
+      f1Score,
+      topQueries,
+      // Add metadata about query logs
+      queryLogStats: {
+        totalQueryLogs: queryLogs.length,
+        totalExecutions: queryLogs.reduce((sum, log) => sum + log.executionCount, 0),
+        lastQueriedAt: queryLogs.length > 0 ? queryLogs[0].lastQueried : null
+      }
+    };
+    
+    return NextResponse.json(metrics);
+  } catch (error) {
+    console.error('Error fetching metrics:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json({ error: errorMessage }, { status: 500 });
+  }
+}
+
+/**
+ * Function to calculate precision and recall has been replaced by real data from query logs
+ */ 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/neo4j/disconnect/route.ts b/nvidia/txt2kg/assets/frontend/app/api/neo4j/disconnect/route.ts
new file mode 100644
index 0000000..64efe5f
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/neo4j/disconnect/route.ts
@@ -0,0 +1,46 @@
+import { NextRequest, NextResponse } from 'next/server';
+
+/**
+ * Legacy Neo4j disconnect endpoint - redirects to the new graph-db/disconnect endpoint
+ * @deprecated Use /api/graph-db/disconnect instead with type=neo4j
+ */
+export async function POST(request: NextRequest) {
+  console.log('Redirecting from deprecated /api/neo4j/disconnect to /api/graph-db/disconnect?type=neo4j');
+  
+  // Create the new URL with the neo4j type parameter
+  const url = new URL(request.url);
+  const newUrl = new URL('/api/graph-db/disconnect', url.origin);
+  
+  // Copy all query parameters
+  url.searchParams.forEach((value, key) => {
+    newUrl.searchParams.append(key, value);
+  });
+  
+  // Add Neo4j type parameter if not present
+  if (!newUrl.searchParams.has('type')) {
+    newUrl.searchParams.append('type', 'neo4j');
+  }
+  
+  // Clone the request with the new URL
+  const newRequest = new Request(newUrl, {
+    method: request.method,
+    headers: request.headers,
+    body: request.body,
+    cache: request.cache,
+    credentials: request.credentials,
+    integrity: request.integrity,
+    keepalive: request.keepalive,
+    mode: request.mode,
+    redirect: request.redirect,
+    referrer: request.referrer,
+    referrerPolicy: request.referrerPolicy,
+    signal: request.signal,
+    duplex: 'half',
+  } as RequestInit);
+  
+  // Fetch from the new endpoint
+  const response = await fetch(newRequest);
+  
+  // Return the response
+  return response;
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/neo4j/route.ts b/nvidia/txt2kg/assets/frontend/app/api/neo4j/route.ts
new file mode 100644
index 0000000..cbd66da
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/neo4j/route.ts
@@ -0,0 +1,105 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { Neo4jService } from '@/lib/neo4j';
+
+// Initialize Neo4j service
+const neo4jService = Neo4jService.getInstance();
+
+// Initialize connection on first request
+let isInitialized = false;
+
+/**
+ * Initialize Neo4j connection if not already initialized
+ * @param request Optional request containing connection parameters
+ */
+function ensureConnection(request?: NextRequest) {
+  try {
+    let uri = process.env.NEO4J_URI;
+    let username = process.env.NEO4J_USER;
+    let password = process.env.NEO4J_PASSWORD;
+
+    // Override with URL parameters if provided
+    if (request) {
+      const params = request.nextUrl.searchParams;
+      if (params.has('url')) uri = params.get('url') as string;
+      if (params.has('username')) username = params.get('username') as string;
+      if (params.has('password')) password = params.get('password') as string;
+    }
+
+    // Connect to Neo4j instance
+    neo4jService.initialize(uri, username, password);
+    isInitialized = true;
+  } catch (error) {
+    console.error('Failed to initialize Neo4j connection:', error);
+    throw error;
+  }
+}
+
+/**
+ * Legacy Neo4j endpoint - redirects to the new graph-db endpoint
+ * @deprecated Use /api/graph-db instead with type=neo4j
+ */
+export async function GET(request: NextRequest) {
+  console.log('Redirecting from deprecated /api/neo4j to /api/graph-db?type=neo4j');
+  
+  // Create the new URL with the same query parameters
+  const url = new URL(request.url);
+  const newUrl = new URL('/api/graph-db', url.origin);
+  
+  // Copy all query parameters
+  url.searchParams.forEach((value, key) => {
+    newUrl.searchParams.append(key, value);
+  });
+  
+  // Add Neo4j type parameter if not present
+  if (!newUrl.searchParams.has('type')) {
+    newUrl.searchParams.append('type', 'neo4j');
+  }
+  
+  // Return a redirect response
+  return NextResponse.redirect(newUrl);
+}
+
+/**
+ * Legacy Neo4j POST endpoint - redirects to the new graph-db endpoint with a type parameter
+ * @deprecated Use /api/graph-db instead with type=neo4j
+ */
+export async function POST(request: NextRequest) {
+  console.log('Redirecting from deprecated /api/neo4j to /api/graph-db?type=neo4j');
+  
+  // Create the new URL with the neo4j type parameter
+  const url = new URL(request.url);
+  const newUrl = new URL('/api/graph-db', url.origin);
+  
+  // Copy all query parameters
+  url.searchParams.forEach((value, key) => {
+    newUrl.searchParams.append(key, value);
+  });
+  
+  // Add Neo4j type parameter if not present
+  if (!newUrl.searchParams.has('type')) {
+    newUrl.searchParams.append('type', 'neo4j');
+  }
+  
+  // Clone the request with the new URL
+  const newRequest = new Request(newUrl, {
+    method: request.method,
+    headers: request.headers,
+    body: request.body,
+    cache: request.cache,
+    credentials: request.credentials,
+    integrity: request.integrity,
+    keepalive: request.keepalive,
+    mode: request.mode,
+    redirect: request.redirect,
+    referrer: request.referrer,
+    referrerPolicy: request.referrerPolicy,
+    signal: request.signal,
+    duplex: 'half',
+  } as RequestInit);
+  
+  // Fetch from the new endpoint
+  const response = await fetch(newRequest);
+  
+  // Return the response
+  return response;
+}
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/neo4j/triples/route.ts b/nvidia/txt2kg/assets/frontend/app/api/neo4j/triples/route.ts
new file mode 100644
index 0000000..3082a31
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/neo4j/triples/route.ts
@@ -0,0 +1,71 @@
+import { NextRequest, NextResponse } from 'next/server';
+
+/**
+ * Legacy Neo4j triples endpoint - redirects to the new graph-db/triples endpoint
+ * @deprecated Use /api/graph-db/triples instead with type=neo4j
+ */
+export async function GET(req: NextRequest) {
+  console.log('Redirecting from deprecated /api/neo4j/triples to /api/graph-db/triples?type=neo4j');
+  
+  // Create the new URL with the same query parameters
+  const url = new URL(req.url);
+  const newUrl = new URL('/api/graph-db/triples', url.origin);
+  
+  // Copy all query parameters
+  url.searchParams.forEach((value, key) => {
+    newUrl.searchParams.append(key, value);
+  });
+  
+  // Add Neo4j type parameter if not present
+  if (!newUrl.searchParams.has('type')) {
+    newUrl.searchParams.append('type', 'neo4j');
+  }
+  
+  // Return a redirect response
+  return NextResponse.redirect(newUrl);
+}
+
+/**
+ * Legacy Neo4j triples POST endpoint - redirects to the new graph-db/triples endpoint
+ * @deprecated Use /api/graph-db/triples instead with type=neo4j
+ */
+export async function POST(req: NextRequest) {
+  console.log('Redirecting from deprecated /api/neo4j/triples to /api/graph-db/triples?type=neo4j');
+  
+  // Create the new URL with the neo4j type parameter
+  const url = new URL(req.url);
+  const newUrl = new URL('/api/graph-db/triples', url.origin);
+  
+  // Copy all query parameters
+  url.searchParams.forEach((value, key) => {
+    newUrl.searchParams.append(key, value);
+  });
+  
+  // Add Neo4j type parameter if not present
+  if (!newUrl.searchParams.has('type')) {
+    newUrl.searchParams.append('type', 'neo4j');
+  }
+  
+  // Clone the request with the new URL
+  const newRequest = new Request(newUrl, {
+    method: req.method,
+    headers: req.headers,
+    body: req.body,
+    cache: req.cache,
+    credentials: req.credentials,
+    integrity: req.integrity,
+    keepalive: req.keepalive,
+    mode: req.mode,
+    redirect: req.redirect,
+    referrer: req.referrer,
+    referrerPolicy: req.referrerPolicy,
+    signal: req.signal,
+    duplex: 'half',
+  } as RequestInit);
+  
+  // Fetch from the new endpoint
+  const response = await fetch(newRequest);
+  
+  // Return the response
+  return response;
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/ollama/batch/route.ts b/nvidia/txt2kg/assets/frontend/app/api/ollama/batch/route.ts
new file mode 100644
index 0000000..da59cd0
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/ollama/batch/route.ts
@@ -0,0 +1,184 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { llmService, LLMMessage } from '@/lib/llm-service';
+
+/**
+ * API endpoint for batch Ollama operations
+ * POST /api/ollama/batch - Process multiple texts in batch for triple extraction
+ */
+
+interface BatchTripleRequest {
+  texts: string[];
+  model?: string;
+  temperature?: number;
+  maxTokens?: number;
+  concurrency?: number;
+}
+
+export async function POST(req: NextRequest) {
+  try {
+    const { 
+      texts, 
+      model = 'qwen3:1.7b', 
+      temperature = 0.1, 
+      maxTokens = 8192,
+      concurrency = 5
+    }: BatchTripleRequest = await req.json();
+
+    if (!texts || !Array.isArray(texts) || texts.length === 0) {
+      return NextResponse.json({ 
+        error: 'Texts array is required and must not be empty' 
+      }, { status: 400 });
+    }
+
+    if (texts.length > 100) {
+      return NextResponse.json({ 
+        error: 'Batch size limited to 100 texts maximum' 
+      }, { status: 400 });
+    }
+
+    // Validate all texts are strings
+    const invalidTexts = texts.filter(text => !text || typeof text !== 'string');
+    if (invalidTexts.length > 0) {
+      return NextResponse.json({ 
+        error: `Invalid texts found at indices: ${texts.map((text, i) => 
+          (!text || typeof text !== 'string') ? i : null
+        ).filter(i => i !== null).join(', ')}` 
+      }, { status: 400 });
+    }
+
+    console.log(`Starting batch triple extraction for ${texts.length} texts using model ${model}`);
+
+    // Create system prompt for triple extraction
+    const systemPrompt = `You are a knowledge graph builder that extracts structured information from text.
+Extract subject-predicate-object triples from the following text.
+
+Guidelines:
+- Extract only factual triples present in the text
+- Normalize entity names to their canonical form
+- Return results in JSON format as an array of objects with "subject", "predicate", "object" fields
+- Each triple should represent a clear relationship between two entities
+- Focus on the most important relationships in the text`;
+
+    // Prepare batch messages
+    const messagesBatch: LLMMessage[][] = texts.map(text => [
+      {
+        role: 'system' as const,
+        content: systemPrompt
+      },
+      {
+        role: 'user' as const,
+        content: `Extract triples from this text:\n\n${text}`
+      }
+    ]);
+
+    // Process batch with Ollama
+    const batchResult = await llmService.generateOllamaBatchCompletion(
+      model,
+      messagesBatch,
+      { temperature, maxTokens, concurrency }
+    );
+
+    // Parse responses to extract triples
+    const processedResults = batchResult.results.map((response, index) => {
+      let triples = [];
+      
+      if (response) {
+        try {
+          // Try to parse as JSON first
+          const jsonMatch = response.match(/\[[\s\S]*\]/);
+          if (jsonMatch) {
+            triples = JSON.parse(jsonMatch[0]);
+          } else {
+            // Fallback: parse line by line
+            triples = parseTriplesFallback(response);
+          }
+        } catch (parseError) {
+          console.warn(`Failed to parse response for text ${index}:`, parseError);
+          triples = parseTriplesFallback(response);
+        }
+      }
+
+      return {
+        textIndex: index,
+        originalText: texts[index].substring(0, 200) + (texts[index].length > 200 ? '...' : ''),
+        triples: triples.map((triple: any, tripleIndex: number) => ({
+          ...triple,
+          confidence: 0.8, // Default confidence for Ollama extractions
+          metadata: {
+            entityTypes: [],
+            source: texts[index].substring(0, 100) + '...',
+            context: texts[index].substring(0, 200) + '...',
+            extractionMethod: 'ollama_batch',
+            model: model,
+            textIndex: index,
+            tripleIndex: tripleIndex
+          }
+        })),
+        tripleCount: triples.length,
+        success: !batchResult.errors.some(error => error.index === index)
+      };
+    });
+
+    // Calculate summary statistics
+    const totalTriples = processedResults.reduce((sum, result) => sum + result.tripleCount, 0);
+    const successfulTexts = processedResults.filter(result => result.success).length;
+
+    return NextResponse.json({
+      results: processedResults,
+      summary: {
+        totalTexts: texts.length,
+        successfulTexts: successfulTexts,
+        failedTexts: batchResult.errors.length,
+        totalTriples: totalTriples,
+        averageTriples: successfulTexts > 0 ? (totalTriples / successfulTexts).toFixed(2) : 0
+      },
+      batchInfo: {
+        model: model,
+        concurrency: concurrency,
+        processingTime: Date.now(), // Could be enhanced with actual timing
+        method: 'ollama_batch'
+      },
+      errors: batchResult.errors,
+      success: true
+    });
+  } catch (error) {
+    console.error('Error in Ollama batch triple extraction:', error);
+    return NextResponse.json(
+      { 
+        error: 'Failed to process batch triple extraction with Ollama',
+        details: error instanceof Error ? error.message : String(error)
+      },
+      { status: 500 }
+    );
+  }
+}
+
+// Fallback parser for when JSON parsing fails (reused from single endpoint)
+function parseTriplesFallback(text: string): Array<{subject: string, predicate: string, object: string}> {
+  const triples = [];
+  const lines = text.split('\n');
+  
+  for (const line of lines) {
+    // Look for patterns like "Subject - Predicate - Object" or similar
+    const tripleMatch = line.match(/^[\s\-\*\d\.]*(.+?)\s*[\-\|]\s*(.+?)\s*[\-\|]\s*(.+)$/);
+    if (tripleMatch) {
+      triples.push({
+        subject: tripleMatch[1].trim(),
+        predicate: tripleMatch[2].trim(),
+        object: tripleMatch[3].trim()
+      });
+    }
+    
+    // Also look for JSON-like objects in the text
+    const jsonObjectMatch = line.match(/\{\s*"subject"\s*:\s*"([^"]+)"\s*,\s*"predicate"\s*:\s*"([^"]+)"\s*,\s*"object"\s*:\s*"([^"]+)"\s*\}/);
+    if (jsonObjectMatch) {
+      triples.push({
+        subject: jsonObjectMatch[1],
+        predicate: jsonObjectMatch[2],
+        object: jsonObjectMatch[3]
+      });
+    }
+  }
+  
+  return triples;
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/ollama/route.ts b/nvidia/txt2kg/assets/frontend/app/api/ollama/route.ts
new file mode 100644
index 0000000..499e81e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/ollama/route.ts
@@ -0,0 +1,160 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { llmService } from '@/lib/llm-service';
+
+// Configure route for dynamic operations and long-running requests
+export const dynamic = 'force-dynamic';
+export const maxDuration = 1800; // 30 minutes for large model processing
+
+/**
+ * API endpoint for Ollama-specific operations
+ * GET /api/ollama - Test connection and list available models
+ * POST /api/ollama/extract-triples - Extract triples using Ollama model
+ */
+
+export async function GET(req: NextRequest) {
+  try {
+    const { searchParams } = new URL(req.url);
+    const action = searchParams.get('action');
+
+    if (action === 'test-connection') {
+      const result = await llmService.testOllamaConnection();
+      return NextResponse.json(result);
+    }
+
+    // Default: test connection and return models
+    const result = await llmService.testOllamaConnection();
+    return NextResponse.json(result);
+  } catch (error) {
+    console.error('Error in Ollama API:', error);
+    return NextResponse.json(
+      { 
+        error: 'Failed to connect to Ollama server',
+        details: error instanceof Error ? error.message : String(error)
+      },
+      { status: 500 }
+    );
+  }
+}
+
+export async function POST(req: NextRequest) {
+  const startTime = Date.now();
+  console.log(`[${new Date().toISOString()}] /api/ollama: POST request received`);
+  
+  try {
+    const { text, model = 'qwen3:1.7b', temperature = 0.1, maxTokens = 8192 } = await req.json();
+    console.log(`[${new Date().toISOString()}] /api/ollama: Parsed body - model: ${model}, text length: ${text?.length || 0}, maxTokens: ${maxTokens}`);
+
+    if (!text || typeof text !== 'string') {
+      return NextResponse.json({ error: 'Text is required' }, { status: 400 });
+    }
+
+    // Use the LLM service to generate completion with Ollama
+    const messages = [
+      {
+        role: 'system' as const,
+        content: `You are a knowledge graph builder that extracts structured information from text.
+Extract subject-predicate-object triples from the following text.
+
+Guidelines:
+- Extract only factual triples present in the text
+- Normalize entity names to their canonical form
+- Return results in JSON format as an array of objects with "subject", "predicate", "object" fields
+- Each triple should represent a clear relationship between two entities
+- Focus on the most important relationships in the text`
+      },
+      {
+        role: 'user' as const,
+        content: `Extract triples from this text:\n\n${text}`
+      }
+    ];
+
+    console.log(`[${new Date().toISOString()}] /api/ollama: Calling llmService.generateOllamaCompletion with model: ${model}`);
+    const llmStartTime = Date.now();
+    
+    const response = await llmService.generateOllamaCompletion(
+      model,
+      messages,
+      { temperature, maxTokens }
+    );
+    
+    const llmDuration = ((Date.now() - llmStartTime) / 1000).toFixed(2);
+    console.log(`[${new Date().toISOString()}] /api/ollama: LLM completion received after ${llmDuration}s, response length: ${response?.length || 0}`);
+
+    // Parse the response to extract triples
+    let triples = [];
+    try {
+      // Try to parse as JSON first
+      const jsonMatch = response.match(/\[[\s\S]*\]/);
+      if (jsonMatch) {
+        triples = JSON.parse(jsonMatch[0]);
+      } else {
+        // Fallback: parse line by line
+        triples = parseTriplesFallback(response);
+      }
+    } catch (parseError) {
+      console.warn('Failed to parse JSON response, using fallback parser:', parseError);
+      triples = parseTriplesFallback(response);
+    }
+
+    const totalDuration = ((Date.now() - startTime) / 1000).toFixed(2);
+    console.log(`[${new Date().toISOString()}] /api/ollama: Returning ${triples.length} triples, total duration: ${totalDuration}s`);
+    
+    return NextResponse.json({
+      triples: triples.map((triple, index) => ({
+        ...triple,
+        confidence: 0.8, // Default confidence for Ollama extractions
+        metadata: {
+          entityTypes: [],
+          source: text.substring(0, 100) + '...',
+          context: text.substring(0, 200) + '...',
+          extractionMethod: 'ollama',
+          model: model
+        }
+      })),
+      count: triples.length,
+      success: true,
+      method: 'ollama',
+      model: model
+    });
+  } catch (error) {
+    const totalDuration = ((Date.now() - startTime) / 1000).toFixed(2);
+    console.error(`[${new Date().toISOString()}] /api/ollama: Error after ${totalDuration}s:`, error);
+    return NextResponse.json(
+      { 
+        error: 'Failed to extract triples with Ollama',
+        details: error instanceof Error ? error.message : String(error)
+      },
+      { status: 500 }
+    );
+  }
+}
+
+// Fallback parser for when JSON parsing fails
+function parseTriplesFallback(text: string): Array<{subject: string, predicate: string, object: string}> {
+  const triples = [];
+  const lines = text.split('\n');
+  
+  for (const line of lines) {
+    // Look for patterns like "Subject - Predicate - Object" or similar
+    const tripleMatch = line.match(/^[\s\-\*\d\.]*(.+?)\s*[\-\|]\s*(.+?)\s*[\-\|]\s*(.+)$/);
+    if (tripleMatch) {
+      triples.push({
+        subject: tripleMatch[1].trim(),
+        predicate: tripleMatch[2].trim(),
+        object: tripleMatch[3].trim()
+      });
+    }
+    
+    // Also look for JSON-like objects in the text
+    const jsonObjectMatch = line.match(/\{\s*"subject"\s*:\s*"([^"]+)"\s*,\s*"predicate"\s*:\s*"([^"]+)"\s*,\s*"object"\s*:\s*"([^"]+)"\s*\}/);
+    if (jsonObjectMatch) {
+      triples.push({
+        subject: jsonObjectMatch[1],
+        predicate: jsonObjectMatch[2],
+        object: jsonObjectMatch[3]
+      });
+    }
+  }
+  
+  return triples;
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/ollama/test/route.ts b/nvidia/txt2kg/assets/frontend/app/api/ollama/test/route.ts
new file mode 100644
index 0000000..f0545d9
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/ollama/test/route.ts
@@ -0,0 +1,82 @@
+import { NextRequest, NextResponse } from 'next/server';
+
+/**
+ * Test endpoint for Ollama integration
+ * GET /api/ollama/test - Test Ollama functionality with sample data
+ */
+
+export async function GET(req: NextRequest) {
+  try {
+    const sampleText = `
+    Apple Inc. is a multinational technology company headquartered in Cupertino, California. 
+    The company was founded by Steve Jobs, Steve Wozniak, and Ronald Wayne in 1976. 
+    Apple designs and develops consumer electronics, computer software, and online services. 
+    Tim Cook is the current CEO of Apple Inc.
+    `;
+
+    console.log('Testing Ollama with sample text...');
+
+    // Test connection first
+    const connectionResponse = await fetch(`${req.nextUrl.origin}/api/ollama?action=test-connection`);
+    const connectionResult = await connectionResponse.json();
+
+    if (!connectionResult.connected) {
+      return NextResponse.json({
+        success: false,
+        error: 'Ollama connection failed',
+        details: connectionResult.error,
+        connectionTest: connectionResult
+      });
+    }
+
+    // Test triple extraction
+    const extractionResponse = await fetch(`${req.nextUrl.origin}/api/ollama`, {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify({
+        text: sampleText.trim(),
+        model: 'qwen3:1.7b',
+        temperature: 0.1,
+        maxTokens: 1024
+      })
+    });
+
+    if (!extractionResponse.ok) {
+      const errorText = await extractionResponse.text();
+      return NextResponse.json({
+        success: false,
+        error: 'Triple extraction failed',
+        details: errorText,
+        connectionTest: connectionResult
+      });
+    }
+
+    const extractionResult = await extractionResponse.json();
+
+    return NextResponse.json({
+      success: true,
+      message: 'Ollama integration test completed successfully',
+      connectionTest: connectionResult,
+      extractionTest: {
+        inputText: sampleText.trim(),
+        triplesExtracted: extractionResult.triples?.length || 0,
+        sampleTriples: (extractionResult.triples || []).slice(0, 3),
+        method: extractionResult.method,
+        model: extractionResult.model
+      },
+      fullResult: extractionResult
+    });
+  } catch (error) {
+    console.error('Error in Ollama test:', error);
+    return NextResponse.json(
+      { 
+        success: false,
+        error: 'Test failed with exception',
+        details: error instanceof Error ? error.message : String(error)
+      },
+      { status: 500 }
+    );
+  }
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/clear/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/clear/route.ts
new file mode 100644
index 0000000..f73d7cf
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/clear/route.ts
@@ -0,0 +1,27 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { PineconeService } from '@/lib/pinecone';
+
+/**
+ * Clear all data from the Pinecone vector database
+ * POST /api/pinecone-diag/clear
+ */
+export async function POST() {
+  // Get the Pinecone service instance
+  const pineconeService = PineconeService.getInstance();
+  
+  // Clear all vectors from the database
+  const deleteSuccess = await pineconeService.deleteAllEntities();
+  
+  // Get updated stats after clearing
+  const stats = await pineconeService.getStats();
+  
+  // Return response based on operation success
+  return NextResponse.json({
+    success: deleteSuccess,
+    message: deleteSuccess 
+      ? 'Successfully cleared all data from Pinecone vector database'
+      : 'Failed to clear Pinecone database - service may not be available',
+    totalVectorCount: stats.totalVectorCount || 0,
+    httpHealthy: stats.httpHealthy || false
+  });
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/create-index/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/create-index/route.ts
new file mode 100644
index 0000000..7ce0f5c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/create-index/route.ts
@@ -0,0 +1,36 @@
+import { NextResponse } from 'next/server';
+import { PineconeService } from '@/lib/pinecone';
+
+/**
+ * Create Pinecone index API endpoint
+ * POST /api/pinecone-diag/create-index
+ */
+export async function POST() {
+  try {
+    // Get the Pinecone service instance
+    const pineconeService = PineconeService.getInstance();
+    
+    // Force re-initialization to create the index
+    (pineconeService as any).initialized = false;
+    await pineconeService.initialize();
+    
+    // Check if initialization was successful by getting stats
+    const stats = await pineconeService.getStats();
+    
+    return NextResponse.json({
+      success: true,
+      message: 'Pinecone index created successfully',
+      httpHealthy: stats.httpHealthy || false
+    });
+  } catch (error) {
+    console.error('Error creating Pinecone index:', error);
+    
+    return NextResponse.json(
+      { 
+        success: false,
+        error: `Failed to create Pinecone index: ${error instanceof Error ? error.message : String(error)}`
+      },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/stats/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/stats/route.ts
new file mode 100644
index 0000000..a1aa129
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pinecone-diag/stats/route.ts
@@ -0,0 +1,42 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { PineconeService } from '@/lib/pinecone';
+
+/**
+ * Get Pinecone vector database stats
+ */
+export async function GET() {
+  try {
+    // Initialize Pinecone service
+    const pineconeService = PineconeService.getInstance();
+    
+    // We can now directly call getStats() which handles initialization and error recovery
+    const stats = await pineconeService.getStats();
+    
+    return NextResponse.json({
+      ...stats,
+      timestamp: new Date().toISOString()
+    });
+  } catch (error) {
+    console.error('Error getting Pinecone stats:', error);
+    
+    // Return a successful response with error information
+    // This prevents the UI from breaking when Pinecone is unavailable
+    let errorMessage = error instanceof Error ? error.message : String(error);
+    
+    // More specific error message for 404 errors
+    if (errorMessage.includes('404')) {
+      errorMessage = 'Pinecone server returned 404. The server may not be running or the index does not exist.';
+    }
+    
+    return NextResponse.json(
+      { 
+        error: `Failed to get Pinecone stats: ${errorMessage}`,
+        totalVectorCount: 0,
+        source: 'error',
+        httpHealthy: false,
+        timestamp: new Date().toISOString()
+      },
+      { status: 200 } // Use 200 instead of 500 to avoid UI errors
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/process-document/route.ts b/nvidia/txt2kg/assets/frontend/app/api/process-document/route.ts
new file mode 100644
index 0000000..9182ff5
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/process-document/route.ts
@@ -0,0 +1,154 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { RemoteBackendService } from '@/lib/remote-backend';
+import { EmbeddingsService } from '@/lib/embeddings';
+import type { Triple } from '@/types/graph';
+import { BackendService } from '@/lib/backend-service';
+import { getGraphDbType } from '../settings/route';
+
+/**
+ * API endpoint for processing documents with LangChain, generating embeddings,
+ * and storing in the knowledge graph
+ * POST /api/process-document
+ */
+export async function POST(req: NextRequest) {
+  try {
+    // Parse request body
+    const body = await req.json();
+    const { 
+      text, 
+      filename, 
+      triples, 
+      useLangChain, 
+      useGraphTransformer,
+      systemPrompt,
+      extractionPrompt,
+      graphTransformerPrompt
+    } = body;
+
+    if (!text || typeof text !== 'string') {
+      return NextResponse.json({ error: 'Text is required' }, { status: 400 });
+    }
+
+    if (!triples || !Array.isArray(triples)) {
+      return NextResponse.json({ error: 'Triples are required' }, { status: 400 });
+    }
+
+    // Initialize services
+    const backendService = RemoteBackendService.getInstance();
+    const embeddingsService = EmbeddingsService.getInstance();
+
+    console.log(`🔍 API: Processing document "${filename || 'unnamed'}" (${text.length} chars)`);
+    console.log(`🔍 API: Processing ${triples.length} triples`);
+    console.log(`🔍 API: Using LangChain for triple extraction: ${useLangChain ? 'Yes' : 'No'}`);
+    console.log(`🔍 API: First few triples:`, triples.slice(0, 3));
+    if (useLangChain) {
+      console.log(`Using LLMGraphTransformer: ${useGraphTransformer ? 'Yes' : 'No'}`);
+    }
+    
+    // Log if custom prompts are being used
+    if (systemPrompt || extractionPrompt || graphTransformerPrompt) {
+      console.log('Using custom prompts for extraction');
+      if (systemPrompt) console.log('Custom system prompt provided');
+      if (extractionPrompt) console.log('Custom extraction prompt provided');
+      if (graphTransformerPrompt) console.log('Custom graph transformer prompt provided');
+    }
+
+    // Filter triples to ensure they are valid
+    const validTriples = triples.filter((triple: any) => {
+      return (
+        triple &&
+        typeof triple.subject === 'string' && triple.subject.trim() !== '' &&
+        typeof triple.predicate === 'string' && triple.predicate.trim() !== '' &&
+        typeof triple.object === 'string' && triple.object.trim() !== ''
+      );
+    }) as Triple[];
+
+    console.log(`Found ${validTriples.length} valid triples`);
+
+    // If useLangChain flag is set, we'll extract triples using the LangChain route
+    let triplesForProcessing = validTriples;
+    
+    if (useLangChain && !filename?.toLowerCase().endsWith('.csv')) {
+      try {
+        console.log('Using LangChain for native triple extraction...');
+        // Use absolute URL with origin from request to fix URL parsing error
+        const baseUrl = new URL(req.url).origin;
+        console.log(`Using base URL: ${baseUrl} for LangChain API call`);
+        
+        // Call the extract-triples endpoint with useLangChain flag and custom prompts
+        const requestBody: any = { 
+          text, 
+          useLangChain: true,
+          useGraphTransformer
+        };
+        
+        // Add custom prompts if available
+        if (systemPrompt) requestBody.systemPrompt = systemPrompt;
+        if (extractionPrompt) requestBody.extractionPrompt = extractionPrompt;
+        if (graphTransformerPrompt) requestBody.graphTransformerPrompt = graphTransformerPrompt;
+        
+        const langchainResponse = await fetch(`${baseUrl}/api/extract-triples`, {
+          method: 'POST',
+          headers: { 'Content-Type': 'application/json' },
+          body: JSON.stringify(requestBody)
+        });
+        
+        if (!langchainResponse.ok) {
+          const errorText = await langchainResponse.text();
+          console.error(`LangChain API error: ${langchainResponse.status} ${langchainResponse.statusText}`, errorText);
+          throw new Error(`LangChain extraction failed: ${langchainResponse.statusText} (${langchainResponse.status})`);
+        }
+        
+        const langchainResult = await langchainResponse.json();
+        if (langchainResult.triples && Array.isArray(langchainResult.triples) && langchainResult.triples.length > 0) {
+          console.log(`Successfully extracted ${langchainResult.triples.length} triples using LangChain${useGraphTransformer ? ' with GraphTransformer' : ''}`);
+          triplesForProcessing = langchainResult.triples;
+        } else {
+          console.warn('LangChain extraction returned no triples, falling back to provided triples');
+        }
+      } catch (langchainError) {
+        console.error('Error using LangChain for triple extraction:', langchainError);
+        console.log('Falling back to provided triples');
+      }
+    }
+
+    // Check if this is a CSV file - if so, skip processing
+    const isCSVFile = filename && filename.toLowerCase().endsWith('.csv');
+    const isJSONFile = filename && filename.toLowerCase().endsWith('.json');
+    
+    if (isCSVFile) {
+      console.log('CSV file detected, skipping text processor');
+      // NOTE: Neo4j storage is no longer done automatically
+      // This is now handled manually through the "Store in Graph DB" button in the UI
+    } else if (isJSONFile) {
+      console.log('JSON file detected, processed as unstructured text document - embeddings can be generated manually via the UI');
+      // NOTE: Automatic embeddings generation has been disabled for JSON files.
+      // Embeddings are now generated only when explicitly requested through the "Generate Embeddings" button in the UI.
+    } else {
+      // Regular text processing flow - no automatic embeddings generation
+      console.log('Document processed successfully - embeddings can be generated manually via the UI');
+      // NOTE: Automatic embeddings generation has been disabled.
+      // Embeddings are now generated only when explicitly requested through the "Generate Embeddings" button in the UI.
+    }
+
+    // Return success response
+    return NextResponse.json({
+      success: true,
+      message: 'Document processed successfully',
+      tripleCount: triplesForProcessing.length,
+      triples: triplesForProcessing,
+      documentName: filename || 'unnamed',
+      langchainUsed: useLangChain,
+      graphTransformerUsed: useGraphTransformer,
+      customPromptsUsed: !!(systemPrompt || extractionPrompt || graphTransformerPrompt),
+      graphDbType: getGraphDbType()
+    });
+  } catch (error) {
+    console.error('Error processing document:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to process document: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/generate/[taskId]/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/generate/[taskId]/route.ts
new file mode 100644
index 0000000..38c6536
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/generate/[taskId]/route.ts
@@ -0,0 +1,42 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+const PYGRAPHISTRY_SERVICE_URL = process.env.PYGRAPHISTRY_SERVICE_URL || 'http://localhost:8080'
+
+export async function GET(request: NextRequest, { params }: { params: { taskId: string } }) {
+  try {
+    const { taskId } = params
+    
+    // Forward the request to the PyGraphistry service
+    const response = await fetch(`${PYGRAPHISTRY_SERVICE_URL}/api/generate/${taskId}`, {
+      method: 'GET',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+    })
+
+    if (!response.ok) {
+      const errorText = await response.text()
+      console.error('PyGraphistry service error:', errorText)
+      return NextResponse.json(
+        { 
+          error: 'PyGraphistry service error', 
+          details: errorText 
+        },
+        { status: response.status }
+      )
+    }
+
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Error forwarding to PyGraphistry service:', error)
+    return NextResponse.json(
+      { 
+        error: 'Failed to connect to PyGraphistry service',
+        details: error instanceof Error ? error.message : 'Unknown error'
+      },
+      { status: 500 }
+    )
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/generate/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/generate/route.ts
new file mode 100644
index 0000000..5405d2b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/generate/route.ts
@@ -0,0 +1,43 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+const PYGRAPHISTRY_SERVICE_URL = process.env.PYGRAPHISTRY_SERVICE_URL || 'http://localhost:8080'
+
+export async function POST(request: NextRequest) {
+  try {
+    const body = await request.json()
+    
+    // Forward the request to the PyGraphistry service
+    const response = await fetch(`${PYGRAPHISTRY_SERVICE_URL}/api/generate`, {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify(body),
+    })
+
+    if (!response.ok) {
+      const errorText = await response.text()
+      console.error('PyGraphistry service error:', errorText)
+      return NextResponse.json(
+        { 
+          error: 'PyGraphistry service error', 
+          details: errorText 
+        },
+        { status: response.status }
+      )
+    }
+
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Error forwarding to PyGraphistry service:', error)
+    return NextResponse.json(
+      { 
+        error: 'Failed to connect to PyGraphistry service',
+        details: error instanceof Error ? error.message : 'Unknown error'
+      },
+      { status: 500 }
+    )
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/health/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/health/route.ts
new file mode 100644
index 0000000..4110185
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/health/route.ts
@@ -0,0 +1,42 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+const PYGRAPHISTRY_SERVICE_URL = process.env.PYGRAPHISTRY_SERVICE_URL || 'http://localhost:8080'
+
+export async function GET(request: NextRequest) {
+  try {
+    // Forward the request to the PyGraphistry service
+    const response = await fetch(`${PYGRAPHISTRY_SERVICE_URL}/api/health`, {
+      method: 'GET',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+    })
+
+    if (!response.ok) {
+      const errorText = await response.text()
+      console.error('PyGraphistry service health check failed:', errorText)
+      return NextResponse.json(
+        { 
+          status: 'error',
+          error: 'PyGraphistry service unhealthy', 
+          details: errorText 
+        },
+        { status: response.status }
+      )
+    }
+
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Error connecting to PyGraphistry service:', error)
+    return NextResponse.json(
+      { 
+        status: 'error',
+        error: 'Failed to connect to PyGraphistry service',
+        details: error instanceof Error ? error.message : 'Unknown error'
+      },
+      { status: 500 }
+    )
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/stats/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/stats/route.ts
new file mode 100644
index 0000000..163a5b7
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/stats/route.ts
@@ -0,0 +1,43 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+const PYGRAPHISTRY_SERVICE_URL = process.env.PYGRAPHISTRY_SERVICE_URL || 'http://localhost:8080'
+
+export async function POST(request: NextRequest) {
+  try {
+    const body = await request.json()
+    
+    // Forward the request to the PyGraphistry service
+    const response = await fetch(`${PYGRAPHISTRY_SERVICE_URL}/api/stats`, {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify(body),
+    })
+
+    if (!response.ok) {
+      const errorText = await response.text()
+      console.error('PyGraphistry stats service error:', errorText)
+      return NextResponse.json(
+        { 
+          error: 'PyGraphistry stats service error', 
+          details: errorText 
+        },
+        { status: response.status }
+      )
+    }
+
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Error forwarding stats request to PyGraphistry service:', error)
+    return NextResponse.json(
+      { 
+        error: 'Failed to connect to PyGraphistry service for stats',
+        details: error instanceof Error ? error.message : 'Unknown error'
+      },
+      { status: 500 }
+    )
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/visualize/route.ts b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/visualize/route.ts
new file mode 100644
index 0000000..9c916ee
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/pygraphistry/visualize/route.ts
@@ -0,0 +1,43 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+const PYGRAPHISTRY_SERVICE_URL = process.env.PYGRAPHISTRY_SERVICE_URL || 'http://localhost:8080'
+
+export async function POST(request: NextRequest) {
+  try {
+    const body = await request.json()
+    
+    // Forward the request to the PyGraphistry service
+    const response = await fetch(`${PYGRAPHISTRY_SERVICE_URL}/api/visualize`, {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify(body),
+    })
+
+    if (!response.ok) {
+      const errorText = await response.text()
+      console.error('PyGraphistry service error:', errorText)
+      return NextResponse.json(
+        { 
+          error: 'PyGraphistry service error', 
+          details: errorText 
+        },
+        { status: response.status }
+      )
+    }
+
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Error forwarding to PyGraphistry service:', error)
+    return NextResponse.json(
+      { 
+        error: 'Failed to connect to PyGraphistry service',
+        details: error instanceof Error ? error.message : 'Unknown error'
+      },
+      { status: 500 }
+    )
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/query-log/add/route.ts b/nvidia/txt2kg/assets/frontend/app/api/query-log/add/route.ts
new file mode 100644
index 0000000..b1283bd
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/query-log/add/route.ts
@@ -0,0 +1,88 @@
+import { NextRequest, NextResponse } from 'next/server';
+import neo4jService from '@/lib/neo4j';
+
+/**
+ * Simple endpoint to directly add a query log with a high count
+ */
+export async function GET(request: NextRequest) {
+  try {
+    // Get the query text from URL params or use a default
+    const query = request.nextUrl.searchParams.get('query') || 'How does machine learning work?';
+    const count = parseInt(request.nextUrl.searchParams.get('count') || '20');
+    
+    // Initialize Neo4j
+    if (!neo4jService.isInitialized()) {
+      neo4jService.initialize();
+    }
+    
+    // Execute direct Cypher query to create a query log with a high count
+    const session = neo4jService.getSession();
+    
+    try {
+      const cypher = `
+        MERGE (q:QueryLog {query: $query})
+        ON CREATE SET 
+          q.firstQueried = datetime(),
+          q.count = $count
+        ON MATCH SET 
+          q.lastQueried = datetime(),
+          q.count = $count
+        
+        CREATE (e:QueryExecution {
+          timestamp: datetime(),
+          queryMode: 'traditional',
+          executionTimeMs: 0,
+          relevanceScore: 0,
+          precision: 0,
+          recall: 0,
+          resultCount: 0
+        })
+        
+        CREATE (q)-[:HAS_EXECUTION]->(e)
+        
+        RETURN q.query as query, q.count as count
+      `;
+      
+      const result = await session.run(cypher, { 
+        query, 
+        count 
+      });
+      
+      const addedQuery = result.records.length > 0 ? {
+        query: result.records[0].get('query'),
+        count: result.records[0].get('count').toNumber()
+      } : null;
+      
+      // Also add a few more queries
+      if (count >= 10) {
+        await session.run(cypher, { 
+          query: 'What are the applications of artificial intelligence?', 
+          count: count - 4 
+        });
+        
+        await session.run(cypher, { 
+          query: 'Explain the principles of deep learning', 
+          count: count - 8 
+        });
+      }
+      
+      // Get the current logs to verify
+      const logs = await neo4jService.getQueryLogs(5);
+      
+      return NextResponse.json({
+        success: true,
+        message: `Added query log for "${query}" with count ${count}`,
+        addedQuery,
+        logs
+      });
+    } finally {
+      session.close();
+    }
+  } catch (error) {
+    console.error('Error adding query log:', error);
+    return NextResponse.json({
+      success: false,
+      error: error instanceof Error ? error.message : String(error)
+    }, { status: 500 });
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/query-log/route.ts b/nvidia/txt2kg/assets/frontend/app/api/query-log/route.ts
new file mode 100644
index 0000000..c9ed02d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/query-log/route.ts
@@ -0,0 +1,94 @@
+import { NextRequest, NextResponse } from 'next/server';
+import queryLoggerService from '@/lib/query-logger';
+
+/**
+ * API endpoint to log query metrics
+ */
+export async function POST(request: NextRequest) {
+  try {
+    const body = await request.json();
+    console.log('Received query log request:', JSON.stringify(body));
+    
+    // Validate required fields
+    if (!body.query) {
+      return NextResponse.json(
+        { error: 'Missing required field: query' },
+        { status: 400 }
+      );
+    }
+    
+    if (!body.queryMode) {
+      return NextResponse.json(
+        { error: 'Missing required field: queryMode' },
+        { status: 400 }
+      );
+    }
+    
+    if (!body.metrics || typeof body.metrics !== 'object') {
+      return NextResponse.json(
+        { error: 'Missing required field: metrics' },
+        { status: 400 }
+      );
+    }
+    
+    // Initialize logger if not already
+    if (!queryLoggerService.isInitialized()) {
+      console.log('Initializing query logger service');
+      await queryLoggerService.initialize();
+    }
+    
+    // Log the query with metrics
+    console.log(`Logging query "${body.query}" with mode "${body.queryMode}"`);
+    await queryLoggerService.logQuery(
+      body.query,
+      body.queryMode,
+      {
+        executionTimeMs: body.metrics.executionTimeMs || 0,
+        relevanceScore: body.metrics.relevanceScore,
+        precision: body.metrics.precision,
+        recall: body.metrics.recall,
+        resultCount: body.metrics.resultCount || 0
+      }
+    );
+    
+    console.log('Query logged successfully to file');
+    return NextResponse.json({ success: true });
+  } catch (error) {
+    console.error('Error logging query:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: errorMessage },
+      { status: 500 }
+    );
+  }
+}
+
+/**
+ * API endpoint to get query logs
+ */
+export async function GET(request: NextRequest) {
+  try {
+    // Initialize logger if not already
+    if (!queryLoggerService.isInitialized()) {
+      console.log('Initializing query logger service for retrieving logs');
+      await queryLoggerService.initialize();
+    }
+    
+    // Get limit from query params or default to 25
+    const limit = parseInt(request.nextUrl.searchParams.get('limit') || '25');
+    console.log(`Retrieving up to ${limit} query logs`);
+    
+    // Get query logs
+    const logs = await queryLoggerService.getQueryLogs(limit);
+    console.log(`Retrieved ${logs.length} query logs from file`);
+    
+    return NextResponse.json({ logs });
+  } catch (error) {
+    console.error('Error getting query logs:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: errorMessage },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/query-log/test/route.ts b/nvidia/txt2kg/assets/frontend/app/api/query-log/test/route.ts
new file mode 100644
index 0000000..b22b31c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/query-log/test/route.ts
@@ -0,0 +1,55 @@
+import { NextRequest, NextResponse } from 'next/server';
+import neo4jService from '@/lib/neo4j';
+
+/**
+ * API endpoint to create a test query log
+ * This is for debugging purposes only
+ */
+export async function GET(request: NextRequest) {
+  try {
+    console.log('[Test] Creating test query log');
+    
+    // Initialize Neo4j if not already
+    if (!neo4jService.isInitialized()) {
+      console.log('[Test] Initializing Neo4j service');
+      neo4jService.initialize();
+    }
+    
+    // Get query text from URL parameters or use a default
+    const query = request.nextUrl.searchParams.get('query') || 'Test query for debugging';
+    const queryMode = (request.nextUrl.searchParams.get('mode') || 'traditional') as 'traditional' | 'vector-search' | 'pure-rag';
+    const executionTime = parseInt(request.nextUrl.searchParams.get('time') || '300');
+    const resultCount = parseInt(request.nextUrl.searchParams.get('count') || '5');
+    
+    console.log(`[Test] Adding test query: "${query}" (${queryMode})`);
+    
+    // Log the query with some test metrics
+    await neo4jService.logQuery(
+      query,
+      queryMode,
+      {
+        executionTimeMs: executionTime,
+        relevanceScore: 0,
+        precision: 0,
+        recall: 0,
+        resultCount: resultCount
+      }
+    );
+    
+    // Get current query logs to verify
+    const logs = await neo4jService.getQueryLogs(10);
+    
+    return NextResponse.json({
+      success: true,
+      message: `Test query "${query}" added successfully`,
+      logs: logs.slice(0, 3) // Return top 3 logs for verification
+    });
+  } catch (error) {
+    console.error('[Test] Error creating test query log:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: errorMessage },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/query/route.ts b/nvidia/txt2kg/assets/frontend/app/api/query/route.ts
new file mode 100644
index 0000000..ede7853
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/query/route.ts
@@ -0,0 +1,53 @@
+import { NextRequest, NextResponse } from 'next/server';
+import backendService from '@/lib/backend-service';
+import type { Triple } from '@/types/graph';
+import { getGraphDbType } from '../settings/route';
+
+export async function POST(request: NextRequest) {
+  try {
+    const { query, triples, kNeighbors, fanout, numHops, useTraditional, queryMode } = await request.json();
+    
+    if (!query) {
+      return NextResponse.json({ error: 'Query is required' }, { status: 400 });
+    }
+    
+    // Initialize backend if needed with the selected graph DB type
+    if (!backendService.isInitialized) {
+      const graphDbType = getGraphDbType();
+      console.log(`Initializing backend with graph DB type: ${graphDbType}`);
+      await backendService.initialize(graphDbType);
+    }
+    
+    // Process triples if provided
+    if (triples && Array.isArray(triples) && triples.length > 0) {
+      await backendService.processTriples(triples);
+    }
+    
+    // Determine if we should use traditional search based on queryMode
+    // This allows the frontend to explicitly choose traditional search
+    const shouldUseTraditional = useTraditional || (queryMode === 'traditional');
+    
+    console.log(`Query mode: ${queryMode}, Using traditional search: ${shouldUseTraditional}`);
+    
+    // Query the backend
+    const relevantTriples = await backendService.query(
+      query,
+      kNeighbors || 4096,
+      fanout || 400,
+      numHops || 2,
+      shouldUseTraditional // Pass the flag to use traditional search
+    );
+    
+    // Return results
+    return NextResponse.json({
+      query,
+      relevantTriples,
+      count: relevantTriples.length,
+      message: `Found ${relevantTriples.length} relevant triples for query: "${query}"${shouldUseTraditional ? ' using traditional search' : ''}`
+    });
+  } catch (error) {
+    console.error('Error querying backend:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json({ error: errorMessage }, { status: 500 });
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/rag-query/route.ts b/nvidia/txt2kg/assets/frontend/app/api/rag-query/route.ts
new file mode 100644
index 0000000..00fec6b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/rag-query/route.ts
@@ -0,0 +1,45 @@
+import { NextRequest, NextResponse } from 'next/server';
+import RAGService from '@/lib/rag';
+
+/**
+ * API endpoint for RAG-based question answering
+ * Uses Pinecone for document retrieval and LangChain for generation
+ * POST /api/rag-query
+ */
+export async function POST(req: NextRequest) {
+  try {
+    // Parse request body
+    const body = await req.json();
+    const { query, topK = 5 } = body;
+
+    if (!query || typeof query !== 'string') {
+      return NextResponse.json({ error: 'Query is required' }, { status: 400 });
+    }
+
+    // Initialize the RAG service
+    const ragService = RAGService;
+    await ragService.initialize();
+    
+    console.log(`Processing RAG query: "${query}" with topK=${topK}`);
+
+    // Retrieve documents and generate answer
+    const answer = await ragService.retrievalQA(query, topK);
+    
+    // Check if this is a fallback response
+    const isGeneralKnowledgeFallback = answer.startsWith('[Note: No specific information was found');
+
+    // Return the results
+    return NextResponse.json({
+      answer,
+      usedFallback: isGeneralKnowledgeFallback,
+      success: true
+    });
+  } catch (error) {
+    console.error('Error in RAG query:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to execute RAG query: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/remote-webgpu-stream/[sessionId]/route.ts b/nvidia/txt2kg/assets/frontend/app/api/remote-webgpu-stream/[sessionId]/route.ts
new file mode 100644
index 0000000..cd57fe0
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/remote-webgpu-stream/[sessionId]/route.ts
@@ -0,0 +1,49 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+// Proxy route specifically for WebRTC streaming frames
+// This handles binary image data streaming from the remote WebGPU service
+
+const REMOTE_WEBGPU_SERVICE_URL = process.env.REMOTE_WEBGPU_SERVICE_URL || 'http://txt2kg-remote-webgpu:8083'
+
+export async function GET(
+  request: NextRequest,
+  { params }: { params: { sessionId: string } }
+) {
+  try {
+    const sessionId = params.sessionId
+    const searchParams = request.nextUrl.searchParams.toString()
+    const url = `${REMOTE_WEBGPU_SERVICE_URL}/api/stream/${sessionId}${searchParams ? `?${searchParams}` : ''}`
+    
+    console.log(`Proxying WebRTC stream request to: ${url}`)
+    
+    const response = await fetch(url, {
+      method: 'GET',
+    })
+    
+    if (!response.ok) {
+      throw new Error(`Remote WebGPU service responded with ${response.status}: ${response.statusText}`)
+    }
+    
+    // Get the image data as array buffer
+    const imageBuffer = await response.arrayBuffer()
+    const contentType = response.headers.get('content-type') || 'image/png'
+    
+    // Return the image with proper headers
+    return new NextResponse(imageBuffer, {
+      status: 200,
+      headers: {
+        'Content-Type': contentType,
+        'Cache-Control': 'no-cache, no-store, must-revalidate',
+        'Pragma': 'no-cache',
+        'Expires': '0',
+      },
+    })
+    
+  } catch (error) {
+    console.error('WebRTC stream proxy error:', error)
+    return NextResponse.json(
+      { error: 'Failed to stream from remote WebGPU service', details: String(error) },
+      { status: 500 }
+    )
+  }
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/remote-webgpu/[...path]/route.ts b/nvidia/txt2kg/assets/frontend/app/api/remote-webgpu/[...path]/route.ts
new file mode 100644
index 0000000..109b0fc
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/remote-webgpu/[...path]/route.ts
@@ -0,0 +1,112 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+// Proxy route for remote WebGPU clustering service
+// This allows the frontend to communicate with the clustering service
+// even when running in a remote browser environment
+
+const REMOTE_WEBGPU_SERVICE_URL = process.env.REMOTE_WEBGPU_SERVICE_URL || 'http://txt2kg-remote-webgpu:8083'
+
+export async function GET(
+  request: NextRequest,
+  { params }: { params: { path: string[] } }
+) {
+  try {
+    const path = params.path.join('/')
+    const searchParams = request.nextUrl.searchParams.toString()
+    const url = `${REMOTE_WEBGPU_SERVICE_URL}/${path}${searchParams ? `?${searchParams}` : ''}`
+    
+    console.log(`Proxying GET request to: ${url}`)
+    
+    const response = await fetch(url, {
+      method: 'GET',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+    })
+    
+    if (!response.ok) {
+      throw new Error(`Remote WebGPU service responded with ${response.status}: ${response.statusText}`)
+    }
+    
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Remote WebGPU proxy error:', error)
+    return NextResponse.json(
+      { error: 'Failed to communicate with remote WebGPU service', details: String(error) },
+      { status: 500 }
+    )
+  }
+}
+
+export async function POST(
+  request: NextRequest,
+  { params }: { params: { path: string[] } }
+) {
+  try {
+    const path = params.path.join('/')
+    const body = await request.json()
+    const url = `${REMOTE_WEBGPU_SERVICE_URL}/${path}`
+    
+    console.log(`Proxying POST request to: ${url}`)
+    console.log(`Request body:`, JSON.stringify(body, null, 2))
+    console.log(`Using service URL: ${REMOTE_WEBGPU_SERVICE_URL}`)
+    
+    const response = await fetch(url, {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify(body),
+    })
+    
+    if (!response.ok) {
+      const errorText = await response.text()
+      throw new Error(`Remote WebGPU service responded with ${response.status}: ${response.statusText} - ${errorText}`)
+    }
+    
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Remote WebGPU proxy error:', error)
+    return NextResponse.json(
+      { error: 'Failed to communicate with remote WebGPU service', details: String(error) },
+      { status: 500 }
+    )
+  }
+}
+
+export async function DELETE(
+  request: NextRequest,
+  { params }: { params: { path: string[] } }
+) {
+  try {
+    const path = params.path.join('/')
+    const url = `${REMOTE_WEBGPU_SERVICE_URL}/${path}`
+    
+    console.log(`Proxying DELETE request to: ${url}`)
+    
+    const response = await fetch(url, {
+      method: 'DELETE',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+    })
+    
+    if (!response.ok) {
+      throw new Error(`Remote WebGPU service responded with ${response.status}: ${response.statusText}`)
+    }
+    
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Remote WebGPU proxy error:', error)
+    return NextResponse.json(
+      { error: 'Failed to communicate with remote WebGPU service', details: String(error) },
+      { status: 500 }
+    )
+  }
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/sentence-embeddings/route.ts b/nvidia/txt2kg/assets/frontend/app/api/sentence-embeddings/route.ts
new file mode 100644
index 0000000..c23c0bb
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/sentence-embeddings/route.ts
@@ -0,0 +1,86 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { processSentenceEmbeddings, SentenceEmbedding } from '@/lib/text-processor';
+import { PineconeService } from '@/lib/pinecone';
+
+/**
+ * API endpoint for splitting text into sentences and generating embeddings
+ * POST /api/sentence-embeddings
+ */
+export async function POST(req: NextRequest) {
+  try {
+    // Parse request body
+    const body = await req.json();
+    const { text, documentId } = body;
+
+    if (!text || typeof text !== 'string') {
+      return NextResponse.json({ error: 'Text is required' }, { status: 400 });
+    }
+
+    console.log(`Processing sentence embeddings for document ${documentId || 'unnamed'}`);
+    console.log(`Text length: ${text.length} characters`);
+
+    // Process sentences and generate embeddings
+    let sentenceEmbeddings: SentenceEmbedding[] = [];
+    try {
+      sentenceEmbeddings = await processSentenceEmbeddings(text, documentId);
+      console.log(`Generated embeddings for ${sentenceEmbeddings.length} sentences using local sentence-transformers service`);
+    } catch (embeddingError) {
+      console.error('Error generating embeddings:', embeddingError);
+      return NextResponse.json(
+        { error: `Failed to generate embeddings: ${embeddingError instanceof Error ? embeddingError.message : String(embeddingError)}` },
+        { status: 500 }
+      );
+    }
+
+    // Optionally store in vector database
+    if (sentenceEmbeddings.length > 0) {
+      try {
+        // Map the embeddings to a format suitable for Pinecone
+        const embeddingsMap = new Map<string, number[]>();
+        const textContentMap = new Map<string, string>();
+        const metadataMap = new Map<string, any>();
+        
+        // Create unique keys for each sentence
+        sentenceEmbeddings.forEach((item, index) => {
+          const key = `${documentId || 'doc'}_sentence_${index}`;
+          embeddingsMap.set(key, item.embedding);
+          textContentMap.set(key, item.sentence);
+          metadataMap.set(key, item.metadata);
+        });
+        
+        // Store in Pinecone
+        const pineconeService = PineconeService.getInstance();
+        await pineconeService.storeEmbeddingsWithMetadata(
+          embeddingsMap,
+          textContentMap, 
+          metadataMap
+        );
+        
+        console.log(`Stored ${sentenceEmbeddings.length} sentence embeddings in vector database`);
+      } catch (storageError) {
+        console.error('Error storing sentence embeddings:', storageError);
+        // Continue even if storage fails - we'll still return the embeddings
+      }
+    }
+
+    // Return a summary to avoid large response sizes
+    return NextResponse.json({
+      success: true,
+      count: sentenceEmbeddings.length,
+      documentId: documentId || 'unnamed',
+      // Return only the first few embeddings as samples
+      samples: sentenceEmbeddings.slice(0, 3).map(item => ({
+        sentence: item.sentence,
+        metadata: item.metadata,
+        embeddingDimensions: item.embedding.length
+      }))
+    });
+  } catch (error) {
+    console.error('Error processing sentence embeddings:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to process sentence embeddings: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/settings/route.ts b/nvidia/txt2kg/assets/frontend/app/api/settings/route.ts
new file mode 100644
index 0000000..e091f38
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/settings/route.ts
@@ -0,0 +1,79 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { GraphDBType } from '@/lib/graph-db-service';
+
+// In-memory storage for settings
+let serverSettings: Record<string, string> = {};
+
+/**
+ * API Route to sync client settings with server environment variables
+ * This allows us to use localStorage settings on the client side
+ * and still access them as environment variables on the server side
+ */
+export async function POST(request: NextRequest) {
+  try {
+    const { settings } = await request.json();
+    
+    if (!settings || typeof settings !== 'object') {
+      return NextResponse.json({ error: 'Settings object is required' }, { status: 400 });
+    }
+    
+    // Update server settings
+    serverSettings = { ...serverSettings, ...settings };
+    
+    // Log some important settings for debugging
+    if (settings.graph_db_type) {
+      console.log(`Setting graph database type to: ${settings.graph_db_type}`);
+    }
+    
+    return NextResponse.json({
+      success: true,
+      message: 'Settings updated successfully'
+    });
+  } catch (error) {
+    console.error('Error updating settings:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json({ error: errorMessage }, { status: 500 });
+  }
+}
+
+/**
+ * GET /api/settings
+ * Retrieve settings from the server side
+ */
+export async function GET(request: NextRequest) {
+  try {
+    const url = new URL(request.url);
+    const key = url.searchParams.get('key');
+    
+    if (key) {
+      // Return specific setting
+      return NextResponse.json({
+        [key]: serverSettings[key] || null
+      });
+    }
+    
+    // Return all settings (may want to filter sensitive ones in production)
+    return NextResponse.json({
+      settings: serverSettings
+    });
+  } catch (error) {
+    console.error('Error retrieving settings:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json({ error: errorMessage }, { status: 500 });
+  }
+}
+
+/**
+ * Helper function to get a setting value
+ * For use in other API routes
+ */
+export function getSetting(key: string): string | null {
+  return serverSettings[key] || null;
+}
+
+/**
+ * Get the currently selected graph database type
+ */
+export function getGraphDbType(): GraphDBType {
+  return (serverSettings.graph_db_type as GraphDBType) || 'arangodb';
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/stop-embeddings/route.ts b/nvidia/txt2kg/assets/frontend/app/api/stop-embeddings/route.ts
new file mode 100644
index 0000000..f8b026b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/stop-embeddings/route.ts
@@ -0,0 +1,46 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+// Global flag to track if embeddings generation should be stopped
+let shouldStopEmbeddings = false
+
+// Function to check if embeddings generation should stop
+export function getShouldStopEmbeddings(): boolean {
+  return shouldStopEmbeddings
+}
+
+// Function to reset the stop flag
+export function resetStopEmbeddings(): void {
+  shouldStopEmbeddings = false
+}
+
+// Function to set the stop flag
+export function setStopEmbeddings(): void {
+  shouldStopEmbeddings = true
+}
+
+export async function POST(request: NextRequest) {
+  try {
+    console.log('Stop embeddings generation request received')
+    
+    // Set the global flag to stop embeddings generation
+    shouldStopEmbeddings = true
+    
+    return NextResponse.json({ 
+      success: true, 
+      message: 'Embeddings generation stop signal sent' 
+    })
+  } catch (error) {
+    console.error('Error stopping embeddings generation:', error)
+    return NextResponse.json(
+      { error: 'Failed to stop embeddings generation' },
+      { status: 500 }
+    )
+  }
+}
+
+export async function GET(request: NextRequest) {
+  // Allow checking the current stop status
+  return NextResponse.json({ 
+    shouldStop: shouldStopEmbeddings 
+  })
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/stop-processing/route.ts b/nvidia/txt2kg/assets/frontend/app/api/stop-processing/route.ts
new file mode 100644
index 0000000..c5b3128
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/stop-processing/route.ts
@@ -0,0 +1,49 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+// Global flag to track if processing should be stopped
+let shouldStopProcessing = false
+
+// Function to check if processing should stop
+export function getShouldStopProcessing(): boolean {
+  return shouldStopProcessing
+}
+
+// Function to reset the stop flag
+export function resetStopProcessing(): void {
+  shouldStopProcessing = false
+}
+
+// Function to set the stop flag
+export function setStopProcessing(): void {
+  shouldStopProcessing = true
+}
+
+export async function POST(request: NextRequest) {
+  try {
+    console.log('Stop processing request received')
+    
+    // Set the global flag to stop processing
+    shouldStopProcessing = true
+    
+    // You could also implement more sophisticated cancellation here
+    // such as canceling ongoing HTTP requests, clearing queues, etc.
+    
+    return NextResponse.json({ 
+      success: true, 
+      message: 'Processing stop signal sent' 
+    })
+  } catch (error) {
+    console.error('Error stopping processing:', error)
+    return NextResponse.json(
+      { error: 'Failed to stop processing' },
+      { status: 500 }
+    )
+  }
+}
+
+export async function GET(request: NextRequest) {
+  // Allow checking the current stop status
+  return NextResponse.json({ 
+    shouldStop: shouldStopProcessing 
+  })
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/store-documents/route.ts b/nvidia/txt2kg/assets/frontend/app/api/store-documents/route.ts
new file mode 100644
index 0000000..af4a8c3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/store-documents/route.ts
@@ -0,0 +1,49 @@
+import { NextRequest, NextResponse } from 'next/server';
+import RAGService from '@/lib/rag';
+
+/**
+ * API endpoint for storing documents in the RAG system
+ * POST /api/store-documents
+ */
+export async function POST(req: NextRequest) {
+  try {
+    // Parse request body
+    const body = await req.json();
+    const { documents, metadata } = body;
+
+    if (!documents || !Array.isArray(documents) || documents.length === 0) {
+      return NextResponse.json({ error: 'Documents array is required' }, { status: 400 });
+    }
+
+    // Validate that all documents are strings
+    const isValid = documents.every(doc => typeof doc === 'string' && doc.trim().length > 0);
+    if (!isValid) {
+      return NextResponse.json({ 
+        error: 'All documents must be non-empty strings' 
+      }, { status: 400 });
+    }
+
+    // Initialize the RAG service
+    const ragService = RAGService;
+    await ragService.initialize();
+    
+    console.log(`Storing ${documents.length} documents in RAG system`);
+
+    // Store the documents
+    await ragService.storeDocuments(documents, metadata);
+
+    // Return success
+    return NextResponse.json({
+      success: true,
+      count: documents.length,
+      message: `Successfully stored ${documents.length} documents`
+    });
+  } catch (error) {
+    console.error('Error storing documents:', error);
+    const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+    return NextResponse.json(
+      { error: `Failed to store documents: ${errorMessage}` },
+      { status: 500 }
+    );
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/test-proxy/route.ts b/nvidia/txt2kg/assets/frontend/app/api/test-proxy/route.ts
new file mode 100644
index 0000000..5e4c77c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/test-proxy/route.ts
@@ -0,0 +1,32 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+// Simple test endpoint to verify proxy connectivity
+const REMOTE_WEBGPU_SERVICE_URL = process.env.REMOTE_WEBGPU_SERVICE_URL || 'http://txt2kg-remote-webgpu:8083'
+
+export async function GET() {
+  try {
+    console.log(`Testing connection to: ${REMOTE_WEBGPU_SERVICE_URL}`)
+    
+    const response = await fetch(`${REMOTE_WEBGPU_SERVICE_URL}/health`)
+    
+    if (!response.ok) {
+      throw new Error(`Service responded with ${response.status}: ${response.statusText}`)
+    }
+    
+    const data = await response.json()
+    
+    return NextResponse.json({
+      success: true,
+      service_url: REMOTE_WEBGPU_SERVICE_URL,
+      service_response: data
+    })
+    
+  } catch (error) {
+    console.error('Proxy test failed:', error)
+    return NextResponse.json({
+      success: false,
+      error: String(error),
+      service_url: REMOTE_WEBGPU_SERVICE_URL
+    }, { status: 500 })
+  }
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/api/unified-gpu/capabilities/route.ts b/nvidia/txt2kg/assets/frontend/app/api/unified-gpu/capabilities/route.ts
new file mode 100644
index 0000000..ff01d58
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/unified-gpu/capabilities/route.ts
@@ -0,0 +1,40 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+const UNIFIED_GPU_SERVICE_URL = process.env.UNIFIED_GPU_SERVICE_URL || 'http://localhost:8080'
+
+export async function GET(request: NextRequest) {
+  try {
+    // Forward the request to the unified GPU service
+    const response = await fetch(`${UNIFIED_GPU_SERVICE_URL}/api/capabilities`, {
+      method: 'GET',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+    })
+
+    if (!response.ok) {
+      const errorText = await response.text()
+      console.error('Unified GPU service error:', errorText)
+      return NextResponse.json(
+        { 
+          error: 'Unified GPU service error', 
+          details: errorText 
+        },
+        { status: response.status }
+      )
+    }
+
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Error forwarding to unified GPU service:', error)
+    return NextResponse.json(
+      { 
+        error: 'Failed to connect to unified GPU service',
+        details: error instanceof Error ? error.message : 'Unknown error'
+      },
+      { status: 500 }
+    )
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/unified-gpu/visualize/route.ts b/nvidia/txt2kg/assets/frontend/app/api/unified-gpu/visualize/route.ts
new file mode 100644
index 0000000..fb4fbd1
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/unified-gpu/visualize/route.ts
@@ -0,0 +1,43 @@
+import { NextRequest, NextResponse } from 'next/server'
+
+const UNIFIED_GPU_SERVICE_URL = process.env.UNIFIED_GPU_SERVICE_URL || 'http://localhost:8080'
+
+export async function POST(request: NextRequest) {
+  try {
+    const body = await request.json()
+    
+    // Forward the request to the unified GPU service
+    const response = await fetch(`${UNIFIED_GPU_SERVICE_URL}/api/visualize`, {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify(body),
+    })
+
+    if (!response.ok) {
+      const errorText = await response.text()
+      console.error('Unified GPU service error:', errorText)
+      return NextResponse.json(
+        { 
+          error: 'Unified GPU service error', 
+          details: errorText 
+        },
+        { status: response.status }
+      )
+    }
+
+    const data = await response.json()
+    return NextResponse.json(data)
+    
+  } catch (error) {
+    console.error('Error forwarding to unified GPU service:', error)
+    return NextResponse.json(
+      { 
+        error: 'Failed to connect to unified GPU service',
+        details: error instanceof Error ? error.message : 'Unknown error'
+      },
+      { status: 500 }
+    )
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/api/vllm/route.ts b/nvidia/txt2kg/assets/frontend/app/api/vllm/route.ts
new file mode 100644
index 0000000..4301a53
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/api/vllm/route.ts
@@ -0,0 +1,184 @@
+import { NextRequest, NextResponse } from 'next/server';
+import { LLMService } from '@/lib/llm-service';
+
+const llmService = LLMService.getInstance();
+
+/**
+ * Test vLLM connection and list available models
+ * GET /api/vllm?action=test-connection
+ */
+export async function GET(req: NextRequest) {
+  const { searchParams } = new URL(req.url);
+  const action = searchParams.get('action');
+
+  if (action === 'test-connection') {
+    try {
+      const vllmBaseUrl = process.env.VLLM_BASE_URL || 'http://localhost:8001/v1';
+      
+      // Test connection to vLLM service using built-in /v1/models endpoint
+      const response = await fetch(`${vllmBaseUrl}/models`, {
+        method: 'GET',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        signal: AbortSignal.timeout(10000),
+      });
+
+      if (!response.ok) {
+        throw new Error(`vLLM service returned ${response.status}: ${response.statusText}`);
+      }
+
+      const healthData = { 
+        status: "healthy", 
+        service: "vllm",
+        note: "Using vLLM's built-in OpenAI API server"
+      };
+      
+      // Get available models (reuse the response from health check)
+      const modelsData = await response.json();
+      const models = modelsData.data?.map((model: any) => model.id) || [];
+
+      return NextResponse.json({
+        connected: true,
+        health: healthData,
+        models: models,
+        baseUrl: vllmBaseUrl
+      });
+
+    } catch (error) {
+      console.error('vLLM connection test failed:', error);
+      return NextResponse.json(
+        { 
+          connected: false, 
+          error: error instanceof Error ? error.message : String(error),
+          baseUrl: process.env.VLLM_BASE_URL || 'http://localhost:8001/v1'
+        },
+        { status: 503 }
+      );
+    }
+  }
+
+  return NextResponse.json(
+    { error: 'Invalid action parameter' },
+    { status: 400 }
+  );
+}
+
+/**
+ * Extract triples using vLLM
+ * POST /api/vllm
+ */
+export async function POST(req: NextRequest) {
+  try {
+    const { text, model = 'meta-llama/Llama-3.2-3B-Instruct', temperature = 0.1, maxTokens = 1024 } = await req.json();
+
+    if (!text || typeof text !== 'string') {
+      return NextResponse.json({ error: 'Text is required' }, { status: 400 });
+    }
+
+    // Use the LLM service to generate completion with vLLM
+    const messages = [
+      {
+        role: 'system' as const,
+        content: `You are a knowledge graph builder that extracts structured information from text.
+Extract subject-predicate-object triples from the following text.
+
+Guidelines:
+- Extract only factual triples present in the text
+- Normalize entity names to their canonical form
+- Return results in JSON format as an array of objects with "subject", "predicate", "object" fields
+- Each triple should represent a clear relationship between two entities
+- Focus on the most important relationships in the text`
+      },
+      {
+        role: 'user' as const,
+        content: `Extract triples from this text:\n\n${text}`
+      }
+    ];
+
+    // Use LLMService for direct chat completion via vLLM's OpenAI API
+    const response = await llmService.generateVllmCompletion(
+      model,
+      messages,
+      { temperature, maxTokens }
+    );
+
+    // Parse the response to extract triples
+    let triples = [];
+    try {
+      // Try to parse as JSON first
+      const jsonMatch = response.match(/\[[\s\S]*\]/);
+      if (jsonMatch) {
+        triples = JSON.parse(jsonMatch[0]);
+      } else {
+        // Fallback: parse line by line
+        triples = parseTriplesFallback(response);
+      }
+    } catch (parseError) {
+      console.warn('Failed to parse JSON response, using fallback parser:', parseError);
+      triples = parseTriplesFallback(response);
+    }
+
+    return NextResponse.json({
+      triples: triples,
+      model: model,
+      provider: 'vllm',
+      rawResponse: response
+    });
+
+  } catch (error) {
+    console.error('Error in vLLM triple extraction:', error);
+    return NextResponse.json(
+      { error: error instanceof Error ? error.message : String(error) },
+      { status: 500 }
+    );
+  }
+}
+
+/**
+ * Fallback parser for extracting triples from text response
+ */
+function parseTriplesFallback(text: string): Array<{ subject: string; predicate: string; object: string }> {
+  const triples = [];
+  const lines = text.split('\n');
+
+  for (const line of lines) {
+    const trimmedLine = line.trim();
+    if (!trimmedLine || trimmedLine.startsWith('#') || trimmedLine.startsWith('//')) {
+      continue;
+    }
+
+    // Try to parse different formats
+    if (trimmedLine.includes(' -> ')) {
+      const parts = trimmedLine.split(' -> ');
+      if (parts.length >= 3) {
+        triples.push({
+          subject: parts[0].trim(),
+          predicate: parts[1].trim(),
+          object: parts[2].trim()
+        });
+      }
+    } else if (trimmedLine.includes('|')) {
+      const parts = trimmedLine.split('|');
+      if (parts.length >= 3) {
+        triples.push({
+          subject: parts[0].trim(),
+          predicate: parts[1].trim(),
+          object: parts[2].trim()
+        });
+      }
+    } else if (trimmedLine.includes(',')) {
+      // Try comma-separated format: "subject, predicate, object"
+      const parts = trimmedLine.split(',');
+      if (parts.length >= 3) {
+        triples.push({
+          subject: parts[0].trim().replace(/['"]/g, ''),
+          predicate: parts[1].trim().replace(/['"]/g, ''),
+          object: parts[2].trim().replace(/['"]/g, '')
+        });
+      }
+    }
+  }
+
+  return triples;
+}
diff --git a/nvidia/txt2kg/assets/frontend/app/components/documents-list.tsx b/nvidia/txt2kg/assets/frontend/app/components/documents-list.tsx
new file mode 100644
index 0000000..e89b83e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/components/documents-list.tsx
@@ -0,0 +1,140 @@
+"use client";
+
+import { useState, useEffect } from "react";
+import { Triple } from "@/types/graph";
+
+export default function DocumentsList() {
+  const [loading, setLoading] = useState(true);
+  const [triples, setTriples] = useState<Triple[]>([]);
+  const [entities, setEntities] = useState<string[]>([]);
+  const [error, setError] = useState<string | null>(null);
+
+  useEffect(() => {
+    async function fetchTriplesAndEntities() {
+      try {
+        setLoading(true);
+        
+        // Fetch triples from Neo4j
+        const response = await fetch('/api/neo4j/triples', {
+          method: 'GET',
+          headers: {
+            'Content-Type': 'application/json'
+          }
+        });
+        
+        if (!response.ok) {
+          throw new Error(`Failed to fetch triples: ${response.statusText}`);
+        }
+        
+        const data = await response.json();
+        
+        // Extract unique entities
+        const uniqueEntities = new Set<string>();
+        data.triples.forEach((triple: Triple) => {
+          uniqueEntities.add(triple.subject);
+          uniqueEntities.add(triple.object);
+        });
+        
+        // Store data in local storage, overwriting previous data
+        localStorage.setItem("graphTriples", JSON.stringify(data.triples));
+        localStorage.setItem("graphDocumentName", "sample_data.csv");
+        
+        // Update state
+        setTriples(data.triples);
+        setEntities(Array.from(uniqueEntities));
+        setError(null);
+      } catch (err) {
+        console.error("Error fetching data:", err);
+        setError(err instanceof Error ? err.message : "Unknown error occurred");
+      } finally {
+        setLoading(false);
+      }
+    }
+    
+    fetchTriplesAndEntities();
+  }, []);
+  
+  return (
+    <div className="p-4">
+      <h2 className="text-xl font-bold mb-4">Document Data</h2>
+      
+      {loading && (
+        <div className="text-center py-8">
+          <div className="animate-spin h-8 w-8 border-4 border-primary border-t-transparent rounded-full mx-auto"></div>
+          <p className="mt-2">Loading data from Neo4j...</p>
+        </div>
+      )}
+      
+      {error && (
+        <div className="bg-red-100 border border-red-400 text-red-700 px-4 py-3 rounded">
+          <p className="font-bold">Error</p>
+          <p>{error}</p>
+        </div>
+      )}
+      
+      {!loading && !error && (
+        <div>
+          <div className="mb-6">
+            <h3 className="text-lg font-semibold mb-2">Entities ({entities.length})</h3>
+            <div className="border rounded-md p-4 bg-background max-h-64 overflow-y-auto">
+              <ul className="grid grid-cols-2 md:grid-cols-3 gap-2">
+                {entities.slice(0, 100).map((entity, index) => (
+                  <li key={index} className="text-sm truncate">{entity}</li>
+                ))}
+              </ul>
+              {entities.length > 100 && (
+                <p className="text-sm text-muted-foreground mt-2">
+                  Showing 100 of {entities.length} entities
+                </p>
+              )}
+            </div>
+          </div>
+          
+          <div>
+            <h3 className="text-lg font-semibold mb-2">Triples ({triples.length})</h3>
+            <div className="border rounded-md overflow-hidden">
+              <div className="overflow-x-auto">
+                <table className="w-full">
+                  <thead>
+                    <tr className="bg-muted/50 border-b border-border">
+                      <th className="px-4 py-3 text-left text-sm font-semibold text-muted-foreground">Subject</th>
+                      <th className="px-4 py-3 text-left text-sm font-semibold text-muted-foreground">Predicate</th>
+                      <th className="px-4 py-3 text-left text-sm font-semibold text-muted-foreground">Object</th>
+                    </tr>
+                  </thead>
+                  <tbody>
+                    {triples.slice(0, 50).map((triple, index) => (
+                      <tr key={index} className="border-t border-border hover:bg-muted/30 transition-colors">
+                        <td className="px-4 py-2 text-sm text-foreground">{triple.subject}</td>
+                        <td className="px-4 py-2 text-sm text-foreground">{triple.predicate}</td>
+                        <td className="px-4 py-2 text-sm text-foreground">{triple.object}</td>
+                      </tr>
+                    ))}
+                  </tbody>
+                </table>
+              </div>
+              {triples.length > 50 && (
+                <p className="text-sm text-muted-foreground p-4 border-t">
+                  Showing 50 of {triples.length} triples
+                </p>
+              )}
+            </div>
+          </div>
+          
+          <div className="mt-6">
+            <button
+              onClick={() => {
+                localStorage.setItem("graphTriples", JSON.stringify(triples));
+                localStorage.setItem("graphDocumentName", "sample_data.csv");
+                alert(`Saved ${triples.length} triples to local storage. You can now view the graph visualization.`);
+              }}
+              className="px-4 py-2 bg-primary text-primary-foreground rounded-md hover:bg-primary/90"
+            >
+              Save to Local Storage
+            </button>
+          </div>
+        </div>
+      )}
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/document-data/page.tsx b/nvidia/txt2kg/assets/frontend/app/document-data/page.tsx
new file mode 100644
index 0000000..b17fcff
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/document-data/page.tsx
@@ -0,0 +1,30 @@
+import { DocumentsTable } from "@/components/documents-table";
+import { DocumentProcessor } from "@/components/document-processor";
+
+export default function DocumentDataPage() {
+  return (
+    <div className="container mx-auto py-8">
+      <h1 className="text-2xl font-bold mb-6">Document Knowledge Graph Builder</h1>
+      <p className="mb-4 text-muted-foreground">
+        Process documents to extract knowledge triples and generate embeddings using LangChain.
+      </p>
+      
+      <div className="grid grid-cols-1 xl:grid-cols-3 gap-8">
+        <div className="xl:col-span-1">
+          <DocumentProcessor className="sticky top-8" />
+        </div>
+        <div className="xl:col-span-2">
+          <div className="bg-card rounded-lg border border-border shadow-sm overflow-hidden">
+            <div className="p-5 border-b border-border">
+              <h2 className="text-xl font-semibold">Documents</h2>
+              <p className="text-sm text-muted-foreground mt-1">
+                Manage your documents and generate embeddings directly from the table
+              </p>
+            </div>
+            <DocumentsTable />
+          </div>
+        </div>
+      </div>
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/globals.css b/nvidia/txt2kg/assets/frontend/app/globals.css
new file mode 100644
index 0000000..74dc189
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/globals.css
@@ -0,0 +1,399 @@
+@tailwind base;
+@tailwind components;
+@tailwind utilities;
+
+/* Import NVIDIA Build typography patterns with Inter font */
+@import url('../styles/nvidia-build-typography.css');
+
+:root {
+  --foreground-rgb: 255, 255, 255;
+  --background-rgb: 0, 0, 0;
+}
+
+body {
+  color: rgb(var(--foreground-rgb));
+  background: rgb(var(--background-rgb));
+  scroll-behavior: smooth;
+}
+
+/* Base page container styles */
+.container {
+  max-width: 90rem;
+  width: 100%;
+  margin-left: auto;
+  margin-right: auto;
+}
+
+@media (min-width: 1024px) {
+  .container {
+    padding-left: 2rem;
+    padding-right: 2rem;
+  }
+}
+
+@layer utilities {
+  .text-balance {
+    text-wrap: balance;
+  }
+
+  .animate-fadeIn {
+    animation: fadeIn 0.3s ease-in-out;
+  }
+
+  /* NVIDIA green utility text color */
+  .text-nvidia-green {
+    color: hsl(var(--nvidia-green));
+  }
+
+  /* Accent color for native form controls (checkbox/radio) */
+  .selection-accent {
+    accent-color: hsl(var(--nvidia-green));
+  }
+
+  @keyframes fadeIn {
+    from {
+      opacity: 0;
+      transform: translateY(-10px);
+    }
+    to {
+      opacity: 1;
+      transform: translateY(0);
+    }
+  }
+  
+  /* Venture-backed startup modern tab styles */
+  .startup-tabs {
+    @apply relative max-w-4xl mx-auto p-1.5 bg-black/5 dark:bg-white/5 
+           backdrop-blur-lg border border-white/10 dark:border-white/5 
+           rounded-full shadow-xl overflow-hidden;
+  }
+  
+  .startup-tab-indicator {
+    @apply absolute h-full bg-primary/5 dark:bg-primary/10 rounded-full 
+           transition-all duration-300 ease-out;
+    margin: 4px;
+    height: calc(100% - 8px);
+  }
+  
+  .startup-tab {
+    @apply flex-1 flex items-center justify-center gap-3 py-3 font-medium 
+           z-10 text-sm transition-all duration-200 hover:text-primary/90;
+    letter-spacing: 0.01em;
+  }
+  
+  .startup-tab-active {
+    @apply text-primary font-semibold;
+    position: relative;
+  }
+  
+  .startup-tab-active::after {
+    content: "";
+    position: absolute;
+    bottom: -2px;
+    left: 25%;
+    width: 50%;
+    height: 2px;
+    @apply bg-primary rounded-full;
+  }
+  
+  .startup-tab-icon {
+    @apply flex items-center justify-center w-8 h-8 rounded-full 
+           bg-gradient-to-br from-primary/20 to-primary/5 text-primary;
+    box-shadow: 0 2px 4px rgba(0, 0, 0, 0.04);
+  }
+  
+  .startup-tab-active .startup-tab-icon {
+    @apply from-primary/30 to-primary/15;
+    transform: scale(1.05);
+  }
+}
+
+@layer base {
+  :root {
+    --background: 0 0% 7%;
+    --foreground: 0 0% 98%;
+
+    --card: 0 0% 9%;
+    --card-foreground: 0 0% 98%;
+
+    --popover: 0 0% 7%;
+    --popover-foreground: 0 0% 98%;
+
+    /* Use NVIDIA green as the brand primary in dark mode */
+    --primary: 81 100% 36%;
+    --primary-foreground: 0 0% 98%;
+
+    --secondary: 0 0% 12%;
+    --secondary-foreground: 0 0% 98%;
+
+    --muted: 0 0% 12%;
+    --muted-foreground: 0 0% 65%;
+
+    --accent: 0 0% 12%;
+    --accent-foreground: 0 0% 98%;
+
+    --destructive: 0 62% 30%;
+    --destructive-foreground: 0 0% 98%;
+
+    --border: 0 0% 18%;
+    --input: 0 0% 18%;
+    --ring: 81 100% 36%;
+
+    --radius: 0.75rem;
+
+    --nvidia-green: 81 100% 36%;
+    --nvidia-green-dark: 81 100% 30%;
+    --nvidia-green-light: 81 100% 42%;
+  }
+
+  .light {
+    --background: 0 0% 100%;
+    --foreground: 0 0% 9%;
+
+    --card: 0 0% 98%;
+    --card-foreground: 0 0% 9%;
+
+    --popover: 0 0% 100%;
+    --popover-foreground: 0 0% 9%;
+
+    /* Use a more subtle primary in light mode */
+    --primary: 81 60% 45%;
+    --primary-foreground: 0 0% 98%;
+
+    --secondary: 210 40% 96%;
+    --secondary-foreground: 0 0% 9%;
+
+    --muted: 210 40% 96%;
+    --muted-foreground: 215 16% 47%;
+
+    --accent: 210 40% 96%;
+    --accent-foreground: 0 0% 9%;
+
+    --destructive: 0 84% 60%;
+    --destructive-foreground: 210 40% 98%;
+
+    --border: 214 32% 91%;
+    --input: 214 32% 91%;
+    --ring: 214 32% 80%;
+
+    --radius: 0.75rem;
+
+    --nvidia-green: 81 100% 36%;
+    --nvidia-green-dark: 81 100% 30%;
+    --nvidia-green-light: 81 100% 42%;
+  }
+}
+
+@layer base {
+  * {
+    @apply border-border;
+  }
+  body {
+    @apply bg-background text-foreground;
+  }
+}
+
+/* Modern UI Elements */
+.glass-card {
+  @apply bg-card/80 backdrop-blur-md shadow-xl;
+  border: 1px solid;
+  border-color: rgba(255, 255, 255, 0.1);
+  transition: all 0.3s ease;
+}
+
+.light .glass-card {
+  border-color: rgba(0, 0, 0, 0.1);
+  box-shadow: 0 6px 14px -6px rgba(0, 0, 0, 0.10);
+}
+
+.glass-card:hover {
+  border-color: rgba(255, 255, 255, 0.15);
+  box-shadow: 0 16px 20px -6px rgba(0, 0, 0, 0.10), 0 8px 10px -6px rgba(0, 0, 0, 0.04);
+}
+
+.light .glass-card:hover {
+  border-color: rgba(0, 0, 0, 0.15);
+}
+
+.glass-card h2 {
+  letter-spacing: 0.01em;
+}
+
+.glass-card p {
+  line-height: 1.6;
+}
+
+.gradient-border {
+  position: relative;
+  border-radius: var(--radius);
+  overflow: hidden;
+}
+
+.gradient-border::before {
+  content: "";
+  position: absolute;
+  inset: 0;
+  border-radius: var(--radius);
+  padding: 1px;
+  background: linear-gradient(
+    to right,
+    hsl(var(--nvidia-green)),
+    hsl(var(--nvidia-green-light)),
+    hsl(var(--nvidia-green-dark))
+  );
+  -webkit-mask: linear-gradient(#fff 0 0) content-box, linear-gradient(#fff 0 0);
+  -webkit-mask-composite: xor;
+  mask-composite: exclude;
+  pointer-events: none;
+}
+
+.gradient-text {
+  background: linear-gradient(to right, hsl(var(--nvidia-green)), hsl(var(--nvidia-green-light)));
+  -webkit-background-clip: text;
+  -webkit-text-fill-color: transparent;
+  background-clip: text;
+  color: transparent;
+}
+
+.gradient-bg {
+  background: linear-gradient(
+    135deg,
+    hsl(var(--nvidia-green-dark)) 0%,
+    hsl(var(--nvidia-green)) 50%,
+    hsl(var(--nvidia-green-light)) 100%
+  );
+}
+
+.hover-lift {
+  transition: transform 0.2s ease, box-shadow 0.2s ease;
+}
+
+.hover-lift:hover {
+  transform: translateY(-2px);
+  box-shadow: 0 8px 18px -6px rgba(0, 0, 0, 0.10), 0 6px 8px -6px rgba(0, 0, 0, 0.08);
+}
+
+.animate-glow {
+  animation: glow 2s ease-in-out infinite alternate;
+}
+
+@keyframes glow {
+  from {
+    box-shadow: 0 0 5px rgba(118, 185, 0, 0.2), 0 0 10px rgba(118, 185, 0, 0.1);
+  }
+  to {
+    box-shadow: 0 0 10px rgba(118, 185, 0, 0.3), 0 0 20px rgba(118, 185, 0, 0.2);
+  }
+}
+
+/* Modern Table Styles */
+.modern-table {
+  @apply w-full border-collapse;
+}
+
+.modern-table th {
+  @apply p-2 text-left font-medium text-muted-foreground text-xs md:text-sm tracking-wide;
+}
+
+.modern-table td {
+  @apply p-2 text-foreground text-sm;
+}
+
+.modern-table tr {
+  @apply border-b border-border/50 transition-colors;
+}
+
+.modern-table tbody tr:hover {
+  @apply bg-muted/30;
+}
+
+/* Light mode row selection highlight using NVIDIA green */
+.light .row-selected {
+  background-color: hsl(var(--nvidia-green) / 0.08);
+}
+
+.light .row-selected:hover {
+  background-color: hsl(var(--nvidia-green) / 0.12);
+}
+
+/* Slider Styles */
+.slider-thumb::-webkit-slider-thumb {
+  appearance: none;
+  height: 18px;
+  width: 18px;
+  border-radius: 50%;
+  background: hsl(var(--nvidia-green));
+  border: 2px solid hsl(var(--background));
+  box-shadow: 0 2px 4px rgba(0, 0, 0, 0.1);
+  cursor: pointer;
+  transition: all 0.2s ease;
+}
+
+.slider-thumb::-webkit-slider-thumb:hover {
+  transform: scale(1.1);
+  box-shadow: 0 4px 8px rgba(0, 0, 0, 0.15);
+}
+
+.slider-thumb::-moz-range-thumb {
+  height: 18px;
+  width: 18px;
+  border-radius: 50%;
+  background: hsl(var(--nvidia-green));
+  border: 2px solid hsl(var(--background));
+  box-shadow: 0 2px 4px rgba(0, 0, 0, 0.1);
+  cursor: pointer;
+  transition: all 0.2s ease;
+}
+
+.slider-thumb::-moz-range-thumb:hover {
+  transform: scale(1.1);
+  box-shadow: 0 4px 8px rgba(0, 0, 0, 0.15);
+}
+
+/* Button Styles */
+.btn-primary {
+  @apply px-4 py-2 bg-primary text-primary-foreground font-medium rounded-md text-sm 
+         hover:bg-primary/90 transition-colors flex items-center gap-2 shadow-md;
+}
+
+.btn-secondary {
+  @apply px-4 py-2 bg-secondary text-secondary-foreground font-medium rounded-md text-sm 
+         hover:bg-secondary/90 transition-colors flex items-center gap-2;
+}
+
+.btn-outline {
+  @apply px-4 py-2 bg-transparent border border-border text-foreground font-medium rounded-md text-sm 
+         hover:bg-muted transition-colors flex items-center gap-2;
+}
+
+.btn-destructive {
+  @apply px-4 py-2 bg-destructive/20 text-destructive font-medium rounded-md text-sm 
+         hover:bg-destructive/30 transition-colors flex items-center gap-2;
+}
+
+.btn-icon {
+  @apply p-2 rounded-full hover:bg-muted text-muted-foreground hover:text-foreground transition-colors;
+}
+
+/* Dropdown Menu Fixes */
+.dropdown-menu {
+  max-height: 80vh;
+  overflow-y: auto;
+  z-index: 50;
+}
+
+.dropdown-container {
+  position: relative;
+}
+
+/* Light mode: soften elevation globally to reduce heavy shadows */
+.light .shadow-sm { box-shadow: 0 1px 2px rgba(0,0,0,0.06) !important; }
+.light .shadow, 
+.light .shadow-md { box-shadow: 0 2px 6px rgba(0,0,0,0.08) !important; }
+.light .shadow-lg { box-shadow: 0 6px 14px -6px rgba(0,0,0,0.10) !important; }
+.light .shadow-xl { box-shadow: 0 10px 18px -8px rgba(0,0,0,0.12) !important; }
+.light .shadow-2xl { box-shadow: 0 12px 22px -10px rgba(0,0,0,0.14) !important; }
+
+/* Light mode: tune specific custom elements */
+.light .glass-card:hover { box-shadow: 0 10px 18px -8px rgba(0,0,0,0.12) !important; }
+.light .startup-tab-icon { box-shadow: 0 1px 3px rgba(0,0,0,0.06) !important; }
diff --git a/nvidia/txt2kg/assets/frontend/app/graph/page.tsx b/nvidia/txt2kg/assets/frontend/app/graph/page.tsx
new file mode 100644
index 0000000..1458303
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/graph/page.tsx
@@ -0,0 +1,330 @@
+"use client"
+
+import { useEffect, useState } from "react"
+import type { Triple } from "@/utils/text-processing"
+import { GraphVisualization } from "@/components/graph-visualization"
+import { NvidiaIcon } from "@/components/nvidia-icon"
+import { ArrowLeft, AlertCircle, Network } from "lucide-react"
+import { GraphDataForm } from "@/components/graph-data-form"
+import { useRouter } from "next/navigation"
+
+export default function GraphPage() {
+  const router = useRouter()
+  const [triples, setTriples] = useState<Triple[]>([])
+  const [documentName, setDocumentName] = useState<string>("")
+  const [loading, setLoading] = useState(true)
+  const [error, setError] = useState<string | null>(null)
+  const [dataSource, setDataSource] = useState<"neo4j" | "api" | "local" | "none">("none")
+
+  useEffect(() => {
+    const loadGraphData = async () => {
+      try {
+        setLoading(true)
+        setError(null)
+
+        // Check URL parameters
+        const params = new URLSearchParams(window.location.search)
+        const graphId = params.get("id")
+        const source = params.get("source")
+        
+        console.log("Loading graph with params:", { graphId, source, url: window.location.href })
+
+        // First try to load from Neo4j if available
+        if (source !== "local") {
+          try {
+            console.log("Attempting to load graph data from Neo4j")
+            const neo4jResponse = await fetch('/api/neo4j')
+            
+            if (neo4jResponse.ok) {
+              const neo4jData = await neo4jResponse.json()
+              if (neo4jData.triples && Array.isArray(neo4jData.triples) && neo4jData.triples.length > 0) {
+                console.log("Successfully loaded graph data from Neo4j")
+                setTriples(neo4jData.triples)
+                setDocumentName(neo4jData.documentName || "Neo4j Graph")
+                setDataSource("neo4j")
+                setLoading(false)
+                return
+              } else {
+                console.warn("Neo4j returned empty or invalid data, trying other methods")
+              }
+            } else {
+              console.warn(`Neo4j returned status ${neo4jResponse.status}, trying other methods`)
+            }
+          } catch (neo4jError) {
+            console.error("Error accessing Neo4j:", neo4jError)
+            console.log("Continuing with other data sources")
+          }
+        }
+
+        // If source=local is specified, use localStorage directly
+        if (source === "local") {
+          console.log("Using localStorage as specified in URL")
+          return loadFromLocalStorage()
+        }
+
+        // If we have a graph ID, try to load from API
+        if (graphId) {
+          try {
+            console.log("Loading graph data from API with ID:", graphId)
+            console.log("Fetching from /api/graph-data?id=" + graphId)
+            const response = await fetch(`/api/graph-data?id=${graphId}`)
+
+            if (response.ok) {
+              const data = await response.json()
+              console.log("Successfully loaded graph data from API")
+              setTriples(data.triples)
+              setDocumentName(data.documentName)
+              setDataSource("api")
+              return
+            } else {
+              // Check if this is our special response indicating localStorage should be used
+              let errorData;
+              try {
+                errorData = await response.json();
+                console.log("Error response data:", errorData);
+                
+                if (errorData.useLocalStorage) {
+                  console.log("Server indicated to use localStorage fallback")
+                  loadFromLocalStorage();
+                  return;
+                }
+              } catch (jsonError) {
+                // Response wasn't JSON, try to get text
+                const errorText = await response.text();
+                console.error(`Failed to load graph data from API (${response.status}):`, errorText)
+              }
+              
+              // Always try localStorage as a fallback
+              console.log("Attempting to fall back to localStorage")
+              try {
+                loadFromLocalStorage()
+                return
+              } catch (localStorageError) {
+                // If both API and localStorage fail, throw a more descriptive error
+                throw new Error(`Graph data not found (ID: ${graphId}). The data may have been lost due to server restart or session expiration.`)
+              }
+            }
+          } catch (apiError) {
+            console.error("Error accessing API:", apiError)
+            // Fall back to localStorage
+            console.log("Falling back to localStorage due to API error")
+            try {
+              loadFromLocalStorage()
+              return
+            } catch (localStorageError) {
+              throw new Error(`Could not load graph data: ${apiError}. Local storage fallback also failed.`)
+            }
+          }
+        } else {
+          // No graph ID, try localStorage
+          console.log("No graph ID found, trying localStorage")
+          loadFromLocalStorage()
+          return
+        }
+      } catch (error) {
+        console.error("Error loading graph data:", error)
+        setError(error instanceof Error ? error.message : "Unknown error loading graph data")
+        setTriples([])
+      } finally {
+        setLoading(false)
+      }
+    }
+
+    const loadFromLocalStorage = () => {
+      try {
+        // Check if localStorage is available
+        if (typeof window === 'undefined' || !window.localStorage) {
+          throw new Error("LocalStorage is not available in this browser")
+        }
+        
+        // Check URL parameters
+        const params = new URLSearchParams(window.location.search)
+        const timestamp = params.get("ts")
+        
+        // Try timestamped version first if timestamp is provided
+        let storedTriples = null
+        let storedDocName = null
+        
+        if (timestamp) {
+          console.log(`Looking for timestamped data with ts=${timestamp}`)
+          storedTriples = localStorage.getItem(`graphTriples_${timestamp}`)
+          storedDocName = localStorage.getItem(`graphDocumentName_${timestamp}`)
+        }
+        
+        // Fall back to non-timestamped version if timestamped version not found
+        if (!storedTriples) {
+          console.log("Timestamped data not found or no timestamp provided, falling back to default keys")
+          storedTriples = localStorage.getItem("graphTriples")
+          storedDocName = localStorage.getItem("graphDocumentName")
+        }
+
+        if (!storedTriples) {
+          console.warn("No triples data found in localStorage")
+          setTriples([])
+          setError("No graph data found in localStorage. Please return to the application and create a new graph.")
+          return
+        }
+
+        try {
+          const parsedTriples = JSON.parse(storedTriples)
+          
+          if (!Array.isArray(parsedTriples)) {
+            setTriples([])
+            throw new Error("Invalid graph data format in localStorage")
+          }
+          
+          console.log(`Successfully parsed triples from localStorage: ${parsedTriples.length} items`)
+          setTriples(parsedTriples)
+          setDataSource("local")
+
+          if (storedDocName) {
+            setDocumentName(storedDocName)
+          } else {
+            setDocumentName("Unnamed Document")
+          }
+        } catch (parseError) {
+          console.error("Error parsing JSON from localStorage:", parseError)
+          setTriples([])
+          throw new Error("The stored graph data appears to be corrupted. Please return to the application and create a new graph.")
+        }
+      } catch (localStorageError) {
+        console.error("Error loading from localStorage:", localStorageError)
+        setTriples([])
+        throw localStorageError instanceof Error 
+          ? localStorageError 
+          : new Error("Failed to load graph data from localStorage")
+      }
+    }
+
+    loadGraphData().catch((err) => {
+      console.error("Unhandled error in loadGraphData:", err)
+      setError(err instanceof Error ? err.message : "Unknown error loading graph data")
+      setLoading(false)
+    })
+  }, [])
+
+  const handleBackClick = () => {
+    router.push("/")
+  }
+
+  if (loading) {
+    return (
+      <div className="min-h-screen bg-background flex items-center justify-center">
+        <div className="flex flex-col items-center">
+          <div className="w-16 h-16 rounded-full bg-primary/20 flex items-center justify-center mb-4 animate-pulse">
+            <Network className="h-8 w-8 text-primary" />
+          </div>
+          <p className="text-primary">Loading graph data...</p>
+        </div>
+      </div>
+    )
+  }
+
+  if (error || triples.length === 0) {
+    return (
+      <div className="min-h-screen bg-background">
+        <header className="border-b border-border/50 backdrop-blur-md bg-background/80 sticky top-0 z-50">
+          <div className="container mx-auto px-4 py-3 flex items-center justify-between">
+            <div className="flex items-center gap-3">
+              <NvidiaIcon className="h-8 w-8" />
+              <div>
+                <span className="text-xl font-bold gradient-text">txt2kg</span>
+                <span className="ml-2 text-xs bg-primary/20 text-primary px-2 py-0.5 rounded-full">
+                  Knowledge Graph Visualization
+                </span>
+              </div>
+            </div>
+            <button onClick={handleBackClick} className="btn-outline">
+              <ArrowLeft className="h-4 w-4" />
+              Back
+            </button>
+          </div>
+        </header>
+
+        <main className="container mx-auto px-4 py-8">
+          <div className="glass-card rounded-xl p-6 mb-8 border border-destructive/30">
+            <div className="flex items-start gap-4">
+              <div className="w-12 h-12 rounded-full bg-destructive/20 flex items-center justify-center flex-shrink-0">
+                <AlertCircle className="h-6 w-6 text-destructive" />
+              </div>
+              <div>
+                <h3 className="text-lg font-bold text-destructive mb-2">Error Loading Graph Data</h3>
+                <p className="text-foreground mb-2">{error || "No graph data found"}</p>
+                <p className="text-muted-foreground text-sm">
+                  This could be due to browser storage limitations or a missing graph ID.
+                </p>
+              </div>
+            </div>
+          </div>
+
+          <div className="glass-card rounded-xl p-6">
+            <h2 className="text-xl font-bold mb-6">Alternative Methods</h2>
+
+            <div className="space-y-8">
+              <div>
+                <h3 className="text-lg font-medium text-primary mb-3 flex items-center gap-2">
+                  <ArrowLeft className="h-5 w-5" />
+                  Method 1: Return to Main App
+                </h3>
+                <p className="text-foreground mb-4">Go back to the main application and try opening the graph again.</p>
+                <button onClick={handleBackClick} className="btn-primary inline-flex">
+                  Return to Main App
+                </button>
+              </div>
+
+              <div className="border-t border-border pt-8">
+                <h3 className="text-lg font-medium text-primary mb-3 flex items-center gap-2">
+                  <Network className="h-5 w-5" />
+                  Method 2: Manual Data Input
+                </h3>
+                <GraphDataForm />
+              </div>
+            </div>
+          </div>
+        </main>
+      </div>
+    )
+  }
+
+  return (
+    <div className="min-h-screen bg-background text-foreground">
+      <header className="border-b border-border/50 backdrop-blur-md bg-background/80 sticky top-0 z-50">
+        <div className="container mx-auto px-4 py-3 flex items-center justify-between">
+          <div className="flex items-center gap-3">
+            <NvidiaIcon className="h-8 w-8" />
+            <div>
+              <span className="text-xl font-bold gradient-text">txt2kg</span>
+              <span className="ml-2 text-xs bg-primary/20 text-primary px-2 py-0.5 rounded-full">
+                Knowledge Graph Visualization
+              </span>
+            </div>
+          </div>
+          <button onClick={handleBackClick} className="btn-outline">
+            <ArrowLeft className="h-4 w-4" />
+            Back to Application
+          </button>
+        </div>
+      </header>
+
+      <main className="container mx-auto px-4 py-6">
+        <div className="flex justify-between items-center mb-6">
+          <h1 className="text-2xl font-bold flex items-center gap-2">
+            <Network className="h-6 w-6 text-primary" />
+            <span>Knowledge Graph:</span>
+            <span className="text-primary">{documentName}</span>
+          </h1>
+
+          <div className="text-xs bg-primary/10 text-primary px-2 py-1 rounded-full">
+            {dataSource === "neo4j" ? "Data from Neo4j" : 
+             dataSource === "api" ? "Data from API" : "Data from Local Storage"}
+          </div>
+        </div>
+
+        <div className="glass-card rounded-xl overflow-hidden h-[calc(100vh-200px)]">
+          <GraphVisualization triples={triples} fullscreen />
+        </div>
+      </main>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/app/graph3d/page.tsx b/nvidia/txt2kg/assets/frontend/app/graph3d/page.tsx
new file mode 100644
index 0000000..2e6dfc2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/graph3d/page.tsx
@@ -0,0 +1,701 @@
+"use client"
+
+/**
+ * 3D Graph Visualization Page
+ * 
+ * This page provides a 3D visualization of knowledge graphs with multiple data source options:
+ * 
+ * Usage:
+ * 1. Stored triples: /graph3d?source=stored - Uses triples from the graph database (ArangoDB/Neo4j)
+ * 2. URL triples: /graph3d?triples=[...] - Uses triples passed directly in URL parameters
+ * 3. localStorage: /graph3d?storageId=xyz - Uses triples from browser localStorage
+ * 4. Sample data: /graph3d - Uses built-in sample data when no other source is available
+ * 
+ * Additional parameters:
+ * - layout: force|hierarchical|radial - Sets the graph layout type
+ * - highlightedNodes: JSON array of node names to highlight
+ * 
+ * Examples:
+ * - /graph3d?source=stored&layout=force
+ * - /graph3d?source=local&triples=[{"subject":"A","predicate":"relates_to","object":"B"}]
+ */
+
+import { useEffect, useState, useCallback } from "react"
+import dynamic from "next/dynamic"
+import { Button } from "@/components/ui/button"
+import { Card, CardContent, CardDescription, CardHeader, CardTitle } from "@/components/ui/card"
+import { Badge } from "@/components/ui/badge"
+import { Switch } from "@/components/ui/switch"
+import { Alert, AlertDescription } from "@/components/ui/alert"
+import { Select, SelectContent, SelectItem, SelectTrigger, SelectValue } from "@/components/ui/select"
+import { Input } from "@/components/ui/input"
+import { Label } from "@/components/ui/label"
+import { Slider } from "@/components/ui/slider"
+import { Separator } from "@/components/ui/separator"
+import { Collapsible, CollapsibleContent, CollapsibleTrigger } from "@/components/ui/collapsible"
+import { Loader2, Cpu, Monitor, Settings, Brain, Layers, Zap, ChevronDown, ChevronRight } from "lucide-react"
+import { useToast } from "@/hooks/use-toast"
+
+// Dynamically import the ForceGraphWrapper component with SSR disabled
+const ForceGraphWrapper = dynamic(
+  () => import("@/components/force-graph-wrapper").then(mod => mod.ForceGraphWrapper),
+  { ssr: false }
+)
+
+// Dynamically import the WebGPU 3D Viewer component with SSR disabled
+const WebGPU3DViewer = dynamic(
+  () => import("@/components/webgpu-3d-viewer").then(mod => mod.WebGPU3DViewer),
+  { ssr: false }
+)
+
+interface PerformanceMetrics {
+  renderingTime: number
+  clusteringTime?: number
+  totalNodes: number
+  totalLinks: number
+  memoryUsage?: number
+}
+
+export default function Graph3DPage() {
+  const [graphData, setGraphData] = useState<any>(null)
+  const [isLoading, setIsLoading] = useState(true)
+  const [error, setError] = useState<string | null>(null)
+  const [debugInfo, setDebugInfo] = useState<string>("")
+  const [highlightedNodes, setHighlightedNodes] = useState<string[]>([])
+  const [layoutType, setLayoutType] = useState<string>("3d")
+  const [useEnhancedWebGPU, setUseEnhancedWebGPU] = useState<boolean>(false)
+  const [enableClustering, setEnableClustering] = useState<boolean>(true)
+  const [enableClusterColors, setEnableClusterColors] = useState<boolean>(false)
+  const [performanceMetrics, setPerformanceMetrics] = useState<PerformanceMetrics | null>(null)
+  const [showClusteringControls, setShowClusteringControls] = useState<boolean>(false)
+  const [clusteringOptionsExpanded, setClusteringOptionsExpanded] = useState<boolean>(false)
+  
+  // Semantic clustering options
+  const [clusteringMethod, setClusteringMethod] = useState<string>("hybrid")
+  const [semanticAlgorithm, setSemanticAlgorithm] = useState<string>("hierarchical")
+  const [numberOfClusters, setNumberOfClusters] = useState<number | null>(null)
+  const [similarityThreshold, setSimilarityThreshold] = useState<number>(0.7)
+  const [nameWeight, setNameWeight] = useState<number>(0.6)
+  const [contentWeight, setContentWeight] = useState<number>(0.3)
+  const [spatialWeight, setSpatialWeight] = useState<number>(0.1)
+  
+  const { toast } = useToast()
+
+  // Handle clustering performance updates
+  const handleClusteringUpdate = useCallback((metrics: PerformanceMetrics) => {
+    setPerformanceMetrics(metrics)
+    if (metrics.clusteringTime && !useEnhancedWebGPU) {
+      toast({
+        title: "Hybrid GPU/CPU Clustering Complete",
+        description: `Server GPU processed ${metrics.totalNodes.toLocaleString()} nodes in ${metrics.clusteringTime.toFixed(0)}ms`,
+      })
+    }
+  }, [toast, useEnhancedWebGPU])
+
+  // Update performance metrics when graph data changes
+  useEffect(() => {
+    if (graphData) {
+      const nodeCount = graphData.nodes?.length || 0
+      const linkCount = graphData.links?.length || 0
+      const tripleCount = graphData.triples?.length || 0
+      
+      if (nodeCount > 0 || tripleCount > 0) {
+        setPerformanceMetrics({
+          renderingTime: 0,
+          totalNodes: nodeCount || Math.ceil(tripleCount * 0.6), // Estimate nodes from triples
+          totalLinks: linkCount || Math.ceil(tripleCount * 0.8), // Estimate links from triples
+        })
+      }
+    }
+  }, [graphData])
+
+  useEffect(() => {
+    // Fetch graph data
+    const fetchGraphData = async () => {
+      try {
+        setIsLoading(true)
+        
+        // Check URL parameters
+        const params = new URLSearchParams(window.location.search)
+        const graphId = params.get("id")
+        const triplesParam = params.get("triples")
+        const layoutParam = params.get("layout")
+        const highlightedNodesParam = params.get("highlightedNodes")
+        const storageId = params.get("storageId")
+        const source = params.get("source")
+        
+        // Set layout type from URL parameter
+        if (layoutParam) {
+          setLayoutType(layoutParam)
+          console.log("Layout type set from URL:", layoutParam)
+        }
+        
+        // Set highlighted nodes from URL parameter
+        if (highlightedNodesParam) {
+          try {
+            const parsedHighlightedNodes = JSON.parse(decodeURIComponent(highlightedNodesParam))
+            if (Array.isArray(parsedHighlightedNodes)) {
+              setHighlightedNodes(parsedHighlightedNodes)
+              console.log("Highlighted nodes set from URL:", parsedHighlightedNodes)
+            }
+          } catch (parseError) {
+            console.error("Failed to parse highlightedNodes from URL:", parseError)
+          }
+        }
+        
+        console.log("URL parameters:", { 
+          graphId: graphId || "not provided", 
+          hasTriples: !!triplesParam,
+          hasStorageId: !!storageId,
+          layout: layoutParam || "default",
+          highlightedNodes: highlightedNodesParam ? "provided" : "not provided",
+          source: source || "auto",
+          allParams: Object.fromEntries(params.entries())
+        });
+        
+        // Try to load from localStorage if storageId is provided
+        if (storageId) {
+          try {
+            console.log("Found storageId in URL, attempting to retrieve data from localStorage:", storageId);
+            const storedData = localStorage.getItem(storageId);
+            
+            if (!storedData) {
+              console.error("No data found in localStorage for storageId:", storageId);
+              setError("Could not find the graph data in your browser storage. It may have expired.");
+              setIsLoading(false);
+              return;
+            }
+            
+            const triples = JSON.parse(storedData);
+            console.log("Successfully retrieved triples from localStorage:", { 
+              count: triples.length,
+              sample: triples.slice(0, 2)
+            });
+            
+            setGraphData({ triples });
+            // setDebugInfo(`Using ${triples.length} triples from browser storage (ID: ${storageId})`);
+            setIsLoading(false);
+            
+            // Clean up localStorage after retrieval to prevent buildup
+            // Only do this for older IDs to prevent issues with multiple tabs/windows
+            const currentTime = Date.now();
+            const idTimestamp = parseInt(storageId.split('_')[1] || '0', 10);
+            
+            // If the ID is older than 5 minutes, clean it up
+            if (currentTime - idTimestamp > 5 * 60 * 1000) {
+              console.log("Cleaning up old localStorage entry:", storageId);
+              localStorage.removeItem(storageId);
+            }
+            
+            return;
+          } catch (storageError) {
+            console.error("Error retrieving data from localStorage:", storageError);
+            setDebugInfo("Failed to retrieve triples from browser storage, falling back to API");
+            // Continue to other methods if parsing fails
+          }
+        }
+        
+        // If we have triples passed directly in the URL param
+        if (triplesParam) {
+          try {
+            console.log("Found triples data in URL parameter, attempting to parse")
+            const triples = JSON.parse(decodeURIComponent(triplesParam))
+            console.log("Successfully parsed triples from URL:", { 
+              count: triples.length,
+              sample: triples.slice(0, 2)
+            });
+            setGraphData({ triples })
+            setDebugInfo("Using triples data from URL parameter")
+            setIsLoading(false)
+            return
+          } catch (parseError) {
+            console.error("Error parsing triples from URL:", parseError)
+            setDebugInfo("Failed to parse triples from URL, falling back to API")
+            // Continue to other methods if parsing fails
+          }
+        }
+        
+        // Determine data source based on URL parameters
+        let endpoint: string;
+        let useStoredTriples = false;
+        
+        if (graphId) {
+          endpoint = `/api/graph-data?id=${graphId}`;
+        } else if (source === 'stored' || (!triplesParam && !storageId)) {
+          // Use stored triples if explicitly requested or if no other data source is available
+          endpoint = '/api/graph-db/triples';
+          useStoredTriples = true;
+        } else {
+          // Fall back to sample data
+          endpoint = '/api/graph-data';
+        }
+        
+        console.log(`Fetching graph data from API: ${endpoint}`);
+        setDebugInfo(`Fetching from ${endpoint}`)
+        
+        const response = await fetch(endpoint)
+        
+        if (!response.ok) {
+          console.error(`API responded with status ${response.status}: ${response.statusText}`)
+          
+          // If we were trying to fetch stored triples and it failed, fall back to sample data
+          if (useStoredTriples) {
+            console.log("Stored triples failed, falling back to sample graph data");
+            setDebugInfo("No stored triples available, using sample data");
+            const fallbackResponse = await fetch('/api/graph-data');
+            if (fallbackResponse.ok) {
+              const fallbackData = await fallbackResponse.json();
+              setGraphData(fallbackData);
+              setIsLoading(false);
+              return;
+            }
+          }
+          
+          setDebugInfo(`API error: ${response.status} ${response.statusText}`)
+          throw new Error(`Error fetching graph data: ${response.statusText}`)
+        }
+        
+        const data = await response.json()
+        console.log("API response received:", {
+          dataExists: !!data,
+          hasNodes: data && Array.isArray(data.nodes),
+          hasLinks: data && Array.isArray(data.links),
+          hasTriples: data && Array.isArray(data.triples),
+          nodeCount: data && Array.isArray(data.nodes) ? data.nodes.length : 0,
+          linkCount: data && Array.isArray(data.links) ? data.links.length : 0,
+          tripleCount: data && Array.isArray(data.triples) ? data.triples.length : 0,
+          dataType: typeof data,
+          keys: data ? Object.keys(data) : [],
+          rawData: JSON.stringify(data).substring(0, 200) + "..."
+        });
+        
+        // Validate the data structure - can be either nodes/links or triples format
+        if (!data) {
+          setDebugInfo("API returned empty data")
+          throw new Error('No data received from API');
+        }
+        
+        // Handle stored triples response format
+        if (useStoredTriples && data.triples && Array.isArray(data.triples)) {
+          console.log("Processing stored triples from graph database");
+          setGraphData({ triples: data.triples });
+          setDebugInfo(`Using ${data.triples.length} stored triples from ${data.databaseType || 'graph database'}`);
+          setIsLoading(false);
+          return;
+        }
+        
+        if ((!Array.isArray(data.nodes) || !Array.isArray(data.links)) && 
+            !Array.isArray(data.triples)) {
+          setDebugInfo(`Invalid data format: ${Object.keys(data).join(", ")}`)
+          throw new Error('Invalid graph data structure: missing required data arrays');
+        }
+        
+        if (Array.isArray(data.triples)) {
+          setDebugInfo(`Using triples data (${data.triples.length} triples) from API`)
+        } else if (Array.isArray(data.nodes) && Array.isArray(data.links)) {
+          setDebugInfo(`Using nodes/links data (${data.nodes.length} nodes, ${data.links.length} links) from API`)
+        }
+        
+        console.log("Setting graph data in state...");
+        setGraphData(data)
+        setIsLoading(false)
+      } catch (err) {
+        console.error('Failed to load graph data:', err)
+        setError(`Failed to load graph data: ${err instanceof Error ? err.message : String(err)}`)
+        setIsLoading(false)
+      }
+    }
+
+    fetchGraphData()
+  }, [])
+
+  useEffect(() => {
+    // Add overflow: hidden to the body element when the component mounts
+    document.body.style.overflow = "hidden"
+
+    // Clean up the effect when the component unmounts
+    return () => {
+      document.body.style.overflow = "auto"
+    }
+  }, [])
+
+  // Display error or loading state
+  if (error || isLoading) {
+    return (
+      <div className="h-screen w-screen overflow-hidden bg-black flex items-center justify-center">
+        {isLoading && (
+          <div className="text-center">
+            <p className="mb-4 text-white">Loading graph data...</p>
+            <div className="w-16 h-16 border-4 border-gray-700 border-t-green-500 rounded-full animate-spin mx-auto"></div>
+            {debugInfo && (
+              <p className="mt-4 text-xs text-gray-400">{debugInfo}</p>
+            )}
+          </div>
+        )}
+        
+        {error && (
+          <div className="bg-black/80 border border-red-900 text-red-400 px-6 py-4 rounded-lg max-w-lg">
+            <p>{error}</p>
+            {debugInfo && (
+              <p className="mt-2 text-xs text-gray-500">{debugInfo}</p>
+            )}
+            <div className="mt-4 flex gap-2">
+              <button 
+                onClick={() => window.location.reload()} 
+                className="bg-red-900/50 hover:bg-red-900 text-white py-1 px-3 rounded text-sm"
+              >
+                Retry
+              </button>
+              <button 
+                onClick={() => window.location.href = '/'}
+                className="bg-gray-800 hover:bg-gray-700 text-white py-1 px-3 rounded text-sm"
+              >
+                Return to Home
+              </button>
+            </div>
+          </div>
+        )}
+      </div>
+    )
+  }
+
+  // Only render the graph when data is ready
+  return (
+    <div className="h-screen w-screen overflow-hidden">
+      {graphData && (
+        <>
+          {/* Controls Panel */}
+          <div className="absolute top-20 left-2 z-50 flex flex-col gap-2 max-w-sm">
+            {/* Main Controls Row */}
+            <div className="flex items-center gap-4">
+              {/* <div className="text-xs text-gray-500">
+                {graphData.nodes && graphData.links ? (
+                  `Rendering graph with ${graphData.nodes.length || 0} nodes and ${graphData.links.length || 0} links`
+                ) : graphData.triples ? (
+                  `Rendering graph from ${graphData.triples.length || 0} triples`
+                ) : (
+                  "Rendering graph data"
+                )}
+              </div> */}
+              
+              {/* WebGPU Mode Toggle */}
+              {/* <button
+                onClick={() => setUseEnhancedWebGPU(!useEnhancedWebGPU)}
+                className="bg-gray-800/80 hover:bg-gray-700/80 px-3 py-1 rounded text-xs text-white border border-gray-600 transition-colors"
+              >
+                {useEnhancedWebGPU ? '🔧 Enhanced WebGPU' : '🎮 Standard 3D'}
+              </button> */}
+              
+              {/* Clustering Controls Toggle */}
+              <button
+                onClick={() => setShowClusteringControls(!showClusteringControls)}
+                className="bg-blue-800/80 hover:bg-blue-700/80 px-3 py-1 rounded text-xs text-white border border-blue-600 transition-colors flex items-center gap-1"
+              >
+                <Settings className="w-3 h-3" />
+                Clustering
+              </button>
+            </div>
+
+            {/* Debug Info */}
+            {debugInfo && (
+              <div className="bg-gray-800/80 px-2 py-1 rounded text-xs text-gray-300">{debugInfo}</div>
+            )}
+
+            {/* Enhanced Clustering Controls Panel */}
+            {showClusteringControls && (
+              <Card className="bg-black/95 border-gray-700 text-white max-w-sm">
+                <CardHeader className="pb-3">
+                  <CardTitle className="text-sm flex items-center gap-2">
+                    <Brain className="w-4 h-4" />
+                    Smart Clustering Controls
+                  </CardTitle>
+                  <CardDescription className="text-xs">
+                    Advanced semantic and spatial clustering options
+                  </CardDescription>
+                </CardHeader>
+                <CardContent className="space-y-4">
+                  {/* Enable Clustering Toggle */}
+                  <div className="flex items-center justify-between">
+                    <div className="space-y-1">
+                      <label className="text-sm font-medium">Enable Clustering</label>
+                      <p className="text-xs text-gray-400">
+                        GPU-accelerated graph clustering
+                      </p>
+                    </div>
+                    <Switch
+                      checked={enableClustering}
+                      onCheckedChange={setEnableClustering}
+                    />
+                  </div>
+
+                  {enableClustering && (
+                    <>
+                      <Separator className="bg-gray-600" />
+                      
+                      {/* Collapsible Clustering Method Options */}
+                      <Collapsible open={clusteringOptionsExpanded} onOpenChange={setClusteringOptionsExpanded}>
+                        <CollapsibleTrigger className="flex items-center justify-between w-full p-2 rounded-lg hover:bg-gray-800/50 transition-colors">
+                          <div className="flex items-center gap-2">
+                            <Settings className="w-4 h-4" />
+                            <span className="text-sm font-medium">Clustering Options</span>
+                          </div>
+                          {clusteringOptionsExpanded ? (
+                            <ChevronDown className="w-4 h-4" />
+                          ) : (
+                            <ChevronRight className="w-4 h-4" />
+                          )}
+                        </CollapsibleTrigger>
+                        
+                        <CollapsibleContent className="space-y-4 pt-2">
+                          {/* Clustering Method Selection */}
+                          <div className="space-y-2">
+                            <Label className="text-sm font-medium flex items-center gap-2">
+                              <Layers className="w-3 h-3" />
+                              Clustering Method
+                            </Label>
+                            <Select value={clusteringMethod} onValueChange={setClusteringMethod}>
+                              <SelectTrigger className="bg-gray-800 border-gray-600 text-white">
+                                <SelectValue />
+                              </SelectTrigger>
+                              <SelectContent className="bg-gray-800 border-gray-600 text-white">
+                                <SelectItem value="spatial">🌐 Spatial - Position-based</SelectItem>
+                                <SelectItem value="semantic">🧠 Semantic - Name similarity</SelectItem>
+                                <SelectItem value="hybrid">⚡ Hybrid - Smart combination</SelectItem>
+                              </SelectContent>
+                            </Select>
+                            <p className="text-xs text-gray-400">
+                              {clusteringMethod === "spatial" && "Groups nodes by 3D coordinates"}
+                              {clusteringMethod === "semantic" && "Groups nodes by name/content similarity"}
+                              {clusteringMethod === "hybrid" && "Combines semantic and spatial features"}
+                            </p>
+                          </div>
+
+                          {/* Algorithm Selection */}
+                          <div className="space-y-2">
+                            <Label className="text-sm font-medium">Algorithm</Label>
+                            <Select value={semanticAlgorithm} onValueChange={setSemanticAlgorithm}>
+                              <SelectTrigger className="bg-gray-800 border-gray-600 text-white">
+                                <SelectValue />
+                              </SelectTrigger>
+                              <SelectContent className="bg-gray-800 border-gray-600 text-white">
+                                <SelectItem value="hierarchical">🌳 Hierarchical</SelectItem>
+                                <SelectItem value="kmeans">🎯 K-Means</SelectItem>
+                                <SelectItem value="dbscan">🔍 DBSCAN</SelectItem>
+                              </SelectContent>
+                            </Select>
+                          </div>
+
+                          {/* Number of Clusters (for K-means and Hierarchical) */}
+                          {(semanticAlgorithm === "kmeans" || semanticAlgorithm === "hierarchical") && (
+                            <div className="space-y-2">
+                              <Label className="text-sm font-medium">Number of Clusters</Label>
+                              <Input
+                                type="number"
+                                value={numberOfClusters || ""}
+                                onChange={(e) => setNumberOfClusters(e.target.value ? parseInt(e.target.value) : null)}
+                                placeholder="Auto"
+                                className="bg-gray-800 border-gray-600 text-white"
+                                min="2"
+                                max="50"
+                              />
+                              <p className="text-xs text-gray-400">Leave empty for automatic selection</p>
+                            </div>
+                          )}
+
+                          {/* Similarity Threshold (for DBSCAN) */}
+                          {semanticAlgorithm === "dbscan" && (
+                            <div className="space-y-2">
+                              <Label className="text-sm font-medium">Similarity Threshold</Label>
+                              <Slider
+                                value={[similarityThreshold]}
+                                onValueChange={(value) => setSimilarityThreshold(value[0])}
+                                min={0.1}
+                                max={1.0}
+                                step={0.05}
+                                className="w-full"
+                              />
+                              <p className="text-xs text-gray-400">
+                                {similarityThreshold.toFixed(2)} - Higher values create fewer, tighter clusters
+                              </p>
+                            </div>
+                          )}
+
+                          {/* Hybrid Weights (for hybrid method) */}
+                          {clusteringMethod === "hybrid" && (
+                            <div className="space-y-3">
+                              <Label className="text-sm font-medium">Feature Weights</Label>
+                              
+                              <div className="space-y-2">
+                                <div className="flex justify-between text-xs">
+                                  <span>Name Similarity</span>
+                                  <span>{nameWeight.toFixed(1)}</span>
+                                </div>
+                                <Slider
+                                  value={[nameWeight]}
+                                  onValueChange={(value) => setNameWeight(value[0])}
+                                  min={0}
+                                  max={1}
+                                  step={0.1}
+                                  className="w-full"
+                                />
+                              </div>
+                              
+                              <div className="space-y-2">
+                                <div className="flex justify-between text-xs">
+                                  <span>Content Similarity</span>
+                                  <span>{contentWeight.toFixed(1)}</span>
+                                </div>
+                                <Slider
+                                  value={[contentWeight]}
+                                  onValueChange={(value) => setContentWeight(value[0])}
+                                  min={0}
+                                  max={1}
+                                  step={0.1}
+                                  className="w-full"
+                                />
+                              </div>
+                              
+                              <div className="space-y-2">
+                                <div className="flex justify-between text-xs">
+                                  <span>Spatial Distance</span>
+                                  <span>{spatialWeight.toFixed(1)}</span>
+                                </div>
+                                <Slider
+                                  value={[spatialWeight]}
+                                  onValueChange={(value) => setSpatialWeight(value[0])}
+                                  min={0}
+                                  max={1}
+                                  step={0.1}
+                                  className="w-full"
+                                />
+                              </div>
+                              
+                              <p className="text-xs text-gray-400">
+                                Total: {(nameWeight + contentWeight + spatialWeight).toFixed(1)}
+                              </p>
+                            </div>
+                          )}
+                        </CollapsibleContent>
+                      </Collapsible>
+
+                      <Separator className="bg-gray-600" />
+
+                      {/* Cluster Colors Toggle */}
+                      <div className="flex items-center justify-between">
+                        <div className="space-y-1">
+                          <label className="text-sm font-medium">Cluster Colors</label>
+                          <p className="text-xs text-gray-400">
+                            Color nodes by cluster assignment
+                          </p>
+                        </div>
+                        <Switch
+                          checked={enableClusterColors}
+                          onCheckedChange={setEnableClusterColors}
+                        />
+                      </div>
+                    </>
+                  )}
+
+                  {/* Performance Metrics */}
+                  {performanceMetrics && (
+                    <>
+                      <Separator className="bg-gray-600" />
+                      <div className="space-y-2">
+                        <label className="text-sm font-medium flex items-center gap-2">
+                          <Zap className="w-3 h-3" />
+                          Performance
+                        </label>
+                        <div className="grid grid-cols-2 gap-2 text-xs">
+                          <Badge variant="outline" className="justify-center">
+                            {performanceMetrics.totalNodes.toLocaleString()} nodes
+                          </Badge>
+                          <Badge variant="outline" className="justify-center">
+                            {performanceMetrics.totalLinks.toLocaleString()} links
+                          </Badge>
+                          {performanceMetrics.clusteringTime && (
+                            <Badge variant="outline" className="justify-center">
+                              {performanceMetrics.clusteringTime.toFixed(0)}ms cluster
+                            </Badge>
+                          )}
+                          <Badge variant="outline" className="justify-center">
+                            {performanceMetrics.renderingTime.toFixed(0)}ms render
+                          </Badge>
+                        </div>
+                      </div>
+                    </>
+                  )}
+
+                  {/* Clustering Status */}
+                  {enableClustering && (
+                    <Alert className="bg-black/50 border-gray-600">
+                      <Monitor className="h-4 w-4" />
+                      <AlertDescription className="text-xs">
+                        {clusteringMethod === "spatial" && "Using spatial coordinate clustering"}
+                        {clusteringMethod === "semantic" && "Using semantic name/content clustering"}
+                        {clusteringMethod === "hybrid" && "Using hybrid semantic + spatial clustering"}
+                        {" with "}
+                        {semanticAlgorithm} algorithm
+                      </AlertDescription>
+                    </Alert>
+                  )}
+                </CardContent>
+              </Card>
+            )}
+          </div>
+          {((graphData.nodes && graphData.links && graphData.nodes.length > 0) || 
+            (graphData.triples && graphData.triples.length > 0)) ? (
+            useEnhancedWebGPU ? (
+              <WebGPU3DViewer
+                graphData={graphData.nodes && graphData.links ? {
+                  nodes: graphData.nodes,
+                  links: graphData.links
+                } : null}
+                remoteServiceUrl="http://localhost:8083"
+                enableClustering={enableClustering}
+                onClusteringUpdate={handleClusteringUpdate}
+                onError={(err) => {
+                  console.error("Error from WebGPU3DViewer:", err);
+                  setError(`Error in enhanced 3D renderer: ${err}`);
+                  setDebugInfo(`Enhanced renderer error: ${err}`);
+                }}
+              />
+            ) : (
+              <ForceGraphWrapper 
+                jsonData={graphData} 
+                layoutType={layoutType}
+                highlightedNodes={highlightedNodes}
+                enableClustering={enableClustering}
+                enableClusterColors={enableClusterColors}
+                clusteringMode="hybrid" // Default to Hybrid GPU/CPU mode
+                remoteServiceUrl="http://localhost:8083"
+                onClusteringUpdate={handleClusteringUpdate}
+                // Semantic clustering parameters
+                clusteringMethod={clusteringMethod}
+                semanticAlgorithm={semanticAlgorithm}
+                numberOfClusters={numberOfClusters}
+                similarityThreshold={similarityThreshold}
+                nameWeight={nameWeight}
+                contentWeight={contentWeight}
+                spatialWeight={spatialWeight}
+                onError={(err) => {
+                  console.error("Error from ForceGraphWrapper:", err);
+                  setError(`Error in graph renderer: ${err.message}`);
+                  setDebugInfo(`Renderer error: ${err.message}`);
+                }}
+              />
+            )
+          ) : (
+            <div className="flex items-center justify-center h-full">
+              <div className="bg-black/80 border border-red-900 text-red-400 px-6 py-4 rounded-lg max-w-lg text-center">
+                <p>Unable to render graph - invalid data structure</p>
+                <p className="mt-2 text-xs text-gray-500">
+                  The graph data must contain either nodes and links arrays or a triples array
+                </p>
+              </div>
+            </div>
+          )}
+        </>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/layout.tsx b/nvidia/txt2kg/assets/frontend/app/layout.tsx
new file mode 100644
index 0000000..1de1542
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/layout.tsx
@@ -0,0 +1,72 @@
+import type React from "react"
+import type { Metadata } from "next"
+import { Inter } from "next/font/google"
+import "./globals.css"
+import { ThemeProvider } from "@/components/theme-provider"
+import { DocumentProvider } from "@/contexts/document-context"
+import { ClientInitializer } from "@/components/client-init"
+import Link from "next/link"
+import { Search as SearchIcon } from "lucide-react"
+import { NvidiaIcon } from "@/components/nvidia-icon"
+import { ThemeToggle } from "@/components/theme-toggle"
+import { InfoModal } from "@/components/info-modal"
+import { SettingsModal } from "@/components/settings-modal"
+import { Toaster } from "@/components/ui/toaster"
+
+const inter = Inter({
+  subsets: ["latin"],
+  variable: "--font-inter",
+  display: "swap",
+})
+
+export const metadata: Metadata = {
+  title: "txt2kg | NVIDIA Knowledge Graph Builder",
+  description: "Convert text documents to knowledge graphs using NVIDIA AI",
+    generator: 'v0.dev'
+}
+
+export default function RootLayout({
+  children,
+}: Readonly<{
+  children: React.ReactNode
+}>) {
+  return (
+    <html lang="en" suppressHydrationWarning className={inter.variable}>
+      <body className={inter.className}>
+        <ThemeProvider defaultTheme="dark">
+          <DocumentProvider>
+            <ClientInitializer />
+            {/* Modern Gradient Header */}
+            <header className="border-b border-border/50 backdrop-blur-md dark:bg-background/95 bg-background sticky top-0 z-50 shadow-sm">
+              <div className="container mx-auto px-4 py-3 flex items-center justify-between">
+                <div className="flex items-center gap-3">
+                  <NvidiaIcon className="h-8 w-8" />
+                  <div>
+                    <span className="text-xl font-bold gradient-text">txt2kg</span>
+                    <span className="ml-2 text-xs bg-primary/20 text-[#76b900] px-2 py-0.5 rounded-full">
+                      Powered by NVIDIA AI
+                    </span>
+                  </div>
+                </div>
+                <div className="flex items-center gap-4">
+                  <Link
+                    href="/rag"
+                    className="flex items-center gap-2 text-sm font-medium rounded-lg px-3 py-2 transition-colors border border-[#76b900]/40 text-[#76b900] bg-[#76b900]/10 hover:bg-[#76b900]/20 focus:outline-none focus:ring-2 focus:ring-offset-2 focus:ring-[#76b900]/50 dark:bg-[#76b900]/20 dark:hover:bg-[#76b900]/30 dark:border-[#76b900]/50"
+                  >
+                    <SearchIcon className="h-4 w-4 text-current" />
+                    <span>RAG Search</span>
+                  </Link>
+                  <InfoModal />
+                  <SettingsModal />
+                  <ThemeToggle />
+                </div>
+              </div>
+            </header>
+            {children}
+            <Toaster />
+          </DocumentProvider>
+        </ThemeProvider>
+      </body>
+    </html>
+  )
+}
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/page.tsx b/nvidia/txt2kg/assets/frontend/app/page.tsx
new file mode 100644
index 0000000..741415e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/page.tsx
@@ -0,0 +1,118 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { ApiKeyPrompt } from "@/components/api-key-prompt"
+import { Upload, Zap, Edit, Network } from "lucide-react"
+import { Tabs, TabsContent, TabsList, TabsTrigger } from "@/components/ui/tabs"
+import React from "react"
+import { UploadTab } from "@/components/tabs/UploadTab"
+import { ConfigureTab } from "@/components/tabs/ConfigureTab"
+import { EditTab } from "@/components/tabs/EditTab"
+import { VisualizeTab } from "@/components/tabs/VisualizeTab"
+
+// Add global styles for dropdown visibility
+const globalStyles = `
+  .model-dropdown {
+    position: relative;
+    z-index: 9999;
+  }
+  .model-dropdown-menu {
+    z-index: 9999;
+  }
+`;
+
+export default function Home() {
+  // Track active tab for animation
+  const [activeTab, setActiveTab] = useState("upload");
+  const steps = [
+    { value: "upload", label: "Upload", Icon: Upload },
+    { value: "configure", label: "Process Documents", Icon: Zap },
+    { value: "edit", label: "Edit Knowledge Graph", Icon: Edit },
+    { value: "visualize", label: "Visualize Graph", Icon: Network },
+  ] as const;
+  const activeIndex = Math.max(0, steps.findIndex(s => s.value === activeTab));
+  
+  // Updated to use callback reference
+  const handleTabChange = React.useCallback((tab: string) => {
+    const tabElement = document.querySelector(`[data-value="${tab}"]`)
+    if (tabElement && 'click' in tabElement) {
+      (tabElement as HTMLElement).click()
+    }
+  }, []);
+  
+  // Handle tab selection based on URL hash
+  useEffect(() => {
+    const handleHashChange = () => {
+      const hash = window.location.hash.replace('#', '');
+      if (['upload', 'configure', 'edit', 'visualize'].includes(hash)) {
+        handleTabChange(hash);
+        setActiveTab(hash);
+      }
+    }
+    
+    // Set hash on initial load if not present
+    if (!window.location.hash) {
+      window.location.hash = 'upload';
+    } else {
+      handleHashChange();
+    }
+    
+    window.addEventListener('hashchange', handleHashChange);
+    return () => window.removeEventListener('hashchange', handleHashChange);
+  }, [handleTabChange]);
+  
+  return (
+    <div className="min-h-screen bg-background text-foreground">
+      {/* Add style element for global styles */}
+      <style dangerouslySetInnerHTML={{ __html: globalStyles }} />
+      
+      <main className="container mx-auto px-6 py-12 border-b border-border/10">
+        
+        <Tabs defaultValue="upload" className="w-full mb-12" onValueChange={setActiveTab}>
+          <TabsList className="nvidia-build-tabs mb-12" aria-label="Workflow steps">
+            {steps.map(({ value, label, Icon }) => (
+              <TabsTrigger
+                key={value}
+                value={value}
+                onClick={() => {
+                  window.location.hash = value;
+                  setActiveTab(value);
+                }}
+                data-value={value}
+                className={`nvidia-build-tab ${activeTab === value ? 'nvidia-build-tab-active' : ''}`}
+              >
+                <div className="nvidia-build-tab-icon">
+                  <Icon className="h-3 w-3 text-nvidia-green" />
+                </div>
+                <span>{label}</span>
+              </TabsTrigger>
+            ))}
+          </TabsList>
+          
+          {/* Step 1: Document Upload */}
+          <TabsContent value="upload" className="space-y-8">
+            <UploadTab onTabChange={handleTabChange} />
+          </TabsContent>
+          
+          {/* Step 2: Configure & Process */}
+          <TabsContent value="configure" className="space-y-8">
+            <ConfigureTab />
+          </TabsContent>
+          
+          {/* Step 3: Edit Knowledge */}
+          <TabsContent value="edit" className="space-y-8">
+            <EditTab />
+          </TabsContent>
+          
+          {/* Step 4: Visualize Knowledge Graph */}
+          <TabsContent value="visualize" className="space-y-8">
+            <VisualizeTab />
+          </TabsContent>
+        </Tabs>
+      </main>
+
+      <ApiKeyPrompt />
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/app/rag/metrics/page.tsx b/nvidia/txt2kg/assets/frontend/app/rag/metrics/page.tsx
new file mode 100644
index 0000000..304ad72
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/rag/metrics/page.tsx
@@ -0,0 +1,378 @@
+"use client";
+
+import { useState, useEffect } from "react";
+import { Card, CardContent, CardHeader, CardTitle, CardDescription } from "@/components/ui/card";
+import { Button } from "@/components/ui/button";
+import Link from "next/link";
+import { ArrowLeft, BarChart2, Database, Clock, Target, AlertTriangle } from "lucide-react";
+import { useToast } from "@/components/ui/use-toast";
+import { NvidiaIcon } from "@/components/nvidia-icon";
+
+interface MetricsData {
+  totalTriples: number;
+  totalEntities: number;
+  avgQueryTime: number;
+  avgRelevance: number;
+  precision: number;
+  recall: number;
+  f1Score: number;
+  topQueries: { query: string; count: number }[];
+  queryLogStats?: {
+    totalQueryLogs: number;
+    totalExecutions: number;
+    lastQueriedAt?: string;
+  };
+}
+
+export default function MetricsPage() {
+  const { toast } = useToast();
+  const [metrics, setMetrics] = useState<MetricsData | null>(null);
+  const [isLoading, setIsLoading] = useState(true);
+  const [error, setError] = useState<string | null>(null);
+
+  useEffect(() => {
+    const fetchMetrics = async () => {
+      setIsLoading(true);
+      setError(null);
+      try {
+        // Fetch real metrics from the API endpoint
+        const response = await fetch('/api/metrics');
+        
+        if (!response.ok) {
+          throw new Error(`Failed to fetch metrics: ${response.statusText}`);
+        }
+        
+        const data = await response.json();
+        setMetrics(data);
+      } catch (error) {
+        console.error("Error fetching metrics:", error);
+        const errorMessage = error instanceof Error ? error.message : "Unknown error occurred";
+        setError(errorMessage);
+        toast({
+          title: "Failed to load metrics",
+          description: "Could not retrieve performance data.",
+          variant: "destructive",
+        });
+      } finally {
+        setIsLoading(false);
+      }
+    };
+
+    fetchMetrics();
+  }, [toast]);
+
+  // Function to refresh metrics data
+  const refreshMetrics = async () => {
+    setIsLoading(true);
+    setError(null);
+    try {
+      const response = await fetch('/api/metrics');
+      if (!response.ok) {
+        throw new Error(`Failed to refresh metrics: ${response.statusText}`);
+      }
+      const data = await response.json();
+      setMetrics(data);
+      toast({
+        title: "Metrics refreshed",
+        description: "Performance metrics have been updated",
+      });
+    } catch (error) {
+      console.error("Error refreshing metrics:", error);
+      const errorMessage = error instanceof Error ? error.message : "Unknown error occurred";
+      setError(errorMessage);
+      toast({
+        title: "Failed to refresh metrics",
+        description: "Could not update performance data",
+        variant: "destructive",
+      });
+    } finally {
+      setIsLoading(false);
+    }
+  };
+
+  return (
+    <div className="min-h-screen bg-background text-foreground">
+      {/* Modern Gradient Header */}
+      <main className="container mx-auto px-4 py-8">
+        <div className="flex items-center justify-between mb-6">
+          <div className="flex items-center">
+            <Link href="/rag" className="btn-outline">
+              <ArrowLeft className="h-4 w-4" />
+              Back to RAG Query
+            </Link>
+            <h1 className="text-xl font-bold ml-4">RAG Performance Metrics</h1>
+          </div>
+          
+          <Button 
+            variant="outline" 
+            size="sm" 
+            onClick={refreshMetrics} 
+            disabled={isLoading}
+            className="flex items-center gap-2"
+          >
+            {isLoading ? (
+              <div className="h-4 w-4 border-2 border-primary border-t-transparent rounded-full animate-spin"></div>
+            ) : (
+              <svg 
+                xmlns="http://www.w3.org/2000/svg" 
+                width="16" 
+                height="16" 
+                viewBox="0 0 24 24" 
+                fill="none" 
+                stroke="currentColor" 
+                strokeWidth="2" 
+                strokeLinecap="round" 
+                strokeLinejoin="round" 
+                className="h-4 w-4"
+              >
+                <path d="M21 2v6h-6"></path>
+                <path d="M3 12a9 9 0 0 1 15-6.7L21 8"></path>
+                <path d="M3 12a9 9 0 0 0 15 6.7L21 16"></path>
+                <path d="M21 22v-6h-6"></path>
+              </svg>
+            )}
+            Refresh
+          </Button>
+          
+          {/* Fix Logs Button */}
+          <Button 
+            variant="outline" 
+            size="sm" 
+            onClick={async () => {
+              try {
+                setIsLoading(true);
+                const response = await fetch('/api/fix-query-logs');
+                if (!response.ok) {
+                  throw new Error('Failed to fix logs');
+                }
+                const data = await response.json();
+                toast({
+                  title: "Logs fixed",
+                  description: `Fixed ${data.results.fixed} query logs. ${data.results.data.length} logs total.`
+                });
+                // Refresh metrics after fixing
+                refreshMetrics();
+              } catch (error) {
+                console.error('Error fixing logs:', error);
+                toast({
+                  title: "Error",
+                  description: "Failed to fix query logs",
+                  variant: "destructive"
+                });
+              } finally {
+                setIsLoading(false);
+              }
+            }}
+            disabled={isLoading}
+            className="flex items-center gap-2 ml-2"
+          >
+            <svg 
+              xmlns="http://www.w3.org/2000/svg" 
+              width="16" 
+              height="16" 
+              viewBox="0 0 24 24" 
+              fill="none" 
+              stroke="currentColor" 
+              strokeWidth="2" 
+              strokeLinecap="round" 
+              strokeLinejoin="round" 
+              className="h-4 w-4"
+            >
+              <path d="M11 4H4a2 2 0 0 0-2 2v14a2 2 0 0 0 2 2h14a2 2 0 0 0 2-2v-7"></path>
+              <path d="M18.5 2.5a2.121 2.121 0 0 1 3 3L12 15l-4 1 1-4 9.5-9.5z"></path>
+            </svg>
+            Fix Logs
+          </Button>
+        </div>
+        
+        {isLoading ? (
+          <div className="flex justify-center items-center h-64">
+            <div className="animate-spin rounded-full h-8 w-8 border-b-2 border-primary"></div>
+          </div>
+        ) : error ? (
+          <div className="flex flex-col items-center justify-center p-8 bg-destructive/10 border border-destructive/30 rounded-lg">
+            <AlertTriangle className="h-12 w-12 text-destructive mb-4" />
+            <h3 className="text-lg font-semibold mb-2">Failed to load metrics</h3>
+            <p className="text-muted-foreground text-center max-w-md">{error}</p>
+            <Button 
+              variant="outline" 
+              size="sm" 
+              onClick={refreshMetrics}
+              className="mt-4"
+            >
+              Try Again
+            </Button>
+          </div>
+        ) : metrics ? (
+          <div className="grid grid-cols-1 md:grid-cols-2 lg:grid-cols-4 gap-6">
+            {/* Knowledge Base Stats */}
+            <Card className="glass-card">
+              <CardHeader className="pb-2">
+                <CardTitle className="text-sm flex items-center gap-2">
+                  <Database className="h-4 w-4 text-primary" />
+                  Knowledge Base
+                </CardTitle>
+              </CardHeader>
+              <CardContent>
+                <div className="space-y-3">
+                  <div>
+                    <div className="text-xs text-muted-foreground">Total Triples</div>
+                    <div className="text-2xl font-bold">{metrics.totalTriples.toLocaleString()}</div>
+                  </div>
+                  <div>
+                    <div className="text-xs text-muted-foreground">Unique Entities</div>
+                    <div className="text-2xl font-bold">{metrics.totalEntities.toLocaleString()}</div>
+                  </div>
+                </div>
+              </CardContent>
+            </Card>
+            
+            {/* Performance Stats */}
+            <Card className="glass-card">
+              <CardHeader className="pb-2">
+                <CardTitle className="text-sm flex items-center gap-2">
+                  <Clock className="h-4 w-4 text-primary" />
+                  Query Performance
+                </CardTitle>
+              </CardHeader>
+              <CardContent>
+                <div className="space-y-3">
+                  {metrics.avgQueryTime > 0 ? (
+                    <div>
+                      <div className="text-xs text-muted-foreground">Avg. Query Time</div>
+                      <div className="text-2xl font-bold">{Math.round(metrics.avgQueryTime)} ms</div>
+                    </div>
+                  ) : (
+                    <div>
+                      <div className="text-xs text-muted-foreground">Avg. Query Time</div>
+                      <div className="text-2xl font-bold">No data</div>
+                    </div>
+                  )}
+                  {metrics.avgRelevance > 0 ? (
+                    <div>
+                      <div className="text-xs text-muted-foreground">Avg. Relevance Score</div>
+                      <div className="text-2xl font-bold">{(metrics.avgRelevance * 100).toFixed(1)}%</div>
+                    </div>
+                  ) : (
+                    <div>
+                      <div className="text-xs text-muted-foreground">Avg. Relevance Score</div>
+                      <div className="text-2xl font-bold">No data</div>
+                    </div>
+                  )}
+                </div>
+              </CardContent>
+            </Card>
+            
+            {/* Precision Metrics - Only show if we have real data */}
+            {(metrics.precision > 0 || metrics.recall > 0) && (
+              <Card className="glass-card">
+                <CardHeader className="pb-2">
+                  <CardTitle className="text-sm flex items-center gap-2">
+                    <Target className="h-4 w-4 text-primary" />
+                    Retrieval Metrics
+                  </CardTitle>
+                </CardHeader>
+                <CardContent>
+                  <div className="space-y-3">
+                    <div className="grid grid-cols-2 gap-2">
+                      <div>
+                        <div className="text-xs text-muted-foreground">Precision</div>
+                        <div className="text-xl font-bold">{metrics.precision > 0 ? `${(metrics.precision * 100).toFixed(1)}%` : "No data"}</div>
+                      </div>
+                      <div>
+                        <div className="text-xs text-muted-foreground">Recall</div>
+                        <div className="text-xl font-bold">{metrics.recall > 0 ? `${(metrics.recall * 100).toFixed(1)}%` : "No data"}</div>
+                      </div>
+                    </div>
+                    <div>
+                      <div className="text-xs text-muted-foreground">F1 Score</div>
+                      <div className="text-xl font-bold">{metrics.f1Score > 0 ? `${(metrics.f1Score * 100).toFixed(1)}%` : "No data"}</div>
+                    </div>
+                  </div>
+                </CardContent>
+              </Card>
+            )}
+            
+            {/* Top Queries */}
+            <Card className="glass-card md:col-span-2 lg:col-span-1">
+              <CardHeader className="pb-2">
+                <CardTitle className="text-sm flex items-center gap-2">
+                  <BarChart2 className="h-4 w-4 text-primary" />
+                  Top Queries
+                </CardTitle>
+              </CardHeader>
+              <CardContent>
+                <div className="space-y-2">
+                  {metrics.topQueries.length === 0 ? (
+                    <div className="text-center text-muted-foreground text-sm p-4">
+                      No queries have been logged yet. Try running some queries!
+                    </div>
+                  ) : metrics.topQueries.every(item => item.count === 0) ? (
+                    <div className="text-center text-muted-foreground text-sm p-4">
+                      Query logs exist but have 0 counts. Try refreshing or making new queries.
+                    </div>
+                  ) : (
+                    metrics.topQueries.map((item, i) => (
+                      <div key={i} className="flex justify-between items-center text-sm">
+                        <div className="truncate flex-1 pr-2">{item.query}</div>
+                        <div className="text-muted-foreground font-mono text-xs bg-muted rounded-full px-2 py-0.5">
+                          {item.count}
+                        </div>
+                      </div>
+                    ))
+                  )}
+                </div>
+              </CardContent>
+            </Card>
+            
+            {/* Query Log Stats - For Debugging */}
+            {metrics.queryLogStats && (
+              <Card className="glass-card md:col-span-2 lg:col-span-4 mt-6">
+                <CardHeader className="pb-2">
+                  <CardTitle className="text-sm flex items-center gap-2">
+                    <Database className="h-4 w-4 text-primary" />
+                    Query Log Stats (Debug)
+                  </CardTitle>
+                </CardHeader>
+                <CardContent>
+                  <div className="grid grid-cols-1 md:grid-cols-3 gap-4">
+                    <div className="p-3 bg-muted/20 rounded-md">
+                      <div className="text-xs text-muted-foreground">Total Query Logs</div>
+                      <div className="text-lg font-bold">{metrics.queryLogStats.totalQueryLogs}</div>
+                    </div>
+                    <div className="p-3 bg-muted/20 rounded-md">
+                      <div className="text-xs text-muted-foreground">Total Query Executions</div>
+                      <div className="text-lg font-bold">{metrics.queryLogStats.totalExecutions}</div>
+                    </div>
+                    <div className="p-3 bg-muted/20 rounded-md">
+                      <div className="text-xs text-muted-foreground">Last Queried At</div>
+                      <div className="text-lg font-bold">
+                        {metrics.queryLogStats.lastQueriedAt ? 
+                          new Date(metrics.queryLogStats.lastQueriedAt).toLocaleString() : 
+                          'Never'
+                        }
+                      </div>
+                    </div>
+                  </div>
+                  
+                  <div className="mt-4 text-xs text-muted-foreground">
+                    <p>If you see query logs but 0 counts, try these steps:</p>
+                    <ol className="list-decimal pl-5 mt-2 space-y-1">
+                      <li>Visit <code className="bg-muted px-1 rounded">/api/query-log/test?query=Test%20query</code> to manually add a test query</li>
+                      <li>Click the refresh button above to update the metrics</li>
+                      <li>Make new queries using the RAG Query page</li>
+                    </ol>
+                  </div>
+                </CardContent>
+              </Card>
+            )}
+          </div>
+        ) : (
+          <div className="text-center p-8 bg-muted/20 rounded-lg">
+            <p>No metrics data available</p>
+          </div>
+        )}
+      </main>
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/rag/page.tsx b/nvidia/txt2kg/assets/frontend/app/rag/page.tsx
new file mode 100644
index 0000000..3de4886
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/rag/page.tsx
@@ -0,0 +1,418 @@
+"use client";
+
+import { useState, useEffect } from "react";
+import { RagQuery, RagParams } from "@/components/rag-query";
+import type { Triple } from "@/types/graph";
+import Link from "next/link";
+import { useRouter } from "next/navigation";
+import { DatabaseConnection } from "@/components/database-connection";
+import { NvidiaIcon } from "@/components/nvidia-icon";
+import { ArrowLeft, BarChart2, Search as SearchIcon } from "lucide-react";
+
+export default function RagPage() {
+  const router = useRouter();
+  const [results, setResults] = useState<Triple[] | null>(null);
+  const [isLoading, setIsLoading] = useState(false);
+  const [errorMessage, setErrorMessage] = useState<string | null>(null);
+  const [vectorEnabled, setVectorEnabled] = useState(false);
+  const [metrics, setMetrics] = useState<{
+    avgQueryTime: number;
+    avgRelevance: number;
+    precision: number;
+    recall: number;
+  } | null>(null);
+  const [currentParams, setCurrentParams] = useState<RagParams>({
+    kNeighbors: 4096,
+    fanout: 400,
+    numHops: 2,
+    topK: 5,
+    useVectorSearch: false,
+    usePureRag: false,
+    queryMode: 'traditional'
+  });
+
+  // Initialize backend when the page loads
+  useEffect(() => {
+    // Initialize the backend services
+    const initializeBackend = async () => {
+      try {
+        // Check graph database connection (ArangoDB by default)
+        const graphResponse = await fetch('/api/graph-db', {
+          method: 'GET',
+          headers: {
+            'Content-Type': 'application/json',
+          },
+        });
+        
+        if (!graphResponse.ok) {
+          const errorData = await graphResponse.json();
+          console.warn('Graph database connection warning:', errorData.error);
+        }
+        
+        // Check if vector search is available
+        const vectorResponse = await fetch('/api/pinecone-diag/stats');
+        if (vectorResponse.ok) {
+          const data = await vectorResponse.json();
+          setVectorEnabled(data.totalVectorCount > 0);
+        }
+        
+        // Fetch basic metrics
+        const metricsResponse = await fetch('/api/metrics');
+        if (metricsResponse.ok) {
+          const data = await metricsResponse.json();
+          setMetrics({
+            avgQueryTime: data.avgQueryTime,
+            avgRelevance: data.avgRelevance,
+            precision: data.precision,
+            recall: data.recall
+          });
+        }
+      } catch (error) {
+        console.warn('Error initializing backends:', error);
+      }
+    };
+
+    initializeBackend();
+  }, []);
+
+  const handleQuerySubmit = async (query: string, params: RagParams) => {
+    setIsLoading(true);
+    setErrorMessage(null);
+    setCurrentParams(params); // Store current params for UI rendering
+    const startTime = Date.now();
+    let queryMode: 'pure-rag' | 'vector-search' | 'traditional' = 'traditional';
+    let resultCount = 0;
+    let relevanceScore = 0;
+    
+    try {
+      // If using pure RAG (Pinecone + LangChain) without graph search
+      if (params.usePureRag) {
+        queryMode = 'pure-rag';
+        try {
+          console.log('Using pure RAG with just Pinecone and LangChain for query:', query);
+          const ragResponse = await fetch('/api/rag-query', {
+            method: 'POST',
+            headers: { 'Content-Type': 'application/json' },
+            body: JSON.stringify({
+              query,
+              topK: params.topK
+            })
+          });
+          
+          if (ragResponse.ok) {
+            const data = await ragResponse.json();
+            // Handle the answer - we might need to display differently than triples
+            if (data.answer) {
+              // Special UI handling for text answer rather than triples
+              setResults([{
+                subject: 'Answer',
+                predicate: '',
+                object: data.answer,
+                usedFallback: data.usedFallback
+              }]);
+              
+              resultCount = 1;
+              relevanceScore = data.relevanceScore || 0;
+              
+              // Log the query with performance metrics
+              logQuery(query, queryMode, {
+                executionTimeMs: Date.now() - startTime,
+                relevanceScore,
+                resultCount
+              });
+              
+              console.log('Pure RAG query completed successfully');
+              setIsLoading(false);
+              return;
+            }
+          } else {
+            // If the RAG query fails, log but continue to try other methods
+            const errorData = await ragResponse.json();
+            throw new Error(errorData.error || 'Failed to execute pure RAG query');
+          }
+        } catch (ragError) {
+          console.warn('Pure RAG query error (falling back to other methods):', ragError);
+          // Continue to other query methods as fallback
+        }
+      }
+      
+      // If we have vector embeddings, use enhanced query with metadata
+      if (vectorEnabled && params.useVectorSearch) {
+        queryMode = 'vector-search';
+        try {
+          console.log('Using enhanced RAG with LangChain for query:', query);
+          const enhancedResponse = await fetch('/api/enhanced-query', {
+            method: 'POST',
+            headers: { 'Content-Type': 'application/json' },
+            body: JSON.stringify({
+              query,
+              kNeighbors: params.kNeighbors,
+              fanout: params.fanout,
+              numHops: params.numHops,
+              topK: params.topK
+            })
+          });
+          
+          if (enhancedResponse.ok) {
+            const data = await enhancedResponse.json();
+            // Update the results
+            setResults(data.relevantTriples || []);
+            resultCount = data.count || 0;
+            relevanceScore = data.relevanceScore || 0;
+            
+            // Log the query with performance metrics
+            logQuery(query, queryMode, {
+              executionTimeMs: Date.now() - startTime,
+              relevanceScore,
+              resultCount,
+              precision: data.precision || 0,
+              recall: data.recall || 0,
+            });
+            
+            // Log to console instead of showing alert
+            let message = `Enhanced query completed. Found ${resultCount} relevant triples`;
+            if (data.metadata?.entityMatches) {
+              message += ` from ${data.metadata.entityMatches} matched entities`;
+            }
+            console.log(message);
+            setIsLoading(false);
+            return;
+          }
+        } catch (enhancedError) {
+          console.warn('Enhanced query error (falling back to traditional query):', enhancedError);
+          // Continue to traditional query as fallback
+        }
+      }
+      
+      // Call the traditional backend API as fallback or if explicitly selected
+      queryMode = 'traditional';
+      const response = await fetch(`/api/query`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify({
+          query,
+          kNeighbors: params.kNeighbors,
+          fanout: params.fanout,
+          numHops: params.numHops,
+          topK: params.topK,
+          queryMode: queryMode, // Explicitly pass the query mode
+          useTraditional: true  // Force use of the direct pattern matching approach
+        }),
+      });
+      
+      if (!response.ok) {
+        const errorData = await response.json();
+        throw new Error(errorData.error || 'Failed to query the RAG backend');
+      }
+      
+      const data = await response.json();
+      
+      // Update the results
+      setResults(data.relevantTriples || []);
+      resultCount = data.count || 0;
+      relevanceScore = data.relevanceScore || 0;
+      
+      // Log the query with performance metrics
+      logQuery(query, queryMode, {
+        executionTimeMs: Date.now() - startTime,
+        relevanceScore,
+        resultCount,
+        precision: data.precision || 0,
+        recall: data.recall || 0,
+      });
+      
+      // Log to console instead of showing alert
+      let message = `Query completed. Found ${resultCount} relevant triples`;
+      if (vectorEnabled && params.useVectorSearch) {
+        message += ` (using standard vector search)`;
+      }
+      console.log(message);
+    } catch (error) {
+      console.error("RAG query error:", error);
+      setErrorMessage(error instanceof Error ? error.message : "An unknown error occurred");
+      setResults([]);
+      
+      // Log failed query
+      logQuery(query, queryMode, {
+        executionTimeMs: Date.now() - startTime,
+        resultCount: 0,
+        error: error instanceof Error ? error.message : "Unknown error"
+      });
+    } finally {
+      setIsLoading(false);
+    }
+  };
+
+  // Helper function to log queries
+  const logQuery = async (
+    query: string, 
+    queryMode: 'pure-rag' | 'vector-search' | 'traditional',
+    metrics: {
+      executionTimeMs: number;
+      relevanceScore?: number;
+      precision?: number;
+      recall?: number;
+      resultCount: number;
+      error?: string;
+    }
+  ) => {
+    try {
+      await fetch('/api/query-log', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify({
+          query,
+          queryMode,
+          metrics
+        })
+      });
+      console.log('Query logged successfully');
+    } catch (error) {
+      // Non-blocking error, just log it
+      console.warn('Failed to log query:', error);
+    }
+  };
+
+  const clearResults = () => {
+    setResults(null);
+    setErrorMessage(null);
+  };
+
+  return (
+    <div className="min-h-screen bg-background text-foreground">
+      {/* Main Content */}
+      <main className="container mx-auto px-6 py-12">
+        {/* Header Section */}
+        <div className="flex items-center justify-between mb-8">
+          <Link href="/" className="inline-flex items-center gap-3 px-4 py-2 text-sm font-medium border border-border/40 hover:border-border/60 bg-background hover:bg-muted/30 rounded-lg transition-colors">
+            <ArrowLeft className="h-4 w-4" />
+            Back to Documents
+          </Link>
+        </div>
+
+        {/* Two Column Layout */}
+        <div className="grid grid-cols-1 lg:grid-cols-4 gap-8">
+          {/* Left Column - Database Connections */}
+          <div className="lg:col-span-1 space-y-6">
+            <div className="nvidia-build-card">
+              <DatabaseConnection />
+            </div>
+            
+            {/* Performance Metrics Card */}
+            {metrics && (
+              <div className="nvidia-build-card">
+                <div className="flex items-center justify-between mb-4">
+                  <div className="flex items-center gap-3">
+                    <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center">
+                      <BarChart2 className="h-3 w-3 text-nvidia-green" />
+                    </div>
+                    <h3 className="text-base font-semibold text-foreground">Performance Metrics</h3>
+                  </div>
+                  <Link href="/rag/metrics" className="text-xs text-nvidia-green hover:text-nvidia-green/80 font-medium underline underline-offset-2">
+                    View All
+                  </Link>
+                </div>
+                
+                <div className="space-y-3 text-sm">
+                  <div className="flex justify-between">
+                    <span className="text-muted-foreground">Avg. Query Time:</span>
+                    <span className="font-medium">{metrics.avgQueryTime > 0 ? `${metrics.avgQueryTime.toFixed(2)}ms` : "No data"}</span>
+                  </div>
+                  <div className="flex justify-between">
+                    <span className="text-muted-foreground">Relevance Score:</span>
+                    <span className="font-medium">{metrics.avgRelevance > 0 ? `${(metrics.avgRelevance * 100).toFixed(1)}%` : "No data"}</span>
+                  </div>
+                  <div className="flex justify-between">
+                    <span className="text-muted-foreground">Precision:</span>
+                    <span className="font-medium">{metrics.precision > 0 ? `${(metrics.precision * 100).toFixed(1)}%` : "No data"}</span>
+                  </div>
+                  <div className="flex justify-between">
+                    <span className="text-muted-foreground">Recall:</span>
+                    <span className="font-medium">{metrics.recall > 0 ? `${(metrics.recall * 100).toFixed(1)}%` : "No data"}</span>
+                  </div>
+                </div>
+              </div>
+            )}
+          </div>
+          
+          {/* Right Column - RAG Query Interface */}
+          <div className="lg:col-span-3">
+            <RagQuery
+              onQuerySubmit={handleQuerySubmit}
+              clearResults={clearResults}
+              isLoading={isLoading}
+              error={errorMessage}
+              vectorEnabled={vectorEnabled}
+            />
+            
+            {/* Results Section */}
+            {results && results.length > 0 && (
+              <div className="mt-8 nvidia-build-card">
+                <div className="flex items-center gap-3 mb-6">
+                  <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center">
+                    <SearchIcon className="h-3 w-3 text-nvidia-green" />
+                  </div>
+                  <h3 className="text-lg font-semibold text-foreground">Results ({results.length})</h3>
+                </div>
+                <div className="space-y-4">
+                  {results.map((triple, index) => (
+                    <div key={index} className="bg-muted/20 border border-border/20 p-4 rounded-xl">
+                      {currentParams.usePureRag ? (
+                        // Pure RAG display format (no subject/predicate/object columns)
+                        <div className="p-2 rounded">
+                          {triple.usedFallback && (
+                            <div className="mb-2 text-sm px-3 py-1 bg-amber-500/20 text-amber-700 dark:text-amber-400 rounded-md inline-block">
+                              Using general knowledge (no documents found)
+                            </div>
+                          )}
+                          <p className="font-medium break-words">{triple.object}</p>
+                        </div>
+                      ) : (
+                        // Standard triple display for other modes
+                        <div className="grid grid-cols-1 md:grid-cols-3 gap-3">
+                          <div className="bg-background/60 border border-border/30 p-3 rounded-lg">
+                            <p className="text-xs font-medium text-nvidia-green uppercase tracking-wider mb-1">Subject</p>
+                            <p className="font-medium break-words text-foreground">{triple.subject}</p>
+                          </div>
+                          <div className="bg-background/60 border border-border/30 p-3 rounded-lg">
+                            <p className="text-xs font-medium text-nvidia-green uppercase tracking-wider mb-1">Predicate</p>
+                            <p className="font-medium break-words text-foreground">{triple.predicate}</p>
+                          </div>
+                          <div className="bg-background/60 border border-border/30 p-3 rounded-lg">
+                            <p className="text-xs font-medium text-nvidia-green uppercase tracking-wider mb-1">Object</p>
+                            <p className="font-medium break-words text-foreground">{triple.object}</p>
+                          </div>
+                        </div>
+                      )}
+                      {triple.confidence && !currentParams.usePureRag && (
+                        <div className="mt-2 text-xs text-muted-foreground">
+                          Confidence: {(triple.confidence * 100).toFixed(1)}%
+                        </div>
+                      )}
+                    </div>
+                  ))}
+                </div>
+              </div>
+            )}
+            
+            {results && results.length === 0 && !isLoading && (
+              <div className="mt-8 nvidia-build-card border-dashed">
+                <div className="text-center py-8">
+                  <div className="w-12 h-12 rounded-xl bg-muted/30 flex items-center justify-center mx-auto mb-4">
+                    <SearchIcon className="h-6 w-6 text-muted-foreground" />
+                  </div>
+                  <p className="text-foreground font-medium mb-2">No results found for your query</p>
+                  <p className="text-sm text-muted-foreground">Try adjusting your query or parameters</p>
+                </div>
+              </div>
+            )}
+          </div>
+        </div>
+      </main>
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/test-large-graph/page.tsx b/nvidia/txt2kg/assets/frontend/app/test-large-graph/page.tsx
new file mode 100644
index 0000000..bbebef3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/test-large-graph/page.tsx
@@ -0,0 +1,129 @@
+"use client"
+
+import React, { useState, useEffect } from "react"
+import { ForceGraphWrapper } from "@/components/force-graph-wrapper"
+
+// Define interfaces matching those in ForceGraphWrapper (or import if shared)
+interface NodeObject {
+  id: string
+  name: string
+  val?: number
+  group?: string
+}
+
+interface LinkObject {
+  source: string
+  target: string
+  name: string
+}
+
+interface GraphData {
+  nodes: NodeObject[]
+  links: LinkObject[]
+}
+
+// Function to generate mock data
+const generateMockData = (numNodes: number, numLinks: number): GraphData => {
+  console.log(`Generating mock data: ${numNodes} nodes, ${numLinks} links...`);
+  const nodes: NodeObject[] = [];
+  const nodeIds = new Set<string>();
+
+  // Generate Nodes
+  for (let i = 0; i < numNodes; i++) {
+    const id = `node_${i}`;
+    nodes.push({ id, name: `Node ${i}`, group: `group_${i % 10}` });
+    nodeIds.add(id);
+  }
+  console.log(`Generated ${nodes.length} nodes.`);
+
+  // Generate Links
+  const links: LinkObject[] = [];
+  const linkIds = new Set<string>(); // To avoid duplicate links
+
+  while (links.length < numLinks) {
+    if (nodes.length < 2) break; // Need at least 2 nodes for a link
+
+    const sourceIndex = Math.floor(Math.random() * nodes.length);
+    let targetIndex = Math.floor(Math.random() * nodes.length);
+
+    // Ensure source and target are different
+    while (sourceIndex === targetIndex) {
+      targetIndex = Math.floor(Math.random() * nodes.length);
+    }
+
+    const sourceId = nodes[sourceIndex].id;
+    const targetId = nodes[targetIndex].id;
+
+    // Create a unique ID for the link pair (order doesn't matter for uniqueness check)
+    const linkId = [sourceId, targetId].sort().join('-');
+
+    if (!linkIds.has(linkId)) {
+      links.push({
+        source: sourceId,
+        target: targetId,
+        name: `link_${sourceId}_${targetId}`,
+      });
+      linkIds.add(linkId);
+    }
+
+    // Log progress occasionally to prevent freezing perception
+    if (links.length % 100000 === 0 && links.length > 0) {
+      console.log(`Generated ${links.length}/${numLinks} links...`);
+    }
+  }
+  console.log(`Generated ${links.length} links.`);
+
+  return { nodes, links };
+};
+
+export default function TestLargeGraphPage() {
+  const [graphData, setGraphData] = useState<GraphData | null>(null);
+  const [isLoading, setIsLoading] = useState(true);
+  const [error, setError] = useState<string | null>(null);
+
+  useEffect(() => {
+    console.log("Starting data generation effect...");
+    setIsLoading(true);
+    setError(null);
+    try {
+      // Use setTimeout to allow the loading state to render before blocking the main thread
+      setTimeout(() => {
+        const startTime = performance.now();
+        const data = generateMockData(10000, 50000); // 10k nodes, 50k links
+        const endTime = performance.now();
+        console.log(`Data generation took ${(endTime - startTime) / 1000} seconds.`);
+        setGraphData(data);
+        setIsLoading(false);
+      }, 50); // Small delay
+    } catch (err: any) {
+      console.error("Error generating mock data:", err);
+      setError(`Failed to generate mock data: ${err.message}`);
+      setIsLoading(false);
+    }
+  }, []);
+
+  return (
+    <div style={{ height: "100vh", width: "100vw", position: "relative" }}>
+      {isLoading && (
+        <div style={{ position: "absolute", top: "50%", left: "50%", transform: "translate(-50%, -50%)", color: "white", zIndex: 10 }}>
+          Generating large graph data (10k nodes, 50k links)... Please wait.
+        </div>
+      )}
+      {error && (
+        <div style={{ position: "absolute", top: "50%", left: "50%", transform: "translate(-50%, -50%)", color: "red", background: "rgba(0,0,0,0.8)", padding: "20px", borderRadius: "8px", zIndex: 10 }}>
+          Error: {error}
+        </div>
+      )}
+      {!isLoading && !error && graphData && (
+        <ForceGraphWrapper
+          jsonData={graphData}
+          fullscreen={true} // Use fullscreen prop if available, otherwise style div
+          onError={(err) => {
+            console.error("Graph visualization error:", err);
+            setError(`Graph rendering error: ${err.message}`);
+          }}
+        />
+      )}
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/test-million-nodes/page.tsx b/nvidia/txt2kg/assets/frontend/app/test-million-nodes/page.tsx
new file mode 100644
index 0000000..aec3ba3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/test-million-nodes/page.tsx
@@ -0,0 +1,652 @@
+"use client"
+
+import React, { useState, useEffect, useCallback, useMemo } from "react"
+import { Card, CardContent, CardHeader, CardTitle } from "@/components/ui/card"
+import { Button } from "@/components/ui/button"
+import { Input } from "@/components/ui/input"
+import { Label } from "@/components/ui/label"
+import { Select, SelectContent, SelectItem, SelectTrigger, SelectValue } from "@/components/ui/select"
+import { Switch } from "@/components/ui/switch"
+import { Badge } from "@/components/ui/badge"
+import { Progress } from "@/components/ui/progress"
+import { Tabs, TabsContent, TabsList, TabsTrigger } from "@/components/ui/tabs"
+import { PyGraphistryViewer } from "@/components/pygraphistry-viewer"
+import { ForceGraphWrapper } from "@/components/force-graph-wrapper"
+import { useToast } from "@/hooks/use-toast"
+import { 
+  Play, 
+  Square, 
+  Zap, 
+  Database, 
+  Activity, 
+  BarChart3, 
+  Settings,
+  AlertTriangle,
+  CheckCircle,
+  Clock,
+  Eye,
+  Download,
+  Upload,
+  Server
+} from "lucide-react"
+
+interface NodeObject {
+  id: string
+  name: string
+  val?: number
+  group?: string
+  pagerank?: number
+  betweenness?: number
+  degree?: number
+  x?: number
+  y?: number
+  z?: number
+}
+
+interface LinkObject {
+  source: string
+  target: string
+  name: string
+  weight?: number
+}
+
+interface GraphData {
+  nodes: NodeObject[]
+  links: LinkObject[]
+}
+
+interface GenerationStats {
+  node_count: number
+  edge_count: number
+  generation_time: number
+  density: number
+  avg_degree: number
+  pattern: string
+  parameters: any
+}
+
+interface GenerationTask {
+  task_id: string
+  status: string
+  progress: number
+  message: string
+  result?: {
+    graph_data: GraphData
+    stats: GenerationStats
+  }
+  error?: string
+}
+
+// Graph generation patterns
+const GRAPH_PATTERNS = {
+  RANDOM: 'random',
+  SCALE_FREE: 'scale-free',
+  SMALL_WORLD: 'small-world',
+  CLUSTERED: 'clustered',
+  HIERARCHICAL: 'hierarchical',
+  GRID: 'grid'
+} as const
+
+type GraphPattern = typeof GRAPH_PATTERNS[keyof typeof GRAPH_PATTERNS]
+
+export default function TestMillionNodesPage() {
+  const [graphData, setGraphData] = useState<GraphData | null>(null)
+  const [isGenerating, setIsGenerating] = useState(false)
+  const [generationProgress, setGenerationProgress] = useState(0)
+  const [generationStats, setGenerationStats] = useState<GenerationStats | null>(null)
+  const [error, setError] = useState<string | null>(null)
+  const [currentTask, setCurrentTask] = useState<string | null>(null)
+  
+  // Generation parameters
+  const [numNodes, setNumNodes] = useState(100000)
+  const [graphPattern, setGraphPattern] = useState<GraphPattern>(GRAPH_PATTERNS.SCALE_FREE)
+  const [avgDegree, setAvgDegree] = useState(5)
+  const [numClusters, setNumClusters] = useState(100)
+  const [smallWorldK, setSmallWorldK] = useState(6)
+  const [smallWorldP, setSmallWorldP] = useState(0.1)
+  const [seed, setSeed] = useState<number | null>(null)
+  
+  // Visualization mode
+  const [visualizationMode, setVisualizationMode] = useState<'pygraphistry' | 'force-graph'>('pygraphistry')
+  
+  const { toast } = useToast()
+  
+  // Poll for task status
+  // Removed polling useEffect since we're using direct API calls now
+  
+  const generateGraphData = useCallback(async () => {
+    if (numNodes > 1000000) {
+      toast({
+        title: "Node Limit Exceeded",
+        description: "Maximum 1 million nodes allowed for performance reasons.",
+        variant: "destructive"
+      })
+      return
+    }
+
+    setIsGenerating(true)
+    setError(null)
+    setGenerationProgress(0)
+    
+    try {
+      const requestBody = {
+        num_nodes: numNodes,
+        pattern: graphPattern,
+        avg_degree: avgDegree,
+        num_clusters: graphPattern === GRAPH_PATTERNS.CLUSTERED ? numClusters : undefined,
+        small_world_k: graphPattern === GRAPH_PATTERNS.SMALL_WORLD ? smallWorldK : undefined,
+        small_world_p: graphPattern === GRAPH_PATTERNS.SMALL_WORLD ? smallWorldP : undefined,
+        seed: seed || undefined
+      }
+      
+      toast({
+        title: "Generation Started",
+        description: `Starting generation of ${numNodes.toLocaleString()} nodes using ${graphPattern} pattern...`,
+      })
+
+      // First generate the graph data
+      const generateResponse = await fetch('/api/pygraphistry/generate', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify(requestBody)
+      })
+      
+      if (!generateResponse.ok) {
+        const errorData = await generateResponse.json()
+        throw new Error(errorData.error || 'Failed to start graph generation')
+      }
+
+      const generateResult = await generateResponse.json()
+      
+      // Update progress
+      setGenerationProgress(0.5)
+      
+      // If we have graph data directly, process it with the unified service
+      if (generateResult.graph_data) {
+        const visualizeRequest = {
+          graph_data: generateResult.graph_data,
+          processing_mode: "pygraphistry_cloud",
+          layout_type: "force",
+          gpu_acceleration: true,
+          clustering: true
+        }
+
+        const visualizeResponse = await fetch('/api/pygraphistry/visualize', {
+          method: 'POST',
+          headers: {
+            'Content-Type': 'application/json'
+          },
+          body: JSON.stringify(visualizeRequest)
+        })
+
+        if (!visualizeResponse.ok) {
+          const errorData = await visualizeResponse.json()
+          throw new Error(errorData.error || 'Failed to process graph visualization')
+        }
+
+        const result = await visualizeResponse.json()
+        setGraphData(result.graph_data || generateResult.graph_data)
+        setGenerationStats({
+          node_count: generateResult.graph_data.nodes.length,
+          edge_count: generateResult.graph_data.links.length,
+          generation_time: 0,
+          density: generateResult.graph_data.links.length / (generateResult.graph_data.nodes.length * (generateResult.graph_data.nodes.length - 1)),
+          avg_degree: avgDegree,
+          pattern: graphPattern,
+          parameters: requestBody
+        })
+        setGenerationProgress(1.0)
+        setIsGenerating(false)
+
+        toast({
+          title: "Graph Generated Successfully", 
+          description: `Generated ${generateResult.graph_data.nodes.length.toLocaleString()} nodes and ${generateResult.graph_data.links.length.toLocaleString()} edges`,
+        })
+      } else {
+        throw new Error('No graph data returned from generation service')
+      }
+      
+    } catch (err: any) {
+      console.error("Error generating graph:", err)
+      setError(`Failed to generate graph: ${err.message}`)
+      setIsGenerating(false)
+      toast({
+        title: "Generation Failed",
+        description: err.message,
+        variant: "destructive"
+      })
+    }
+  }, [numNodes, graphPattern, avgDegree, numClusters, smallWorldK, smallWorldP, seed, toast])
+  
+  const clearGraph = useCallback(() => {
+    setGraphData(null)
+    setGenerationStats(null)
+    setGenerationProgress(0)
+    setError(null)
+    setCurrentTask(null)
+    if (isGenerating) {
+      setIsGenerating(false)
+    }
+  }, [isGenerating])
+  
+  const exportGraphData = useCallback(() => {
+    if (!graphData) return
+    
+    const dataStr = JSON.stringify(graphData, null, 2)
+    const dataBlob = new Blob([dataStr], { type: 'application/json' })
+    const url = URL.createObjectURL(dataBlob)
+    
+    const link = document.createElement('a')
+    link.href = url
+    link.download = `graph_${numNodes}_nodes_${Date.now()}.json`
+    document.body.appendChild(link)
+    link.click()
+    document.body.removeChild(link)
+    URL.revokeObjectURL(url)
+  }, [graphData, numNodes])
+  
+  const presetConfigs = [
+    { name: "Small Test", nodes: 1000, pattern: GRAPH_PATTERNS.RANDOM, degree: 5 },
+    { name: "Medium Test", nodes: 10000, pattern: GRAPH_PATTERNS.SCALE_FREE, degree: 3 },
+    { name: "Large Test", nodes: 100000, pattern: GRAPH_PATTERNS.SCALE_FREE, degree: 3 },
+    { name: "Huge Test", nodes: 500000, pattern: GRAPH_PATTERNS.CLUSTERED, degree: 4 },
+    { name: "Million Nodes", nodes: 1000000, pattern: GRAPH_PATTERNS.CLUSTERED, degree: 2 },
+  ]
+  
+  const memoryEstimate = useMemo(() => {
+    // Rough estimate: each node ~100 bytes, each link ~150 bytes
+    const nodeMemory = numNodes * 100 / 1024 / 1024 // MB
+    const linkMemory = (numNodes * avgDegree / 2) * 150 / 1024 / 1024 // MB
+    return nodeMemory + linkMemory
+  }, [numNodes, avgDegree])
+  
+  return (
+    <div className="container mx-auto p-4 max-w-7xl">
+      <div className="flex flex-col gap-6">
+        {/* Header */}
+        <div className="flex items-center justify-between">
+          <div>
+            <h1 className="text-3xl font-bold">Million Node Graph Test</h1>
+            <p className="text-muted-foreground">
+              Generate and visualize large-scale graphs with up to 1 million nodes using GPU acceleration
+            </p>
+          </div>
+          <div className="flex items-center gap-2">
+            <Badge variant="outline" className="text-sm">
+              <Server className="w-4 h-4 mr-1" />
+              Backend Generation
+            </Badge>
+            <Badge variant="outline" className="text-sm">
+              <Database className="w-4 h-4 mr-1" />
+              PyGraphistry Ready
+            </Badge>
+            {memoryEstimate > 100 && (
+              <Badge variant="destructive" className="text-sm">
+                <AlertTriangle className="w-4 h-4 mr-1" />
+                High Memory ({memoryEstimate.toFixed(0)}MB)
+              </Badge>
+            )}
+          </div>
+        </div>
+        
+        {/* Control Panel */}
+        <Card>
+          <CardHeader>
+            <CardTitle className="flex items-center gap-2">
+              <Settings className="w-5 h-5" />
+              Graph Generation Settings
+            </CardTitle>
+          </CardHeader>
+          <CardContent>
+            <Tabs defaultValue="parameters" className="w-full">
+              <TabsList className="grid w-full grid-cols-3">
+                <TabsTrigger value="parameters">Parameters</TabsTrigger>
+                <TabsTrigger value="presets">Presets</TabsTrigger>
+                <TabsTrigger value="advanced">Advanced</TabsTrigger>
+              </TabsList>
+              
+              <TabsContent value="parameters" className="space-y-4">
+                <div className="grid grid-cols-1 md:grid-cols-2 lg:grid-cols-4 gap-4">
+                  <div className="space-y-2">
+                    <Label htmlFor="num-nodes">Number of Nodes</Label>
+                    <Input
+                      id="num-nodes"
+                      type="number"
+                      value={numNodes}
+                      onChange={(e) => setNumNodes(Math.min(1000000, Math.max(1, parseInt(e.target.value) || 1)))}
+                      min="1"
+                      max="1000000"
+                      step="1000"
+                      disabled={isGenerating}
+                    />
+                    <p className="text-xs text-muted-foreground">Max: 1,000,000</p>
+                  </div>
+                  
+                  <div className="space-y-2">
+                    <Label htmlFor="graph-pattern">Graph Pattern</Label>
+                    <Select 
+                      value={graphPattern} 
+                      onValueChange={(value: GraphPattern) => setGraphPattern(value)}
+                      disabled={isGenerating}
+                    >
+                      <SelectTrigger>
+                        <SelectValue />
+                      </SelectTrigger>
+                      <SelectContent>
+                        <SelectItem value={GRAPH_PATTERNS.RANDOM}>Random (Erdős–Rényi)</SelectItem>
+                        <SelectItem value={GRAPH_PATTERNS.SCALE_FREE}>Scale-Free (Barabási–Albert)</SelectItem>
+                        <SelectItem value={GRAPH_PATTERNS.CLUSTERED}>Clustered Communities</SelectItem>
+                        <SelectItem value={GRAPH_PATTERNS.SMALL_WORLD}>Small World (Watts-Strogatz)</SelectItem>
+                        <SelectItem value={GRAPH_PATTERNS.HIERARCHICAL}>Hierarchical Tree</SelectItem>
+                        <SelectItem value={GRAPH_PATTERNS.GRID}>Grid Network</SelectItem>
+                      </SelectContent>
+                    </Select>
+                  </div>
+                  
+                  <div className="space-y-2">
+                    <Label htmlFor="avg-degree">Average Degree</Label>
+                    <Input
+                      id="avg-degree"
+                      type="number"
+                      value={avgDegree}
+                      onChange={(e) => setAvgDegree(Math.min(50, Math.max(1, parseInt(e.target.value) || 1)))}
+                      min="1"
+                      max="50"
+                      disabled={isGenerating}
+                    />
+                    <p className="text-xs text-muted-foreground">Connections per node</p>
+                  </div>
+                  
+                  <div className="space-y-2">
+                    <Label htmlFor="seed">Random Seed (Optional)</Label>
+                    <Input
+                      id="seed"
+                      type="number"
+                      value={seed || ''}
+                      onChange={(e) => setSeed(e.target.value ? parseInt(e.target.value) : null)}
+                      placeholder="Random"
+                      disabled={isGenerating}
+                    />
+                    <p className="text-xs text-muted-foreground">For reproducible graphs</p>
+                  </div>
+                </div>
+                
+                {/* Pattern-specific parameters */}
+                {graphPattern === GRAPH_PATTERNS.CLUSTERED && (
+                  <div className="grid grid-cols-1 md:grid-cols-2 gap-4 pt-4 border-t">
+                    <div className="space-y-2">
+                      <Label htmlFor="num-clusters">Number of Clusters</Label>
+                      <Input
+                        id="num-clusters"
+                        type="number"
+                        value={numClusters}
+                        onChange={(e) => setNumClusters(Math.max(1, parseInt(e.target.value) || 1))}
+                        min="1"
+                        max="1000"
+                        disabled={isGenerating}
+                      />
+                    </div>
+                  </div>
+                )}
+                
+                {graphPattern === GRAPH_PATTERNS.SMALL_WORLD && (
+                  <div className="grid grid-cols-1 md:grid-cols-2 gap-4 pt-4 border-t">
+                    <div className="space-y-2">
+                      <Label htmlFor="small-world-k">Initial Neighbors (k)</Label>
+                      <Input
+                        id="small-world-k"
+                        type="number"
+                        value={smallWorldK}
+                        onChange={(e) => setSmallWorldK(Math.max(2, parseInt(e.target.value) || 2))}
+                        min="2"
+                        max="20"
+                        disabled={isGenerating}
+                      />
+                    </div>
+                    <div className="space-y-2">
+                      <Label htmlFor="small-world-p">Rewiring Probability (p)</Label>
+                      <Input
+                        id="small-world-p"
+                        type="number"
+                        step="0.01"
+                        value={smallWorldP}
+                        onChange={(e) => setSmallWorldP(Math.min(1, Math.max(0, parseFloat(e.target.value) || 0)))}
+                        min="0"
+                        max="1"
+                        disabled={isGenerating}
+                      />
+                    </div>
+                  </div>
+                )}
+                
+                <div className="flex items-center justify-between pt-4">
+                  <div className="flex items-center gap-4">
+                    <Button
+                      onClick={generateGraphData}
+                      disabled={isGenerating}
+                      className="flex items-center gap-2"
+                    >
+                      {isGenerating ? (
+                        <>
+                          <Clock className="w-4 h-4 animate-spin" />
+                          Generating...
+                        </>
+                      ) : (
+                        <>
+                          <Play className="w-4 h-4" />
+                          Generate Graph
+                        </>
+                      )}
+                    </Button>
+                    
+                    <Button
+                      onClick={clearGraph}
+                      variant="outline"
+                      disabled={!graphData && !isGenerating}
+                    >
+                      {isGenerating ? 'Cancel' : 'Clear'}
+                    </Button>
+                    
+                    <Button
+                      onClick={exportGraphData}
+                      variant="outline"
+                      disabled={!graphData}
+                      className="flex items-center gap-2"
+                    >
+                      <Download className="w-4 h-4" />
+                      Export
+                    </Button>
+                  </div>
+                  
+                  <div className="text-sm text-muted-foreground">
+                    Estimated Memory: {memoryEstimate.toFixed(1)}MB
+                  </div>
+                </div>
+              </TabsContent>
+              
+              <TabsContent value="presets" className="space-y-4">
+                <div className="grid grid-cols-1 md:grid-cols-2 lg:grid-cols-3 gap-4">
+                  {presetConfigs.map((preset, index) => (
+                    <Card key={index} className="cursor-pointer hover:shadow-md transition-shadow">
+                      <CardContent className="p-4">
+                        <div className="space-y-2">
+                          <h3 className="font-semibold">{preset.name}</h3>
+                          <div className="text-sm text-muted-foreground space-y-1">
+                            <div>Nodes: {preset.nodes.toLocaleString()}</div>
+                            <div>Pattern: {preset.pattern}</div>
+                            <div>Degree: {preset.degree}</div>
+                          </div>
+                          <Button
+                            size="sm"
+                            onClick={() => {
+                              setNumNodes(preset.nodes)
+                              setGraphPattern(preset.pattern)
+                              setAvgDegree(preset.degree)
+                            }}
+                            className="w-full"
+                            disabled={isGenerating}
+                          >
+                            Load Preset
+                          </Button>
+                        </div>
+                      </CardContent>
+                    </Card>
+                  ))}
+                </div>
+              </TabsContent>
+              
+              <TabsContent value="advanced" className="space-y-4">
+                <div className="grid grid-cols-1 md:grid-cols-2 gap-4">
+                  <div className="space-y-4">
+                    <div className="space-y-2">
+                      <Label>Visualization Mode</Label>
+                      <Select value={visualizationMode} onValueChange={(value: 'pygraphistry' | 'force-graph') => setVisualizationMode(value)}>
+                        <SelectTrigger>
+                          <SelectValue />
+                        </SelectTrigger>
+                        <SelectContent>
+                          <SelectItem value="pygraphistry">PyGraphistry (GPU)</SelectItem>
+                          <SelectItem value="force-graph">Force Graph (WebGL)</SelectItem>
+                        </SelectContent>
+                      </Select>
+                      <p className="text-xs text-muted-foreground">
+                        PyGraphistry recommended for large graphs (&gt;50k nodes)
+                      </p>
+                    </div>
+                  </div>
+                  
+                  <div className="space-y-4">
+                    <div className="p-4 bg-muted rounded-lg">
+                      <h4 className="font-medium mb-2">Backend Processing</h4>
+                      <p className="text-sm text-muted-foreground">
+                        Graphs are now generated on the backend using optimized NetworkX algorithms with GPU acceleration available through PyGraphistry.
+                      </p>
+                    </div>
+                  </div>
+                </div>
+              </TabsContent>
+            </Tabs>
+          </CardContent>
+        </Card>
+        
+        {/* Progress and Stats */}
+        {(isGenerating || generationStats) && (
+          <Card>
+            <CardContent className="p-6">
+              {isGenerating && (
+                <div className="space-y-4">
+                  <div className="flex items-center gap-2">
+                    <Clock className="w-4 h-4 animate-spin" />
+                    <span>Generating graph on backend...</span>
+                    {currentTask && (
+                      <Badge variant="outline" className="text-xs">
+                        Task: {currentTask}
+                      </Badge>
+                    )}
+                  </div>
+                  <Progress value={generationProgress} className="w-full" />
+                  <p className="text-sm text-muted-foreground">
+                    Progress: {generationProgress.toFixed(1)}%
+                  </p>
+                </div>
+              )}
+              
+              {generationStats && (
+                <div className="space-y-4">
+                  <div className="flex items-center gap-2">
+                    <CheckCircle className="w-4 h-4 text-green-500" />
+                    <span className="font-medium">Generation Complete</span>
+                  </div>
+                  
+                  <div className="grid grid-cols-2 md:grid-cols-4 lg:grid-cols-6 gap-4 text-sm">
+                    <div className="flex items-center gap-2">
+                      <Activity className="w-4 h-4" />
+                      <span>{generationStats.node_count.toLocaleString()} Nodes</span>
+                    </div>
+                    <div className="flex items-center gap-2">
+                      <BarChart3 className="w-4 h-4" />
+                      <span>{generationStats.edge_count.toLocaleString()} Links</span>
+                    </div>
+                    <div className="flex items-center gap-2">
+                      <Clock className="w-4 h-4" />
+                      <span>{generationStats.generation_time.toFixed(2)}s</span>
+                    </div>
+                    <div className="flex items-center gap-2">
+                      <Zap className="w-4 h-4" />
+                      <span>{generationStats.pattern}</span>
+                    </div>
+                    <div>
+                      <span>Density: {(generationStats.density * 100).toFixed(4)}%</span>
+                    </div>
+                    <div>
+                      <span>Avg Degree: {generationStats.avg_degree.toFixed(2)}</span>
+                    </div>
+                  </div>
+                </div>
+              )}
+            </CardContent>
+          </Card>
+        )}
+        
+        {/* Error Display */}
+        {error && (
+          <Card className="border-red-200 bg-red-50">
+            <CardContent className="p-4">
+              <div className="flex items-center gap-2 text-red-700">
+                <AlertTriangle className="w-4 h-4" />
+                <span className="font-medium">Error:</span>
+                <span>{error}</span>
+              </div>
+            </CardContent>
+          </Card>
+        )}
+        
+        {/* Visualization */}
+        {graphData && !isGenerating && (
+          <Card className="min-h-[600px]">
+            <CardHeader>
+              <div className="flex items-center justify-between">
+                <CardTitle className="flex items-center gap-2">
+                  <Eye className="w-5 h-5" />
+                  Graph Visualization
+                </CardTitle>
+                <div className="flex items-center gap-2">
+                  <Badge variant="outline">
+                    {visualizationMode === 'pygraphistry' ? 'GPU Accelerated' : 'WebGL'}
+                  </Badge>
+                  <Badge variant="secondary">
+                    {graphData.nodes.length.toLocaleString()} nodes
+                  </Badge>
+                </div>
+              </div>
+            </CardHeader>
+            <CardContent className="p-0">
+              <div style={{ height: "600px", width: "100%" }}>
+                {visualizationMode === 'pygraphistry' ? (
+                  <PyGraphistryViewer
+                    graphData={graphData}
+                    onError={(err) => {
+                      console.error("PyGraphistry error:", err)
+                      setError(`Visualization error: ${err.message}`)
+                    }}
+                  />
+                ) : (
+                  <ForceGraphWrapper
+                    jsonData={graphData}
+                    fullscreen={false}
+                    onError={(err) => {
+                      console.error("Force graph error:", err)
+                      setError(`Visualization error: ${err.message}`)
+                    }}
+                  />
+                )}
+              </div>
+            </CardContent>
+          </Card>
+        )}
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/app/test-webgpu-clustering/page.tsx b/nvidia/txt2kg/assets/frontend/app/test-webgpu-clustering/page.tsx
new file mode 100644
index 0000000..3863f9e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/app/test-webgpu-clustering/page.tsx
@@ -0,0 +1,493 @@
+"use client"
+
+// @ts-nocheck
+import React, { useEffect, useState, useCallback } from "react"
+import dynamic from "next/dynamic"
+import { Button } from "@/components/ui/button"
+import { Card, CardContent, CardDescription, CardHeader, CardTitle } from "@/components/ui/card"
+import { Badge } from "@/components/ui/badge"
+import { Slider } from "@/components/ui/slider"
+import { Switch } from "@/components/ui/switch"
+import { Alert, AlertDescription } from "@/components/ui/alert"
+import { Loader2, Play, Square, RotateCcw, Zap, Cpu, Monitor, Maximize, Minimize } from "lucide-react"
+import { useToast } from "@/hooks/use-toast"
+
+// Dynamically import components with SSR disabled
+const WebGPU3DViewer = dynamic(
+  () => import("@/components/webgpu-3d-viewer").then(mod => mod.WebGPU3DViewer),
+  { ssr: false }
+)
+
+const ForceGraphWrapper = dynamic(
+  () => import("@/components/force-graph-wrapper").then(mod => mod.ForceGraphWrapper),
+  { ssr: false }
+)
+
+interface TestConfiguration {
+  nodeCount: number
+  linkDensity: number
+  use3D: boolean
+  enableClustering: boolean
+  graphType: 'random' | 'scale-free' | 'small-world' | 'hierarchical' | 'clustered'
+}
+
+interface PerformanceMetrics {
+  generationTime: number
+  renderingTime: number
+  clusteringTime?: number
+  totalNodes: number
+  totalLinks: number
+  memoryUsage?: number
+}
+
+export default function TestWebGPUClusteringPage() {
+  const [testConfig, setTestConfig] = useState<TestConfiguration>({
+    nodeCount: 2000, // Smaller for better clustering
+    linkDensity: 0.08, // Higher density for clustering (8.0%)
+    use3D: true,
+    enableClustering: true,
+    graphType: 'clustered' // Optimized for clustering
+  })
+
+  const [graphData, setGraphData] = useState<any>(null)
+  const [isGenerating, setIsGenerating] = useState(false)
+  const [isRendering, setIsRendering] = useState(false)
+  const [performanceMetrics, setPerformanceMetrics] = useState<PerformanceMetrics | null>(null)
+  const [error, setError] = useState<string | null>(null)
+  const [debugInfo, setDebugInfo] = useState<string>("")
+  const [isFullscreen, setIsFullscreen] = useState(false)
+
+  const { toast } = useToast()
+
+  // Generate test graph data
+  const generateTestGraph = useCallback(async () => {
+    setIsGenerating(true)
+    setError(null)
+    setDebugInfo("Generating large test graph...")
+    
+    const startTime = performance.now()
+    
+    try {
+      const nodes = []
+      const links = []
+      
+      // Generate nodes based on graph type
+      for (let i = 0; i < testConfig.nodeCount; i++) {
+        let x, y, z
+        
+        switch (testConfig.graphType) {
+          case 'hierarchical':
+            // Hierarchical layout with levels
+            const level = Math.floor(Math.log2(i + 1))
+            const angleStep = (2 * Math.PI) / Math.pow(2, level)
+            const angle = (i - Math.pow(2, level) + 1) * angleStep
+            const radius = level * 50
+            x = Math.cos(angle) * radius + (Math.random() - 0.5) * 20
+            y = Math.sin(angle) * radius + (Math.random() - 0.5) * 20
+            z = level * 30 + (Math.random() - 0.5) * 10
+            break
+            
+          case 'small-world':
+            // Small world with tight clusters - optimized for clustering algorithms
+            const clusterSize = 200 // Smaller, tighter clusters
+            const clusterId = Math.floor(i / clusterSize)
+            const clustersPerRow = Math.ceil(Math.sqrt(Math.ceil(testConfig.nodeCount / clusterSize)))
+            const clusterX = (clusterId % clustersPerRow) * 300
+            const clusterY = Math.floor(clusterId / clustersPerRow) * 300
+            // Tight clustering with some spread
+            x = clusterX + (Math.random() - 0.5) * 80
+            y = clusterY + (Math.random() - 0.5) * 80
+            z = (Math.random() - 0.5) * 40
+            break
+            
+          case 'clustered':
+            // Highly clustered graph optimized for clustering algorithms
+            const totalClusters = 8 // Fixed number of clusters
+            const nodesPerCluster = Math.ceil(testConfig.nodeCount / totalClusters)
+            const clusterIdx = Math.floor(i / nodesPerCluster)
+            
+            // Position clusters in a grid
+            const clustersInRow = Math.ceil(Math.sqrt(totalClusters))
+            const clusterXPos = (clusterIdx % clustersInRow) * 400
+            const clusterYPos = Math.floor(clusterIdx / clustersInRow) * 400
+            
+            // Very tight clustering with minimal spread
+            x = clusterXPos + (Math.random() - 0.5) * 60
+            y = clusterYPos + (Math.random() - 0.5) * 60
+            z = (Math.random() - 0.5) * 30
+            break
+            
+          case 'scale-free':
+            // Scale-free network with power-law distribution
+            const r = Math.pow(Math.random(), 0.5) * 300
+            const theta = Math.random() * 2 * Math.PI
+            const phi = Math.acos(2 * Math.random() - 1)
+            x = r * Math.sin(phi) * Math.cos(theta)
+            y = r * Math.sin(phi) * Math.sin(theta)
+            z = r * Math.cos(phi)
+            break
+            
+          default: // random
+            x = (Math.random() - 0.5) * 1000
+            y = (Math.random() - 0.5) * 1000
+            z = (Math.random() - 0.5) * 1000
+        }
+        
+        nodes.push({
+          id: i.toString(),
+          name: `Node ${i}`,
+          x,
+          y,
+          z,
+          val: Math.random() * 10 + 1,
+          color: `hsl(${(i * 137.508) % 360}, 70%, 60%)`, // Golden angle for color distribution
+          group: Math.floor(i / 1000).toString()
+        })
+      }
+      
+      // Generate links based on graph type and density
+      const targetLinkCount = Math.floor(testConfig.nodeCount * testConfig.linkDensity)
+      const linkSet = new Set<string>() // Prevent duplicate links
+      
+      for (let i = 0; i < targetLinkCount && linkSet.size < targetLinkCount; i++) {
+        let sourceId, targetId
+        
+        switch (testConfig.graphType) {
+          case 'hierarchical':
+            // Connect nodes in hierarchical structure
+            sourceId = Math.floor(Math.random() * testConfig.nodeCount)
+            // Prefer connections to nearby hierarchy levels
+            const level = Math.floor(Math.log2(sourceId + 1))
+            const levelStart = Math.pow(2, level) - 1
+            const levelEnd = Math.min(Math.pow(2, level + 1) - 1, testConfig.nodeCount - 1)
+            targetId = levelStart + Math.floor(Math.random() * (levelEnd - levelStart + 1))
+            break
+            
+          case 'small-world':
+            // Connect within clusters and few random long-distance connections
+            sourceId = Math.floor(Math.random() * testConfig.nodeCount)
+            if (Math.random() < 0.9) { // 90% local connections for strong clustering
+              // Local connection within cluster
+              const clusterSize = 200
+              const clusterId = Math.floor(sourceId / clusterSize)
+              const clusterStart = clusterId * clusterSize
+              const clusterEnd = Math.min(clusterStart + clusterSize, testConfig.nodeCount)
+              targetId = clusterStart + Math.floor(Math.random() * (clusterEnd - clusterStart))
+            } else {
+              // Random long-distance connection (bridges between clusters)
+              targetId = Math.floor(Math.random() * testConfig.nodeCount)
+            }
+            break
+            
+          case 'clustered':
+            // Highly clustered connections - 95% within cluster, 5% between clusters
+            sourceId = Math.floor(Math.random() * testConfig.nodeCount)
+            if (Math.random() < 0.95) {
+              // Connect within the same cluster
+              const totalClusters = 8
+              const nodesPerCluster = Math.ceil(testConfig.nodeCount / totalClusters)
+              const sourceCluster = Math.floor(sourceId / nodesPerCluster)
+              const clusterStart = sourceCluster * nodesPerCluster
+              const clusterEnd = Math.min(clusterStart + nodesPerCluster, testConfig.nodeCount)
+              targetId = clusterStart + Math.floor(Math.random() * (clusterEnd - clusterStart))
+            } else {
+              // Bridge connection to a different cluster
+              targetId = Math.floor(Math.random() * testConfig.nodeCount)
+            }
+            break
+            
+          case 'scale-free':
+            // Preferential attachment for scale-free network
+            sourceId = Math.floor(Math.random() * testConfig.nodeCount)
+            // Use power-law distribution for target selection
+            targetId = Math.floor(Math.pow(Math.random(), 2) * testConfig.nodeCount)
+            break
+            
+          default: // random
+            sourceId = Math.floor(Math.random() * testConfig.nodeCount)
+            targetId = Math.floor(Math.random() * testConfig.nodeCount)
+        }
+        
+        // Ensure no self-loops and no duplicates
+        if (sourceId !== targetId) {
+          const linkKey = sourceId < targetId ? `${sourceId}-${targetId}` : `${targetId}-${sourceId}`
+          if (!linkSet.has(linkKey)) {
+            linkSet.add(linkKey)
+            links.push({
+              source: sourceId.toString(),
+              target: targetId.toString(),
+              name: `edge_${sourceId}_${targetId}`,
+              strength: Math.random() * 0.5 + 0.5
+            })
+          }
+        }
+      }
+      
+      const generationTime = performance.now() - startTime
+      
+      const newGraphData = { nodes, links }
+      setGraphData(newGraphData)
+      
+      const metrics: PerformanceMetrics = {
+        generationTime,
+        renderingTime: 0,
+        totalNodes: nodes.length,
+        totalLinks: links.length
+      }
+      
+      setPerformanceMetrics(metrics)
+      setDebugInfo(`Generated ${nodes.length} nodes and ${links.length} links in ${generationTime.toFixed(2)}ms`)
+      
+      toast({
+        title: "Test Graph Generated",
+        description: `Created ${nodes.length.toLocaleString()} nodes and ${links.length.toLocaleString()} links`,
+      })
+      
+    } catch (error) {
+      console.error('Graph generation failed:', error)
+      setError(`Graph generation failed: ${error}`)
+    } finally {
+      setIsGenerating(false)
+    }
+  }, [testConfig, toast])
+
+  // Update configuration handlers
+  const updateNodeCount = useCallback((value: number[]) => {
+    setTestConfig((prev: TestConfiguration) => ({ ...prev, nodeCount: value[0] }))
+  }, [])
+
+  const updateLinkDensity = useCallback((value: number[]) => {
+    setTestConfig((prev: TestConfiguration) => ({ ...prev, linkDensity: value[0] }))
+  }, [])
+
+  const updateGraphType = useCallback((type: TestConfiguration['graphType']) => {
+    setTestConfig((prev: TestConfiguration) => ({ ...prev, graphType: type }))
+  }, [])
+
+  // Calculate estimated memory usage
+  const estimatedMemoryMB = Math.round((testConfig.nodeCount * 200 + testConfig.nodeCount * testConfig.linkDensity * 100) / 1024)
+
+  return (
+    <div className="min-h-screen bg-gray-50 p-4">
+      <div className="max-w-7xl mx-auto space-y-6">
+        
+        {/* Configuration Panel */}
+        <Card>
+          <CardHeader>
+            <CardTitle>Configuration</CardTitle>
+          </CardHeader>
+          <CardContent className="space-y-6">
+            
+            {/* Node Count Slider */}
+            <div className="space-y-2">
+              <div className="flex justify-between items-center">
+                <label className="text-sm font-medium">Node Count</label>
+                <Badge variant="outline">{testConfig.nodeCount.toLocaleString()}</Badge>
+              </div>
+              <Slider
+                value={[testConfig.nodeCount]}
+                onValueChange={updateNodeCount}
+                min={1000}
+                max={500000}
+                step={1000}
+                className="w-full"
+              />
+              <div className="flex justify-between text-xs text-gray-500">
+                <span>1K</span>
+                <span>500K</span>
+              </div>
+            </div>
+
+            {/* Link Density Slider */}
+            <div className="space-y-2">
+              <div className="flex justify-between items-center">
+                <label className="text-sm font-medium">Link Density</label>
+                <Badge variant="outline">{(testConfig.linkDensity * 100).toFixed(3)}%</Badge>
+              </div>
+              <Slider
+                value={[testConfig.linkDensity]}
+                onValueChange={updateLinkDensity}
+                min={0.00001}
+                max={0.2}
+                step={0.00001}
+                className="w-full"
+              />
+              <div className="flex justify-between text-xs text-gray-500">
+                <span>0.001%</span>
+                <span>20%</span>
+              </div>
+            </div>
+
+            {/* Graph Type Selection */}
+            <div className="space-y-2">
+              <label className="text-sm font-medium">Graph Type</label>
+              <div className="grid grid-cols-2 md:grid-cols-5 gap-2">
+                {[
+                  { id: 'clustered', name: 'Clustered' },
+                  { id: 'small-world', name: 'Small-World' },
+                  { id: 'scale-free', name: 'Scale-Free' },
+                  { id: 'hierarchical', name: 'Hierarchical' },
+                  { id: 'random', name: 'Random' }
+                ].map((type) => (
+                  <Button
+                    key={type.id}
+                    variant={testConfig.graphType === type.id ? "default" : "outline"}
+                    size="sm"
+                    onClick={() => updateGraphType(type.id as TestConfiguration['graphType'])}
+                    className="p-2"
+                  >
+                    <span className="font-medium">{type.name}</span>
+                  </Button>
+                ))}
+              </div>
+            </div>
+
+            {/* Options */}
+            <div className="flex items-center justify-between">
+              <div className="space-y-2">
+                <div className="flex items-center space-x-2">
+                  <Switch
+                    id="use3d"
+                    checked={testConfig.use3D}
+                    onCheckedChange={(checked: boolean) => setTestConfig((prev: TestConfiguration) => ({ ...prev, use3D: checked }))}
+                  />
+                  <label htmlFor="use3d" className="text-sm font-medium">3D Visualization</label>
+                </div>
+                
+                <div className="flex items-center space-x-2">
+                  <Switch
+                    id="clustering"
+                    checked={testConfig.enableClustering}
+                    onCheckedChange={(checked: boolean) => setTestConfig((prev: TestConfiguration) => ({ ...prev, enableClustering: checked }))}
+                  />
+                  <label htmlFor="clustering" className="text-sm font-medium">GPU Clustering</label>
+                </div>
+              </div>
+
+              <div className="text-right">
+                <Badge variant={estimatedMemoryMB > 1000 ? "destructive" : "secondary"}>
+                  ~{estimatedMemoryMB}MB
+                </Badge>
+              </div>
+            </div>
+
+            {/* Generate Button */}
+            <Button
+              onClick={generateTestGraph}
+              disabled={isGenerating}
+              size="lg"
+              className="w-full"
+            >
+              {isGenerating ? (
+                <>
+                  <Loader2 className="mr-2 h-4 w-4 animate-spin" />
+                  Generating Test Graph...
+                </>
+              ) : (
+                <>
+                  <Play className="mr-2 h-4 w-4" />
+                  Generate Graph
+                </>
+              )}
+            </Button>
+          </CardContent>
+        </Card>
+
+        {/* Performance Metrics */}
+        {performanceMetrics && (
+          <Card>
+            <CardContent className="pt-6">
+              <div className="flex justify-center gap-4">
+                <Badge variant="outline">
+                  {performanceMetrics.totalNodes.toLocaleString()} nodes
+                </Badge>
+                <Badge variant="outline">
+                  {performanceMetrics.totalLinks.toLocaleString()} links
+                </Badge>
+                <Badge variant="outline">
+                  {performanceMetrics.generationTime.toFixed(0)}ms
+                </Badge>
+              </div>
+            </CardContent>
+          </Card>
+        )}
+
+        {/* Error Display */}
+        {error && (
+          <Alert variant="destructive">
+            <AlertDescription>{error}</AlertDescription>
+          </Alert>
+        )}
+
+        {/* Visualization */}
+        {graphData && (
+          <Card>
+            <CardHeader className="pb-2">
+              <div className="flex items-center justify-between">
+                <CardTitle className="flex items-center gap-2">
+                  {testConfig.use3D ? <Monitor className="h-5 w-5" /> : <Cpu className="h-5 w-5" />}
+                  {testConfig.use3D ? '3D' : '2D'}
+                </CardTitle>
+                {testConfig.use3D && (
+                  <Button
+                    variant="outline"
+                    size="sm"
+                    onClick={() => setIsFullscreen(!isFullscreen)}
+                    className="flex items-center gap-2"
+                  >
+                    {isFullscreen ? (
+                      <>
+                        <Minimize className="h-4 w-4" />
+                        Exit Fullscreen
+                      </>
+                    ) : (
+                      <>
+                        <Maximize className="h-4 w-4" />
+                        Fullscreen
+                      </>
+                    )}
+                  </Button>
+                )}
+              </div>
+            </CardHeader>
+            <CardContent className="p-0">
+              <div className={`relative ${isFullscreen ? 'fixed inset-0 z-50 bg-black' : 'h-[600px]'}`}>
+                {testConfig.use3D ? (
+                  <WebGPU3DViewer
+                    graphData={graphData}
+                    remoteServiceUrl="http://localhost:8083"
+                    onError={(err: string) => {
+                      console.error("WebGPU 3D Viewer Error:", err)
+                      setError(`WebGPU 3D Viewer: ${err}`)
+                    }}
+                  />
+                ) : (
+                  <ForceGraphWrapper
+                    jsonData={graphData}
+                    layoutType="force"
+                    highlightedNodes={[]}
+                    onError={(err: Error) => {
+                      console.error("Force Graph Error:", err)
+                      setError(`Force Graph: ${err.message}`)
+                    }}
+                  />
+                )}
+                {isFullscreen && (
+                  <Button
+                    variant="outline"
+                    size="sm"
+                    onClick={() => setIsFullscreen(false)}
+                    className="absolute top-4 right-4 z-10 bg-black/80 text-white border-white/20 hover:bg-black/60"
+                  >
+                    <Minimize className="h-4 w-4 mr-2" />
+                    Exit Fullscreen
+                  </Button>
+                )}
+              </div>
+            </CardContent>
+          </Card>
+        )}
+      </div>
+    </div>
+  )
+}
diff --git a/nvidia/txt2kg/assets/frontend/components.json b/nvidia/txt2kg/assets/frontend/components.json
new file mode 100644
index 0000000..d9ef0ae
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components.json
@@ -0,0 +1,21 @@
+{
+  "$schema": "https://ui.shadcn.com/schema.json",
+  "style": "default",
+  "rsc": true,
+  "tsx": true,
+  "tailwind": {
+    "config": "tailwind.config.ts",
+    "css": "app/globals.css",
+    "baseColor": "neutral",
+    "cssVariables": true,
+    "prefix": ""
+  },
+  "aliases": {
+    "components": "@/components",
+    "utils": "@/lib/utils",
+    "ui": "@/components/ui",
+    "lib": "@/lib",
+    "hooks": "@/hooks"
+  },
+  "iconLibrary": "lucide"
+}
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/advanced-options.tsx b/nvidia/txt2kg/assets/frontend/components/advanced-options.tsx
new file mode 100644
index 0000000..5922a0a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/advanced-options.tsx
@@ -0,0 +1,43 @@
+import React, { useState } from "react";
+import { ChevronDown, ChevronRight } from "lucide-react";
+import { cn } from "@/lib/utils";
+
+interface AdvancedOptionsProps {
+  title?: string;
+  children: React.ReactNode;
+  className?: string;
+  defaultOpen?: boolean;
+}
+
+export function AdvancedOptions({
+  title = "Advanced Options",
+  children,
+  className,
+  defaultOpen = false
+}: AdvancedOptionsProps) {
+  const [isOpen, setIsOpen] = useState(defaultOpen);
+
+  return (
+    <div className={cn("border rounded-md overflow-hidden", className)}>
+      <div 
+        className="flex items-center justify-between p-3 bg-muted/30 cursor-pointer hover:bg-muted/50 transition-colors"
+        onClick={() => setIsOpen(!isOpen)}
+      >
+        <h3 className="text-sm font-medium flex items-center">
+          {isOpen ? (
+            <ChevronDown className="h-4 w-4 mr-2" />
+          ) : (
+            <ChevronRight className="h-4 w-4 mr-2" />
+          )}
+          {title}
+        </h3>
+      </div>
+      
+      {isOpen && (
+        <div className="p-4 border-t border-border/50">
+          {children}
+        </div>
+      )}
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/api-key-prompt.tsx b/nvidia/txt2kg/assets/frontend/components/api-key-prompt.tsx
new file mode 100644
index 0000000..486aa3c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/api-key-prompt.tsx
@@ -0,0 +1,116 @@
+"use client"
+
+import type React from "react"
+
+import { useState, useEffect } from "react"
+import { Key, Lock, ArrowRight, X } from "lucide-react"
+
+// DEPRECATED: This component was used for xAI API key management.
+// xAI integration has been removed, so this component is now non-functional.
+// It remains for backward compatibility only.
+export function ApiKeyPrompt() {
+  const [apiKey, setApiKey] = useState("")
+  const [isVisible, setIsVisible] = useState(false)
+
+  useEffect(() => {
+    // API key prompt is completely disabled - xAI integration removed
+    setIsVisible(false)
+  }, [])
+
+  // Close on Escape
+  useEffect(() => {
+    const onKeyDown = (e: KeyboardEvent) => {
+      if (e.key === "Escape") setIsVisible(false)
+    }
+    document.addEventListener("keydown", onKeyDown)
+    return () => document.removeEventListener("keydown", onKeyDown)
+  }, [])
+
+  const handleSubmit = (e: React.FormEvent) => {
+    e.preventDefault()
+    if (!apiKey.trim()) return
+
+    // xAI integration has been removed - this function is deprecated
+    console.log("API Key prompt is deprecated - xAI integration removed")
+    setIsVisible(false)
+  }
+
+  // Public function to show the modal again (can be called from other components)
+  const showPrompt = () => {
+    // API key prompt is disabled - xAI integration has been removed
+    console.log("API key prompt is disabled - xAI integration has been removed.")
+    return false
+  }
+
+  // Attach the function to the window object for backward compatibility
+  useEffect(() => {
+    // @ts-ignore
+    window.showApiKeyPrompt = showPrompt
+    return () => {
+      // @ts-ignore
+      delete window.showApiKeyPrompt
+    }
+  }, [])
+
+  if (!isVisible) return null
+
+  return (
+    <div
+      className="fixed inset-0 bg-background/80 backdrop-blur-sm flex items-center justify-center z-50"
+      onClick={() => setIsVisible(false)}
+      role="dialog"
+      aria-modal="true"
+    >
+      <div
+        className="glass-card rounded-xl p-8 max-w-md w-full mx-4 relative"
+        onClick={(e) => e.stopPropagation()}
+      >
+        <button
+          type="button"
+          aria-label="Close"
+          className="absolute top-3 right-3 p-2 rounded-md hover:bg-muted/30 text-muted-foreground hover:text-foreground"
+          onClick={() => setIsVisible(false)}
+        >
+          <X className="h-4 w-4" />
+        </button>
+        <div className="flex items-center gap-4 mb-6">
+          <div className="w-12 h-12 rounded-full bg-primary/20 flex items-center justify-center">
+            <Key className="h-6 w-6 text-primary" />
+          </div>
+          <div>
+            <h2 className="text-xl font-bold">API Key (Deprecated)</h2>
+            <p className="text-muted-foreground text-sm">xAI integration has been removed</p>
+          </div>
+        </div>
+
+        <p className="text-foreground mb-6">
+          xAI integration has been removed. This prompt is no longer functional.
+        </p>
+
+        <form onSubmit={handleSubmit} className="space-y-4">
+          <div className="relative">
+            <div className="absolute inset-y-0 left-0 pl-3 flex items-center pointer-events-none">
+              <Lock className="h-5 w-5 text-muted-foreground" />
+            </div>
+            <input
+              type="password"
+              value={apiKey}
+              onChange={(e) => setApiKey(e.target.value)}
+              placeholder="deprecated"
+              className="w-full bg-background border border-border rounded-lg p-3 pl-10 text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary transition-colors"
+              required
+            />
+          </div>
+
+          <div className="flex justify-end">
+            <button type="submit" className="btn-primary">
+              <span>Submit</span>
+              <ArrowRight className="h-4 w-4" />
+            </button>
+          </div>
+        </form>
+      </div>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/client-init.tsx b/nvidia/txt2kg/assets/frontend/components/client-init.tsx
new file mode 100644
index 0000000..5e7fe85
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/client-init.tsx
@@ -0,0 +1,25 @@
+"use client"
+
+import { useEffect } from "react"
+import { syncSettingsWithServer, initializeDefaultSettings } from "@/lib/client-init"
+
+export function ClientInitializer() {
+  useEffect(() => {
+    // Initialize client settings
+    const initSettings = async () => {
+      try {
+        // Set default values first
+        initializeDefaultSettings()
+        // Then sync with server
+        await syncSettingsWithServer()
+      } catch (error) {
+        console.error("Failed to initialize client settings:", error)
+      }
+    }
+    
+    initSettings()
+  }, [])
+  
+  // This component doesn't render anything
+  return null
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/database-connection.tsx b/nvidia/txt2kg/assets/frontend/components/database-connection.tsx
new file mode 100644
index 0000000..e1ddc80
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/database-connection.tsx
@@ -0,0 +1,608 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { Network, Database, Zap, AlertCircle, RefreshCw, ChevronDown, ChevronUp, InfoIcon, Trash2 } from "lucide-react"
+import { Badge } from '@/components/ui/badge'
+import { Tooltip, TooltipContent, TooltipProvider, TooltipTrigger } from '@/components/ui/tooltip'
+import { Button } from "@/components/ui/button"
+import { VectorDBStats } from '@/types/graph'
+import { Collapsible, CollapsibleContent, CollapsibleTrigger } from "@/components/ui/collapsible"
+import { Alert, AlertDescription, AlertTitle } from "@/components/ui/alert"
+import { Dialog, DialogContent, DialogDescription, DialogFooter, DialogHeader, DialogTitle, DialogTrigger, DialogClose } from "@/components/ui/dialog"
+
+interface DatabaseConnectionProps {
+  className?: string
+}
+
+export function DatabaseConnection({ className }: DatabaseConnectionProps) {
+  // Neo4j/Graph DB state
+  const [graphConnectionStatus, setGraphConnectionStatus] = useState<"connected" | "disconnected" | "checking">("disconnected")
+  const [graphError, setGraphError] = useState<string | null>(null)
+  const [nodeCount, setNodeCount] = useState<number | null>(null)
+  const [relationshipCount, setRelationshipCount] = useState<number | null>(null)
+  const [connectionUrl, setConnectionUrl] = useState<string>("")
+  const [dbType, setDbType] = useState<string>("")
+  const [isClearingDB, setIsClearingDB] = useState<boolean>(false)
+  const [showClearDialog, setShowClearDialog] = useState<boolean>(false)
+
+  // Vector DB state
+  const [vectorConnectionStatus, setVectorConnectionStatus] = useState<"connected" | "disconnected" | "checking">("disconnected")
+  const [vectorError, setVectorError] = useState<string | null>(null)
+  const [vectorStats, setVectorStats] = useState<VectorDBStats>({ nodes: 0, relationships: 0, source: 'none' })
+  const [isClearingVectorDB, setIsClearingVectorDB] = useState<boolean>(false)
+  const [showClearVectorDialog, setShowClearVectorDialog] = useState<boolean>(false)
+
+  // UI state
+  const [expandedSection, setExpandedSection] = useState<"graph" | "vector" | null>("graph")
+  
+  // Check graph database connection status (Neo4j or ArangoDB)
+  const checkGraphConnection = async () => {
+    setGraphConnectionStatus("checking")
+    setGraphError(null)
+    
+    try {
+      // Get database type from localStorage
+      const graphDbType = localStorage.getItem("graph_db_type") || "arangodb"
+      setDbType(graphDbType === "arangodb" ? "ArangoDB" : "Neo4j")
+      
+      if (graphDbType === "neo4j") {
+        // Neo4j connection logic
+        const dbUrl = localStorage.getItem("NEO4J_URL")
+        const dbUsername = localStorage.getItem("NEO4J_USERNAME")
+        const dbPassword = localStorage.getItem("NEO4J_PASSWORD")
+        
+        // Add query parameters if credentials exist
+        const queryParams = new URLSearchParams()
+        if (dbUrl) queryParams.append("url", dbUrl)
+        if (dbUsername) queryParams.append("username", dbUsername)
+        if (dbPassword) queryParams.append("password", dbPassword)
+        
+        const queryString = queryParams.toString()
+        const endpoint = queryString ? `/api/neo4j?${queryString}` : '/api/neo4j'
+        
+        const response = await fetch(endpoint)
+        
+        if (!response.ok) {
+          const errorData = await response.json()
+          const errorMessage = errorData.error || 'Failed to connect to Neo4j'
+          console.error('Neo4j connection failed:', errorMessage)
+          setGraphConnectionStatus("disconnected")
+          setGraphError(errorMessage)
+          return
+        }
+        
+        const data = await response.json()
+        setNodeCount(data.nodes?.length || 0)
+        setRelationshipCount(data.links?.length || 0)
+        
+        // Use the connection URL from the API response
+        if (data.connectionUrl) {
+          setConnectionUrl(data.connectionUrl)
+        } else if (dbUrl) {
+          setConnectionUrl(dbUrl)
+        }
+      } else {
+        // ArangoDB connection logic
+        const arangoUrl = localStorage.getItem("arango_url") || "http://localhost:8529"
+        const arangoDb = localStorage.getItem("arango_db") || "txt2kg"
+        const arangoUser = localStorage.getItem("arango_user") || ""
+        const arangoPassword = localStorage.getItem("arango_password") || ""
+        
+        // Add query parameters if credentials exist
+        const queryParams = new URLSearchParams()
+        if (arangoUrl) queryParams.append("url", arangoUrl)
+        if (arangoDb) queryParams.append("dbName", arangoDb)
+        if (arangoUser) queryParams.append("username", arangoUser)
+        if (arangoPassword) queryParams.append("password", arangoPassword)
+        
+        const queryString = queryParams.toString()
+        const endpoint = queryString ? `/api/graph-db?${queryString}` : '/api/graph-db'
+        
+        const response = await fetch(endpoint)
+        
+        if (!response.ok) {
+          const errorData = await response.json()
+          const errorMessage = errorData.error || 'Failed to connect to ArangoDB'
+          console.error('ArangoDB connection failed:', errorMessage)
+          setGraphConnectionStatus("disconnected")
+          setGraphError(errorMessage)
+          return
+        }
+        
+        const data = await response.json()
+        setNodeCount(data.nodes?.length || 0)
+        setRelationshipCount(data.links?.length || 0)
+        
+        // Set ArangoDB connection URL
+        setConnectionUrl(`${arangoUrl}/_db/${arangoDb}`)
+      }
+      
+      setGraphConnectionStatus("connected")
+    } catch (err) {
+      console.error('Graph database connection error:', err)
+      setGraphConnectionStatus("disconnected")
+      setGraphError(err instanceof Error ? err.message : 'Unknown error connecting to database')
+    }
+  }
+
+  // Disconnect from graph database
+  const disconnectGraph = async () => {
+    try {
+      const graphDbType = localStorage.getItem("graph_db_type") || "arangodb"
+      const endpoint = graphDbType === "neo4j" ? '/api/neo4j/disconnect' : '/api/graph-db/disconnect'
+      
+      const response = await fetch(endpoint, {
+        method: 'POST',
+      })
+      
+      if (!response.ok) {
+        const errorData = await response.json()
+        const errorMessage = errorData.error || `Failed to disconnect from ${graphDbType}`
+        console.error('Graph database disconnect failed:', errorMessage)
+        setGraphError(errorMessage)
+        return
+      }
+      
+      setGraphConnectionStatus("disconnected")
+      setNodeCount(null)
+      setRelationshipCount(null)
+    } catch (err) {
+      console.error('Graph database disconnect error:', err)
+      setGraphError(err instanceof Error ? err.message : 'Unknown error disconnecting from database')
+    }
+  }
+
+  // Fetch vector DB stats
+  const fetchVectorStats = async () => {
+    try {
+      const response = await fetch('/api/pinecone-diag/stats');
+      const data = await response.json();
+      
+      if (response.ok) {
+        setVectorStats({
+          nodes: typeof data.totalVectorCount === 'number' ? data.totalVectorCount : 0,
+          relationships: 0, // Vector DB doesn't store relationships
+          source: data.source || 'unknown',
+          httpHealthy: data.httpHealthy
+        });
+        
+        // If we have a healthy HTTP connection, we're connected
+        if (data.httpHealthy) {
+          setVectorConnectionStatus("connected");
+          setVectorError(null);
+        } else {
+          setVectorConnectionStatus("disconnected");
+          setVectorError(data.error || 'Connection failed');
+        }
+      } else {
+        console.error('Failed to fetch vector DB stats:', data);
+        setVectorConnectionStatus("disconnected");
+        setVectorError(data.error || 'Failed to connect to vector database');
+      }
+    } catch (error) {
+      console.error('Error fetching vector DB stats:', error);
+      setVectorConnectionStatus("disconnected");
+      setVectorError(error instanceof Error ? error.message : 'Error connecting to vector database');
+    }
+  };
+
+  // Check vector connection
+  const checkVectorConnection = async () => {
+    setVectorConnectionStatus("checking")
+    setVectorError(null)
+    
+    try {
+      await fetchVectorStats();
+    } catch (error) {
+      console.error('Error connecting to Vector DB:', error)
+      setVectorConnectionStatus("disconnected")
+      setVectorError(error instanceof Error ? error.message : 'Unknown error connecting to Vector DB')
+    }
+  }
+
+  // Reset vector connection state
+  const disconnectVector = async () => {
+    setVectorConnectionStatus("disconnected")
+    setVectorStats({ nodes: 0, relationships: 0, source: 'none' })
+  }
+
+  // Clear the graph database
+  const clearGraphDatabase = async () => {
+    if (graphConnectionStatus !== "connected") {
+      return
+    }
+    
+    setIsClearingDB(true)
+    setGraphError(null)
+    
+    try {
+      // Call API to clear the database
+      const response = await fetch('/api/graph-db/clear', {
+        method: 'POST',
+      })
+      
+      if (!response.ok) {
+        const errorData = await response.json()
+        const errorMessage = errorData.error || 'Failed to clear database'
+        console.error('Graph database clear failed:', errorMessage)
+        setGraphError(errorMessage)
+        return
+      }
+      
+      // Refresh graph connection to update stats
+      await checkGraphConnection()
+      
+      setShowClearDialog(false)
+    } catch (err) {
+      console.error('Graph database clear error:', err)
+      setGraphError(err instanceof Error ? err.message : 'Unknown error clearing database')
+    } finally {
+      setIsClearingDB(false)
+    }
+  }
+
+  // Clear the vector database
+  const clearVectorDatabase = async () => {
+    if (vectorConnectionStatus !== "connected") {
+      return
+    }
+    
+    setIsClearingVectorDB(true)
+    setVectorError(null)
+    
+    try {
+      // Call API to clear the database
+      const response = await fetch('/api/pinecone-diag/clear', {
+        method: 'POST',
+      })
+      
+      if (!response.ok) {
+        const errorData = await response.json()
+        const errorMessage = errorData.error || 'Failed to clear vector database'
+        console.error('Vector database clear failed:', errorMessage)
+        setVectorError(errorMessage)
+        return
+      }
+      
+      // Refresh vector connection to update stats
+      await checkVectorConnection()
+      
+      setShowClearVectorDialog(false)
+    } catch (err) {
+      console.error('Vector database clear error:', err)
+      setVectorError(err instanceof Error ? err.message : 'Unknown error clearing vector database')
+    } finally {
+      setIsClearingVectorDB(false)
+    }
+  }
+
+  // Check both connections on mount
+  useEffect(() => {
+    checkGraphConnection()
+    checkVectorConnection()
+  }, [])
+
+  const toggleSection = (section: "graph" | "vector") => {
+    if (expandedSection === section) {
+      setExpandedSection(null)
+    } else {
+      setExpandedSection(section)
+    }
+  }
+
+  return (
+    <div className={`rounded-xl overflow-hidden border border-border/50 bg-card/30 backdrop-blur-sm ${className}`}>
+      {/* Graph DB Section */}
+      <Collapsible open={expandedSection === "graph"} onOpenChange={() => toggleSection("graph")}>
+        <div className="p-3 border-b border-border/50 cursor-pointer flex justify-between items-center" onClick={() => toggleSection("graph")}>
+          <div className="flex items-center gap-1.5">
+            <Network className="h-3.5 w-3.5 text-primary" />
+            <h3 className="text-xs md:text-sm font-medium">Graph DB</h3>
+          </div>
+          
+          <div className="flex items-center gap-2">
+            {graphConnectionStatus === "checking" && (
+              <RefreshCw className="h-3 w-3 animate-spin text-yellow-500" />
+            )}
+            {graphConnectionStatus === "connected" && (
+              <span className="h-1.5 w-1.5 rounded-full bg-green-500 animate-pulse"></span>
+            )}
+            {graphConnectionStatus === "disconnected" && (
+              <span className="h-1.5 w-1.5 rounded-full bg-destructive"></span>
+            )}
+            <CollapsibleTrigger asChild>
+              <button className="p-1 rounded-full hover:bg-secondary">
+                {expandedSection === "graph" ? <ChevronUp size={14} /> : <ChevronDown size={14} />}
+              </button>
+            </CollapsibleTrigger>
+          </div>
+        </div>
+        
+        <CollapsibleContent>
+          <div className="p-3 space-y-2 bg-card/50">
+            {graphConnectionStatus === "checking" && (
+              <div className="flex items-center gap-2 text-xs">
+                <RefreshCw className="h-3.5 w-3.5 animate-spin" />
+                <span>Checking connection...</span>
+              </div>
+            )}
+            
+            {graphConnectionStatus === "connected" && (
+              <>
+                <div className="flex items-center gap-2 text-xs md:text-sm">
+                  <span className="text-foreground font-medium">
+                    {dbType}
+                  </span>
+                  <span className="text-foreground font-mono text-[11px] bg-secondary/50 px-2 py-0.5 rounded truncate max-w-full">
+                    {connectionUrl}
+                  </span>
+                </div>
+                
+                {(nodeCount !== null || relationshipCount !== null) && (
+                  <div className="text-xs md:text-sm text-muted-foreground">
+                    <div className="flex items-center gap-2">
+                      <Database className="h-3.5 w-3.5" />
+                      <span>{nodeCount?.toLocaleString()} nodes, {relationshipCount?.toLocaleString()} relationships</span>
+                    </div>
+                  </div>
+                )}
+              </>
+            )}
+            
+            {graphConnectionStatus === "disconnected" && (
+              <div className="flex items-center gap-2 text-xs">
+                <span className="text-foreground font-mono text-[11px] bg-secondary/50 px-2 py-0.5 rounded">
+                  Not connected
+                </span>
+              </div>
+            )}
+            
+            {graphError && (
+              <div className="text-xs md:text-sm text-red-600 bg-red-50 dark:bg-red-950/20 dark:text-red-400 p-2 rounded">
+                <p className="whitespace-normal break-words">Error: {graphError}</p>
+              </div>
+            )}
+            
+            <div className="flex gap-2">
+              <Button 
+                variant="outline" 
+                size="sm" 
+                onClick={checkGraphConnection}
+                disabled={graphConnectionStatus === "checking"}
+                className="flex-1 text-xs h-7"
+              >
+                {graphConnectionStatus === "checking" ? "Checking..." : "Refresh"}
+              </Button>
+              
+              {graphConnectionStatus === "connected" ? (
+                <>
+                  <Button 
+                    variant="outline" 
+                    size="sm" 
+                    onClick={disconnectGraph}
+                    className="flex-1 text-xs h-7"
+                  >
+                    Disconnect
+                  </Button>
+                  
+                  <Dialog open={showClearDialog} onOpenChange={setShowClearDialog}>
+                    <DialogTrigger asChild>
+                      <Button 
+                        variant="destructive" 
+                        size="sm" 
+                        className="flex-1 text-xs h-7"
+                        disabled={isClearingDB}
+                      >
+                        <Trash2 className="h-3 w-3 mr-1" />
+                        Clear
+                      </Button>
+                    </DialogTrigger>
+                    <DialogContent>
+                      <DialogHeader>
+                        <DialogTitle className="text-destructive">Clear Database</DialogTitle>
+                        <DialogDescription>
+                          Are you sure you want to clear all data from the {dbType} database? This action cannot be undone.
+                        </DialogDescription>
+                      </DialogHeader>
+                      <Alert variant="destructive" className="mt-2">
+                        <AlertCircle className="h-4 w-4" />
+                        <AlertTitle>Warning</AlertTitle>
+                        <AlertDescription>
+                          This will permanently delete all nodes and relationships from the database.
+                        </AlertDescription>
+                      </Alert>
+                      <DialogFooter className="gap-2 mt-4">
+                        <DialogClose asChild>
+                          <Button variant="outline" size="sm">Cancel</Button>
+                        </DialogClose>
+                        <Button 
+                          variant="destructive" 
+                          size="sm"
+                          onClick={clearGraphDatabase}
+                          disabled={isClearingDB}
+                        >
+                          {isClearingDB ? "Clearing..." : "Clear Database"}
+                        </Button>
+                      </DialogFooter>
+                    </DialogContent>
+                  </Dialog>
+                </>
+              ) : (
+                <Button 
+                  variant="outline" 
+                  size="sm" 
+                  onClick={() => {
+                    // Open Graph DB settings
+                    const event = new CustomEvent('open-settings', { 
+                      detail: { tab: 'graph' } 
+                    });
+                    window.dispatchEvent(event);
+                  }}
+                  className="flex-1 text-xs h-7"
+                >
+                  Configure
+                </Button>
+              )}
+            </div>
+          </div>
+        </CollapsibleContent>
+      </Collapsible>
+      
+      {/* Vector DB Section */}
+      <Collapsible open={expandedSection === "vector"} onOpenChange={() => toggleSection("vector")}>
+        <div className="p-3 cursor-pointer flex justify-between items-center" onClick={() => toggleSection("vector")}>
+          <div className="flex items-center gap-1.5">
+            <Database className="h-3.5 w-3.5 text-primary" />
+            <h3 className="text-xs md:text-sm font-medium">Vector DB</h3>
+          </div>
+          
+          <div className="flex items-center gap-2">
+            {vectorConnectionStatus === "checking" && (
+              <RefreshCw className="h-3 w-3 animate-spin text-yellow-500" />
+            )}
+            {vectorConnectionStatus === "connected" && (
+              <span className="h-1.5 w-1.5 rounded-full bg-green-500 animate-pulse"></span>
+            )}
+            {vectorConnectionStatus === "disconnected" && (
+              <span className="h-1.5 w-1.5 rounded-full bg-destructive"></span>
+            )}
+            <CollapsibleTrigger asChild>
+              <button className="p-1 rounded-full hover:bg-secondary">
+                {expandedSection === "vector" ? <ChevronUp size={14} /> : <ChevronDown size={14} />}
+              </button>
+            </CollapsibleTrigger>
+          </div>
+        </div>
+        
+        <CollapsibleContent>
+          <div className="p-3 space-y-2 bg-card/50">
+            {vectorConnectionStatus === "checking" && (
+              <div className="flex items-center gap-2 text-xs">
+                <RefreshCw className="h-3.5 w-3.5 animate-spin" />
+                <span>Checking connection...</span>
+              </div>
+            )}
+            
+            {vectorConnectionStatus === "connected" && (
+              <>
+                <div className="flex items-center gap-2 text-xs md:text-sm">
+                  <span className="text-foreground font-medium">
+                    Pinecone
+                  </span>
+                  <span className="text-foreground font-mono text-[11px] bg-secondary/50 px-2 py-0.5 rounded truncate max-w-full">
+                    direct-http
+                  </span>
+                </div>
+                
+                {vectorStats.nodes > 0 && (
+                  <div className="text-xs md:text-sm text-muted-foreground">
+                    <div className="flex items-center gap-2">
+                      <Database className="h-3.5 w-3.5" />
+                      <span>{vectorStats.nodes.toLocaleString()} vectors</span>
+                    </div>
+                  </div>
+                )}
+              </>
+            )}
+            
+            {vectorConnectionStatus === "disconnected" && (
+              <div className="flex items-center gap-2 text-xs">
+                <span className="text-foreground font-mono text-[11px] bg-secondary/50 px-2 py-0.5 rounded">
+                  Not connected
+                </span>
+              </div>
+            )}
+            
+            {vectorError && (
+              <div className="text-xs md:text-sm text-red-600 bg-red-50 dark:bg-red-950/20 dark:text-red-400 p-2 rounded">
+                <p className="whitespace-normal break-words">Error: {vectorError}</p>
+              </div>
+            )}
+            
+            <div className="flex gap-2">
+              <Button 
+                variant="outline" 
+                size="sm" 
+                onClick={checkVectorConnection}
+                disabled={vectorConnectionStatus === "checking"}
+                className="flex-1 text-xs h-7"
+              >
+                {vectorConnectionStatus === "checking" ? "Checking..." : "Refresh"}
+              </Button>
+              
+              {vectorConnectionStatus === "connected" ? (
+                <>
+                  <Button 
+                    variant="outline" 
+                    size="sm" 
+                    onClick={disconnectVector}
+                    className="flex-1 text-xs h-7"
+                  >
+                    Disconnect
+                  </Button>
+                  
+                  <Dialog open={showClearVectorDialog} onOpenChange={setShowClearVectorDialog}>
+                    <DialogTrigger asChild>
+                      <Button 
+                        variant="destructive" 
+                        size="sm" 
+                        className="flex-1 text-xs h-7"
+                        disabled={isClearingVectorDB}
+                      >
+                        <Trash2 className="h-3 w-3 mr-1" />
+                        Clear
+                      </Button>
+                    </DialogTrigger>
+                    <DialogContent>
+                      <DialogHeader>
+                        <DialogTitle className="text-destructive">Clear Pinecone Database</DialogTitle>
+                        <DialogDescription>
+                          Are you sure you want to clear all data from the Pinecone database? This action cannot be undone.
+                        </DialogDescription>
+                      </DialogHeader>
+                      <Alert variant="destructive" className="mt-2">
+                        <AlertCircle className="h-4 w-4" />
+                        <AlertTitle>Warning</AlertTitle>
+                        <AlertDescription>
+                          This will permanently delete all vectors from the Pinecone database.
+                        </AlertDescription>
+                      </Alert>
+                      <DialogFooter className="gap-2 mt-4">
+                        <DialogClose asChild>
+                          <Button variant="outline" size="sm">Cancel</Button>
+                        </DialogClose>
+                        <Button 
+                          variant="destructive" 
+                          size="sm"
+                          onClick={clearVectorDatabase}
+                          disabled={isClearingVectorDB}
+                        >
+                          {isClearingVectorDB ? "Clearing..." : "Clear Database"}
+                        </Button>
+                      </DialogFooter>
+                    </DialogContent>
+                  </Dialog>
+                </>
+              ) : (
+                <Button 
+                  variant="outline" 
+                  size="sm" 
+                  onClick={() => {
+                    // Open Vector DB settings
+                    const event = new CustomEvent('open-settings', { 
+                      detail: { tab: 'vectordb' } 
+                    });
+                    window.dispatchEvent(event);
+                  }}
+                  className="flex-1 text-xs h-7"
+                >
+                  Configure
+                </Button>
+              )}
+            </div>
+          </div>
+        </CollapsibleContent>
+      </Collapsible>
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/document-actions.tsx b/nvidia/txt2kg/assets/frontend/components/document-actions.tsx
new file mode 100644
index 0000000..e1dd8a2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/document-actions.tsx
@@ -0,0 +1,87 @@
+"use client"
+
+import { FC, useState, useEffect } from 'react'
+import { useDocuments } from '@/contexts/document-context'
+import { Zap, Loader2 } from 'lucide-react'
+import { Tooltip, TooltipContent, TooltipProvider, TooltipTrigger } from "@/components/ui/tooltip"
+
+interface DocumentActionsProps {
+  documentId: string
+}
+
+export const DocumentActions: FC<DocumentActionsProps> = ({ documentId }) => {
+  const { generateEmbeddings, isGeneratingEmbeddings, documents } = useDocuments()
+  const [processingEmbeddingsId, setProcessingEmbeddingsId] = useState<string | null>(null)
+  const [embeddingsProvider, setEmbeddingsProvider] = useState<string>("local")
+  
+  // Get the document status to determine if we can generate embeddings
+  const document = documents.find(doc => doc.id === documentId)
+  const isProcessed = document?.status === 'Processed'
+  
+  // Load embeddings provider setting from localStorage when component mounts
+  useEffect(() => {
+    const storedProvider = localStorage.getItem("embeddings_provider") || "local"
+    setEmbeddingsProvider(storedProvider)
+    
+    // Create a custom event listener for embeddings settings changes
+    const handleEmbeddingsSettingsChanged = () => {
+      const updatedProvider = localStorage.getItem("embeddings_provider") || "local"
+      setEmbeddingsProvider(updatedProvider)
+      console.log("Document actions detected embeddings settings change:", updatedProvider)
+    }
+    
+    // Listen for a custom event that will be dispatched when settings are saved
+    window.addEventListener('embeddings-settings-changed', handleEmbeddingsSettingsChanged)
+    
+    return () => {
+      window.removeEventListener('embeddings-settings-changed', handleEmbeddingsSettingsChanged)
+    }
+  }, [])
+
+  // Handle embeddings generation
+  const handleGenerateEmbeddings = async () => {
+    setProcessingEmbeddingsId(documentId)
+    try {
+      await generateEmbeddings(documentId)
+    } finally {
+      setProcessingEmbeddingsId(null)
+    }
+  }
+  
+  const getProviderLabel = () => {
+    return embeddingsProvider === "nvidia" ? "NVIDIA API" : "Local Transformer"
+  }
+  
+  return (
+    <div className="flex gap-2">
+      {isProcessed && (
+        <TooltipProvider>
+          <Tooltip>
+            <TooltipTrigger asChild>
+              <button
+                onClick={handleGenerateEmbeddings}
+                disabled={isGeneratingEmbeddings || processingEmbeddingsId === documentId}
+                className="btn-outline flex items-center justify-center gap-2 py-2 px-3 rounded-md text-sm text-primary border-primary hover:bg-primary/10 disabled:opacity-50 disabled:cursor-not-allowed"
+                title="Generate vector embeddings from this document"
+              >
+                {processingEmbeddingsId === documentId ? (
+                  <Loader2 className="h-4 w-4 animate-spin" />
+                ) : (
+                  <Zap className="h-4 w-4" />
+                )}
+                <span>
+                  {isGeneratingEmbeddings || processingEmbeddingsId === documentId ? 
+                    'Generating...' : 
+                    'Generate Embeddings'}
+                </span>
+              </button>
+            </TooltipTrigger>
+            <TooltipContent side="top">
+              Using {getProviderLabel()} for embeddings generation
+            </TooltipContent>
+          </Tooltip>
+        </TooltipProvider>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/document-processor.tsx b/nvidia/txt2kg/assets/frontend/components/document-processor.tsx
new file mode 100644
index 0000000..a18693d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/document-processor.tsx
@@ -0,0 +1,600 @@
+"use client";
+
+import { useState } from "react";
+import { Button } from "@/components/ui/button";
+import { Card, CardContent, CardDescription, CardHeader, CardTitle } from "@/components/ui/card";
+import { AlertCircle, FileText, FileUp, Loader2, Zap } from "lucide-react";
+import { Progress } from "@/components/ui/progress";
+import { Alert, AlertDescription } from "@/components/ui/alert";
+import { useToast } from "@/components/ui/use-toast";
+import { Switch } from "@/components/ui/switch";
+import { Label } from "@/components/ui/label";
+import { Tabs, TabsContent, TabsList, TabsTrigger } from "@/components/ui/tabs";
+import { useDocuments } from "@/contexts/document-context";
+
+interface DocumentProcessorProps {
+  onComplete?: (results: any) => void;
+  className?: string;
+}
+
+export function DocumentProcessor({ onComplete, className }: DocumentProcessorProps) {
+  const { addDocuments, processDocuments, documents } = useDocuments();
+  const [file, setFile] = useState<File | null>(null);
+  const [isProcessing, setIsProcessing] = useState(false);
+  const [progress, setProgress] = useState(0);
+  const [processingStatus, setProcessingStatus] = useState("");
+  const [error, setError] = useState<string | null>(null);
+  const [processingTab, setProcessingTab] = useState<string>("triples");
+  const [useSentenceChunking, setUseSentenceChunking] = useState(true);
+  const [useEntityExtraction, setUseEntityExtraction] = useState(true);
+  const { toast } = useToast();
+
+  const handleFileChange = (e: React.ChangeEvent<HTMLInputElement>) => {
+    const selectedFile = e.target.files?.[0] || null;
+    if (selectedFile) {
+      // Add file to document context for display in document list
+      addDocuments([selectedFile]);
+      setFile(selectedFile);
+      setError(null);
+      setProgress(0);
+      setProcessingStatus("");
+      
+      // Show toast notification
+      toast({
+        title: "File Uploaded",
+        description: `"${selectedFile.name}" added to document list.`,
+        duration: 3000,
+      });
+    }
+  };
+
+  const processFile = async () => {
+    if (!file) {
+      setError("Please select a file to process");
+      return;
+    }
+
+    try {
+      setIsProcessing(true);
+      setProgress(0);
+      setProcessingStatus("Reading file...");
+      setError(null);
+
+      // Find the document ID for the file we're processing
+      const docToProcess = documents.find(doc => doc.name === file.name);
+      
+      if (!docToProcess) {
+        throw new Error("Document not found in document list");
+      }
+
+      // Use the document context to process documents with the specific ID
+      await processDocuments([docToProcess.id], {
+        useLangChain: true,
+        useGraphTransformer: false,
+        promptConfigs: undefined
+      });
+      setProgress(100);
+      setProcessingStatus("Processing complete!");
+      
+      // Notify about completion
+      toast({
+        title: "Processing Complete",
+        description: "Document has been processed successfully. You can now generate embeddings from the document table.",
+        duration: 5000,
+      });
+      
+      // Reset the file input
+      setFile(null);
+      
+      // Call onComplete callback if provided
+      if (onComplete) {
+        onComplete({
+          success: true,
+          message: "Document processed successfully"
+        });
+      }
+    } catch (err) {
+      console.error("Error processing document:", err);
+      setError(err instanceof Error ? err.message : "Unknown error processing document");
+      
+      toast({
+        title: "Processing Failed",
+        description: err instanceof Error ? err.message : "Failed to process document",
+        variant: "destructive",
+        duration: 5000,
+      });
+    } finally {
+      setIsProcessing(false);
+    }
+  };
+
+  // Process triples from text
+  const processTriples = async (text: string, filename: string) => {
+    setProcessingStatus("Extracting triples with LangChain...");
+
+    let triples;
+    
+    // If it's a CSV file, process each row as a document with LLM extraction
+    if (filename.toLowerCase().endsWith('.csv')) {
+      setProcessingStatus(`Processing CSV file rows as documents (${(text.length / 1024).toFixed(2)} KB)...`);
+      try {
+        console.log(`🔥 DocumentProcessor: Starting CSV row-by-row processing for file: ${filename}`);
+        triples = await parseCSVToTriples(text);
+        console.log(`🔥 DocumentProcessor: Extracted ${triples.length} triples from CSV file`);
+        setProgress(60);
+        
+        // For very large triple sets, limit what we send to the API
+        const maxTriplesToProcess = 10000;
+        let triplesToProcess = triples;
+        
+        if (triples.length > maxTriplesToProcess) {
+          console.log(`Limiting triples to ${maxTriplesToProcess} out of ${triples.length} total`);
+          setProcessingStatus(`Processing ${maxTriplesToProcess} of ${triples.length} triples (limited for performance)...`);
+          triplesToProcess = triples.slice(0, maxTriplesToProcess);
+        }
+        
+        setProcessingStatus(`Processing ${triplesToProcess.length} triples...`);
+        
+        // Process document to create backend with embeddings
+        // NOTE: This API no longer automatically stores triples in Neo4j.
+        // Storage in Neo4j is now handled manually through the UI's "Store in Graph DB" button.
+        const processingResponse = await fetch('/api/process-document', {
+          method: 'POST',
+          headers: { 'Content-Type': 'application/json' },
+          body: JSON.stringify({ 
+            text: `CSV file with ${triples.length} triples`,  // Don't send the full CSV content
+            filename,
+            triples: triplesToProcess
+          })
+        });
+
+        if (!processingResponse.ok) {
+          throw new Error(`Failed to process document: ${processingResponse.statusText}`);
+        }
+
+        const processingData = await processingResponse.json();
+        setProgress(100);
+        setProcessingStatus("Processing complete!");
+
+        // Notify about completion
+        toast({
+          title: "CSV Processing Complete",
+          description: `Processed ${triplesToProcess.length} triples${triples.length > triplesToProcess.length ? ' (limited from ' + triples.length + ' total)' : ''} from your CSV file.`,
+          duration: 5000,
+        });
+
+        // Call the onComplete callback with results
+        if (onComplete) {
+          onComplete({
+            triples: triplesToProcess,
+            totalTriples: triples.length,
+            embeddings: processingData.embeddings || [],
+            filename
+          });
+        }
+        
+        return; // Early return to skip the standard processing flow
+      } catch (err) {
+        console.error(`CSV processing error:`, err);
+        throw new Error(`Failed to parse CSV file: ${err instanceof Error ? err.message : String(err)}`);
+      }
+    }
+    
+    // Standard processing for non-CSV files
+    const extractResponse = await fetch('/api/extract-triples', {
+      method: 'POST',
+      headers: { 'Content-Type': 'application/json' },
+      body: JSON.stringify({ text })
+    });
+
+    if (!extractResponse.ok) {
+      throw new Error(`Failed to extract triples: ${extractResponse.statusText}`);
+    }
+
+    const extractData = await extractResponse.json();
+    triples = extractData.triples;
+    setProgress(60);
+    
+    setProcessingStatus("Generating embeddings...");
+
+    // Process document to create backend with embeddings
+    // NOTE: This API no longer automatically stores triples in Neo4j.
+    // Storage in Neo4j is now handled manually through the UI's "Store in Graph DB" button.
+    const processingResponse = await fetch('/api/process-document', {
+      method: 'POST',
+      headers: { 'Content-Type': 'application/json' },
+      body: JSON.stringify({ 
+        text,
+        filename,
+        triples
+      })
+    });
+
+    if (!processingResponse.ok) {
+      throw new Error(`Failed to process document: ${processingResponse.statusText}`);
+    }
+
+    const processingData = await processingResponse.json();
+    setProgress(100);
+    setProcessingStatus("Processing complete!");
+
+    // Notify about completion
+    toast({
+      title: "Triple Extraction Complete",
+      description: `Extracted ${triples.length} triples and generated embeddings for the knowledge graph.`,
+      duration: 5000,
+    });
+
+    // Call the onComplete callback with results
+    if (onComplete) {
+      onComplete({
+        triples,
+        embeddings: processingData.embeddings || [],
+        filename
+      });
+    }
+  };
+
+  // Process sentence embeddings
+  const processSentenceEmbeddings = async (text: string, filename: string) => {
+    // If it's a CSV file, we need to convert it to text first
+    let processableText = text;
+    
+    if (filename.toLowerCase().endsWith('.csv')) {
+      setProcessingStatus("Preparing CSV data for embedding...");
+      try {
+        // For CSV files, we'll use the content of the cells as text to generate embeddings
+        const triples = await parseCSVToTriples(text);
+        // Create a text representation by joining subjects, predicates and objects
+        processableText = triples
+          .map(t => `${t.subject} ${t.predicate} ${t.object}`)
+          .join('. ');
+      } catch (err) {
+        throw new Error(`Failed to process CSV file: ${err instanceof Error ? err.message : String(err)}`);
+      }
+    }
+    
+    setProcessingStatus("Chunking text into sentences...");
+
+    // Call sentence embeddings API
+    const embeddingsResponse = await fetch('/api/sentence-embeddings', {
+      method: 'POST',
+      headers: { 'Content-Type': 'application/json' },
+      body: JSON.stringify({ 
+        text: processableText,
+        documentId: filename 
+      })
+    });
+
+    if (!embeddingsResponse.ok) {
+      throw new Error(`Failed to process sentence embeddings: ${embeddingsResponse.statusText}`);
+    }
+
+    const embeddingsData = await embeddingsResponse.json();
+    setProgress(100);
+    setProcessingStatus("Sentence embeddings complete!");
+
+    // Notify about completion
+    toast({
+      title: "Sentence Embeddings Complete",
+      description: `Generated embeddings for ${embeddingsData.count} sentences from your document.`,
+      duration: 5000,
+    });
+
+    // Show sample sentences in console for debugging
+    console.log("Sample sentences:", embeddingsData.samples);
+
+    // Call the onComplete callback with results
+    if (onComplete) {
+      onComplete({
+        sentenceCount: embeddingsData.count,
+        samples: embeddingsData.samples,
+        filename
+      });
+    }
+  };
+
+  // Helper function to read file content
+  const readFileContent = (file: File): Promise<string> => {
+    return new Promise((resolve, reject) => {
+      console.log(`Reading file: ${file.name}, size: ${(file.size / 1024).toFixed(2)} KB`);
+      
+      const reader = new FileReader();
+      reader.onload = (event) => {
+        if (event.target?.result) {
+          const content = event.target.result as string;
+          console.log(`File content loaded, length: ${content.length} characters`);
+          
+          // Special handling for CSV files
+          if (file.name.toLowerCase().endsWith('.csv')) {
+            try {
+              console.log(`Processing CSV file content...`);
+              // Don't parse here, just validate the content
+              const lineCount = content.split('\n').length;
+              console.log(`CSV file has ${lineCount} lines`);
+              resolve(content);
+            } catch (err) {
+              console.error(`CSV parsing error:`, err);
+              reject(new Error(`Failed to parse CSV file: ${err instanceof Error ? err.message : String(err)}`));
+            }
+          } else if (file.name.toLowerCase().endsWith('.json')) {
+            try {
+              console.log(`Processing JSON file content...`);
+              // Convert JSON to readable text format for processing
+              const textContent = convertJsonToText(content);
+              console.log(`Converted JSON file to text format, length: ${textContent.length} characters`);
+              resolve(textContent);
+            } catch (err) {
+              console.error(`JSON conversion error:`, err);
+              reject(new Error(`Failed to process JSON file: ${err instanceof Error ? err.message : String(err)}`));
+            }
+          } else {
+            resolve(content);
+          }
+        } else {
+          reject(new Error("Failed to read file content"));
+        }
+      };
+      reader.onerror = (error) => {
+        console.error(`Error reading file:`, error);
+        reject(new Error("Error reading file"));
+      };
+      reader.readAsText(file);
+    });
+  };
+
+  // Helper function to convert JSON content to readable text format
+  const convertJsonToText = (jsonContent: string): string => {
+    try {
+      // Parse the JSON to validate it
+      const jsonData = JSON.parse(jsonContent);
+      
+      // Convert JSON to a readable text format that preserves structure and relationships
+      const formatJsonObject = (obj: any, indent: number = 0): string => {
+        const spaces = '  '.repeat(indent);
+        
+        if (obj === null || obj === undefined) {
+          return 'null';
+        }
+        
+        if (typeof obj === 'string' || typeof obj === 'number' || typeof obj === 'boolean') {
+          return String(obj);
+        }
+        
+        if (Array.isArray(obj)) {
+          if (obj.length === 0) return '[]';
+          const items = obj.map((item, index) => 
+            `${spaces}  Item ${index + 1}: ${formatJsonObject(item, indent + 1)}`
+          ).join('\n');
+          return `[\n${items}\n${spaces}]`;
+        }
+        
+        if (typeof obj === 'object') {
+          const entries = Object.entries(obj);
+          if (entries.length === 0) return '{}';
+          
+          const props = entries.map(([key, value]) => 
+            `${spaces}  ${key}: ${formatJsonObject(value, indent + 1)}`
+          ).join('\n');
+          return `{\n${props}\n${spaces}}`;
+        }
+        
+        return String(obj);
+      };
+      
+      // Create a descriptive text representation
+      let textContent = `JSON Document Content:\n\n`;
+      textContent += formatJsonObject(jsonData);
+      
+      return textContent;
+    } catch (error) {
+      console.warn('Failed to parse JSON, treating as plain text:', error);
+      // If JSON parsing fails, return the original content as-is
+      return jsonContent;
+    }
+  }
+
+  // Parse CSV file and process each row as a document for LLM-based triple extraction
+  const parseCSVToTriples = async (csvContent: string): Promise<any[]> => {
+    console.log(`Processing CSV content as individual documents, length: ${csvContent.length} characters`);
+    
+    // Split the CSV content into lines
+    const lines = csvContent.split('\n').filter(line => line.trim().length > 0);
+    console.log(`CSV has ${lines.length} non-empty lines`);
+    
+    if (lines.length < 2) {
+      throw new Error("CSV file must contain a header row and at least one data row");
+    }
+    
+    // Parse the header row
+    const header = lines[0].split(',').map(h => h.trim().replace(/^"(.*)"$/, '$1'));
+    console.log(`CSV headers: ${header.join(', ')}`);
+    
+    // Get data rows (skip header)
+    const dataRows = lines.slice(1);
+    console.log(`Processing ${dataRows.length} data rows as individual documents`);
+    
+    let allTriples: any[] = [];
+    
+    // Process each row as a separate document
+    for (let rowIdx = 0; rowIdx < dataRows.length; rowIdx++) {
+      const line = dataRows[rowIdx];
+      setProcessingStatus(`Processing CSV row ${rowIdx + 1}/${dataRows.length} with LLM...`);
+      
+      try {
+        // Parse CSV row into fields
+        const fields: string[] = [];
+        let fieldStart = 0;
+        let inQuotes = false;
+        
+        for (let i = 0; i < line.length; i++) {
+          if (line[i] === '"') {
+            inQuotes = !inQuotes;
+          } else if (line[i] === ',' && !inQuotes) {
+            fields.push(line.substring(fieldStart, i).trim().replace(/^"(.*)"$/, '$1'));
+            fieldStart = i + 1;
+          }
+        }
+        
+        // Add the last field
+        fields.push(line.substring(fieldStart).trim().replace(/^"(.*)"$/, '$1'));
+        
+        // Create document text from the row data
+        let documentText = '';
+        for (let i = 0; i < Math.min(header.length, fields.length); i++) {
+          if (fields[i] && fields[i].trim()) {
+            documentText += `${header[i]}: ${fields[i]}\n`;
+          }
+        }
+        
+        // Skip empty rows
+        if (!documentText.trim()) {
+          console.warn(`Skipping empty CSV row ${rowIdx + 1}`);
+          continue;
+        }
+        
+        console.log(`Processing row ${rowIdx + 1} as document: ${documentText.substring(0, 100)}...`);
+        
+        // Extract triples from this row's text using LLM
+        try {
+          const response = await fetch('/api/extract-triples', {
+            method: 'POST',
+            headers: { 'Content-Type': 'application/json' },
+            body: JSON.stringify({ 
+              text: documentText,
+              useLangChain: true // Use LLM-based extraction
+            })
+          });
+
+          if (!response.ok) {
+            console.error(`Failed to extract triples from row ${rowIdx + 1}: ${response.statusText}`);
+            continue;
+          }
+
+          const data = await response.json();
+          if (data.triples && Array.isArray(data.triples)) {
+            console.log(`Extracted ${data.triples.length} triples from row ${rowIdx + 1}`);
+            allTriples = allTriples.concat(data.triples);
+          }
+        } catch (error) {
+          console.error(`Error processing row ${rowIdx + 1}:`, error);
+          continue;
+        }
+        
+        // Update progress
+        setProgress(20 + (rowIdx / dataRows.length) * 40);
+        
+      } catch (parseError) {
+        console.error(`Error parsing CSV row ${rowIdx + 1}:`, parseError);
+        continue;
+      }
+    }
+    
+    console.log(`Successfully extracted ${allTriples.length} triples from ${dataRows.length} CSV rows`);
+    return allTriples;
+  };
+
+  return (
+    <Card className={className}>
+      <CardHeader>
+        <CardTitle>Process Document</CardTitle>
+        <CardDescription>
+          Extract triples from documents and build a knowledge graph
+        </CardDescription>
+      </CardHeader>
+      <CardContent>
+        <div className="space-y-4">
+          <div className="flex items-start gap-4">
+            <div className="grid w-full gap-2">
+              <label htmlFor="document-upload" className="cursor-pointer">
+                <div className="flex h-24 w-full items-center justify-center rounded-md border border-dashed border-input bg-muted/50 p-4 hover:bg-muted/80 transition-colors">
+                  <div className="flex flex-col items-center gap-2">
+                    <FileUp className="h-10 w-10 text-muted-foreground" />
+                    <span className="text-sm font-medium text-muted-foreground">
+                      {file ? file.name : "Upload document"}
+                    </span>
+                  </div>
+                </div>
+                <input
+                  id="document-upload"
+                  type="file"
+                  accept=".md,.txt,.csv"
+                  onChange={handleFileChange}
+                  className="sr-only"
+                />
+              </label>
+            </div>
+          </div>
+
+          <Tabs 
+            defaultValue="triples" 
+            value={processingTab}
+            onValueChange={setProcessingTab}
+            className="w-full"
+          >
+            <TabsList className="grid w-full grid-cols-2">
+              <TabsTrigger value="triples">Knowledge Triples</TabsTrigger>
+              <TabsTrigger value="embeddings">Sentence Embeddings</TabsTrigger>
+            </TabsList>
+            <TabsContent value="triples">
+              <div className="space-y-4 py-4">
+                <div className="flex items-center space-x-2">
+                  <Switch 
+                    id="use-sentence-chunking" 
+                    checked={useSentenceChunking}
+                    onCheckedChange={setUseSentenceChunking}
+                  />
+                  <Label htmlFor="use-sentence-chunking">Use sentence-level chunking</Label>
+                </div>
+              </div>
+            </TabsContent>
+            <TabsContent value="embeddings">
+              <div className="py-4 text-sm text-muted-foreground">
+                You can now generate embeddings directly from the document table after processing.
+                <div className="flex items-center mt-2 p-2 bg-muted/30 rounded-md">
+                  <Zap className="h-4 w-4 text-primary mr-2" />
+                  <span>Click the lightning icon in the document table to generate embeddings</span>
+                </div>
+              </div>
+            </TabsContent>
+          </Tabs>
+
+          {error && (
+            <Alert variant="destructive">
+              <AlertCircle className="h-4 w-4" />
+              <AlertDescription>{error}</AlertDescription>
+            </Alert>
+          )}
+
+          {isProcessing && (
+            <div className="space-y-2">
+              <div className="flex items-center gap-2 text-sm">
+                <Loader2 className="h-4 w-4 animate-spin" />
+                <span>{processingStatus}</span>
+              </div>
+              <Progress value={progress} className="h-2 w-full" />
+            </div>
+          )}
+
+          <Button
+            onClick={processFile}
+            className="w-full"
+            disabled={!file || isProcessing}
+          >
+            {isProcessing ? (
+              <>
+                <Loader2 className="mr-2 h-4 w-4 animate-spin" />
+                Processing...
+              </>
+            ) : (
+              <>
+                <FileText className="mr-2 h-4 w-4" />
+                Process Document & Generate Triples
+              </>
+            )}
+          </Button>
+        </div>
+      </CardContent>
+    </Card>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/document-selection.tsx b/nvidia/txt2kg/assets/frontend/components/document-selection.tsx
new file mode 100644
index 0000000..fbde5b1
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/document-selection.tsx
@@ -0,0 +1,350 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { useDocuments } from "@/contexts/document-context"
+import { CheckCircle, Loader2, FileText, AlertCircle, X } from "lucide-react"
+import { Switch } from "@/components/ui/switch"
+import { Label } from "@/components/ui/label"
+import { Button } from "@/components/ui/button"
+import { Tooltip, TooltipContent, TooltipProvider, TooltipTrigger } from "@/components/ui/tooltip"
+import { useRouter } from "next/navigation"
+import { useShiftSelect } from "@/hooks/use-shift-select"
+
+export function DocumentSelection() {
+  const { documents, processDocuments, isProcessing } = useDocuments()
+  const [useLangChain, setUseLangChain] = useState(false)
+  const [useSentenceChunking, setUseSentenceChunking] = useState(true)
+  const [useEntityExtraction, setUseEntityExtraction] = useState(true)
+  const [processingStatus, setProcessingStatus] = useState("")
+  const [error, setError] = useState<string | null>(null)
+  const [forceUpdate, setForceUpdate] = useState(0)
+  const router = useRouter()
+
+  // Use shift-select hook for document selection
+  const {
+    selectedItems: selectedDocs,
+    setSelectedItems: setSelectedDocs,
+    handleItemClick,
+    handleSelectAll,
+    isSelected
+  } = useShiftSelect({
+    items: documents,
+    getItemId: (doc) => doc.id,
+    canSelect: (doc) => doc.status === "New" || doc.status === "Processed" || doc.status === "Error",
+    onSelectionChange: (selectedIds) => {
+      // Optional: handle selection change if needed
+    }
+  })
+
+  // Add event listener for document status changes
+  useEffect(() => {
+    const handleDocumentStatusChange = () => {
+      console.log("Document status changed, forcing UI refresh");
+      setForceUpdate(prev => prev + 1); // Increment to force a re-render
+    };
+    
+    const handleProcessingComplete = () => {
+      console.log("Processing complete event received, resetting UI state");
+      setProcessingStatus(""); // Clear the processing status message
+      setForceUpdate(prev => prev + 1); // Force a refresh
+    };
+    
+    window.addEventListener('document-status-changed', handleDocumentStatusChange);
+    window.addEventListener('processing-complete', handleProcessingComplete);
+    
+    return () => {
+      window.removeEventListener('document-status-changed', handleDocumentStatusChange);
+      window.removeEventListener('processing-complete', handleProcessingComplete);
+    };
+  }, []);
+  
+  // Add automatic UI refresh every second during processing
+  useEffect(() => {
+    let interval: NodeJS.Timeout;
+    
+    if (isProcessing) {
+      interval = setInterval(() => {
+        setForceUpdate(prev => prev + 1); // Force UI refresh
+      }, 1000);
+    }
+    
+    return () => {
+      if (interval) clearInterval(interval);
+    };
+  }, [isProcessing]);
+
+  // On component mount, default to select all documents with status "New", "Processed", or "Error"
+  useEffect(() => {
+    const availableDocs = documents
+      .filter(doc => doc.status === "New" || doc.status === "Processed" || doc.status === "Error")
+      .map(doc => doc.id)
+    
+    setSelectedDocs(availableDocs)
+  }, [documents, setSelectedDocs])
+
+  const handleTabChange = (tab: string) => {
+    const tabElement = document.querySelector(`[data-value="${tab}"]`)
+    if (tabElement && 'click' in tabElement) {
+      (tabElement as HTMLElement).click()
+    }
+  }
+
+  const handleProcessDocuments = async () => {
+    if (selectedDocs.length === 0) {
+      setError("Please select at least one document to process")
+      return
+    }
+
+    setError(null)
+    setProcessingStatus("Preparing documents for processing...")
+
+    try {
+      // Update the processing status display
+      const docNames = selectedDocs.map(id => 
+        documents.find(d => d.id === id)?.name || 'Unknown'
+      ).join(', ');
+      
+      setProcessingStatus(`Processing ${selectedDocs.length} document(s): ${docNames}`);
+      
+      // Call processDocuments with the selected document IDs
+      await processDocuments(selectedDocs, {
+        useLangChain,
+        useGraphTransformer: false,
+        promptConfigs: undefined
+      })
+      
+      // Ensure UI is updated after processing completes
+      setForceUpdate(prev => prev + 1);
+      setProcessingStatus("Processing complete! Navigating to edit view...");
+      
+      // Short delay before navigation to allow status update to be seen
+      setTimeout(() => {
+        // Navigate to the edit tab after processing
+        handleTabChange("edit")
+      }, 1000);
+    } catch (error) {
+      console.error("Error processing documents:", error)
+      setError("Failed to process documents. Please try again.")
+    }
+  }
+
+  const handleStopProcessing = async () => {
+    try {
+      // Call the stop processing API
+      const response = await fetch('/api/stop-processing', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+      });
+
+      if (response.ok) {
+        setProcessingStatus("Processing stopped by user");
+        setError(null);
+        // Force UI refresh to update document statuses
+        setForceUpdate(prev => prev + 1);
+      } else {
+        setError("Failed to stop processing. Please try again.");
+      }
+    } catch (error) {
+      console.error("Error stopping processing:", error);
+      setError("Failed to stop processing. Please try again.");
+    }
+  }
+
+  return (
+    <div className="space-y-4">
+      <h3 className="text-md font-medium">Document Selection</h3>
+      <p className="text-sm text-muted-foreground mb-2">Select which documents to process for triple extraction</p>
+      
+      <div className="space-y-3 pt-2 mb-4">
+        <h4 className="text-sm font-medium">Processing Options</h4>
+        
+        <div className="space-y-2">
+          <div className="flex items-center space-x-2">
+            <Switch 
+              id="use-langchain" 
+              checked={useLangChain}
+              onCheckedChange={(value) => {
+                setUseLangChain(value);
+                // Dispatch custom event to update other components
+                window.dispatchEvent(new CustomEvent('langChainToggled', { 
+                  detail: { useLangChain: value } 
+                }));
+              }}
+              disabled={isProcessing}
+            />
+            <Label htmlFor="use-langchain" className="text-sm cursor-pointer">Use LangChain</Label>
+          </div>
+          {/* <p className="text-xs text-muted-foreground pl-7">
+            Leverages LangChain for knowledge extraction from documents
+          </p> */}
+          
+          {useLangChain && (
+            <>
+              <div className="flex items-center space-x-2">
+                <Switch 
+                  id="use-sentence-chunking" 
+                  checked={useSentenceChunking}
+                  onCheckedChange={setUseSentenceChunking}
+                  disabled={isProcessing}
+                />
+                <Label htmlFor="use-sentence-chunking" className="text-sm cursor-pointer">
+                  Use Sentence Chunking
+                </Label>
+              </div>
+              <p className="text-xs text-muted-foreground pl-7">
+                Split documents into sentences for more accurate triple extraction
+              </p>
+              
+              <div className="flex items-center space-x-2">
+                <Switch 
+                  id="use-entity-extraction" 
+                  checked={useEntityExtraction}
+                  onCheckedChange={setUseEntityExtraction}
+                  disabled={isProcessing}
+                />
+                <Label htmlFor="use-entity-extraction" className="text-sm cursor-pointer">
+                  Entity Extraction
+                </Label>
+              </div>
+              <p className="text-xs text-muted-foreground pl-7">
+                Automatically detect and extract entities from documents
+              </p>
+            </>
+          )}
+        </div>
+      </div>
+      
+      {error && (
+        <div className="bg-destructive/10 border border-destructive rounded-md p-4 flex items-start gap-3">
+          <AlertCircle className="h-5 w-5 text-destructive mt-0.5 flex-shrink-0" />
+          <p className="text-sm text-destructive">{error}</p>
+        </div>
+      )}
+
+      <div className="border rounded-md overflow-hidden">
+        <div className="bg-muted/30 p-3 flex items-center justify-between">
+          <div className="flex items-center">
+            <input
+              type="checkbox"
+              className="rounded border-border text-primary focus:ring-primary mr-3 h-4 w-4"
+              checked={selectedDocs.length === documents.filter(doc => doc.status === "New" || doc.status === "Processed" || doc.status === "Error").length && 
+                      documents.filter(doc => doc.status === "New" || doc.status === "Processed" || doc.status === "Error").length > 0}
+              onChange={handleSelectAll}
+              disabled={documents.filter(doc => doc.status === "New" || doc.status === "Processed" || doc.status === "Error").length === 0 || isProcessing}
+            />
+            <span className="text-sm font-medium">
+              {selectedDocs.length > 0 ? (
+                <span className="text-nvidia-green text-xs">{selectedDocs.length} selected</span>
+              ) : (
+                <span className="text-xs">Select all</span>
+              )}
+            </span>
+          </div>
+          
+          <div className="flex gap-2">
+            <Button
+              size="sm"
+              onClick={handleProcessDocuments}
+              disabled={selectedDocs.length === 0 || isProcessing}
+              className="gap-1"
+            >
+              {isProcessing ? (
+                <>
+                  <Loader2 className="h-3.5 w-3.5 animate-spin" />
+                  <span>Processing...</span>
+                </>
+              ) : (
+                <>
+                  <FileText className="h-3.5 w-3.5" />
+                  <span>Extract Triples</span>
+                </>
+              )}
+            </Button>
+            
+            {isProcessing && (
+              <Button
+                size="sm"
+                variant="destructive"
+                onClick={handleStopProcessing}
+                className="gap-1"
+              >
+                <X className="h-3.5 w-3.5" />
+                <span>Stop</span>
+              </Button>
+            )}
+          </div>
+        </div>
+
+        <div className="max-h-[200px] overflow-y-auto">
+          {documents.length === 0 ? (
+            <div className="p-6 text-center">
+              <p className="text-muted-foreground">No documents available for processing</p>
+              <Button 
+                variant="link" 
+                onClick={() => handleTabChange("upload")}
+                className="mt-2"
+              >
+                Go to Upload
+              </Button>
+            </div>
+          ) : (
+            <table className="w-full text-sm">
+              <tbody>
+                {documents.map((doc) => (
+                  <tr key={doc.id} className="border-b last:border-b-0 hover:bg-muted/20">
+                    <td className="pl-3 py-3">
+                      <input
+                        type="checkbox"
+                        className="rounded border-border text-primary focus:ring-primary h-4 w-4"
+                        checked={isSelected(doc.id)}
+                        onChange={(e) => handleItemClick(doc, e)}
+                        disabled={(doc.status !== "New" && doc.status !== "Processed" && doc.status !== "Error") || isProcessing}
+                      />
+                    </td>
+                    <td className="px-3 py-3 font-medium text-foreground flex items-center gap-2 cursor-pointer" 
+                        onClick={(e) => (doc.status === "New" || doc.status === "Processed" || doc.status === "Error") && !isProcessing && handleItemClick(doc, e)}>
+                      <FileText className="h-4 w-4 text-muted-foreground" />
+                      {doc.name}
+                    </td>
+                    <td className="px-3 py-3">
+                      <div className="flex items-center">
+                        {doc.status === "New" && (
+                          <span className="h-2 w-2 rounded-full bg-cyan-400 mr-2"></span>
+                        )}
+                        {doc.status === "Processing" && (
+                          <Loader2 className="h-4 w-4 text-yellow-500 mr-2 animate-spin" />
+                        )}
+                        {doc.status === "Processed" && (
+                          <CheckCircle className="h-4 w-4 text-green-500 mr-2" />
+                        )}
+                        {doc.status === "Error" && (
+                          <AlertCircle className="h-4 w-4 text-destructive mr-2" />
+                        )}
+                        <span>{doc.status}</span>
+                      </div>
+                    </td>
+                    <td className="px-3 py-3">{doc.size}</td>
+                  </tr>
+                ))}
+              </tbody>
+            </table>
+          )}
+        </div>
+      </div>
+
+      {isProcessing && processingStatus && (
+        <div className="border rounded-md p-4 bg-primary/5">
+          <div className="flex items-center gap-2 text-sm">
+            <Loader2 className="h-4 w-4 animate-spin text-primary" />
+            <span>{processingStatus}</span>
+          </div>
+          <div className="mt-2 h-1 w-full bg-muted overflow-hidden rounded-full">
+            <div className="h-full bg-primary rounded-full animate-progress"></div>
+          </div>
+        </div>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/documents-table.tsx b/nvidia/txt2kg/assets/frontend/components/documents-table.tsx
new file mode 100644
index 0000000..4cdba00
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/documents-table.tsx
@@ -0,0 +1,421 @@
+"use client"
+
+import { useState } from "react"
+import { CheckCircle, AlertCircle, Loader2, Trash2, FileText, Table, Edit, Eye, Network, Download, Info } from "lucide-react"
+import { useDocuments } from "@/contexts/document-context"
+import { DocumentActions } from "@/components/document-actions"
+import { useShiftSelect } from "@/hooks/use-shift-select"
+import {
+  Dialog,
+  DialogContent,
+  DialogDescription,
+  DialogHeader,
+  DialogTitle,
+} from "@/components/ui/dialog"
+import { Button } from "@/components/ui/button"
+import type { Triple } from "@/utils/text-processing"
+import { Tooltip, TooltipContent, TooltipProvider, TooltipTrigger } from "@/components/ui/tooltip"
+import { downloadDocument } from "@/lib/utils"
+import { toast } from "@/hooks/use-toast"
+
+export interface DocumentsTableProps {
+  onTabChange?: (tab: string) => void;
+}
+
+export function DocumentsTable({ onTabChange }: DocumentsTableProps) {
+  const { documents, deleteDocuments, updateTriples } = useDocuments()
+  const [showTriplesDialog, setShowTriplesDialog] = useState(false)
+  const [currentDocumentId, setCurrentDocumentId] = useState<string | null>(null)
+  const [editableTriples, setEditableTriples] = useState<Triple[]>([])
+  const [editingTripleIndex, setEditingTripleIndex] = useState<number | null>(null)
+
+  // Use shift-select hook for document selection
+  const {
+    selectedItems: selectedDocuments,
+    setSelectedItems: setSelectedDocuments,
+    handleItemClick,
+    handleSelectAll,
+    isSelected
+  } = useShiftSelect({
+    items: documents,
+    getItemId: (doc) => doc.id,
+    canSelect: () => true, // All documents can be selected in this table
+    onSelectionChange: (selectedIds) => {
+      // Optional: handle selection change if needed
+    }
+  })
+
+  const handleDeleteSelected = () => {
+    if (selectedDocuments.length === 0) return
+
+    if (confirm(`Are you sure you want to delete ${selectedDocuments.length} selected document(s)?`)) {
+      deleteDocuments(selectedDocuments)
+      setSelectedDocuments([])
+    }
+  }
+  
+  const openTriplesDialog = (documentId: string) => {
+    const document = documents.find(doc => doc.id === documentId);
+    if (document && document.triples) {
+      setCurrentDocumentId(documentId);
+      setEditableTriples([...document.triples]);
+      setShowTriplesDialog(true);
+    }
+  }
+  
+  const saveTriples = () => {
+    if (currentDocumentId) {
+      updateTriples(currentDocumentId, editableTriples);
+      setShowTriplesDialog(false);
+    }
+  }
+  
+  const updateTriple = (index: number, field: 'subject' | 'predicate' | 'object', value: string) => {
+    const newTriples = [...editableTriples];
+    newTriples[index] = {
+      ...newTriples[index],
+      [field]: value
+    };
+    setEditableTriples(newTriples);
+  }
+  
+  const deleteTriple = (index: number) => {
+    const newTriples = [...editableTriples];
+    newTriples.splice(index, 1);
+    setEditableTriples(newTriples);
+  }
+  
+  const addNewTriple = () => {
+    setEditableTriples([...editableTriples, { subject: '', predicate: '', object: '' }]);
+    setEditingTripleIndex(editableTriples.length);
+  }
+
+  const getStatusIcon = (status: string) => {
+    switch (status) {
+      case "New":
+        return <span className="h-1.5 w-1.5 rounded-full bg-cyan-400 mr-2"></span>
+      case "Processing":
+        return <Loader2 className="h-3.5 w-3.5 text-yellow-500 mr-2 animate-spin" />
+      case "Processed":
+        return <CheckCircle className="h-3.5 w-3.5 text-green-500 mr-2" />
+      case "Error":
+        return <AlertCircle className="h-3.5 w-3.5 text-destructive mr-2" />
+      default:
+        return <span className="h-1.5 w-1.5 rounded-full bg-gray-400 mr-2"></span>
+    }
+  }
+
+  // Show different columns based on document processing state
+  const showTriplesColumn = documents.some(doc => doc.status === 'Processed')
+
+
+
+  return (
+    <div className="relative">
+      <div className="flex justify-between items-center p-6 bg-muted/10 border-b border-border/20">
+        <div className="flex items-center">
+          <div className="relative flex items-center">
+            <input
+              type="checkbox"
+              className="rounded border-border selection-accent mr-4 h-4 w-4"
+              checked={selectedDocuments.length === documents.length && documents.length > 0}
+              onChange={handleSelectAll}
+              disabled={documents.length === 0}
+            />
+            <div className="flex flex-col">
+              <span className="text-sm font-medium">
+                {selectedDocuments.length > 0 ? (
+                  <span className="text-nvidia-green">{selectedDocuments.length} selected</span>
+                ) : (
+                  <span className="text-foreground">Select all documents</span>
+                )}
+              </span>
+              {documents.length > 0 && (
+                <span className="text-xs text-muted-foreground">
+                  {documents.length} total document{documents.length !== 1 ? 's' : ''}
+                </span>
+              )}
+            </div>
+          </div>
+        </div>
+
+        <div className="flex items-center gap-3">
+          {selectedDocuments.length > 0 && (
+            <button 
+              onClick={handleDeleteSelected} 
+              className="flex items-center gap-2 px-3 py-2 text-sm font-medium bg-red-500/10 hover:bg-red-500/20 text-red-600 dark:text-red-400 rounded-lg transition-colors"
+            >
+              <Trash2 className="h-4 w-4" />
+              <span>Delete Selected ({selectedDocuments.length})</span>
+            </button>
+          )}
+        </div>
+      </div>
+
+      <div className="overflow-hidden">
+        <table className="w-full">
+          <thead>
+            <tr className="border-b border-border/20 bg-muted/5">
+              <th className="w-12 pl-6 py-3"></th>
+              <th className="text-xs font-semibold text-muted-foreground uppercase tracking-wider text-left py-3">Name</th>
+              <th className="text-xs font-semibold text-muted-foreground uppercase tracking-wider text-left py-3">Status</th>
+              <th className="text-xs font-semibold text-muted-foreground uppercase tracking-wider text-left py-3">Upload Status</th>
+              <th className="text-xs font-semibold text-muted-foreground uppercase tracking-wider text-right py-3 pr-4">Size (KB)</th>
+              {showTriplesColumn && <th className="text-xs font-semibold text-muted-foreground uppercase tracking-wider text-center py-3">Triples</th>}
+              <th className="text-xs font-semibold text-muted-foreground uppercase tracking-wider text-center py-3 pr-6">Actions</th>
+            </tr>
+          </thead>
+        <tbody>
+          {documents.length === 0 ? (
+            <tr>
+              <td colSpan={showTriplesColumn ? 7 : 6} className="py-16">
+                <div className="flex flex-col items-center justify-center text-center">
+                  <div className="w-24 h-24 rounded-2xl bg-nvidia-green/10 flex items-center justify-center mb-6 border-2 border-dashed border-nvidia-green/20">
+                    <FileText className="h-12 w-12 text-nvidia-green" />
+                  </div>
+                  <h3 className="text-xl font-semibold text-foreground mb-3">No documents uploaded yet</h3>
+                  <p className="text-sm text-muted-foreground mb-6 max-w-md leading-relaxed">
+                    Get started by uploading markdown, CSV, text, or JSON files to extract knowledge graphs
+                  </p>
+                  <div className="inline-flex items-center gap-2 text-xs text-muted-foreground bg-muted/40 px-3 py-1.5 rounded-full border border-border/30">
+                    <Info className="h-4 w-4" />
+                    <span>Supported: .md, .csv, .txt, .json</span>
+                  </div>
+                </div>
+              </td>
+            </tr>
+          ) : (
+            documents.map((doc) => (
+               <tr key={doc.id} className={`transition-all duration-200 hover:bg-nvidia-green/5 cursor-pointer group border-b border-border/10 last:border-b-0 ${isSelected(doc.id) ? 'bg-nvidia-green/8 border-l-4 border-l-nvidia-green' : 'hover:border-l-4 hover:border-l-nvidia-green/40'}`}
+                   onClick={(e) => handleItemClick(doc, e)}>
+                <td className="pl-6 py-4" onClick={(e) => e.stopPropagation()}>
+                  <input
+                    type="checkbox"
+                    className="rounded border-border selection-accent h-4 w-4"
+                    checked={isSelected(doc.id)}
+                    onChange={(e) => handleItemClick(doc, e)}
+                  />
+                </td>
+                 <td className="py-4">
+                  <div className="flex items-center gap-3">
+                    <FileText className="h-4 w-4 text-nvidia-green flex-shrink-0" />
+                    <span className="text-sm font-medium text-foreground truncate max-w-[200px]" title={doc.name}>{doc.name}</span>
+                  </div>
+                </td>
+                <td className="py-4">
+                  <div className="flex items-center gap-2">
+                    {getStatusIcon(doc.status)}
+                    <span className={`text-xs font-medium px-2.5 py-1 rounded-full ${
+                      doc.status === 'Processed' ? 'bg-green-100 text-green-800 dark:bg-green-900/30 dark:text-green-400' :
+                      doc.status === 'Processing' ? 'bg-yellow-100 text-yellow-800 dark:bg-yellow-900/30 dark:text-yellow-400' :
+                      doc.status === 'Error' ? 'bg-red-100 text-red-800 dark:bg-red-900/30 dark:text-red-400' :
+                      'bg-cyan-100 text-cyan-800 dark:bg-cyan-900/30 dark:text-cyan-400'
+                    }`}>{doc.status}</span>
+                  </div>
+                </td>
+                <td className="py-4">
+                  <div className="flex items-center gap-2">
+                    <CheckCircle className="h-4 w-4 text-nvidia-green" />
+                    <span className="text-xs font-medium px-2.5 py-1 rounded-full bg-green-100 text-green-800 dark:bg-green-900/30 dark:text-green-400">{doc.uploadStatus}</span>
+                  </div>
+                </td>
+                <td className="py-4 text-right pr-4">
+                  <span className="text-xs font-mono bg-muted/50 px-2 py-1 rounded">{doc.size}</span>
+                </td>
+                {showTriplesColumn && (
+                  <td className="py-4 text-center">
+                    {doc.status === "Processed" && doc.triples ? (
+                      <div className="flex items-center justify-center gap-3">
+                        <span className="text-xs font-bold text-nvidia-green bg-nvidia-green/15 px-2.5 py-1 rounded-full">{doc.triples.length}</span>
+                        <button 
+                          onClick={(e) => {
+                            e.stopPropagation();
+                            openTriplesDialog(doc.id);
+                          }}
+                          className="p-2 text-nvidia-green hover:bg-nvidia-green/10 rounded-lg transition-colors"
+                          title="View and edit triples"
+                        >
+                          <Eye className="h-4 w-4" />
+                        </button>
+                      </div>
+                    ) : doc.status === "Error" ? (
+                      <span className="text-xs text-destructive font-medium">Error</span>
+                    ) : (
+                      <span className="text-xs text-muted-foreground">-</span>
+                    )}
+                  </td>
+                )}
+                <td className="py-4 pr-6">
+                  <div className="flex items-center justify-center gap-1 opacity-0 group-hover:opacity-100 transition-opacity">
+                    <button 
+                      onClick={(e) => {
+                        e.stopPropagation()
+                        // Create a simple info modal or tooltip showing document details
+                      }}
+                      className="p-2 text-muted-foreground hover:text-nvidia-green hover:bg-nvidia-green/10 rounded-lg transition-colors"
+                      title="View document info"
+                    >
+                      <Info className="h-4 w-4" />
+                    </button>
+                    <button 
+                      onClick={(e) => {
+                        e.stopPropagation()
+                        try {
+                          downloadDocument(doc.file, doc.name)
+                          toast({
+                            title: "Download Started",
+                            description: `"${doc.name}" is being downloaded.`,
+                            duration: 3000,
+                          })
+                        } catch (error) {
+                          console.error('Download failed:', error)
+                          toast({
+                            title: "Download Failed",
+                            description: `Failed to download "${doc.name}". Please try again.`,
+                            variant: "destructive",
+                            duration: 5000,
+                          })
+                        }
+                      }}
+                      className="p-2 text-muted-foreground hover:text-nvidia-green hover:bg-nvidia-green/10 rounded-lg transition-colors"
+                      title="Download document"
+                    >
+                      <Download className="h-4 w-4" />
+                    </button>
+                    <button 
+                      onClick={(e) => {
+                        e.stopPropagation()
+                        if (confirm(`Are you sure you want to delete ${doc.name}?`)) {
+                          deleteDocuments([doc.id])
+                        }
+                      }}
+                      className="p-2 text-muted-foreground hover:text-red-500 hover:bg-red-500/10 rounded-lg transition-colors"
+                      title="Delete document"
+                    >
+                      <Trash2 className="h-4 w-4" />
+                    </button>
+                  </div>
+                </td>
+              </tr>
+            ))
+          )}
+        </tbody>
+        </table>
+      </div>
+      
+      <Dialog open={showTriplesDialog} onOpenChange={setShowTriplesDialog}>
+        <DialogContent className="max-w-4xl max-h-[80vh] overflow-y-auto">
+          <DialogHeader>
+            <DialogTitle className="nvidia-build-h3">Edit Knowledge Graph Triples</DialogTitle>
+            <DialogDescription className="nvidia-build-body text-muted-foreground">
+              View and edit the extracted triples before processing into your graph database
+            </DialogDescription>
+          </DialogHeader>
+          
+          <div className="mt-6">
+            <div className="flex justify-between items-center mb-6">
+              <div>
+                <span className="nvidia-build-h3">{editableTriples.length} Triples</span>
+                <p className="nvidia-build-caption text-muted-foreground mt-1">Subject-Predicate-Object relationships</p>
+              </div>
+              <Button variant="outline" size="sm" onClick={addNewTriple} className="nvidia-build-button">
+                <Edit className="h-4 w-4 mr-2" />
+                Add Triple
+              </Button>
+            </div>
+            
+            <div className="border rounded-md overflow-hidden">
+              <table className="w-full">
+                <thead>
+                  <tr className="bg-muted/50 border-b border-border">
+                    <th className="px-4 py-3 text-left text-sm font-semibold text-muted-foreground">Subject</th>
+                    <th className="px-4 py-3 text-left text-sm font-semibold text-muted-foreground">Predicate</th>
+                    <th className="px-4 py-3 text-left text-sm font-semibold text-muted-foreground">Object</th>
+                    <th className="px-4 py-3 text-left text-sm font-semibold text-muted-foreground w-20">Actions</th>
+                  </tr>
+                </thead>
+                <tbody>
+                  {editableTriples.map((triple, index) => (
+                    <tr key={index} className="border-b border-border last:border-b-0 hover:bg-muted/30 transition-colors">
+                      <td className="px-4 py-2">
+                        {editingTripleIndex === index ? (
+                          <input
+                            type="text"
+                            value={triple.subject}
+                            onChange={(e) => updateTriple(index, 'subject', e.target.value)}
+                            className="w-full bg-background border border-input rounded p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary"
+                          />
+                        ) : (
+                          <span className="text-sm text-foreground">{triple.subject}</span>
+                        )}
+                      </td>
+                      <td className="px-4 py-2">
+                        {editingTripleIndex === index ? (
+                          <input
+                            type="text"
+                            value={triple.predicate}
+                            onChange={(e) => updateTriple(index, 'predicate', e.target.value)}
+                            className="w-full bg-background border border-input rounded p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary"
+                          />
+                        ) : (
+                          <span className="text-sm text-foreground">{triple.predicate}</span>
+                        )}
+                      </td>
+                      <td className="px-4 py-2">
+                        {editingTripleIndex === index ? (
+                          <input
+                            type="text"
+                            value={triple.object}
+                            onChange={(e) => updateTriple(index, 'object', e.target.value)}
+                            className="w-full bg-background border border-input rounded p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary"
+                          />
+                        ) : (
+                          <span className="text-sm text-foreground">{triple.object}</span>
+                        )}
+                      </td>
+                      <td className="px-4 py-2">
+                        <div className="flex items-center gap-1">
+                          {editingTripleIndex === index ? (
+                            <button
+                              onClick={() => setEditingTripleIndex(null)}
+                              className="p-1.5 text-primary hover:text-primary/80 hover:bg-primary/10 rounded-full transition-colors"
+                              title="Save"
+                            >
+                              <CheckCircle className="h-4 w-4" />
+                            </button>
+                          ) : (
+                            <button
+                              onClick={() => setEditingTripleIndex(index)}
+                              className="p-1.5 text-muted-foreground hover:text-foreground hover:bg-muted/50 rounded-full transition-colors"
+                              title="Edit"
+                            >
+                              <Edit className="h-4 w-4" />
+                            </button>
+                          )}
+                          <button
+                            onClick={() => deleteTriple(index)}
+                            className="p-1.5 text-muted-foreground hover:text-destructive hover:bg-destructive/10 rounded-full transition-colors"
+                            title="Delete"
+                          >
+                            <Trash2 className="h-4 w-4" />
+                          </button>
+                        </div>
+                      </td>
+                    </tr>
+                  ))}
+                </tbody>
+              </table>
+            </div>
+            
+            <div className="flex justify-end mt-4">
+              <Button onClick={saveTriples}>
+                Save Changes
+              </Button>
+            </div>
+          </div>
+        </DialogContent>
+      </Dialog>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/dynamic-graph.tsx b/nvidia/txt2kg/assets/frontend/components/dynamic-graph.tsx
new file mode 100644
index 0000000..73956c6
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/dynamic-graph.tsx
@@ -0,0 +1,27 @@
+"use client"
+
+import dynamic from "next/dynamic"
+import type { Triple } from "@/utils/text-processing"
+
+// Dynamically import the GraphVisualization component with no SSR
+// This allows for GPU-accelerated WebGL rendering
+const DynamicGraphVisualization = dynamic(
+  () => import("./graph-visualization").then((mod) => ({ default: mod.GraphVisualization })),
+  {
+    ssr: false,
+    loading: () => (
+      <div className="h-[400px] bg-gray-900 rounded-lg flex items-center justify-center">
+        <div className="text-nvidia-green">Loading GPU-accelerated graph visualization...</div>
+      </div>
+    ),
+  },
+)
+
+interface DynamicGraphProps {
+  triples: Triple[]
+}
+
+export function DynamicGraph({ triples }: DynamicGraphProps) {
+  return <DynamicGraphVisualization triples={triples} />
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/embeddings-generator.tsx b/nvidia/txt2kg/assets/frontend/components/embeddings-generator.tsx
new file mode 100644
index 0000000..1aedbaa
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/embeddings-generator.tsx
@@ -0,0 +1,1251 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { useDocuments } from "@/contexts/document-context"
+import { Button } from "@/components/ui/button"
+import { Sparkles, Loader2, CheckCircle, AlertCircle, FileText, Zap, Cpu, X, ChevronUp, ChevronDown } from "lucide-react"
+import { Switch } from "@/components/ui/switch"
+import { Label } from "@/components/ui/label"
+import { Tooltip, TooltipContent, TooltipProvider, TooltipTrigger } from "@/components/ui/tooltip"
+import { Tabs, TabsContent, TabsList, TabsTrigger } from "@/components/ui/tabs"
+import React from "react"
+import { AdvancedOptions } from "@/components/advanced-options";
+import { PromptConfiguration, PromptConfigurations } from "@/components/prompt-configuration";
+import { useShiftSelect } from "@/hooks/use-shift-select"
+
+interface EmbeddingsGeneratorProps {
+  showTripleExtraction?: boolean;
+}
+
+type Document = {
+  id: string;
+  name: string;
+  status: string;
+  uploadStatus: string;
+  size: string;
+  triples?: any[];
+  embeddings?: {
+    count: number;
+    generated: Date;
+    status: "New" | "Processing" | "Processed" | "Error";
+    error?: string;
+  };
+};
+
+interface ContentProps {
+  documents: Document[];
+  selectedDocs: string[];
+  handleSelectAll: () => void;
+  handleItemClick: (item: Document, event?: React.MouseEvent) => void;
+  isSelected: (itemId: string) => boolean;
+  error: string | null;
+  status: string;
+}
+
+interface EmbeddingsContentProps extends ContentProps {
+  generateEmbeddings: () => void;
+  isGenerating: boolean;
+  useLangChain: boolean;
+  setUseLangChain: (value: boolean) => void;
+  useSentenceChunking: boolean;
+  setUseSentenceChunking: (value: boolean) => void;
+  embeddingsProvider: string;
+  handleStopEmbeddings: () => void;
+}
+
+interface TriplesContentProps extends ContentProps {
+  extractTriples: (promptConfigs?: PromptConfigurations) => void;
+  isProcessing: boolean;
+  useLangChain: boolean;
+  setUseLangChain: (value: boolean) => void;
+  useSentenceChunking: boolean;
+  setUseSentenceChunking: (value: boolean) => void;
+  useEntityExtraction: boolean;
+  setUseEntityExtraction: (value: boolean) => void;
+  error: string | null;
+  status: string;
+  handleStopProcessing: () => void;
+}
+
+export function EmbeddingsGenerator({ showTripleExtraction = false }: EmbeddingsGeneratorProps) {
+  const { documents, processDocuments, generateEmbeddings: contextGenerateEmbeddings } = useDocuments()
+  const [isGenerating, setIsGenerating] = useState(false)
+  const [isProcessing, setIsProcessing] = useState(false)
+  const [useLangChain, setUseLangChain] = useState(false)
+  const [useSentenceChunking, setUseSentenceChunking] = useState(true)
+  const [useEntityExtraction, setUseEntityExtraction] = useState(true)
+  const [error, setError] = useState<string | null>(null)
+  const [status, setStatus] = useState("")
+  const [langChainMethod, setLangChainMethod] = React.useState<'default' | 'graphtransformer'>(
+    'default'
+  );
+  const [embeddingsProvider, setEmbeddingsProvider] = useState<string>(
+    typeof window !== 'undefined' ? localStorage.getItem("embeddings_provider") || "local" : "local"
+  );
+
+  // Use shift-select hook for document selection
+  const {
+    selectedItems: selectedDocs,
+    setSelectedItems: setSelectedDocs,
+    handleItemClick,
+    handleSelectAll,
+    isSelected
+  } = useShiftSelect({
+    items: documents,
+    getItemId: (doc) => doc.id,
+    canSelect: (doc) => doc.status === "New" || doc.status === "Processed" || doc.status === "Error",
+    onSelectionChange: (selectedIds) => {
+      // Optional: handle selection change if needed
+    }
+  })
+
+  // Listen for embeddings settings changes
+  useEffect(() => {
+    const handleEmbeddingsSettingsChanged = () => {
+      const updatedProvider = localStorage.getItem("embeddings_provider") || "local";
+      setEmbeddingsProvider(updatedProvider);
+      console.log("Embeddings generator detected embeddings settings change:", updatedProvider);
+    };
+    
+    window.addEventListener('embeddings-settings-changed', handleEmbeddingsSettingsChanged);
+    
+    return () => {
+      window.removeEventListener('embeddings-settings-changed', handleEmbeddingsSettingsChanged);
+    };
+  }, []);
+
+
+
+  // Handle tab navigation
+  const handleTabChange = (tab: string) => {
+    const tabElement = document.querySelector(`[data-value="${tab}"]`)
+    if (tabElement && 'click' in tabElement) {
+      (tabElement as HTMLElement).click()
+    }
+  }
+
+  // When LangChain is toggled off, disable dependent options
+  useEffect(() => {
+    if (!useLangChain) {
+      setUseSentenceChunking(false)
+      setUseEntityExtraction(false)
+    }
+    
+    // Dispatch custom event to update embedding model info in Processing Summary
+    const event = new CustomEvent('langChainToggled', {
+      detail: { useLangChain }
+    });
+    window.dispatchEvent(event);
+  }, [useLangChain])
+
+  // Simulate embedding generation
+  const generateEmbeddings = async () => {
+    if (selectedDocs.length === 0) {
+      setError("Please select at least one document")
+      return
+    }
+
+    setError(null)
+    setIsGenerating(true)
+    setStatus("Preparing documents for embedding generation...")
+
+    try {
+      // Process each selected document
+      for (let i = 0; i < selectedDocs.length; i++) {
+        const docId = selectedDocs[i];
+        const doc = documents.find(d => d.id === docId);
+        
+        if (!doc) {
+          console.error(`Document with ID ${docId} not found`);
+          continue;
+        }
+        
+        setStatus(`Generating embeddings for document ${i+1} of ${selectedDocs.length}: ${doc.name}`);
+        await contextGenerateEmbeddings(docId);
+      }
+      
+      setStatus("Embedding generation complete!");
+      setTimeout(() => {
+        setIsGenerating(false);
+        setStatus("");
+      }, 1500);
+    } catch (error) {
+      console.error("Error generating embeddings:", error);
+      setError("Failed to generate embeddings. Please try again.");
+      setIsGenerating(false);
+    }
+  }
+
+  // Extract triples from documents
+  const extractTriples = async (options?: PromptConfigurations & { chunkSize?: number; overlapSize?: number; chunkingMethod?: 'optimized' | 'pyg' }) => {
+    if (selectedDocs.length === 0) {
+      setError("Please select at least one document")
+      return
+    }
+
+    setError(null)
+    setIsProcessing(true)
+    setStatus("Preparing documents for triple extraction...")
+    
+    // Set up a listener for the processing-complete event
+    const handleProcessingComplete = () => {
+      console.log("Processing complete event received in embeddings-generator");
+      setIsProcessing(false);
+      setStatus("");
+    };
+    
+    window.addEventListener('processing-complete', handleProcessingComplete);
+
+    try {
+      // Update the processing status display
+      const docNames = selectedDocs.map(id => 
+        documents.find(d => d.id === id)?.name || 'Unknown'
+      ).join(', ');
+      
+      // Determine the processing method based on selected model and options
+      let processingMethod = 'default extractor';
+      try {
+        const selectedModel = localStorage.getItem("selectedModel");
+        if (selectedModel) {
+          const model = JSON.parse(selectedModel);
+          if (model.provider === "ollama") {
+            processingMethod = `Ollama ${model.model || 'qwen3:1.7b'}`;
+          } else if (model.id?.startsWith("nvidia-")) {
+            processingMethod = 'NVIDIA Nemotron';
+          }
+        }
+      } catch (e) {
+        // Fallback to default if parsing fails
+      }
+      
+      if (useLangChain) {
+        processingMethod += langChainMethod === 'graphtransformer' ? ' with LLMGraphTransformer' : ' with LangChain';
+      }
+      
+      setStatus(`Processing ${selectedDocs.length} document(s): ${docNames} using ${processingMethod}`);
+      
+      // Call processDocuments with the selected document IDs and processing options
+      const useGraphTransformer = useLangChain && langChainMethod === 'graphtransformer';
+      await processDocuments(selectedDocs, {
+        useLangChain,
+        useGraphTransformer,
+        promptConfigs: options || undefined,
+        chunkSize: options?.chunkSize,
+        overlapSize: options?.overlapSize,
+        chunkingMethod: options?.chunkingMethod
+      });
+      
+      // Navigate to the edit tab after processing is complete
+      setTimeout(() => {
+        // Clean up the event listener
+        window.removeEventListener('processing-complete', handleProcessingComplete);
+        
+        // Navigate to the edit tab
+        handleTabChange("edit");
+      }, 500);
+    } catch (error) {
+      console.error("Error processing documents:", error)
+      setError("Failed to process documents. Please try again.")
+      setIsProcessing(false)
+      setStatus("")
+      
+      // Clean up the event listener
+      window.removeEventListener('processing-complete', handleProcessingComplete);
+    }
+  }
+
+  // Stop processing function
+  const handleStopProcessing = async () => {
+    try {
+      const response = await fetch('/api/stop-processing', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+      });
+
+      if (response.ok) {
+        setStatus("Processing stopped by user");
+        setError(null);
+        setIsProcessing(false);
+        setIsGenerating(false);
+      } else {
+        setError("Failed to stop processing. Please try again.");
+      }
+    } catch (error) {
+      console.error("Error stopping processing:", error);
+      setError("Failed to stop processing. Please try again.");
+    }
+  }
+
+  // Stop embeddings generation function
+  const handleStopEmbeddings = async () => {
+    try {
+      const response = await fetch('/api/stop-embeddings', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+      });
+
+      if (response.ok) {
+        setStatus("Embeddings generation stopped by user");
+        setError(null);
+        setIsGenerating(false);
+      } else {
+        setError("Failed to stop embeddings generation. Please try again.");
+      }
+    } catch (error) {
+      console.error("Error stopping embeddings generation:", error);
+      setError("Failed to stop embeddings generation. Please try again.");
+    }
+  }
+
+  return (
+    <div className="space-y-4">
+      {showTripleExtraction ? (
+        <Tabs defaultValue="triples" className="w-full">
+          <TabsList className="grid w-full grid-cols-2">
+            <TabsTrigger value="triples">Triple Extraction</TabsTrigger>
+            <TabsTrigger value="embeddings">Embeddings</TabsTrigger>
+          </TabsList>
+          
+          <TabsContent value="embeddings" className="space-y-4 pt-4">
+            <EmbeddingsContent
+              documents={documents}
+              selectedDocs={selectedDocs}
+              handleSelectAll={handleSelectAll}
+              handleItemClick={handleItemClick}
+              isSelected={isSelected}
+              generateEmbeddings={generateEmbeddings}
+              isGenerating={isGenerating}
+              useLangChain={useLangChain}
+              setUseLangChain={setUseLangChain}
+              useSentenceChunking={useSentenceChunking}
+              setUseSentenceChunking={setUseSentenceChunking}
+              error={error}
+              status={status}
+              embeddingsProvider={embeddingsProvider}
+              handleStopEmbeddings={handleStopEmbeddings}
+            />
+          </TabsContent>
+          
+          <TabsContent value="triples" className="space-y-4 pt-4">
+            <TriplesContent
+              documents={documents}
+              selectedDocs={selectedDocs}
+              handleSelectAll={handleSelectAll}
+              handleItemClick={handleItemClick}
+              isSelected={isSelected}
+              extractTriples={extractTriples}
+              isProcessing={isProcessing}
+              useLangChain={useLangChain}
+              setUseLangChain={setUseLangChain}
+              useSentenceChunking={useSentenceChunking}
+              setUseSentenceChunking={setUseSentenceChunking}
+              useEntityExtraction={useEntityExtraction}
+              setUseEntityExtraction={setUseEntityExtraction}
+              error={error}
+              status={status}
+              handleStopProcessing={handleStopProcessing}
+            />
+          </TabsContent>
+        </Tabs>
+      ) : (
+        <EmbeddingsContent
+          documents={documents}
+          selectedDocs={selectedDocs}
+          handleSelectAll={handleSelectAll}
+          handleItemClick={handleItemClick}
+          isSelected={isSelected}
+          generateEmbeddings={generateEmbeddings}
+          isGenerating={isGenerating}
+          useLangChain={useLangChain}
+          setUseLangChain={setUseLangChain}
+          useSentenceChunking={useSentenceChunking}
+          setUseSentenceChunking={setUseSentenceChunking}
+          error={error}
+          status={status}
+          embeddingsProvider={embeddingsProvider}
+          handleStopEmbeddings={handleStopEmbeddings}
+        />
+      )}
+    </div>
+  )
+}
+
+// Embeddings content component
+function EmbeddingsContent({
+  documents,
+  selectedDocs,
+  handleSelectAll,
+  handleItemClick,
+  isSelected,
+  generateEmbeddings,
+  isGenerating,
+  useLangChain,
+  setUseLangChain,
+  useSentenceChunking,
+  setUseSentenceChunking,
+  error,
+  status,
+  embeddingsProvider,
+  handleStopEmbeddings
+}: EmbeddingsContentProps) {
+  // Helper function to get embeddings status icon
+  const getEmbeddingsStatusIcon = (doc: Document) => {
+    // Use embeddings status if available, otherwise show 'New'
+    const embeddingsStatus = doc.embeddings?.status || "New";
+    
+    switch (embeddingsStatus) {
+      case "New":
+        return <span className="h-2 w-2 rounded-full bg-cyan-400 mr-2"></span>;
+      case "Processing":
+        return <Loader2 className="h-3.5 w-3.5 text-yellow-500 mr-2 animate-spin" />;
+      case "Processed":
+        return <CheckCircle className="h-3.5 w-3.5 text-green-500 mr-2" />;
+      case "Error":
+        return <AlertCircle className="h-3.5 w-3.5 text-destructive mr-2" />;
+      default:
+        return <span className="h-2 w-2 rounded-full bg-gray-400 mr-2"></span>;
+    }
+  };
+  
+  // Helper function to get embeddings status text
+  const getEmbeddingsStatusText = (doc: Document) => {
+    if (doc.embeddings?.status === "Processed") {
+      return `${doc.embeddings.count} vectors`;
+    } else if (doc.embeddings?.status) {
+      return doc.embeddings.status;
+    } else {
+      return "Ready";
+    }
+  };
+  
+  return (
+    <>
+      <div className="flex items-center justify-between mb-4">
+        <div className="flex items-center gap-3">
+          <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center">
+            <Sparkles className="h-3 w-3 text-nvidia-green" />
+          </div>
+          <h3 className="text-base font-semibold text-foreground">Generate Embeddings</h3>
+        </div>
+        <TooltipProvider>
+          <Tooltip>
+            <TooltipTrigger asChild>
+              <div className="text-xs text-muted-foreground cursor-help flex items-center hover:text-foreground transition-colors">
+                <InfoIcon className="h-4 w-4 mr-1" />
+                What are embeddings?
+              </div>
+            </TooltipTrigger>
+            <TooltipContent className="max-w-[280px]">
+              <p className="text-xs">
+                Embeddings are vector representations of your documents that enable semantic search and similarity matching between documents.
+              </p>
+            </TooltipContent>
+          </Tooltip>
+        </TooltipProvider>
+      </div>
+      
+      <p className="text-sm text-muted-foreground">
+        Generate vector embeddings for semantic search and document similarity
+      </p>
+      
+      {/* Current embeddings provider indicator */}
+      <div className="flex items-center text-xs text-muted-foreground mt-1 mb-3 bg-background/40 border border-border/40 rounded-md p-2">
+        <Cpu className="h-3.5 w-3.5 mr-1.5" />
+        <span>Using: <span className="font-medium text-primary">{embeddingsProvider === "nvidia" ? "NVIDIA API" : "Local Sentence Transformer"}</span></span>
+      </div>
+
+      <div className="space-y-3 pt-2">
+        <h4 className="text-sm font-medium">Processing Options</h4>
+        
+        <div className="space-y-2">
+          <div className="flex items-center space-x-2">
+            <Switch 
+              id="use-langchain-embeddings" 
+              checked={useLangChain}
+              onCheckedChange={setUseLangChain}
+              disabled={isGenerating}
+            />
+            <Label htmlFor="use-langchain-embeddings" className="text-sm cursor-pointer">Use LangChain</Label>
+          </div>
+          
+          {useLangChain && (
+            <div className="mt-3">
+              <AdvancedOptions title="LangChain Options" defaultOpen={false}>
+                <div className="space-y-3">
+                  <div className="flex items-center space-x-2">
+                    <Switch 
+                      id="use-sentence-chunking-embeddings" 
+                      checked={useSentenceChunking}
+                      onCheckedChange={setUseSentenceChunking}
+                      disabled={isGenerating}
+                    />
+                    <Label 
+                      htmlFor="use-sentence-chunking-embeddings" 
+                      className="text-sm cursor-pointer"
+                    >
+                      Use Sentence Chunking
+                    </Label>
+                  </div>
+                  <p className="text-xs text-muted-foreground pl-7">
+                    Split documents into sentences for more accurate embeddings
+                  </p>
+                </div>
+              </AdvancedOptions>
+            </div>
+          )}
+        </div>
+      </div>
+
+      {error && (
+        <div className="bg-destructive/10 border border-destructive rounded-md p-3 flex items-start gap-2">
+          <AlertCircle className="h-4 w-4 text-destructive mt-0.5 flex-shrink-0" />
+          <p className="text-sm text-destructive">{error}</p>
+        </div>
+      )}
+
+      <div className="border rounded-md overflow-hidden mt-4">
+        <div className="bg-muted/30 p-3 flex items-center justify-between">
+          <div className="flex items-center">
+            <input
+              type="checkbox"
+              className="rounded border-border text-primary focus:ring-primary mr-3 h-4 w-4"
+              checked={selectedDocs.length === documents.filter(doc => doc.uploadStatus === "Uploaded").length && 
+                      documents.filter(doc => doc.uploadStatus === "Uploaded").length > 0}
+              onChange={handleSelectAll}
+              disabled={documents.length === 0 || isGenerating}
+            />
+            <span className="text-sm font-medium">
+              {selectedDocs.length > 0 ? (
+            <span className="text-nvidia-green text-xs">{selectedDocs.length} selected</span>
+              ) : (
+                <span className="text-xs">Select all</span>
+              )}
+            </span>
+          </div>
+
+          <div className="flex gap-3 pt-4">
+            <Button
+              size="default"
+              disabled={isGenerating || selectedDocs.length === 0}
+              onClick={generateEmbeddings}
+              className="bg-nvidia-green hover:bg-nvidia-green/90 text-white font-medium px-6 py-2 gap-2"
+            >
+              {isGenerating ? (
+                <>
+                  <Loader2 className="h-4 w-4 animate-spin" />
+                  <span>Generating</span>
+                </>
+              ) : (
+                <>
+                  <Sparkles className="h-4 w-4" />
+                  <span>Generate Embeddings</span>
+                </>
+              )}
+            </Button>
+            
+            {isGenerating && (
+              <Button
+                size="default"
+                variant="destructive"
+                onClick={handleStopEmbeddings}
+                className="px-4 py-2 gap-2"
+              >
+                <X className="h-4 w-4" />
+                <span>Stop</span>
+              </Button>
+            )}
+          </div>
+        </div>
+
+        <div className="max-h-[250px] overflow-y-auto">
+          <table className="w-full text-sm">
+            <thead className="bg-muted/50">
+              <tr>
+                <th className="w-12 py-2 px-3 text-left"></th>
+                <th className="py-2 px-3 text-left">Document</th>
+                <th className="py-2 px-3 text-left">Size</th>
+                <th className="py-2 px-3 text-left">Triple Status</th>
+                <th className="py-2 px-3 text-left">Embeddings Status</th>
+              </tr>
+            </thead>
+            <tbody>
+              {documents.length === 0 ? (
+                <tr>
+                  <td colSpan={5} className="py-6 text-center text-muted-foreground">
+                    No documents available for embedding generation
+                  </td>
+                </tr>
+              ) : (
+                documents.map((doc) => (
+                  <tr key={doc.id} className="border-t hover:bg-muted/20"
+                      onClick={(e) => handleItemClick(doc, e)}>
+                    <td className="py-2 px-3" onClick={(e) => e.stopPropagation()}>
+                      <input
+                        type="checkbox"
+                        className="rounded border-border text-primary focus:ring-primary h-4 w-4"
+                        checked={isSelected(doc.id)}
+                        onChange={(e) => handleItemClick(doc, e)}
+                        disabled={isGenerating}
+                      />
+                    </td>
+                    <td className="py-2 px-3 font-medium">{doc.name}</td>
+                    <td className="py-2 px-3">{doc.size}</td>
+                    <td className="py-2 px-3">
+                      <div className="flex items-center">
+                        {doc.status === "New" && (
+                          <span className="h-2 w-2 rounded-full bg-cyan-400 mr-2"></span>
+                        )}
+                        {doc.status === "Processing" && (
+                          <Loader2 className="h-3.5 w-3.5 text-yellow-500 mr-2 animate-spin" />
+                        )}
+                        {doc.status === "Processed" && (
+                          <CheckCircle className="h-3.5 w-3.5 text-green-500 mr-2" />
+                        )}
+                        {doc.status === "Error" && (
+                          <AlertCircle className="h-3.5 w-3.5 text-destructive mr-2" />
+                        )}
+                        <span>{doc.status}</span>
+                      </div>
+                    </td>
+                    <td className="py-2 px-3">
+                      <div className="flex items-center">
+                        {getEmbeddingsStatusIcon(doc)}
+                        <span>
+                          {getEmbeddingsStatusText(doc)}
+                          {doc.embeddings?.error && (
+                            <TooltipProvider>
+                              <Tooltip>
+                                <TooltipTrigger asChild>
+                                  <span className="ml-1 cursor-help text-destructive">
+                                    <AlertCircle className="h-3 w-3 inline" />
+                                  </span>
+                                </TooltipTrigger>
+                                <TooltipContent>
+                                  <p className="text-xs max-w-[200px]">{doc.embeddings.error}</p>
+                                </TooltipContent>
+                              </Tooltip>
+                            </TooltipProvider>
+                          )}
+                        </span>
+                      </div>
+                    </td>
+                  </tr>
+                ))
+              )}
+            </tbody>
+          </table>
+        </div>
+      </div>
+
+      {isGenerating && (
+        <div className="border rounded-md p-4 bg-primary/5 mt-4">
+          <div className="flex items-center gap-2 text-sm">
+            <Loader2 className="h-4 w-4 animate-spin text-primary" />
+            <span>{status}</span>
+          </div>
+          <div className="mt-2 h-1 w-full bg-muted overflow-hidden rounded-full">
+            <div className="h-full bg-primary rounded-full animate-progress"></div>
+          </div>
+        </div>
+      )}
+    </>
+  )
+}
+
+// Add this function near the top of the file
+function RadioButton({ id, name, value, checked, onChange, disabled = false, children }: {
+  id: string;
+  name: string;
+  value: string;
+  checked: boolean;
+  onChange: (e: React.ChangeEvent<HTMLInputElement>) => void;
+  disabled?: boolean;
+  children: React.ReactNode;
+}) {
+  return (
+    <div className="flex items-center space-x-2">
+      <input
+        type="radio"
+        id={id}
+        name={name}
+        value={value}
+        checked={checked}
+        onChange={onChange}
+        disabled={disabled}
+        className="h-4 w-4 border-border text-primary focus:ring-primary"
+      />
+      <label htmlFor={id} className={`text-sm cursor-pointer ${disabled ? "opacity-50" : ""}`}>
+        {children}
+      </label>
+    </div>
+  );
+}
+
+// Triple extraction content component
+function TriplesContent({
+  documents,
+  selectedDocs,
+  handleSelectAll,
+  handleItemClick,
+  isSelected,
+  extractTriples,
+  isProcessing,
+  useLangChain,
+  setUseLangChain,
+  useSentenceChunking,
+  setUseSentenceChunking,
+  useEntityExtraction,
+  setUseEntityExtraction,
+  error,
+  status,
+  handleStopProcessing
+}: TriplesContentProps) {
+  // Add sorting state
+  const [sortField, setSortField] = useState<'name' | 'size' | 'status'>('name')
+  const [sortDirection, setSortDirection] = useState<'asc' | 'desc'>('asc')
+
+  // Sort documents based on current sort field and direction
+  const sortedDocuments = React.useMemo(() => {
+    return [...documents].sort((a, b) => {
+      let aValue: string | number
+      let bValue: string | number
+
+      switch (sortField) {
+        case 'name':
+          aValue = a.name.toLowerCase()
+          bValue = b.name.toLowerCase()
+          break
+        case 'size':
+          aValue = parseFloat(a.size) || 0
+          bValue = parseFloat(b.size) || 0
+          break
+        case 'status':
+          aValue = a.status
+          bValue = b.status
+          break
+        default:
+          aValue = a.name.toLowerCase()
+          bValue = b.name.toLowerCase()
+      }
+
+      if (sortDirection === 'asc') {
+        return aValue < bValue ? -1 : aValue > bValue ? 1 : 0
+      } else {
+        return aValue > bValue ? -1 : aValue < bValue ? 1 : 0
+      }
+    })
+  }, [documents, sortField, sortDirection])
+
+  // Handle column header click for sorting
+  const handleSort = (field: 'name' | 'size' | 'status') => {
+    if (sortField === field) {
+      setSortDirection(sortDirection === 'asc' ? 'desc' : 'asc')
+    } else {
+      setSortField(field)
+      setSortDirection('asc')
+    }
+  }
+  const [langChainMethod, setLangChainMethod] = React.useState<'default' | 'graphtransformer'>(
+    'default'
+  );
+  const [promptConfigs, setPromptConfigs] = useState<PromptConfigurations | null>(null);
+  
+  // Chunk configuration state
+  const [chunkSize, setChunkSize] = useState<number>(512);
+  const [overlapSize, setOverlapSize] = useState<number>(0);
+  const [chunkingMethod, setChunkingMethod] = useState<'optimized' | 'pyg'>('pyg');
+
+  // Handle radio button changes for LangChain method
+  const handleLangChainMethodChange = (e: React.ChangeEvent<HTMLInputElement>) => {
+    setLangChainMethod(e.target.value as 'default' | 'graphtransformer');
+  };
+
+  // Load prompt configurations from localStorage on component mount
+  useEffect(() => {
+    try {
+      const savedConfigs = localStorage.getItem("promptConfigurations");
+      if (savedConfigs) {
+        setPromptConfigs(JSON.parse(savedConfigs));
+      }
+    } catch (err) {
+      console.error("Error loading prompt configurations:", err);
+    }
+  }, []);
+
+  // Handle prompt configuration changes
+  const handlePromptConfigsChange = (configs: PromptConfigurations) => {
+    setPromptConfigs(configs);
+  };
+
+  // Update actual flag used by API based on both useLangChain and langChainMethod
+  React.useEffect(() => {
+    // This effect is used to monitor langChainMethod changes
+    // The actual implementation of different methods is handled in the API
+  }, [langChainMethod]);
+
+  // Handle extract triples button click
+  const handleExtractTriples = () => {
+    const options = {
+      ...(promptConfigs || {}),
+      chunkSize,
+      overlapSize,
+      chunkingMethod
+    };
+    extractTriples(options);
+  };
+
+  return (
+    <>
+      <div className="flex items-center gap-3 mb-4">
+        <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center">
+          <Zap className="h-3 w-3 text-nvidia-green" />
+        </div>
+        <h3 className="text-base font-semibold text-foreground">Knowledge Graph Triple Extraction</h3>
+      </div>
+      <p className="text-sm text-muted-foreground leading-relaxed mb-4">
+        Extract structured knowledge triples from documents for knowledge graph construction
+      </p>
+
+      <div className="space-y-4">
+        <h4 className="text-sm font-semibold text-foreground">Processing Options</h4>
+        
+        <div className="space-y-2">
+          <div className="flex items-center space-x-2">
+            <Switch 
+              id="use-langchain-triples" 
+              checked={useLangChain}
+              onCheckedChange={setUseLangChain}
+              disabled={isProcessing}
+            />
+            <Label htmlFor="use-langchain-triples" className="text-sm cursor-pointer">Use LangChain</Label>
+          </div>
+          {/* <p className="text-xs text-muted-foreground pl-7">
+            Leverages LangChain for knowledge extraction from documents
+          </p> */}
+          
+          {useLangChain && (
+            <div className="mt-3">
+              <AdvancedOptions title="LangChain Options" defaultOpen={false}>
+                <div className="space-y-3">
+                  <div>
+                    <h5 className="text-sm font-medium mb-2">LangChain Method</h5>
+                    
+                    <RadioButton
+                      id="default-extractor"
+                      name="langchain-method"
+                      value="default"
+                      checked={langChainMethod === 'default'}
+                      onChange={handleLangChainMethodChange}
+                      disabled={isProcessing}
+                    >
+                      Default Extractor
+                    </RadioButton>
+                    <p className="text-xs text-muted-foreground ml-6 mb-2">
+                      Uses the standard LangChain extraction pipeline
+                    </p>
+                    
+                    <RadioButton
+                      id="graph-transformer"
+                      name="langchain-method"
+                      value="graphtransformer"
+                      checked={langChainMethod === 'graphtransformer'}
+                      onChange={handleLangChainMethodChange}
+                      disabled={isProcessing}
+                    >
+                      LLMGraphTransformer
+                    </RadioButton>
+                    <p className="text-xs text-muted-foreground ml-6 mb-3">
+                      Uses LangChain's specialized graph structure transformer
+                    </p>
+                  </div>
+                  
+                  <div>
+                    <div className="flex items-center space-x-2">
+                      <Switch 
+                        id="use-sentence-chunking-triples" 
+                        checked={useSentenceChunking}
+                        onCheckedChange={setUseSentenceChunking}
+                        disabled={isProcessing}
+                      />
+                      <Label 
+                        htmlFor="use-sentence-chunking-triples" 
+                        className="text-sm cursor-pointer"
+                      >
+                        Use Sentence Chunking
+                      </Label>
+                    </div>
+                    <p className="text-xs text-muted-foreground pl-7 mb-3">
+                      Split documents into sentences for more accurate triple extraction
+                    </p>
+                  </div>
+                  
+                  <div>
+                    <div className="flex items-center space-x-2">
+                      <Switch 
+                        id="use-entity-extraction-triples" 
+                        checked={useEntityExtraction}
+                        onCheckedChange={setUseEntityExtraction}
+                        disabled={isProcessing}
+                      />
+                      <Label 
+                        htmlFor="use-entity-extraction-triples" 
+                        className="text-sm cursor-pointer"
+                      >
+                        Entity Extraction
+                      </Label>
+                    </div>
+                    <p className="text-xs text-muted-foreground pl-7">
+                      Automatically detect and extract entities from documents
+                    </p>
+                  </div>
+                </div>
+              </AdvancedOptions>
+            </div>
+          )}
+        </div>
+      </div>
+      
+      {/* Chunk Configuration */}
+      <div className="mt-4">
+        <AdvancedOptions title="Chunk Configuration" defaultOpen={false}>
+          <div className="space-y-4">
+            {/* Chunking Method Selection */}
+            <div>
+              <Label className="text-sm font-medium mb-3 block">Chunking Method</Label>
+              <div className="space-y-2">
+                <div className="flex items-center space-x-2">
+                  <input
+                    type="radio"
+                    id="chunking-optimized"
+                    name="chunking-method"
+                    value="optimized"
+                    checked={chunkingMethod === 'optimized'}
+                    onChange={(e) => setChunkingMethod(e.target.value as 'optimized' | 'pyg')}
+                    disabled={isProcessing}
+                    className="w-4 h-4 text-primary border-border focus:ring-primary"
+                  />
+                  <Label htmlFor="chunking-optimized" className="text-sm cursor-pointer">
+                    Large chunks
+                  </Label>
+                </div>
+                <p className="text-xs text-muted-foreground ml-6 mb-2">
+                  Large chunks with overlap for modern LLMs like Gemma3:27b. Best for efficiency.
+                </p>
+                
+                <div className="flex items-center space-x-2">
+                  <input
+                    type="radio"
+                    id="chunking-pyg"
+                    name="chunking-method"
+                    value="pyg"
+                    checked={chunkingMethod === 'pyg'}
+                    onChange={(e) => setChunkingMethod(e.target.value as 'optimized' | 'pyg')}
+                    disabled={isProcessing}
+                    className="w-4 h-4 text-primary border-border focus:ring-primary"
+                  />
+                  <Label htmlFor="chunking-pyg" className="text-sm cursor-pointer">
+                    Default (configurable size and overlap)
+                  </Label>
+                </div>
+                <p className="text-xs text-muted-foreground ml-6 mb-3">
+                  PyG's txt2kg.py chunking algorithm with configurable chunk size and overlap. Set overlap to 0 for original PyG behavior.
+                </p>
+              </div>
+            </div>
+            
+            <div>
+              <Label htmlFor="chunk-size" className="text-sm font-medium">
+                Chunk Size (characters)
+              </Label>
+              <div className="mt-1">
+                <input
+                  id="chunk-size"
+                  type="number"
+                  min="1000"
+                  max="128000"
+                  step="1000"
+                  value={chunkSize}
+                  onChange={(e) => setChunkSize(Number(e.target.value))}
+                  disabled={isProcessing}
+                  className="w-full px-3 py-2 border border-border rounded-md bg-background text-foreground focus:outline-none focus:ring-2 focus:ring-primary focus:border-transparent"
+                />
+              </div>
+              <p className="text-xs text-muted-foreground mt-1">
+                Larger chunks provide more context but use more GPU memory and may lose detailed information.
+              </p>
+            </div>
+            
+            <div>
+              <Label htmlFor="overlap-size" className="text-sm font-medium">
+                Overlap Size (characters)
+              </Label>
+              <div className="mt-1">
+                <input
+                  id="overlap-size"
+                  type="number"
+                  min="0"
+                  max="8000"
+                  step="100"
+                  value={overlapSize}
+                  onChange={(e) => setOverlapSize(Number(e.target.value))}
+                  disabled={isProcessing}
+                  className="w-full px-3 py-2 border border-border rounded-md bg-background text-foreground focus:outline-none focus:ring-2 focus:ring-primary focus:border-transparent"
+                />
+              </div>
+              <p className="text-xs text-muted-foreground mt-1">
+                Overlap between chunks to preserve context across boundaries. Set to 0 for original PyG behavior.
+              </p>
+            </div>
+            
+            <div className="bg-muted/30 rounded-md p-3">
+              <div className="flex items-center gap-2 mb-2">
+                <div className="w-4 h-4 rounded bg-primary/20 flex items-center justify-center">
+                  <span className="text-xs text-primary">ℹ</span>
+                </div>
+                <span className="text-xs font-medium">Current Configuration</span>
+              </div>
+              <div className="text-xs text-muted-foreground space-y-1">
+                {chunkingMethod === 'pyg' ? (
+                  <>
+                    <div>• Method: PyTorch Geometric (enhanced with overlap)</div>
+                    <div>• Estimated chunks for 64KB document: ~{Math.ceil(64000 / Math.max(1, chunkSize - overlapSize))}</div>
+                    <div>• Chunk size: {chunkSize.toLocaleString()} characters</div>
+                    <div>• Overlap: {overlapSize} characters {overlapSize === 0 ? '(original PyG)' : '(enhanced)'}</div>
+                    <div>• Best for: {overlapSize === 0 ? 'PyG compatibility' : 'Enhanced context preservation'}</div>
+                  </>
+                ) : (
+                  <>
+                    <div>• Method: Optimized for modern LLMs</div>
+                    <div>• Estimated chunks for 64KB document: ~{Math.ceil(64000 / chunkSize)}</div>
+                    <div>• GPU memory per chunk: ~{Math.round(chunkSize / 1000)}MB</div>
+                    <div>• Overlap: {overlapSize} characters</div>
+                    <div>• Processing efficiency: {chunkSize >= 32000 ? 'Optimal' : chunkSize >= 16000 ? 'Good' : 'Basic'}</div>
+                  </>
+                )}
+              </div>
+            </div>
+          </div>
+        </AdvancedOptions>
+      </div>
+      
+      {/* Advanced Options with Prompt Configuration */}
+      <div className="mt-4">
+        <AdvancedOptions title="Prompt Configuration">
+          <PromptConfiguration 
+            onChange={handlePromptConfigsChange}
+            initialConfigs={promptConfigs || undefined}
+            langChainMethod={langChainMethod}
+            useLangChain={useLangChain}
+          />
+        </AdvancedOptions>
+      </div>
+      
+      {error && (
+        <div className="bg-destructive/10 border border-destructive rounded-md p-3 flex items-start gap-2 mt-4">
+          <AlertCircle className="h-4 w-4 text-destructive mt-0.5 flex-shrink-0" />
+          <p className="text-sm text-destructive">{error}</p>
+        </div>
+      )}
+
+      <div className="border rounded-md overflow-hidden mt-4">
+        <div className="bg-muted/30 p-3 flex items-center justify-between">
+          <div className="flex items-center">
+            <input
+              type="checkbox"
+              className="rounded border-border text-primary focus:ring-primary mr-3 h-4 w-4"
+              checked={selectedDocs.length === documents.filter(doc => (doc.status === "New" || doc.status === "Processed" || doc.status === "Error")).length && 
+                      documents.filter(doc => (doc.status === "New" || doc.status === "Processed" || doc.status === "Error")).length > 0}
+              onChange={handleSelectAll}
+              disabled={documents.filter(doc => (doc.status === "New" || doc.status === "Processed" || doc.status === "Error")).length === 0 || isProcessing}
+            />
+            <span className="text-sm font-medium">
+              {selectedDocs.length > 0 ? (
+                <span className="text-primary">{selectedDocs.length} selected</span>
+              ) : (
+                "Select all"
+              )}
+            </span>
+          </div>
+          
+          <div className="flex gap-3 pt-4">
+            <Button
+              size="default"
+              onClick={handleExtractTriples}
+              disabled={selectedDocs.length === 0 || isProcessing}
+              className="bg-nvidia-green hover:bg-nvidia-green/90 text-white font-medium px-6 py-2 gap-2"
+            >
+              {isProcessing ? (
+                <>
+                  <Loader2 className="h-4 w-4 animate-spin" />
+                  <span>Processing...</span>
+                </>
+              ) : (
+                <>
+                  <Zap className="h-4 w-4" />
+                  <span>Extract Triples</span>
+                </>
+              )}
+            </Button>
+            
+            {isProcessing && (
+              <Button
+                size="default"
+                variant="destructive"
+                onClick={handleStopProcessing}
+                className="px-4 py-2 gap-2"
+              >
+                <X className="h-4 w-4" />
+                <span>Stop</span>
+              </Button>
+            )}
+          </div>
+        </div>
+
+        <div className="max-h-[200px] overflow-y-auto">
+          {documents.length === 0 ? (
+            <div className="p-6 text-center">
+              <p className="text-muted-foreground">No documents available for processing</p>
+            </div>
+          ) : (
+            <table className="w-full text-sm">
+              <thead className="bg-muted/50">
+                <tr>
+                  <th className="w-12 py-2 px-3 text-left"></th>
+                  <th 
+                    className="py-2 px-3 text-left cursor-pointer hover:bg-muted/30 transition-colors"
+                    onClick={() => handleSort('name')}
+                  >
+                    <div className="flex items-center gap-1">
+                      <span>Document</span>
+                      {sortField === 'name' && (
+                        sortDirection === 'asc' ? 
+                          <ChevronUp className="h-3 w-3" /> : 
+                          <ChevronDown className="h-3 w-3" />
+                      )}
+                    </div>
+                  </th>
+                  <th 
+                    className="py-2 px-3 text-left cursor-pointer hover:bg-muted/30 transition-colors"
+                    onClick={() => handleSort('size')}
+                  >
+                    <div className="flex items-center gap-1">
+                      <span>Size</span>
+                      {sortField === 'size' && (
+                        sortDirection === 'asc' ? 
+                          <ChevronUp className="h-3 w-3" /> : 
+                          <ChevronDown className="h-3 w-3" />
+                      )}
+                    </div>
+                  </th>
+                  <th 
+                    className="py-2 px-3 text-left cursor-pointer hover:bg-muted/30 transition-colors"
+                    onClick={() => handleSort('status')}
+                  >
+                    <div className="flex items-center gap-1">
+                      <span>Status</span>
+                      {sortField === 'status' && (
+                        sortDirection === 'asc' ? 
+                          <ChevronUp className="h-3 w-3" /> : 
+                          <ChevronDown className="h-3 w-3" />
+                      )}
+                    </div>
+                  </th>
+                </tr>
+              </thead>
+              <tbody>
+                {sortedDocuments.map((doc) => (
+                  <tr 
+                    key={doc.id} 
+                    className={`border-b last:border-b-0 hover:bg-muted/20 ${(doc.status === "New" || doc.status === "Processed" || doc.status === "Error") && !isProcessing ? "cursor-pointer" : ""}`}
+                    onClick={(e) => (doc.status === "New" || doc.status === "Processed" || doc.status === "Error") && !isProcessing && handleItemClick(doc, e)}
+                  >
+                    <td className="pl-3 py-3" onClick={(e) => e.stopPropagation()}>
+                      <input
+                        type="checkbox"
+                        className="rounded border-border text-primary focus:ring-primary h-4 w-4"
+                        checked={isSelected(doc.id)}
+                        onChange={(e) => {
+                          e.stopPropagation();
+                          handleItemClick(doc, e);
+                        }}
+                        disabled={(doc.status !== "New" && doc.status !== "Processed" && doc.status !== "Error") || isProcessing}
+                      />
+                    </td>
+                    <td className="px-3 py-3 font-medium text-foreground">
+                      <div className="flex items-center gap-2">
+                        <FileText className="h-4 w-4 text-muted-foreground" />
+                        <span>{doc.name}</span>
+                      </div>
+                    </td>
+                    <td className="px-3 py-3">
+                      {doc.status === "New" && (
+                        <span className="inline-flex items-center">
+                          <span className="h-2 w-2 rounded-full bg-cyan-400 mr-2"></span>
+                          <span>{doc.status}</span>
+                        </span>
+                      )}
+                      {doc.status === "Processing" && (
+                        <span className="inline-flex items-center">
+                          <Loader2 className="h-4 w-4 text-yellow-500 mr-2 animate-spin" />
+                          <span>{doc.status}</span>
+                        </span>
+                      )}
+                      {doc.status === "Processed" && (
+                        <span className="inline-flex items-center">
+                          <CheckCircle className="h-4 w-4 text-green-500 mr-2" />
+                          <span>{doc.status}</span>
+                        </span>
+                      )}
+                      {doc.status === "Error" && (
+                        <span className="inline-flex items-center">
+                          <AlertCircle className="h-4 w-4 text-destructive mr-2" />
+                          <span>{doc.status}</span>
+                        </span>
+                      )}
+                    </td>
+                    <td className="px-3 py-3">{doc.size} KB</td>
+                  </tr>
+                ))}
+              </tbody>
+            </table>
+          )}
+        </div>
+      </div>
+
+      {isProcessing && status && (
+        <div className="border rounded-md p-4 bg-primary/5 mt-4">
+          <div className="flex items-center gap-2 text-sm">
+            <Loader2 className="h-4 w-4 animate-spin text-primary" />
+            <span>{status}</span>
+          </div>
+          <div className="mt-2 h-1 w-full bg-muted overflow-hidden rounded-full">
+            <div className="h-full bg-primary rounded-full animate-progress"></div>
+          </div>
+        </div>
+      )}
+    </>
+  )
+}
+
+function InfoIcon(props: React.SVGProps<SVGSVGElement>) {
+  return (
+    <svg
+      xmlns="http://www.w3.org/2000/svg"
+      viewBox="0 0 24 24"
+      fill="none"
+      stroke="currentColor"
+      strokeWidth="2"
+      strokeLinecap="round"
+      strokeLinejoin="round"
+      {...props}
+    >
+      <circle cx="12" cy="12" r="10" />
+      <path d="M12 16v-4" />
+      <path d="M12 8h.01" />
+    </svg>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/enhanced-graph-visualization.tsx b/nvidia/txt2kg/assets/frontend/components/enhanced-graph-visualization.tsx
new file mode 100644
index 0000000..09c65d1
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/enhanced-graph-visualization.tsx
@@ -0,0 +1,208 @@
+"use client"
+
+import React, { useState, useCallback } from 'react'
+import { Card, CardContent, CardHeader, CardTitle } from '@/components/ui/card'
+import { Tabs, TabsContent, TabsList, TabsTrigger } from '@/components/ui/tabs'
+import { Badge } from '@/components/ui/badge'
+import { Button } from '@/components/ui/button'
+import { Switch } from '@/components/ui/switch'
+import { Zap, Cpu, Eye, Settings } from 'lucide-react'
+import { ForceGraphWrapper } from './force-graph-wrapper'
+import { PyGraphistryViewer } from './pygraphistry-viewer'
+import type { Triple } from '@/utils/text-processing'
+
+interface GraphData {
+  nodes: Array<{
+    id: string
+    name: string
+    group?: string
+    [key: string]: any
+  }>
+  links: Array<{
+    source: string
+    target: string
+    name: string
+    [key: string]: any
+  }>
+}
+
+interface EnhancedGraphVisualizationProps {
+  graphData?: GraphData
+  jsonData?: any // For backward compatibility with existing ForceGraphWrapper
+  triples?: Triple[]
+  fullscreen?: boolean
+  layoutType?: string
+  highlightedNodes?: string[]
+  onError?: (error: Error) => void
+}
+
+export function EnhancedGraphVisualization({
+  graphData,
+  jsonData,
+  triples,
+  fullscreen = false,
+  layoutType,
+  highlightedNodes,
+  onError
+}: EnhancedGraphVisualizationProps) {
+  const [activeTab, setActiveTab] = useState<'threejs' | 'pygraphistry'>('threejs')
+  const [gpuPreferred, setGpuPreferred] = useState(false)
+  
+  // Convert triples to graph data format if needed
+  const processedGraphData = React.useMemo(() => {
+    if (graphData) {
+      return graphData
+    }
+    
+    if (triples && triples.length > 0) {
+      const nodes = new Map<string, any>()
+      const links: any[] = []
+      
+      triples.forEach((triple, index) => {
+        // Triple interface has simple string properties
+        const subjectId = triple.subject
+        const subjectName = triple.subject
+        const objectId = triple.object
+        const objectName = triple.object
+        const predicateName = triple.predicate
+        
+        // Add nodes
+        if (!nodes.has(subjectId)) {
+          nodes.set(subjectId, {
+            id: subjectId,
+            name: subjectName,
+            group: 'entity'
+          })
+        }
+        
+        if (!nodes.has(objectId)) {
+          nodes.set(objectId, {
+            id: objectId,
+            name: objectName,
+            group: 'entity'
+          })
+        }
+        
+        // Add link
+        links.push({
+          source: subjectId,
+          target: objectId,
+          name: predicateName
+        })
+      })
+      
+      return {
+        nodes: Array.from(nodes.values()),
+        links: links
+      }
+    }
+    
+    return null
+  }, [graphData, triples])
+
+  const handleTabChange = useCallback((value: string) => {
+    setActiveTab(value as 'threejs' | 'pygraphistry')
+  }, [])
+
+  const handleError = useCallback((error: Error) => {
+    console.error('Graph visualization error:', error)
+    if (onError) {
+      onError(error)
+    }
+  }, [onError])
+
+  const nodeCount = processedGraphData?.nodes?.length || jsonData?.nodes?.length || 0
+  const linkCount = processedGraphData?.links?.length || jsonData?.links?.length || 0
+
+  return (
+    <div className="w-full h-full">
+      <Card className="h-full">
+        <CardHeader className="pb-3">
+          <div className="flex items-center justify-between">
+            <CardTitle className="flex items-center gap-2">
+              <Eye className="w-5 h-5" />
+              Knowledge Graph Visualization
+            </CardTitle>
+            <div className="flex items-center gap-4">
+              <div className="flex items-center gap-2 text-sm text-muted-foreground">
+                <span>{nodeCount} nodes</span>
+                <span>•</span>
+                <span>{linkCount} edges</span>
+              </div>
+              <div className="flex items-center space-x-2">
+                <Switch
+                  id="gpu-preferred"
+                  checked={gpuPreferred}
+                  onCheckedChange={(checked) => {
+                    setGpuPreferred(checked)
+                    if (checked && activeTab === 'threejs') {
+                      setActiveTab('pygraphistry')
+                    }
+                  }}
+                />
+                <label htmlFor="gpu-preferred" className="text-sm font-medium">
+                  GPU Preferred
+                </label>
+              </div>
+            </div>
+          </div>
+        </CardHeader>
+        <CardContent className="p-0 h-[calc(100%-80px)]">
+          <Tabs value={activeTab} onValueChange={handleTabChange} className="h-full">
+            <div className="px-6 pb-4">
+              <TabsList className="grid w-full grid-cols-2">
+                <TabsTrigger value="threejs" className="flex items-center gap-2">
+                  <Cpu className="w-4 h-4" />
+                  Client-Side (Three.js)
+                  <Badge variant="secondary" className="ml-1">WebGPU</Badge>
+                </TabsTrigger>
+                <TabsTrigger value="pygraphistry" className="flex items-center gap-2">
+                  <Zap className="w-4 h-4" />
+                  Server-Side (PyGraphistry)
+                  <Badge variant="secondary" className="ml-1">GPU</Badge>
+                </TabsTrigger>
+              </TabsList>
+            </div>
+            
+            <TabsContent value="threejs" className="h-[calc(100%-80px)] px-6 pb-6 mt-0">
+              <div className="h-full rounded-lg border overflow-hidden">
+                <ForceGraphWrapper
+                  jsonData={jsonData || {
+                    nodes: processedGraphData?.nodes || [],
+                    links: processedGraphData?.links || []
+                  }}
+                  fullscreen={fullscreen}
+                  layoutType={layoutType}
+                  highlightedNodes={highlightedNodes}
+                  onError={handleError}
+                />
+              </div>
+            </TabsContent>
+            
+            <TabsContent value="pygraphistry" className="h-[calc(100%-80px)] px-6 pb-6 mt-0">
+              <div className="h-full">
+                {processedGraphData ? (
+                  <PyGraphistryViewer
+                    graphData={processedGraphData}
+                    onError={handleError}
+                  />
+                ) : (
+                  <div className="h-full flex items-center justify-center border rounded-lg bg-muted/50">
+                    <div className="text-center space-y-2">
+                      <div className="text-muted-foreground">
+                        No graph data available for PyGraphistry visualization
+                      </div>
+                      <div className="text-sm text-muted-foreground">
+                        Please load graph data to enable GPU-accelerated visualization
+                      </div>
+                    </div>
+                  </div>
+                )}
+              </div>
+            </TabsContent>
+          </Tabs>
+        </CardContent>
+      </Card>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/fallback-graph.tsx b/nvidia/txt2kg/assets/frontend/components/fallback-graph.tsx
new file mode 100644
index 0000000..7c96df3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/fallback-graph.tsx
@@ -0,0 +1,1225 @@
+"use client"
+
+import type React from "react"
+
+import { useEffect, useRef, useState, useCallback } from "react"
+import type { Triple } from "@/types/graph"
+import { Maximize2, Minimize2, ZoomIn, ZoomOut, Move, Filter, Play, Pause } from "lucide-react"
+
+interface FallbackGraphProps {
+  triples: Triple[]
+  fullscreen?: boolean
+  highlightedNodes?: string[]
+}
+
+interface Node {
+  id: string
+  label: string
+  x: number
+  y: number
+  vx: number
+  vy: number
+  radius: number
+  color: string
+  connections: number
+}
+
+interface Link {
+  source: string
+  target: string
+  label: string
+}
+
+// Add interface for CPU-based grid cell
+interface GridCell {
+  x: number;
+  y: number;
+  nodeIndices: number[];
+}
+
+export function FallbackGraph({ triples, fullscreen = false, highlightedNodes }: FallbackGraphProps) {
+  const containerRef = useRef<HTMLDivElement>(null)
+  const canvasRef = useRef<HTMLCanvasElement>(null)
+  const [isFullscreen, setIsFullscreen] = useState(fullscreen)
+  const [isBrowserFullscreen, setIsBrowserFullscreen] = useState(false)
+  const [hoveredNode, setHoveredNode] = useState<string | null>(null)
+  const [selectedNode, setSelectedNode] = useState<string | null>(null)
+  const [zoom, setZoom] = useState(1)
+  const [offset, setOffset] = useState({ x: 0, y: 0 })
+  const [isDragging, setIsDragging] = useState(false)
+  const [dragStart, setDragStart] = useState({ x: 0, y: 0 })
+  const [simulation, setSimulation] = useState<{
+    nodes: Node[]
+    links: Link[]
+    isRunning: boolean
+    iteration: number
+  } | null>(null)
+  const [nodeLimit, setNodeLimit] = useState(75) // Default node limit
+  const [showNodeLimitWarning, setShowNodeLimitWarning] = useState(false)
+  const [allNodesCount, setAllNodesCount] = useState(0)
+  const [tooltipText, setTooltipText] = useState("")
+  const [tooltipPosition, setTooltipPosition] = useState({ x: 0, y: 0 })
+  const [showTooltip, setShowTooltip] = useState(false)
+  const [simulationPaused, setSimulationPaused] = useState(true) // Start with simulation paused
+
+  // Add state for CPU-based clustering
+  const [cpuClustering, setCpuClustering] = useState<boolean>(false);
+  const [gridCells, setGridCells] = useState<Map<string, GridCell>>(new Map());
+
+  // Handle browser fullscreen changes
+  useEffect(() => {
+    const handleFullscreenChange = () => {
+      setIsBrowserFullscreen(!!document.fullscreenElement)
+    }
+
+    document.addEventListener("fullscreenchange", handleFullscreenChange)
+
+    return () => {
+      document.removeEventListener("fullscreenchange", handleFullscreenChange)
+    }
+  }, [])
+
+  // Toggle browser fullscreen
+  const toggleFullscreen = useCallback(() => {
+    if (!containerRef.current) return
+
+    if (!document.fullscreenElement) {
+      // Enter fullscreen
+      containerRef.current.requestFullscreen().catch((err) => {
+        console.error(`Error attempting to enable fullscreen: ${err.message}`)
+      })
+    } else {
+      // Exit fullscreen
+      document.exitFullscreen().catch((err) => {
+        console.error(`Error attempting to exit fullscreen: ${err.message}`)
+      })
+    }
+  }, [])
+
+  const handleZoomIn = useCallback(() => {
+    setZoom((prev) => Math.min(3, prev + 0.1))
+  }, [])
+
+  const handleZoomOut = useCallback(() => {
+    setZoom((prev) => Math.max(0.1, prev - 0.1))
+  }, [])
+
+  const handleResetView = useCallback(() => {
+    setZoom(1)
+    setOffset({ x: 0, y: 0 })
+    setSelectedNode(null)
+
+    // Restart simulation
+    setSimulation((prev) => (prev ? { ...prev, isRunning: true, iteration: 0 } : null))
+  }, [])
+
+  const handleIncreaseNodeLimit = useCallback(() => {
+    setNodeLimit((prev) => Math.min(prev + 50, 500))
+  }, [])
+
+  const handleDecreaseNodeLimit = useCallback(() => {
+    setNodeLimit((prev) => Math.max(25, prev - 25))
+  }, [])
+
+  const toggleNodeLimit = useCallback(() => {
+    setNodeLimit((prev) => (prev === 75 ? 150 : 75))
+  }, [])
+
+  // Handle tooltip display
+  const handleButtonMouseEnter = useCallback((e: React.MouseEvent, text: string) => {
+    const rect = e.currentTarget.getBoundingClientRect()
+    setTooltipText(text)
+    setTooltipPosition({
+      x: rect.left + rect.width / 2,
+      y: rect.bottom + 5,
+    })
+    setShowTooltip(true)
+  }, [])
+
+  const handleButtonMouseLeave = useCallback(() => {
+    setShowTooltip(false)
+  }, [])
+
+  // Add a CPU-based clustering implementation as fallback for GPU clustering
+  const applyCpuClustering = (nodes: Node[]) => {
+    if (!cpuClustering || !nodes.length) return;
+    
+    console.log("Applying CPU-based clustering fallback");
+    
+    // Create a grid for spatial partitioning (2D for fallback graph)
+    const cellSize = 100; // Size of each grid cell
+    const newGridCells = new Map<string, GridCell>();
+    
+    // Assign nodes to grid cells
+    nodes.forEach((node, index) => {
+      const cellX = Math.floor(node.x / cellSize);
+      const cellY = Math.floor(node.y / cellSize);
+      const cellKey = `${cellX},${cellY}`;
+      
+      if (!newGridCells.has(cellKey)) {
+        newGridCells.set(cellKey, {
+          x: cellX,
+          y: cellY,
+          nodeIndices: []
+        });
+      }
+      
+      newGridCells.get(cellKey)!.nodeIndices.push(index);
+    });
+    
+    setGridCells(newGridCells);
+    console.log(`CPU clustering: Created ${newGridCells.size} grid cells for ${nodes.length} nodes`);
+  };
+
+  // Initialize the simulation with a subset of nodes
+  useEffect(() => {
+    if (!triples.length) return
+
+    // Extract unique entities and count their connections
+    const entityConnections = new Map<string, number>()
+
+    triples.forEach((triple) => {
+      // Count subject connections
+      if (entityConnections.has(triple.subject)) {
+        entityConnections.set(triple.subject, entityConnections.get(triple.subject)! + 1)
+      } else {
+        entityConnections.set(triple.subject, 1)
+      }
+
+      // Count object connections
+      if (entityConnections.has(triple.object)) {
+        entityConnections.set(triple.object, entityConnections.get(triple.object)! + 1)
+      } else {
+        entityConnections.set(triple.object, 1)
+      }
+    })
+
+    // Sort entities by connection count (most connected first)
+    const sortedEntities = Array.from(entityConnections.entries()).sort((a, b) => b[1] - a[1])
+
+    // Store total node count
+    setAllNodesCount(sortedEntities.length)
+
+    // Show warning if we're limiting nodes
+    setShowNodeLimitWarning(sortedEntities.length > nodeLimit)
+
+    // Take only the top N entities
+    const topEntities = sortedEntities.slice(0, nodeLimit).map(([id]) => id)
+
+    // Create a Set for faster lookups
+    const includedEntities = new Set(topEntities)
+
+    // Create nodes
+    const nodes: Node[] = topEntities.map((id) => {
+      const connectionCount = entityConnections.get(id) || 0
+      const isHighlighted = highlightedNodes?.includes(id) || false
+
+      return {
+        id,
+        label: id,
+        x: Math.random() * 800 - 400,
+        y: Math.random() * 800 - 400,
+        vx: 0,
+        vy: 0,
+        radius: Math.max(5, Math.min(12, 5 + connectionCount * 0.5)),
+        color: isHighlighted ? "#FF9900" : "#76B900",
+        connections: connectionCount,
+      }
+    })
+
+    // Create links (only between included entities)
+    const links: Link[] = triples
+      .filter((triple) => includedEntities.has(triple.subject) && includedEntities.has(triple.object))
+      .map((triple) => ({
+        source: triple.subject,
+        target: triple.object,
+        label: triple.predicate,
+      }))
+
+    setSimulation({
+      nodes,
+      links,
+      isRunning: !simulationPaused, // Use the simulationPaused state to determine initial running state
+      iteration: 0,
+    })
+    
+    // Apply CPU clustering after setting up the simulation
+    applyCpuClustering(nodes);
+  }, [triples, nodeLimit, simulationPaused, highlightedNodes])
+
+  // Run the simulation with optimizations
+  useEffect(() => {
+    if (!simulation || !simulation.isRunning) return
+
+    let animationFrameId: number
+    const canvas = canvasRef.current
+    if (!canvas) return
+
+    const ctx = canvas.getContext("2d")
+    if (!ctx) return
+
+    // Force simulation parameters
+    const strength = -30 // Repulsive force between nodes
+    const linkDistance = 100 // Desired distance between connected nodes
+    const linkStrength = 0.1 // Strength of the links
+    const friction = 0.9 // Friction to slow down nodes
+    const gravity = 0.1 // Force pulling nodes to the center
+    const maxIterations = 300 // Maximum number of iterations
+
+    // Create a node lookup map for faster access
+    const nodeMap = new Map(simulation.nodes.map((node) => [node.id, node]))
+
+    const tick = () => {
+      // Apply forces
+      const { nodes, links, iteration } = simulation
+
+      // Stop if we've reached max iterations
+      if (iteration >= maxIterations) {
+        setSimulation((prev) => (prev ? { ...prev, isRunning: false } : null))
+        return
+      }
+
+      // Optimization: Use a grid-based approach for repulsion
+      // This serves as CPU fallback for GPU clustering
+      const cellSize = 100; // Size of each grid cell
+
+      // If CPU clustering is enabled, use the pre-computed grid cells instead of recalculating
+      if (cpuClustering && gridCells.size > 0) {
+        // Apply repulsive forces only between nodes in the same or adjacent cells
+        for (const node of nodes) {
+          const cellX = Math.floor(node.x / cellSize);
+          const cellY = Math.floor(node.y / cellSize);
+          
+          // Check current cell and adjacent cells
+          for (let dx = -1; dx <= 1; dx++) {
+            for (let dy = -1; dy <= 1; dy++) {
+              const neighborCellKey = `${cellX + dx},${cellY + dy}`;
+              const cell = gridCells.get(neighborCellKey);
+              
+              if (!cell) continue;
+              
+              // Only calculate forces between nodes in this cell
+              for (const otherNodeIndex of cell.nodeIndices) {
+                const otherNode = nodes[otherNodeIndex];
+                if (node.id === otherNode.id) continue;
+                
+                // Calculate repulsive force (same as before)
+                const dx = otherNode.x - node.x;
+                const dy = otherNode.y - node.y;
+                const distance = Math.sqrt(dx * dx + dy * dy) || 1;
+                
+                // Skip if too far
+                if (distance > cellSize * 1.5) continue;
+                
+                const force = strength / (distance * distance);
+                const maxForce = 5;
+                const limitedForce = Math.max(-maxForce, Math.min(maxForce, force));
+                
+                const fx = (limitedForce * dx) / distance;
+                const fy = (limitedForce * dy) / distance;
+                
+                node.vx -= fx;
+                node.vy -= fy;
+              }
+            }
+          }
+        }
+      } else {
+        // Original grid-based approach (existing code)
+        const grid = new Map<string, Node[]>();
+        
+        // Place nodes in grid cells
+        for (const node of nodes) {
+          const cellX = Math.floor(node.x / cellSize);
+          const cellY = Math.floor(node.y / cellSize);
+          const cellKey = `${cellX},${cellY}`;
+
+          if (!grid.has(cellKey)) {
+            grid.set(cellKey, []);
+          }
+
+          grid.get(cellKey)!.push(node);
+        }
+        
+        // Apply repulsive forces between nodes in same or adjacent cells (existing code)
+        // ... existing cell-based force calculation code ...
+      }
+
+      // Apply attractive forces along links
+      for (const link of links) {
+        const sourceNode = nodeMap.get(link.source)
+        const targetNode = nodeMap.get(link.target)
+
+        if (sourceNode && targetNode) {
+          const dx = targetNode.x - sourceNode.x
+          const dy = targetNode.y - sourceNode.y
+          const distance = Math.sqrt(dx * dx + dy * dy) || 1
+          const force = (distance - linkDistance) * linkStrength
+
+          const fx = (force * dx) / distance
+          const fy = (force * dy) / distance
+
+          sourceNode.vx += fx
+          sourceNode.vy += fy
+          targetNode.vx -= fx
+          targetNode.vy -= fy
+        }
+      }
+
+      // Apply gravity towards center
+      for (const node of nodes) {
+        node.vx += (-node.x * gravity) / 100
+        node.vy += (-node.y * gravity) / 100
+      }
+
+      // Apply velocity with friction and update positions
+      for (const node of nodes) {
+        node.vx *= friction
+        node.vy *= friction
+        node.x += node.vx
+        node.y += node.vy
+      }
+
+      // Check if simulation has stabilized
+      const isStable = nodes.every((node) => Math.abs(node.vx) < 0.1 && Math.abs(node.vy) < 0.1)
+
+      if (isStable) {
+        setSimulation((prev) => (prev ? { ...prev, isRunning: false } : null))
+      } else {
+        setSimulation((prev) => (prev ? { ...prev, iteration: prev.iteration + 1 } : null))
+      }
+
+      // Draw the graph
+      drawGraph()
+
+      if (!isStable && iteration < maxIterations) {
+        animationFrameId = requestAnimationFrame(tick)
+      }
+    }
+
+    const drawGraph = () => {
+      if (!canvas || !simulation) return
+
+      const ctx = canvas.getContext("2d")
+      if (!ctx) return
+
+      const { width, height } = canvas
+      
+      // Clear the canvas
+      ctx.clearRect(0, 0, width, height)
+      
+      // Calculate center offset for panning
+      const centerX = width / 2 + offset.x * zoom
+      const centerY = height / 2 + offset.y * zoom
+      
+      // Draw connections first (so nodes appear on top)
+      ctx.lineWidth = 1 / zoom
+      simulation.links.forEach((link) => {
+        const source = simulation.nodes.find((n) => n.id === link.source)
+        const target = simulation.nodes.find((n) => n.id === link.target)
+        
+        if (!source || !target) return
+        
+        const sourceIsHighlighted = highlightedNodes?.includes(source.id) || false
+        const targetIsHighlighted = highlightedNodes?.includes(target.id) || false
+        const isHighlightedLink = sourceIsHighlighted && targetIsHighlighted
+        
+        // Calculate positions with zoom and pan
+        const x1 = centerX + source.x * zoom
+        const y1 = centerY + source.y * zoom
+        const x2 = centerX + target.x * zoom
+        const y2 = centerY + target.y * zoom
+        
+        // Draw link line
+        ctx.beginPath()
+        ctx.moveTo(x1, y1)
+        ctx.lineTo(x2, y2)
+        ctx.strokeStyle = isHighlightedLink ? 'rgba(255, 153, 0, 0.8)' : "rgba(150, 150, 150, 0.3)"
+        ctx.stroke()
+        
+        // Draw directional arrow
+        const angle = Math.atan2(y2 - y1, x2 - x1)
+        const arrowLength = 10 / zoom
+        const arrowWidth = 3 / zoom
+        
+        // Calculate position for the arrow near the target
+        const radius = target.radius
+        const distance = Math.sqrt((x2 - x1) ** 2 + (y2 - y1) ** 2)
+        const ratio = (distance - radius) / distance
+        const arrowX = x1 + (x2 - x1) * ratio
+        const arrowY = y1 + (y2 - y1) * ratio
+        
+        ctx.beginPath()
+        ctx.moveTo(arrowX, arrowY)
+        ctx.lineTo(
+          arrowX - arrowLength * Math.cos(angle - Math.PI / 6),
+          arrowY - arrowLength * Math.sin(angle - Math.PI / 6)
+        )
+        ctx.lineTo(
+          arrowX - arrowLength * 0.7 * Math.cos(angle),
+          arrowY - arrowLength * 0.7 * Math.sin(angle)
+        )
+        ctx.lineTo(
+          arrowX - arrowLength * Math.cos(angle + Math.PI / 6),
+          arrowY - arrowLength * Math.sin(angle + Math.PI / 6)
+        )
+        ctx.closePath()
+        
+        ctx.fillStyle = isHighlightedLink ? 'rgba(255, 153, 0, 0.8)' : "rgba(150, 150, 150, 0.5)"
+        ctx.fill()
+        
+        // Draw link label if hovered/selected or zoom is high enough
+        if (
+          (hoveredNode === source.id || 
+           hoveredNode === target.id || 
+           selectedNode === source.id || 
+           selectedNode === target.id || 
+           zoom > 2) && 
+          link.label
+        ) {
+          // Calculate label position (middle of the link)
+          const labelX = (x1 + x2) / 2
+          const labelY = (y1 + y2) / 2 - 5 / zoom
+          
+          // Draw label background
+          const labelText = String(link.label)
+          const textWidth = (ctx.measureText(labelText).width + 8) / zoom
+          const textHeight = 16 / zoom
+          
+          ctx.fillStyle = isHighlightedLink ? "rgba(255, 153, 0, 0.2)" : "rgba(0, 0, 0, 0.6)"
+          ctx.beginPath()
+          ctx.roundRect(
+            labelX - textWidth / 2,
+            labelY - textHeight,
+            textWidth,
+            textHeight,
+            5 / zoom
+          )
+          ctx.fill()
+          
+          // Draw label text
+          ctx.fillStyle = isHighlightedLink ? "#FFF" : "rgba(255, 255, 255, 0.9)"
+          ctx.font = `${12 / zoom}px sans-serif`
+          ctx.textAlign = "center"
+          ctx.textBaseline = "middle"
+          ctx.fillText(labelText, labelX, labelY - textHeight / 2)
+        }
+      })
+      
+      // Draw nodes
+      simulation.nodes.forEach((node) => {
+        const isHighlighted = highlightedNodes?.includes(node.id) || false
+        const isHovered = hoveredNode === node.id
+        const isSelected = selectedNode === node.id
+        
+        // Calculate position with zoom and pan
+        const x = centerX + node.x * zoom
+        const y = centerY + node.y * zoom
+        const radius = node.radius * zoom
+        
+        // Draw glow for highlighted, hovered or selected nodes
+        if (isHighlighted || isHovered || isSelected) {
+          ctx.beginPath()
+          ctx.arc(x, y, radius * 1.5, 0, Math.PI * 2)
+          ctx.fillStyle = isHighlighted 
+            ? "rgba(255, 153, 0, 0.3)" 
+            : (isSelected ? "rgba(0, 128, 255, 0.3)" : "rgba(255, 255, 255, 0.3)")
+          ctx.fill()
+        }
+        
+        // Draw node circle
+        ctx.beginPath()
+        ctx.arc(x, y, radius, 0, Math.PI * 2)
+        ctx.fillStyle = isHighlighted 
+          ? "#FF9900" 
+          : (isSelected ? "#0088FF" : (isHovered ? "#7CD22D" : node.color))
+        ctx.fill()
+        
+        // Draw node stroke
+        ctx.lineWidth = 1.5 / zoom
+        ctx.strokeStyle = isHighlighted 
+          ? "rgba(255, 153, 0, 0.8)" 
+          : (isSelected ? "rgba(0, 128, 255, 0.8)" : "rgba(50, 50, 50, 0.5)")
+        ctx.stroke()
+        
+        // Draw node label if hovered, selected, or zoom is high enough
+        if (isHovered || isSelected || zoom > 1.2 || isHighlighted) {
+          const labelText = String(node.label)
+          const fontSize = isHighlighted || isSelected ? 14 / zoom : 12 / zoom
+          ctx.font = `${fontSize}px sans-serif`
+          ctx.textAlign = "center"
+          ctx.textBaseline = "middle"
+          
+          // Draw text background for better readability
+          const textWidth = (ctx.measureText(labelText).width + 10) / zoom
+          const textHeight = 20 / zoom
+          
+          ctx.fillStyle = isHighlighted 
+            ? "rgba(255, 153, 0, 0.8)" 
+            : (isSelected ? "rgba(0, 128, 255, 0.8)" : "rgba(0, 0, 0, 0.7)")
+          ctx.beginPath()
+          ctx.roundRect(
+            x - textWidth / 2,
+            y + radius + 4 / zoom,
+            textWidth,
+            textHeight,
+            5 / zoom
+          )
+          ctx.fill()
+          
+          // Draw text
+          ctx.fillStyle = "rgba(255, 255, 255, 0.95)"
+          ctx.fillText(labelText, x, y + radius + 4 / zoom + textHeight / 2)
+        }
+      })
+    }
+
+    // Start the simulation
+    animationFrameId = requestAnimationFrame(tick)
+
+    return () => {
+      cancelAnimationFrame(animationFrameId)
+    }
+  }, [simulation, hoveredNode, selectedNode, zoom, offset, cpuClustering, gridCells])
+
+  // Handle canvas resize
+  useEffect(() => {
+    const handleResize = () => {
+      const canvas = canvasRef.current
+      if (!canvas) return
+
+      const container = canvas.parentElement
+      if (!container) return
+
+      canvas.width = container.clientWidth
+      canvas.height = container.clientHeight
+    }
+
+    handleResize()
+    window.addEventListener("resize", handleResize)
+
+    return () => {
+      window.removeEventListener("resize", handleResize)
+    }
+  }, [isFullscreen, isBrowserFullscreen])
+
+  // Handle mouse interactions
+  useEffect(() => {
+    const canvas = canvasRef.current
+    if (!canvas || !simulation) return
+
+    const getMousePosition = (e: MouseEvent) => {
+      const rect = canvas.getBoundingClientRect()
+      const x = e.clientX - rect.left
+      const y = e.clientY - rect.top
+      return { x, y }
+    }
+
+    const handleMouseMove = (e: MouseEvent) => {
+      if (!simulation) return
+
+      const { x, y } = getMousePosition(e)
+      const centerX = canvas.width / 2
+      const centerY = canvas.height / 2
+
+      // Check if mouse is over any node
+      let hovered = null
+      for (const node of simulation.nodes) {
+        const nodeX = centerX + (node.x + offset.x) * zoom
+        const nodeY = centerY + (node.y + offset.y) * zoom
+        const distance = Math.sqrt(Math.pow(x - nodeX, 2) + Math.pow(y - nodeY, 2))
+
+        if (distance < node.radius * zoom * 1.2) { // Slightly larger hover area
+          hovered = node.id
+          break
+        }
+      }
+
+      setHoveredNode(hovered)
+
+      // Handle dragging
+      if (isDragging) {
+        const dx = (x - dragStart.x) / zoom
+        const dy = (y - dragStart.y) / zoom
+
+        setOffset((prev) => ({
+          x: prev.x + dx,
+          y: prev.y + dy,
+        }))
+
+        setDragStart({ x, y })
+      }
+    }
+
+    const handleMouseDown = (e: MouseEvent) => {
+      const { x, y } = getMousePosition(e)
+
+      // Check if clicking on a node
+      const centerX = canvas.width / 2
+      const centerY = canvas.height / 2
+
+      let clickedNode = null
+      for (const node of simulation.nodes) {
+        const nodeX = centerX + (node.x + offset.x) * zoom
+        const nodeY = centerY + (node.y + offset.y) * zoom
+        const distance = Math.sqrt(Math.pow(x - nodeX, 2) + Math.pow(y - nodeY, 2))
+
+        if (distance < node.radius * zoom * 1.2) { // Slightly larger clickable area
+          clickedNode = node.id
+          break
+        }
+      }
+
+      if (clickedNode) {
+        // Toggle selection if clicking on a node
+        if (selectedNode === clickedNode) {
+          setSelectedNode(null);
+        } else {
+          setSelectedNode(clickedNode);
+          console.log(`Selected node: ${clickedNode}`);
+          console.log(`Node connections: ${simulation.links.filter((link) => link.source === clickedNode || link.target === clickedNode).length}`);
+        }
+      } else {
+        // Start dragging the canvas
+        setIsDragging(true)
+        setDragStart({ x, y })
+      }
+    }
+
+    const handleMouseUp = () => {
+      setIsDragging(false)
+    }
+
+    const handleWheel = (e: WheelEvent) => {
+      e.preventDefault()
+
+      // Adjust zoom level
+      const delta = -Math.sign(e.deltaY) * 0.1
+      const newZoom = Math.max(0.1, Math.min(3, zoom + delta))
+
+      setZoom(newZoom)
+    }
+
+    canvas.addEventListener("mousemove", handleMouseMove)
+    canvas.addEventListener("mousedown", handleMouseDown)
+    canvas.addEventListener("mouseup", handleMouseUp)
+    canvas.addEventListener("mouseleave", handleMouseUp)
+    canvas.addEventListener("wheel", handleWheel, { passive: false })
+
+    return () => {
+      canvas.removeEventListener("mousemove", handleMouseMove)
+      canvas.removeEventListener("mousedown", handleMouseDown)
+      canvas.removeEventListener("mouseup", handleMouseUp)
+      canvas.removeEventListener("mouseleave", handleMouseUp)
+      canvas.removeEventListener("wheel", handleWheel)
+    }
+  }, [simulation, isDragging, dragStart, zoom, offset, selectedNode])
+
+  // Handle canvas resize when fullscreen changes
+  useEffect(() => {
+    const canvas = canvasRef.current
+    if (!canvas) return
+
+    const container = canvas.parentElement
+    if (!container) return
+
+    canvas.width = container.clientWidth
+    canvas.height = container.clientHeight
+
+    // Force redraw with full highlighting and labels
+    if (simulation) {
+      // Instead of a simplified redraw, call the main drawing function
+      // which includes all highlighting and labels
+      const nodeMap = new Map(simulation.nodes.map((node) => [node.id, node]))
+
+      const drawFullGraph = () => {
+        const ctx = canvas.getContext("2d")
+        if (!ctx) return
+
+        ctx.clearRect(0, 0, canvas.width, canvas.height)
+
+        const { nodes, links } = simulation
+        const centerX = canvas.width / 2
+        const centerY = canvas.height / 2
+
+        // Draw links with proper highlighting
+        for (const link of links) {
+          const sourceNode = nodeMap.get(link.source)
+          const targetNode = nodeMap.get(link.target)
+
+          if (sourceNode && targetNode) {
+            // Transform coordinates based on zoom and offset
+            const sx = centerX + (sourceNode.x + offset.x) * zoom
+            const sy = centerY + (sourceNode.y + offset.y) * zoom
+            const tx = centerX + (targetNode.x + offset.x) * zoom
+            const ty = centerY + (targetNode.y + offset.y) * zoom
+
+            // Highlight links connected to selected node
+            if (
+              hoveredNode === sourceNode.id ||
+              hoveredNode === targetNode.id ||
+              selectedNode === sourceNode.id ||
+              selectedNode === targetNode.id
+            ) {
+              ctx.strokeStyle = "rgba(118, 185, 0, 0.6)"
+              ctx.lineWidth = 2
+            } else {
+              ctx.strokeStyle = "rgba(255, 255, 255, 0.2)"
+              ctx.lineWidth = 1
+            }
+
+            ctx.beginPath()
+            ctx.moveTo(sx, sy)
+            ctx.lineTo(tx, ty)
+            ctx.stroke()
+
+            // Draw arrow
+            const angle = Math.atan2(ty - sy, tx - sx)
+            const arrowLength = 8
+
+            ctx.beginPath()
+            ctx.moveTo(tx, ty)
+            ctx.lineTo(
+              tx - arrowLength * Math.cos(angle - Math.PI / 6),
+              ty - arrowLength * Math.sin(angle - Math.PI / 6),
+            )
+            ctx.lineTo(
+              tx - arrowLength * Math.cos(angle + Math.PI / 6),
+              ty - arrowLength * Math.sin(angle + Math.PI / 6),
+            )
+            ctx.closePath()
+            ctx.fillStyle = "rgba(118, 185, 0, 0.6)"
+            ctx.fill()
+
+            // Draw link label for selected connections
+            if (
+              hoveredNode === sourceNode.id ||
+              hoveredNode === targetNode.id ||
+              selectedNode === sourceNode.id ||
+              selectedNode === targetNode.id
+            ) {
+              const midX = (sx + tx) / 2
+              const midY = (sy + ty) / 2
+
+              // Background for label
+              ctx.font = "10px Inter, sans-serif"
+              const labelWidth = ctx.measureText(link.label).width + 8
+              ctx.fillStyle = "rgba(0, 0, 0, 0.7)"
+              ctx.fillRect(midX - labelWidth / 2, midY - 10, labelWidth, 20)
+
+              // Label text
+              ctx.fillStyle = "white"
+              ctx.textAlign = "center"
+              ctx.textBaseline = "middle"
+              ctx.fillText(link.label, midX, midY)
+            }
+          }
+        }
+
+        // Draw nodes with proper highlighting
+        for (const node of nodes) {
+          // Transform coordinates based on zoom and offset
+          const x = centerX + (node.x + offset.x) * zoom
+          const y = centerY + (node.y + offset.y) * zoom
+          const radius = node.radius * zoom
+
+          // Node circle
+          ctx.beginPath()
+          ctx.arc(x, y, radius, 0, Math.PI * 2)
+
+          // Highlight hovered or selected node
+          if (node.id === hoveredNode || node.id === selectedNode) {
+            // Glow effect
+            ctx.fillStyle = "#76B900"
+          } else {
+            ctx.fillStyle = "rgba(118, 185, 0, 0.8)"
+          }
+
+          ctx.fill()
+
+          // Draw node border
+          if (node.id === selectedNode) {
+            ctx.strokeStyle = "white"
+            ctx.lineWidth = 2
+            ctx.stroke()
+          }
+
+          // Draw node label
+          ctx.font =
+            node.id === hoveredNode || node.id === selectedNode
+              ? "bold 12px Inter, sans-serif"
+              : "11px Inter, sans-serif"
+
+          ctx.fillStyle = "rgba(255, 255, 255, 0.9)"
+          ctx.textAlign = "center"
+          ctx.textBaseline = "middle"
+
+          // Always show labels for important nodes
+          const isImportantNode = node.connections > 2
+          const isHighlightedNode = node.id === hoveredNode || node.id === selectedNode
+
+          if (isImportantNode || isHighlightedNode) {
+            // Background for label
+            const labelWidth = ctx.measureText(node.label).width + 8
+            const labelHeight = 20
+
+            // Always add background for important nodes
+            ctx.fillStyle = "rgba(0, 0, 0, 0.8)"
+            ctx.fillRect(x - labelWidth / 2, y + radius + 4, labelWidth, labelHeight)
+
+            // Text color
+            ctx.fillStyle = isHighlightedNode ? "white" : "rgba(255, 255, 255, 0.9)"
+            ctx.fillText(node.label, x, y + radius + 14)
+          }
+        }
+      }
+
+      drawFullGraph()
+    }
+  }, [isFullscreen, isBrowserFullscreen, simulation, zoom, offset, hoveredNode, selectedNode])
+
+  // Add toggle function for simulation pause/play
+  const toggleSimulation = useCallback(() => {
+    setSimulationPaused(!simulationPaused);
+    setSimulation(prev => prev ? { ...prev, isRunning: simulationPaused } : null);
+  }, [simulationPaused]);
+
+  // Add UI control for CPU clustering toggle
+  useEffect(() => {
+    // Detect if GPU clustering might be unavailable (simple check)
+    const checkGpuSupport = () => {
+      const hasGpu = typeof navigator !== 'undefined' && 'gpu' in navigator;
+      if (!hasGpu) {
+        console.log("WebGPU not detected, enabling CPU clustering fallback");
+        setCpuClustering(true);
+      }
+    };
+    
+    checkGpuSupport();
+  }, []);
+
+  // Add this additional useEffect to refresh the display when selectedNode changes
+  useEffect(() => {
+    // Force a redraw when selectedNode changes
+    const canvas = canvasRef.current;
+    if (canvas && simulation) {
+      const ctx = canvas.getContext('2d');
+      if (ctx) {
+        const centerX = canvas.width / 2;
+        const centerY = canvas.height / 2;
+        
+        // Clear canvas
+        ctx.clearRect(0, 0, canvas.width, canvas.height);
+        
+        // Create a node map for faster lookup
+        const nodeMap = new Map();
+        simulation.nodes.forEach(node => {
+          nodeMap.set(node.id, node);
+        });
+        
+        // Draw the graph
+        const drawFullGraph = () => {
+          // Draw links
+          for (const link of simulation.links) {
+            const sourceNode = nodeMap.get(link.source);
+            const targetNode = nodeMap.get(link.target);
+            
+            if (sourceNode && targetNode) {
+              // Transform coordinates based on zoom and offset
+              const sx = centerX + (sourceNode.x + offset.x) * zoom;
+              const sy = centerY + (sourceNode.y + offset.y) * zoom;
+              const tx = centerX + (targetNode.x + offset.x) * zoom;
+              const ty = centerY + (targetNode.y + offset.y) * zoom;
+              
+              // Highlight links connected to selected node
+              if (
+                hoveredNode === sourceNode.id ||
+                hoveredNode === targetNode.id ||
+                selectedNode === sourceNode.id ||
+                selectedNode === targetNode.id
+              ) {
+                ctx.strokeStyle = "rgba(118, 185, 0, 0.7)";
+                ctx.lineWidth = 2.5;
+              } else {
+                ctx.strokeStyle = "rgba(255, 255, 255, 0.2)";
+                ctx.lineWidth = 1;
+              }
+              
+              ctx.beginPath();
+              ctx.moveTo(sx, sy);
+              ctx.lineTo(tx, ty);
+              ctx.stroke();
+              
+              // Draw arrow
+              const angle = Math.atan2(ty - sy, tx - sx);
+              const arrowLength = 8;
+              
+              ctx.beginPath();
+              ctx.moveTo(tx, ty);
+              ctx.lineTo(
+                tx - arrowLength * Math.cos(angle - Math.PI / 6),
+                ty - arrowLength * Math.sin(angle - Math.PI / 6)
+              );
+              ctx.lineTo(
+                tx - arrowLength * Math.cos(angle + Math.PI / 6),
+                ty - arrowLength * Math.sin(angle + Math.PI / 6)
+              );
+              ctx.closePath();
+              ctx.fillStyle = "rgba(118, 185, 0, 0.7)";
+              ctx.fill();
+            }
+          }
+          
+          // Draw nodes
+          for (const node of simulation.nodes) {
+            // Transform coordinates based on zoom and offset
+            const x = centerX + (node.x + offset.x) * zoom;
+            const y = centerY + (node.y + offset.y) * zoom;
+            const radius = node.radius * zoom;
+            
+            // Node circle
+            ctx.beginPath();
+            ctx.arc(x, y, radius, 0, Math.PI * 2);
+            
+            // Highlight the selected node or nodes connected to the selected node
+            const isSelected = node.id === selectedNode;
+            const isConnectedToSelected = selectedNode && 
+              simulation.links.some(
+                link => (link.source === selectedNode && link.target === node.id) || 
+                      (link.target === selectedNode && link.source === node.id)
+              );
+            
+            if (isSelected) {
+              ctx.fillStyle = "#76B900"; // Bright green for selected
+            } else if (isConnectedToSelected) {
+              ctx.fillStyle = "#50a0ff"; // Blue for connected nodes
+            } else if (node.id === hoveredNode) {
+              ctx.fillStyle = "#d0ff50"; // Yellow-green for hovered
+            } else {
+              ctx.fillStyle = "rgba(118, 185, 0, 0.8)"; // Default
+            }
+            
+            ctx.fill();
+            
+            // Draw node border
+            if (isSelected) {
+              ctx.strokeStyle = "white";
+              ctx.lineWidth = 2;
+              ctx.stroke();
+            } else if (isConnectedToSelected) {
+              ctx.strokeStyle = "#a0d0ff";
+              ctx.lineWidth = 1.5;
+              ctx.stroke();
+            }
+            
+            // Draw node label
+            const isImportantNode = node.connections > 2 || isSelected || isConnectedToSelected;
+            const isHighlightedNode = node.id === hoveredNode || isSelected;
+            
+            if (isImportantNode || isHighlightedNode) {
+              ctx.font = isSelected ? "bold 12px Inter, sans-serif" : "11px Inter, sans-serif";
+              
+              // Background for label
+              const labelWidth = ctx.measureText(node.label).width + 8;
+              const labelHeight = 20;
+              
+              // Add background
+              if (isSelected) {
+                ctx.fillStyle = "rgba(0, 128, 0, 0.9)";
+              } else if (isConnectedToSelected) {
+                ctx.fillStyle = "rgba(0, 64, 128, 0.9)";
+              } else {
+                ctx.fillStyle = "rgba(0, 0, 0, 0.8)";
+              }
+              
+              ctx.fillRect(x - labelWidth / 2, y + radius + 4, labelWidth, labelHeight);
+              
+              // Text color
+              ctx.fillStyle = "white";
+              ctx.textAlign = "center";
+              ctx.textBaseline = "middle";
+              ctx.fillText(node.label, x, y + radius + 14);
+            }
+          }
+        };
+        
+        drawFullGraph();
+      }
+    }
+  }, [selectedNode, simulation, zoom, offset, hoveredNode]);
+
+  return (
+    <div className="relative h-full w-full" ref={containerRef}>
+      <div className={`bg-black rounded-lg overflow-hidden h-full w-full`}>
+        <canvas ref={canvasRef} className="w-full h-full cursor-grab active:cursor-grabbing" />
+
+        {/* Info panel */}
+        <div className="absolute bottom-2 left-2 text-xs bg-black/70 px-3 py-2 rounded flex items-center gap-2">
+          <span className="text-gray-400">Force-Directed Graph</span>
+          {simulation?.isRunning && (
+            <span className="text-primary animate-pulse">
+              Simulating... {Math.round((simulation.iteration / 300) * 100)}%
+            </span>
+          )}
+        </div>
+
+        {/* Node limit warning */}
+        {showNodeLimitWarning && (
+          <div className="absolute top-2 left-2 text-xs bg-black/70 px-3 py-2 rounded flex items-center gap-2">
+            <span className="text-yellow-400">
+              Showing {nodeLimit} of {allNodesCount} nodes
+            </span>
+            <button
+              onClick={handleIncreaseNodeLimit}
+              className="px-2 py-0.5 bg-primary/20 text-primary rounded hover:bg-primary/30"
+              title="Show more nodes"
+            >
+              Show more
+            </button>
+          </div>
+        )}
+
+        {/* Controls */}
+        <div className="absolute top-2 right-2 flex flex-col gap-2">
+          {/* Add play/pause simulation button */}
+          <button
+            onClick={toggleSimulation}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, simulationPaused ? "Start simulation" : "Pause simulation")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            {simulationPaused ? <Play className="h-4 w-4" /> : <Pause className="h-4 w-4" />}
+          </button>
+
+          <button
+            onClick={handleZoomIn}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, "Zoom in")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            <ZoomIn className="h-4 w-4" />
+          </button>
+
+          <button
+            onClick={handleZoomOut}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, "Zoom out")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            <ZoomOut className="h-4 w-4" />
+          </button>
+
+          <button
+            onClick={toggleNodeLimit}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, "Toggle node limit")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            <Filter className="h-4 w-4" />
+          </button>
+        </div>
+
+        {/* Node limit controls */}
+        {showNodeLimitWarning && (
+          <div className="absolute bottom-2 right-2 bg-black/70 rounded px-2 py-1 flex items-center gap-2">
+            <button
+              onClick={handleDecreaseNodeLimit}
+              className="text-white text-xs px-2 py-0.5 bg-gray-700 rounded hover:bg-gray-600"
+              disabled={nodeLimit <= 25}
+              type="button"
+              onMouseEnter={(e) => handleButtonMouseEnter(e, "Show fewer nodes")}
+              onMouseLeave={handleButtonMouseLeave}
+            >
+              -
+            </button>
+            <span className="text-xs text-white">{nodeLimit} nodes</span>
+            <button
+              onClick={handleIncreaseNodeLimit}
+              className="text-white text-xs px-2 py-0.5 bg-gray-700 rounded hover:bg-gray-600"
+              disabled={nodeLimit >= 500}
+              type="button"
+              onMouseEnter={(e) => handleButtonMouseEnter(e, "Show more nodes")}
+              onMouseLeave={handleButtonMouseLeave}
+            >
+              +
+            </button>
+          </div>
+        )}
+
+        {/* Selected node info */}
+        {selectedNode && (
+          <div className="absolute top-2 left-2 bg-black/80 text-white text-sm px-4 py-3 rounded max-w-xs">
+            <h3 className="font-bold text-primary mb-1">{selectedNode}</h3>
+            <div className="text-xs text-gray-300">
+              {simulation?.links.filter((link) => link.source === selectedNode || link.target === selectedNode)
+                .length || 0}{" "}
+              connections
+            </div>
+            <div className="mt-2 text-xs max-h-[400px] overflow-auto">
+              <div className="mb-2">
+                <div className="text-gray-400 text-xs uppercase mb-1">Outgoing</div>
+                {simulation?.links
+                  .filter((link) => link.source === selectedNode)
+                  .map((link, i) => (
+                    <div key={`out-${i}`} className="flex items-center gap-1 mb-1">
+                      <span className="text-gray-400">→</span>
+                      <span className="text-primary">{link.label}</span>
+                      <span className="text-gray-300">→</span>
+                      <span>{link.target}</span>
+                    </div>
+                  )) || <div className="text-gray-500 italic">None</div>}
+              </div>
+              
+              <div>
+                <div className="text-gray-400 text-xs uppercase mb-1">Incoming</div>
+                {simulation?.links
+                  .filter((link) => link.target === selectedNode)
+                  .map((link, i) => (
+                    <div key={`in-${i}`} className="flex items-center gap-1 mb-1">
+                      <span>{link.source}</span>
+                      <span className="text-gray-300">→</span>
+                      <span className="text-primary">{link.label}</span>
+                      <span className="text-gray-400">→</span>
+                    </div>
+                  )) || <div className="text-gray-500 italic">None</div>}
+              </div>
+              
+              <button 
+                onClick={() => setSelectedNode(null)}
+                className="mt-4 bg-gray-700 hover:bg-gray-600 text-white px-3 py-1 rounded text-xs"
+              >
+                Clear selection
+              </button>
+            </div>
+          </div>
+        )}
+
+        {/* Tooltip */}
+        {showTooltip && (
+          <div
+            className="absolute bg-black/90 text-white text-xs px-2 py-1 rounded pointer-events-none z-50"
+            style={{
+              left: `${tooltipPosition.x}px`,
+              top: `${tooltipPosition.y}px`,
+              transform: "translateX(-50%)",
+            }}
+          >
+            {tooltipText}
+          </div>
+        )}
+
+        {/* Add CPU fallback indicator */}
+        {cpuClustering && (
+          <div className="absolute bottom-2 left-2 bg-gray-900/80 text-white text-xs px-2 py-1 rounded">
+            Using CPU clustering fallback
+          </div>
+        )}
+      </div>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/fallback-graph.tsx.original b/nvidia/txt2kg/assets/frontend/components/fallback-graph.tsx.original
new file mode 100644
index 0000000..f38482c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/fallback-graph.tsx.original
@@ -0,0 +1,961 @@
+"use client"
+
+import type React from "react"
+
+import { useEffect, useRef, useState, useCallback } from "react"
+import type { Triple } from "@/utils/text-processing"
+import { Maximize2, Minimize2, ZoomIn, ZoomOut, Move, Filter, Play, Pause } from "lucide-react"
+
+interface FallbackGraphProps {
+  triples: Triple[]
+  fullscreen?: boolean
+}
+
+interface Node {
+  id: string
+  label: string
+  x: number
+  y: number
+  vx: number
+  vy: number
+  radius: number
+  color: string
+  connections: number
+}
+
+interface Link {
+  source: string
+  target: string
+  label: string
+}
+
+export function FallbackGraph({ triples, fullscreen = false }: FallbackGraphProps) {
+  const containerRef = useRef<HTMLDivElement>(null)
+  const canvasRef = useRef<HTMLCanvasElement>(null)
+  const [isFullscreen, setIsFullscreen] = useState(fullscreen)
+  const [isBrowserFullscreen, setIsBrowserFullscreen] = useState(false)
+  const [hoveredNode, setHoveredNode] = useState<string | null>(null)
+  const [selectedNode, setSelectedNode] = useState<string | null>(null)
+  const [zoom, setZoom] = useState(1)
+  const [offset, setOffset] = useState({ x: 0, y: 0 })
+  const [isDragging, setIsDragging] = useState(false)
+  const [dragStart, setDragStart] = useState({ x: 0, y: 0 })
+  const [simulation, setSimulation] = useState<{
+    nodes: Node[]
+    links: Link[]
+    isRunning: boolean
+    iteration: number
+  } | null>(null)
+  const [nodeLimit, setNodeLimit] = useState(75) // Default node limit
+  const [showNodeLimitWarning, setShowNodeLimitWarning] = useState(false)
+  const [allNodesCount, setAllNodesCount] = useState(0)
+  const [tooltipText, setTooltipText] = useState("")
+  const [tooltipPosition, setTooltipPosition] = useState({ x: 0, y: 0 })
+  const [showTooltip, setShowTooltip] = useState(false)
+  const [simulationPaused, setSimulationPaused] = useState(true) // Start with simulation paused
+
+  // Handle browser fullscreen changes
+  useEffect(() => {
+    const handleFullscreenChange = () => {
+      setIsBrowserFullscreen(!!document.fullscreenElement)
+    }
+
+    document.addEventListener("fullscreenchange", handleFullscreenChange)
+
+    return () => {
+      document.removeEventListener("fullscreenchange", handleFullscreenChange)
+    }
+  }, [])
+
+  // Toggle browser fullscreen
+  const toggleFullscreen = useCallback(() => {
+    if (!containerRef.current) return
+
+    if (!document.fullscreenElement) {
+      // Enter fullscreen
+      containerRef.current.requestFullscreen().catch((err) => {
+        console.error(`Error attempting to enable fullscreen: ${err.message}`)
+      })
+    } else {
+      // Exit fullscreen
+      document.exitFullscreen().catch((err) => {
+        console.error(`Error attempting to exit fullscreen: ${err.message}`)
+      })
+    }
+  }, [])
+
+  const handleZoomIn = useCallback(() => {
+    setZoom((prev) => Math.min(3, prev + 0.1))
+  }, [])
+
+  const handleZoomOut = useCallback(() => {
+    setZoom((prev) => Math.max(0.1, prev - 0.1))
+  }, [])
+
+  const handleResetView = useCallback(() => {
+    setZoom(1)
+    setOffset({ x: 0, y: 0 })
+    setSelectedNode(null)
+
+    // Restart simulation
+    setSimulation((prev) => (prev ? { ...prev, isRunning: true, iteration: 0 } : null))
+  }, [])
+
+  const handleIncreaseNodeLimit = useCallback(() => {
+    setNodeLimit((prev) => Math.min(prev + 50, 500))
+  }, [])
+
+  const handleDecreaseNodeLimit = useCallback(() => {
+    setNodeLimit((prev) => Math.max(25, prev - 25))
+  }, [])
+
+  const toggleNodeLimit = useCallback(() => {
+    setNodeLimit((prev) => (prev === 75 ? 150 : 75))
+  }, [])
+
+  // Handle tooltip display
+  const handleButtonMouseEnter = useCallback((e: React.MouseEvent, text: string) => {
+    const rect = e.currentTarget.getBoundingClientRect()
+    setTooltipText(text)
+    setTooltipPosition({
+      x: rect.left + rect.width / 2,
+      y: rect.bottom + 5,
+    })
+    setShowTooltip(true)
+  }, [])
+
+  const handleButtonMouseLeave = useCallback(() => {
+    setShowTooltip(false)
+  }, [])
+
+  // Initialize the simulation with a subset of nodes
+  useEffect(() => {
+    if (!triples.length) return
+
+    // Extract unique entities and count their connections
+    const entityConnections = new Map<string, number>()
+
+    triples.forEach((triple) => {
+      // Count subject connections
+      if (entityConnections.has(triple.subject)) {
+        entityConnections.set(triple.subject, entityConnections.get(triple.subject)! + 1)
+      } else {
+        entityConnections.set(triple.subject, 1)
+      }
+
+      // Count object connections
+      if (entityConnections.has(triple.object)) {
+        entityConnections.set(triple.object, entityConnections.get(triple.object)! + 1)
+      } else {
+        entityConnections.set(triple.object, 1)
+      }
+    })
+
+    // Sort entities by connection count (most connected first)
+    const sortedEntities = Array.from(entityConnections.entries()).sort((a, b) => b[1] - a[1])
+
+    // Store total node count
+    setAllNodesCount(sortedEntities.length)
+
+    // Show warning if we're limiting nodes
+    setShowNodeLimitWarning(sortedEntities.length > nodeLimit)
+
+    // Take only the top N entities
+    const topEntities = sortedEntities.slice(0, nodeLimit).map(([id]) => id)
+
+    // Create a Set for faster lookups
+    const includedEntities = new Set(topEntities)
+
+    // Create nodes
+    const nodes: Node[] = topEntities.map((id) => {
+      const connectionCount = entityConnections.get(id) || 0
+
+      return {
+        id,
+        label: id,
+        x: Math.random() * 800 - 400,
+        y: Math.random() * 800 - 400,
+        vx: 0,
+        vy: 0,
+        radius: Math.max(5, Math.min(12, 5 + connectionCount * 0.5)),
+        color: "#76B900",
+        connections: connectionCount,
+      }
+    })
+
+    // Create links (only between included entities)
+    const links: Link[] = triples
+      .filter((triple) => includedEntities.has(triple.subject) && includedEntities.has(triple.object))
+      .map((triple) => ({
+        source: triple.subject,
+        target: triple.object,
+        label: triple.predicate,
+      }))
+
+    setSimulation({
+      nodes,
+      links,
+      isRunning: !simulationPaused, // Use the simulationPaused state to determine initial running state
+      iteration: 0,
+    })
+  }, [triples, nodeLimit, simulationPaused])
+
+  // Run the simulation with optimizations
+  useEffect(() => {
+    if (!simulation || !simulation.isRunning) return
+
+    let animationFrameId: number
+    const canvas = canvasRef.current
+    if (!canvas) return
+
+    const ctx = canvas.getContext("2d")
+    if (!ctx) return
+
+    // Force simulation parameters
+    const strength = -30 // Repulsive force between nodes
+    const linkDistance = 100 // Desired distance between connected nodes
+    const linkStrength = 0.1 // Strength of the links
+    const friction = 0.9 // Friction to slow down nodes
+    const gravity = 0.1 // Force pulling nodes to the center
+    const maxIterations = 300 // Maximum number of iterations
+
+    // Create a node lookup map for faster access
+    const nodeMap = new Map(simulation.nodes.map((node) => [node.id, node]))
+
+    const tick = () => {
+      // Apply forces
+      const { nodes, links, iteration } = simulation
+
+      // Stop if we've reached max iterations
+      if (iteration >= maxIterations) {
+        setSimulation((prev) => (prev ? { ...prev, isRunning: false } : null))
+        return
+      }
+
+      // Optimization: Use a grid-based approach for repulsion
+      // Divide space into cells and only calculate forces between nodes in nearby cells
+      const cellSize = 100
+      const grid = new Map<string, Node[]>()
+
+      // Place nodes in grid cells
+      for (const node of nodes) {
+        const cellX = Math.floor(node.x / cellSize)
+        const cellY = Math.floor(node.y / cellSize)
+        const cellKey = `${cellX},${cellY}`
+
+        if (!grid.has(cellKey)) {
+          grid.set(cellKey, [])
+        }
+
+        grid.get(cellKey)!.push(node)
+      }
+
+      // Apply repulsive forces between nodes in same or adjacent cells
+      for (const node of nodes) {
+        const cellX = Math.floor(node.x / cellSize)
+        const cellY = Math.floor(node.y / cellSize)
+
+        // Check current cell and adjacent cells
+        for (let dx = -1; dx <= 1; dx++) {
+          for (let dy = -1; dy <= 1; dy++) {
+            const neighborCellKey = `${cellX + dx},${cellY + dy}`
+            const neighborNodes = grid.get(neighborCellKey) || []
+
+            for (const otherNode of neighborNodes) {
+              if (node.id === otherNode.id) continue
+
+              const dx = otherNode.x - node.x
+              const dy = otherNode.y - node.y
+              const distance = Math.sqrt(dx * dx + dy * dy) || 1
+
+              // Skip if too far
+              if (distance > cellSize * 1.5) continue
+
+              const force = strength / (distance * distance)
+
+              // Avoid extreme forces at very close distances
+              const maxForce = 5
+              const limitedForce = Math.max(-maxForce, Math.min(maxForce, force))
+
+              const fx = (limitedForce * dx) / distance
+              const fy = (limitedForce * dy) / distance
+
+              node.vx -= fx
+              node.vy -= fy
+            }
+          }
+        }
+      }
+
+      // Apply attractive forces along links
+      for (const link of links) {
+        const sourceNode = nodeMap.get(link.source)
+        const targetNode = nodeMap.get(link.target)
+
+        if (sourceNode && targetNode) {
+          const dx = targetNode.x - sourceNode.x
+          const dy = targetNode.y - sourceNode.y
+          const distance = Math.sqrt(dx * dx + dy * dy) || 1
+          const force = (distance - linkDistance) * linkStrength
+
+          const fx = (force * dx) / distance
+          const fy = (force * dy) / distance
+
+          sourceNode.vx += fx
+          sourceNode.vy += fy
+          targetNode.vx -= fx
+          targetNode.vy -= fy
+        }
+      }
+
+      // Apply gravity towards center
+      for (const node of nodes) {
+        node.vx += (-node.x * gravity) / 100
+        node.vy += (-node.y * gravity) / 100
+      }
+
+      // Apply velocity with friction and update positions
+      for (const node of nodes) {
+        node.vx *= friction
+        node.vy *= friction
+        node.x += node.vx
+        node.y += node.vy
+      }
+
+      // Check if simulation has stabilized
+      const isStable = nodes.every((node) => Math.abs(node.vx) < 0.1 && Math.abs(node.vy) < 0.1)
+
+      if (isStable) {
+        setSimulation((prev) => (prev ? { ...prev, isRunning: false } : null))
+      } else {
+        setSimulation((prev) => (prev ? { ...prev, iteration: prev.iteration + 1 } : null))
+      }
+
+      // Draw the graph
+      drawGraph()
+
+      if (!isStable && iteration < maxIterations) {
+        animationFrameId = requestAnimationFrame(tick)
+      }
+    }
+
+    const drawGraph = () => {
+      if (!ctx || !canvas || !simulation) return
+
+      // Clear canvas
+      ctx.clearRect(0, 0, canvas.width, canvas.height)
+
+      const { nodes, links } = simulation
+      const centerX = canvas.width / 2
+      const centerY = canvas.height / 2
+
+      // Draw links
+      ctx.lineWidth = 1
+      ctx.strokeStyle = "rgba(255, 255, 255, 0.2)"
+
+      for (const link of links) {
+        const sourceNode = nodeMap.get(link.source)
+        const targetNode = nodeMap.get(link.target)
+
+        if (sourceNode && targetNode) {
+          // Transform coordinates based on zoom and offset
+          const sx = centerX + (sourceNode.x + offset.x) * zoom
+          const sy = centerY + (sourceNode.y + offset.y) * zoom
+          const tx = centerX + (targetNode.x + offset.x) * zoom
+          const ty = centerY + (targetNode.y + offset.y) * zoom
+
+          // Highlight links connected to hovered/selected node
+          if (
+            hoveredNode === sourceNode.id ||
+            hoveredNode === targetNode.id ||
+            selectedNode === sourceNode.id ||
+            selectedNode === targetNode.id
+          ) {
+            ctx.strokeStyle = "rgba(118, 185, 0, 0.6)"
+            ctx.lineWidth = 2
+          } else {
+            ctx.strokeStyle = "rgba(255, 255, 255, 0.2)"
+            ctx.lineWidth = 1
+          }
+
+          ctx.beginPath()
+          ctx.moveTo(sx, sy)
+          ctx.lineTo(tx, ty)
+          ctx.stroke()
+
+          // Draw arrow
+          const angle = Math.atan2(ty - sy, tx - sx)
+          const arrowLength = 8
+
+          ctx.beginPath()
+          ctx.moveTo(tx, ty)
+          ctx.lineTo(tx - arrowLength * Math.cos(angle - Math.PI / 6), ty - arrowLength * Math.sin(angle - Math.PI / 6))
+          ctx.lineTo(tx - arrowLength * Math.cos(angle + Math.PI / 6), ty - arrowLength * Math.sin(angle + Math.PI / 6))
+          ctx.closePath()
+          ctx.fillStyle = "rgba(118, 185, 0, 0.6)"
+          ctx.fill()
+
+          // Draw link label for hovered connections
+          if (
+            hoveredNode === sourceNode.id ||
+            hoveredNode === targetNode.id ||
+            selectedNode === sourceNode.id ||
+            selectedNode === targetNode.id
+          ) {
+            const midX = (sx + tx) / 2
+            const midY = (sy + ty) / 2
+
+            // Background for label
+            ctx.font = "10px Inter, sans-serif"
+            const labelWidth = ctx.measureText(link.label).width + 8
+            ctx.fillStyle = "rgba(0, 0, 0, 0.7)"
+            ctx.fillRect(midX - labelWidth / 2, midY - 10, labelWidth, 20)
+
+            // Label text
+            ctx.fillStyle = "white"
+            ctx.textAlign = "center"
+            ctx.textBaseline = "middle"
+            ctx.fillText(link.label, midX, midY)
+          }
+        }
+      }
+
+      // Draw nodes
+      for (const node of nodes) {
+        // Transform coordinates based on zoom and offset
+        const x = centerX + (node.x + offset.x) * zoom
+        const y = centerY + (node.y + offset.y) * zoom
+        const radius = node.radius * zoom
+
+        // Node circle
+        ctx.beginPath()
+        ctx.arc(x, y, radius, 0, Math.PI * 2)
+
+        // Highlight hovered or selected node
+        if (node.id === hoveredNode || node.id === selectedNode) {
+          // Glow effect
+          ctx.fillStyle = "#76B900"
+        } else {
+          ctx.fillStyle = "rgba(118, 185, 0, 0.8)"
+        }
+
+        ctx.fill()
+
+        // Draw node border
+        if (node.id === selectedNode) {
+          ctx.strokeStyle = "white"
+          ctx.lineWidth = 2
+          ctx.stroke()
+        }
+
+        // Draw node label
+        ctx.font =
+          node.id === hoveredNode || node.id === selectedNode ? "bold 12px Inter, sans-serif" : "11px Inter, sans-serif"
+
+        ctx.fillStyle = "rgba(255, 255, 255, 0.9)"
+        ctx.textAlign = "center"
+        ctx.textBaseline = "middle"
+
+        // Only show labels for hovered, selected, or nodes with many connections
+        // Always show labels for important nodes (those with many connections)
+        // and always show for hovered/selected nodes
+        const isImportantNode = node.connections > 2
+        const isHighlightedNode = node.id === hoveredNode || node.id === selectedNode
+
+        if (isImportantNode || isHighlightedNode) {
+          // Background for label
+          const labelWidth = ctx.measureText(node.label).width + 8
+          const labelHeight = 20
+
+          // Always add background for important nodes
+          ctx.fillStyle = "rgba(0, 0, 0, 0.8)"
+          ctx.fillRect(x - labelWidth / 2, y + radius + 4, labelWidth, labelHeight)
+
+          // Text color
+          ctx.fillStyle = isHighlightedNode ? "white" : "rgba(255, 255, 255, 0.9)"
+          ctx.fillText(node.label, x, y + radius + 14)
+        }
+      }
+    }
+
+    // Start the simulation
+    animationFrameId = requestAnimationFrame(tick)
+
+    return () => {
+      cancelAnimationFrame(animationFrameId)
+    }
+  }, [simulation, hoveredNode, selectedNode, zoom, offset])
+
+  // Handle canvas resize
+  useEffect(() => {
+    const handleResize = () => {
+      const canvas = canvasRef.current
+      if (!canvas) return
+
+      const container = canvas.parentElement
+      if (!container) return
+
+      canvas.width = container.clientWidth
+      canvas.height = container.clientHeight
+    }
+
+    handleResize()
+    window.addEventListener("resize", handleResize)
+
+    return () => {
+      window.removeEventListener("resize", handleResize)
+    }
+  }, [isFullscreen, isBrowserFullscreen])
+
+  // Handle mouse interactions
+  useEffect(() => {
+    const canvas = canvasRef.current
+    if (!canvas || !simulation) return
+
+    const getMousePosition = (e: MouseEvent) => {
+      const rect = canvas.getBoundingClientRect()
+      const x = e.clientX - rect.left
+      const y = e.clientY - rect.top
+      return { x, y }
+    }
+
+    const handleMouseMove = (e: MouseEvent) => {
+      if (!simulation) return
+
+      const { x, y } = getMousePosition(e)
+      const centerX = canvas.width / 2
+      const centerY = canvas.height / 2
+
+      // Check if mouse is over any node
+      let hovered = null
+      for (const node of simulation.nodes) {
+        const nodeX = centerX + (node.x + offset.x) * zoom
+        const nodeY = centerY + (node.y + offset.y) * zoom
+        const distance = Math.sqrt(Math.pow(x - nodeX, 2) + Math.pow(y - nodeY, 2))
+
+        if (distance < node.radius * zoom) {
+          hovered = node.id
+          break
+        }
+      }
+
+      setHoveredNode(hovered)
+
+      // Handle dragging
+      if (isDragging) {
+        const dx = (x - dragStart.x) / zoom
+        const dy = (y - dragStart.y) / zoom
+
+        setOffset((prev) => ({
+          x: prev.x + dx,
+          y: prev.y + dy,
+        }))
+
+        setDragStart({ x, y })
+      }
+    }
+
+    const handleMouseDown = (e: MouseEvent) => {
+      const { x, y } = getMousePosition(e)
+
+      // Check if clicking on a node
+      const centerX = canvas.width / 2
+      const centerY = canvas.height / 2
+
+      let clickedNode = null
+      for (const node of simulation.nodes) {
+        const nodeX = centerX + (node.x + offset.x) * zoom
+        const nodeY = centerY + (node.y + offset.y) * zoom
+        const distance = Math.sqrt(Math.pow(x - nodeX, 2) + Math.pow(y - nodeY, 2))
+
+        if (distance < node.radius * zoom) {
+          clickedNode = node.id
+          break
+        }
+      }
+
+      if (clickedNode) {
+        // Toggle selection if clicking on a node
+        setSelectedNode((prev) => (prev === clickedNode ? null : clickedNode))
+      } else {
+        // Start dragging the canvas
+        setIsDragging(true)
+        setDragStart({ x, y })
+      }
+    }
+
+    const handleMouseUp = () => {
+      setIsDragging(false)
+    }
+
+    const handleWheel = (e: WheelEvent) => {
+      e.preventDefault()
+
+      // Adjust zoom level
+      const delta = -Math.sign(e.deltaY) * 0.1
+      const newZoom = Math.max(0.1, Math.min(3, zoom + delta))
+
+      setZoom(newZoom)
+    }
+
+    canvas.addEventListener("mousemove", handleMouseMove)
+    canvas.addEventListener("mousedown", handleMouseDown)
+    canvas.addEventListener("mouseup", handleMouseUp)
+    canvas.addEventListener("mouseleave", handleMouseUp)
+    canvas.addEventListener("wheel", handleWheel, { passive: false })
+
+    return () => {
+      canvas.removeEventListener("mousemove", handleMouseMove)
+      canvas.removeEventListener("mousedown", handleMouseDown)
+      canvas.removeEventListener("mouseup", handleMouseUp)
+      canvas.removeEventListener("mouseleave", handleMouseUp)
+      canvas.removeEventListener("wheel", handleWheel)
+    }
+  }, [simulation, isDragging, dragStart, zoom, offset])
+
+  // Handle canvas resize when fullscreen changes
+  useEffect(() => {
+    const canvas = canvasRef.current
+    if (!canvas) return
+
+    const container = canvas.parentElement
+    if (!container) return
+
+    canvas.width = container.clientWidth
+    canvas.height = container.clientHeight
+
+    // Force redraw with full highlighting and labels
+    if (simulation) {
+      // Instead of a simplified redraw, call the main drawing function
+      // which includes all highlighting and labels
+      const nodeMap = new Map(simulation.nodes.map((node) => [node.id, node]))
+
+      const drawFullGraph = () => {
+        const ctx = canvas.getContext("2d")
+        if (!ctx) return
+
+        ctx.clearRect(0, 0, canvas.width, canvas.height)
+
+        const { nodes, links } = simulation
+        const centerX = canvas.width / 2
+        const centerY = canvas.height / 2
+
+        // Draw links with proper highlighting
+        for (const link of links) {
+          const sourceNode = nodeMap.get(link.source)
+          const targetNode = nodeMap.get(link.target)
+
+          if (sourceNode && targetNode) {
+            // Transform coordinates based on zoom and offset
+            const sx = centerX + (sourceNode.x + offset.x) * zoom
+            const sy = centerY + (sourceNode.y + offset.y) * zoom
+            const tx = centerX + (targetNode.x + offset.x) * zoom
+            const ty = centerY + (targetNode.y + offset.y) * zoom
+
+            // Highlight links connected to selected node
+            if (
+              hoveredNode === sourceNode.id ||
+              hoveredNode === targetNode.id ||
+              selectedNode === sourceNode.id ||
+              selectedNode === targetNode.id
+            ) {
+              ctx.strokeStyle = "rgba(118, 185, 0, 0.6)"
+              ctx.lineWidth = 2
+            } else {
+              ctx.strokeStyle = "rgba(255, 255, 255, 0.2)"
+              ctx.lineWidth = 1
+            }
+
+            ctx.beginPath()
+            ctx.moveTo(sx, sy)
+            ctx.lineTo(tx, ty)
+            ctx.stroke()
+
+            // Draw arrow
+            const angle = Math.atan2(ty - sy, tx - sx)
+            const arrowLength = 8
+
+            ctx.beginPath()
+            ctx.moveTo(tx, ty)
+            ctx.lineTo(
+              tx - arrowLength * Math.cos(angle - Math.PI / 6),
+              ty - arrowLength * Math.sin(angle - Math.PI / 6),
+            )
+            ctx.lineTo(
+              tx - arrowLength * Math.cos(angle + Math.PI / 6),
+              ty - arrowLength * Math.sin(angle + Math.PI / 6),
+            )
+            ctx.closePath()
+            ctx.fillStyle = "rgba(118, 185, 0, 0.6)"
+            ctx.fill()
+
+            // Draw link label for selected connections
+            if (
+              hoveredNode === sourceNode.id ||
+              hoveredNode === targetNode.id ||
+              selectedNode === sourceNode.id ||
+              selectedNode === targetNode.id
+            ) {
+              const midX = (sx + tx) / 2
+              const midY = (sy + ty) / 2
+
+              // Background for label
+              ctx.font = "10px Inter, sans-serif"
+              const labelWidth = ctx.measureText(link.label).width + 8
+              ctx.fillStyle = "rgba(0, 0, 0, 0.7)"
+              ctx.fillRect(midX - labelWidth / 2, midY - 10, labelWidth, 20)
+
+              // Label text
+              ctx.fillStyle = "white"
+              ctx.textAlign = "center"
+              ctx.textBaseline = "middle"
+              ctx.fillText(link.label, midX, midY)
+            }
+          }
+        }
+
+        // Draw nodes with proper highlighting
+        for (const node of nodes) {
+          // Transform coordinates based on zoom and offset
+          const x = centerX + (node.x + offset.x) * zoom
+          const y = centerY + (node.y + offset.y) * zoom
+          const radius = node.radius * zoom
+
+          // Node circle
+          ctx.beginPath()
+          ctx.arc(x, y, radius, 0, Math.PI * 2)
+
+          // Highlight hovered or selected node
+          if (node.id === hoveredNode || node.id === selectedNode) {
+            // Glow effect
+            ctx.fillStyle = "#76B900"
+          } else {
+            ctx.fillStyle = "rgba(118, 185, 0, 0.8)"
+          }
+
+          ctx.fill()
+
+          // Draw node border
+          if (node.id === selectedNode) {
+            ctx.strokeStyle = "white"
+            ctx.lineWidth = 2
+            ctx.stroke()
+          }
+
+          // Draw node label
+          ctx.font =
+            node.id === hoveredNode || node.id === selectedNode
+              ? "bold 12px Inter, sans-serif"
+              : "11px Inter, sans-serif"
+
+          ctx.fillStyle = "rgba(255, 255, 255, 0.9)"
+          ctx.textAlign = "center"
+          ctx.textBaseline = "middle"
+
+          // Always show labels for important nodes
+          const isImportantNode = node.connections > 2
+          const isHighlightedNode = node.id === hoveredNode || node.id === selectedNode
+
+          if (isImportantNode || isHighlightedNode) {
+            // Background for label
+            const labelWidth = ctx.measureText(node.label).width + 8
+            const labelHeight = 20
+
+            // Always add background for important nodes
+            ctx.fillStyle = "rgba(0, 0, 0, 0.8)"
+            ctx.fillRect(x - labelWidth / 2, y + radius + 4, labelWidth, labelHeight)
+
+            // Text color
+            ctx.fillStyle = isHighlightedNode ? "white" : "rgba(255, 255, 255, 0.9)"
+            ctx.fillText(node.label, x, y + radius + 14)
+          }
+        }
+      }
+
+      drawFullGraph()
+    }
+  }, [isFullscreen, isBrowserFullscreen, simulation, zoom, offset, hoveredNode, selectedNode])
+
+  // Add toggle function for simulation pause/play
+  const toggleSimulation = useCallback(() => {
+    setSimulationPaused(!simulationPaused);
+    setSimulation(prev => prev ? { ...prev, isRunning: simulationPaused } : null);
+  }, [simulationPaused]);
+
+  return (
+    <div className="relative h-full" ref={containerRef}>
+      <div className={`bg-gray-900 rounded-lg overflow-hidden ${isFullscreen ? "h-full" : "h-[400px]"}`}>
+        <canvas ref={canvasRef} className="w-full h-full cursor-grab active:cursor-grabbing" />
+
+        {/* Info panel */}
+        <div className="absolute bottom-2 left-2 text-xs bg-black/70 px-3 py-2 rounded flex items-center gap-2">
+          <span className="text-gray-400">Force-Directed Graph</span>
+          {simulation?.isRunning && (
+            <span className="text-primary animate-pulse">
+              Simulating... {Math.round((simulation.iteration / 300) * 100)}%
+            </span>
+          )}
+        </div>
+
+        {/* Node limit warning */}
+        {showNodeLimitWarning && (
+          <div className="absolute top-2 left-2 text-xs bg-black/70 px-3 py-2 rounded flex items-center gap-2">
+            <span className="text-yellow-400">
+              Showing {nodeLimit} of {allNodesCount} nodes
+            </span>
+            <button
+              onClick={handleIncreaseNodeLimit}
+              className="px-2 py-0.5 bg-primary/20 text-primary rounded hover:bg-primary/30"
+              title="Show more nodes"
+            >
+              Show more
+            </button>
+          </div>
+        )}
+
+        {/* Controls */}
+        <div className="absolute top-2 right-2 flex flex-col gap-2">
+          <button
+            onClick={toggleFullscreen}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) =>
+              handleButtonMouseEnter(e, isBrowserFullscreen ? "Exit fullscreen" : "Enter fullscreen")
+            }
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            {isBrowserFullscreen ? <Minimize2 className="h-4 w-4" /> : <Maximize2 className="h-4 w-4" />}
+          </button>
+
+          {/* Add play/pause simulation button */}
+          <button
+            onClick={toggleSimulation}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, simulationPaused ? "Start simulation" : "Pause simulation")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            {simulationPaused ? <Play className="h-4 w-4" /> : <Pause className="h-4 w-4" />}
+          </button>
+
+          <button
+            onClick={handleZoomIn}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, "Zoom in")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            <ZoomIn className="h-4 w-4" />
+          </button>
+
+          <button
+            onClick={handleZoomOut}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, "Zoom out")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            <ZoomOut className="h-4 w-4" />
+          </button>
+
+          <button
+            onClick={handleResetView}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, "Reset view")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            <Move className="h-4 w-4" />
+          </button>
+
+          <button
+            onClick={toggleNodeLimit}
+            className="p-2 bg-black/70 hover:bg-black/90 text-white rounded-full z-10"
+            type="button"
+            onMouseEnter={(e) => handleButtonMouseEnter(e, "Toggle node limit")}
+            onMouseLeave={handleButtonMouseLeave}
+          >
+            <Filter className="h-4 w-4" />
+          </button>
+        </div>
+
+        {/* Node limit controls */}
+        {showNodeLimitWarning && (
+          <div className="absolute bottom-2 right-2 bg-black/70 rounded px-2 py-1 flex items-center gap-2">
+            <button
+              onClick={handleDecreaseNodeLimit}
+              className="text-white text-xs px-2 py-0.5 bg-gray-700 rounded hover:bg-gray-600"
+              disabled={nodeLimit <= 25}
+              type="button"
+              onMouseEnter={(e) => handleButtonMouseEnter(e, "Show fewer nodes")}
+              onMouseLeave={handleButtonMouseLeave}
+            >
+              -
+            </button>
+            <span className="text-xs text-white">{nodeLimit} nodes</span>
+            <button
+              onClick={handleIncreaseNodeLimit}
+              className="text-white text-xs px-2 py-0.5 bg-gray-700 rounded hover:bg-gray-600"
+              disabled={nodeLimit >= 500}
+              type="button"
+              onMouseEnter={(e) => handleButtonMouseEnter(e, "Show more nodes")}
+              onMouseLeave={handleButtonMouseLeave}
+            >
+              +
+            </button>
+          </div>
+        )}
+
+        {/* Selected node info */}
+        {selectedNode && (
+          <div className="absolute top-2 left-2 bg-black/80 text-white text-sm px-4 py-3 rounded max-w-xs">
+            <h3 className="font-bold text-primary mb-1">{selectedNode}</h3>
+            <div className="text-xs text-gray-300">
+              {simulation?.links.filter((link) => link.source === selectedNode || link.target === selectedNode)
+                .length || 0}{" "}
+              connections
+            </div>
+            <div className="mt-2 text-xs">
+              {simulation?.links
+                .filter((link) => link.source === selectedNode)
+                .map((link, i) => (
+                  <div key={`out-${i}`} className="flex items-center gap-1 mb-1">
+                    <span className="text-gray-400">→</span>
+                    <span className="text-primary">{link.label}</span>
+                    <span className="text-gray-300">→</span>
+                    <span>{link.target}</span>
+                  </div>
+                ))}
+
+              {simulation?.links
+                .filter((link) => link.target === selectedNode)
+                .map((link, i) => (
+                  <div key={`in-${i}`} className="flex items-center gap-1 mb-1">
+                    <span>{link.source}</span>
+                    <span className="text-gray-300">→</span>
+                    <span className="text-primary">{link.label}</span>
+                    <span className="text-gray-400">→</span>
+                  </div>
+                ))}
+            </div>
+          </div>
+        )}
+
+        {/* Tooltip */}
+        {showTooltip && (
+          <div
+            className="absolute bg-black/90 text-white text-xs px-2 py-1 rounded pointer-events-none z-50"
+            style={{
+              left: `${tooltipPosition.x}px`,
+              top: `${tooltipPosition.y}px`,
+              transform: "translateX(-50%)",
+            }}
+          >
+            {tooltipText}
+          </div>
+        )}
+      </div>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/force-graph-wrapper.tsx b/nvidia/txt2kg/assets/frontend/components/force-graph-wrapper.tsx
new file mode 100644
index 0000000..6b94b65
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/force-graph-wrapper.tsx
@@ -0,0 +1,2896 @@
+"use client"
+
+import React, { useEffect, useRef, useState, useCallback } from "react"
+import type { Triple } from "@/utils/text-processing"
+import { Maximize2, Minimize2, Pause, Play, RefreshCw, ZoomIn, X, LayoutGrid } from "lucide-react"
+import { WebGPUClusteringEngine } from "@/utils/webgpu-clustering"
+import { EnhancedWebGPUClusteringEngine } from "@/utils/remote-webgpu-clustering"
+import * as d3 from 'd3'
+import * as THREE from 'three'
+
+// Define interfaces for graph data
+interface NodeObject {
+  id: string
+  name: string
+  val?: number
+  color?: string
+  group?: string
+  x?: number
+  y?: number
+  z?: number
+}
+
+interface LinkObject {
+  id?: string  // Add id as optional property
+  source: string | NodeObject
+  target: string | NodeObject
+  name: string
+  color?: string
+}
+
+interface Connection {
+  source: string;
+  target: string;
+  label?: string;
+  nodeName?: string;
+  type?: 'incoming' | 'outgoing';
+}
+
+interface PerformanceMetrics {
+  renderingTime: number
+  clusteringTime?: number
+  totalNodes: number
+  totalLinks: number
+  memoryUsage?: number
+}
+
+interface ForceGraphWrapperProps {
+  jsonData: any; // The graph data in JSON format
+  fullscreen?: boolean
+  layoutType?: string
+  highlightedNodes?: string[]
+  enableClustering?: boolean
+  enableClusterColors?: boolean // Color nodes by cluster assignment
+  clusteringMode?: 'local' | 'hybrid' | 'cpu' // Default clustering mode
+  remoteServiceUrl?: string // URL for remote WebGPU service
+  onClusteringUpdate?: (metrics: PerformanceMetrics) => void
+  onError?: (error: Error) => void
+  
+  // Semantic clustering parameters
+  clusteringMethod?: string // "spatial", "semantic", "hybrid"
+  semanticAlgorithm?: string // "hierarchical", "kmeans", "dbscan"
+  numberOfClusters?: number | null
+  similarityThreshold?: number
+  nameWeight?: number
+  contentWeight?: number
+  spatialWeight?: number
+}
+
+// Type definitions for Three.js objects
+type ThreeNodeObject = {
+  id: string
+  name: string
+  x?: number
+  y?: number
+  z?: number
+  val?: number
+  [key: string]: any
+}
+
+type ThreeLinkObject = {
+  source: ThreeNodeObject | string
+  target: ThreeNodeObject | string
+  name: string
+  [key: string]: any
+}
+
+// Add the fuzzyCompare function before the getLinkId function
+const fuzzyCompare = (str1: string, str2: string): boolean => {
+  if (!str1 || !str2) return false;
+  
+  // Convert both strings to lowercase and remove quotes, spaces, and special characters
+  const normalize = (s: string) => s.toLowerCase().replace(/['"(){}[\]]/g, '').replace(/\s+/g, '');
+  
+  const norm1 = normalize(str1);
+  const norm2 = normalize(str2);
+  
+  // Check exact match after normalization
+  if (norm1 === norm2) return true;
+  
+  // Check if one contains the other
+  if (norm1.includes(norm2) || norm2.includes(norm1)) return true;
+  
+  // Check for significant partial match (more than 70% of characters match)
+  const minLength = Math.min(norm1.length, norm2.length);
+  if (minLength > 3) {
+    let matchCount = 0;
+    for (let i = 0; i < minLength; i++) {
+      if (norm1[i] === norm2[i]) matchCount++;
+    }
+    if (matchCount / minLength > 0.7) return true;
+  }
+  
+  return false;
+};
+
+// Helper function to get a consistent link ID
+const getLinkId = (link: any): string => {
+  const sourceId = typeof link.source === 'object' ? link.source.id : link.source;
+  const targetId = typeof link.target === 'object' ? link.target.id : link.target;
+  return `${sourceId}-${targetId}`;
+};
+
+// Generate cluster colors with a midnight Tokyo vibe - neon colors against dark backdrop
+const generateClusterColors = (numClusters: number): string[] => {
+  // Midnight Tokyo inspired color palette - neon lights, electric blues, hot pinks, cyber greens
+  const tokyoColors = [
+    '#FF0080', // Hot pink neon
+    '#00FFFF', // Electric cyan
+    '#FF4081', // Neon pink
+    '#8A2BE2', // Electric purple
+    '#00FF41', // Matrix green
+    '#FF6B35', // Neon orange
+    '#1E90FF', // Electric blue
+    '#FF1493', // Deep pink
+    '#00CED1', // Dark turquoise
+    '#9932CC', // Dark orchid
+    '#32CD32', // Lime green
+    '#FF4500', // Orange red
+    '#4169E1', // Royal blue
+    '#DC143C', // Crimson
+    '#00FA9A', // Medium spring green
+    '#FF69B4', // Hot pink
+    '#1E88E5', // Blue
+    '#E91E63', // Pink
+    '#00E676', // Green
+    '#FF5722', // Deep orange
+    '#673AB7', // Deep purple
+    '#03DAC6', // Teal
+    '#BB86FC', // Light purple
+    '#CF6679'  // Light pink
+  ];
+  
+  const colors: string[] = [];
+  
+  for (let i = 0; i < numClusters; i++) {
+    if (i < tokyoColors.length) {
+      // Use predefined Tokyo colors first
+      colors.push(tokyoColors[i]);
+    } else {
+      // For additional clusters, generate variations of the base palette
+      const baseColorIndex = i % tokyoColors.length;
+      const baseColor = tokyoColors[baseColorIndex];
+      
+      // Convert hex to HSL and create variations
+      const variation = Math.floor(i / tokyoColors.length);
+      const hueShift = variation * 30; // Shift hue by 30 degrees for each cycle
+      
+      // Parse hex color and convert to HSL with variation
+      const hex = baseColor.replace('#', '');
+      const r = parseInt(hex.substr(0, 2), 16) / 255;
+      const g = parseInt(hex.substr(2, 2), 16) / 255;
+      const b = parseInt(hex.substr(4, 2), 16) / 255;
+      
+      const max = Math.max(r, g, b);
+      const min = Math.min(r, g, b);
+      let h, s, l = (max + min) / 2;
+      
+      if (max === min) {
+        h = s = 0; // achromatic
+      } else {
+        const d = max - min;
+        s = l > 0.5 ? d / (2 - max - min) : d / (max + min);
+        switch (max) {
+          case r: h = (g - b) / d + (g < b ? 6 : 0); break;
+          case g: h = (b - r) / d + 2; break;
+          case b: h = (r - g) / d + 4; break;
+          default: h = 0;
+        }
+        h /= 6;
+      }
+      
+      // Apply hue shift and maintain Tokyo neon characteristics
+      h = ((h * 360 + hueShift) % 360) / 360;
+      s = Math.max(0.7, s); // Keep high saturation for neon effect
+      l = Math.min(0.7, Math.max(0.4, l)); // Bright but not too light
+      
+      // Convert back to HSL string
+      colors.push(`hsl(${Math.round(h * 360)}, ${Math.round(s * 100)}%, ${Math.round(l * 100)}%)`);
+    }
+  }
+  
+  return colors;
+};
+
+// Assign cluster colors to nodes based on their actual cluster assignment
+const assignClusterColors = (nodes: any[], enableColors: boolean, useSemanticClusters: boolean = false): any[] => {
+  if (!enableColors || !nodes || nodes.length === 0) {
+    return nodes;
+  }
+  
+  // Check if nodes already have semantic cluster assignments
+  const hasSemanticClusters = useSemanticClusters && nodes.some(node => node.clusterId !== undefined || node.clusterIndex !== undefined);
+  
+  console.log("🔍 assignClusterColors debug:", {
+    enableColors,
+    useSemanticClusters,
+    nodeCount: nodes.length,
+    hasSemanticClusters,
+    sampleNodeIds: nodes.slice(0, 3).map(n => ({ 
+      id: n.id, 
+      clusterId: n.clusterId, 
+      clusterIndex: n.clusterIndex 
+    }))
+  });
+  
+  if (hasSemanticClusters) {
+    console.log("🎯 Using semantic cluster assignments for coloring");
+    
+    // Get unique cluster IDs
+    const clusterIds = new Set<number>();
+    nodes.forEach(node => {
+      const clusterId = node.clusterId !== undefined ? node.clusterId : node.clusterIndex;
+      if (clusterId !== undefined) {
+        clusterIds.add(clusterId);
+      }
+    });
+    
+    const clusterColors = generateClusterColors(clusterIds.size);
+    const clusterIdToIndex = Array.from(clusterIds).reduce((acc, id, index) => {
+      acc[id] = index;
+      return acc;
+    }, {} as Record<number, number>);
+    
+    return nodes.map(node => ({
+      ...node,
+      color: (() => {
+        const clusterId = node.clusterId !== undefined ? node.clusterId : node.clusterIndex;
+        if (clusterId !== undefined && clusterIdToIndex[clusterId] !== undefined) {
+          return clusterColors[clusterIdToIndex[clusterId]];
+        }
+        return node.color || '#76b900';
+      })()
+    }));
+  }
+  
+  // Fallback to spatial clustering if no semantic clusters available
+  console.log("🗺️ Using spatial clustering for coloring (fallback)");
+  
+  // Simple spatial clustering based on position
+  const clusterGrid = 4; // 4x4x4 grid = 64 possible clusters
+  const clusters = new Map<string, number>();
+  let clusterCount = 0;
+  
+  // Find bounds
+  const bounds = nodes.reduce((acc, node) => {
+    const x = node.x || 0;
+    const y = node.y || 0;
+    const z = node.z || 0;
+    return {
+      minX: Math.min(acc.minX, x),
+      maxX: Math.max(acc.maxX, x),
+      minY: Math.min(acc.minY, y),
+      maxY: Math.max(acc.maxY, y),
+      minZ: Math.min(acc.minZ, z),
+      maxZ: Math.max(acc.maxZ, z),
+    };
+  }, { minX: Infinity, maxX: -Infinity, minY: Infinity, maxY: -Infinity, minZ: Infinity, maxZ: -Infinity });
+  
+  const rangeX = bounds.maxX - bounds.minX || 1;
+  const rangeY = bounds.maxY - bounds.minY || 1;
+  const rangeZ = bounds.maxZ - bounds.minZ || 1;
+  
+  // Assign cluster IDs based on spatial position
+  nodes.forEach(node => {
+    const x = node.x || 0;
+    const y = node.y || 0;
+    const z = node.z || 0;
+    
+    // Normalize to grid coordinates
+    const gridX = Math.min(Math.floor(((x - bounds.minX) / rangeX) * clusterGrid), clusterGrid - 1);
+    const gridY = Math.min(Math.floor(((y - bounds.minY) / rangeY) * clusterGrid), clusterGrid - 1);
+    const gridZ = Math.min(Math.floor(((z - bounds.minZ) / rangeZ) * clusterGrid), clusterGrid - 1);
+    
+    const clusterKey = `${gridX},${gridY},${gridZ}`;
+    
+    if (!clusters.has(clusterKey)) {
+      clusters.set(clusterKey, clusterCount++);
+    }
+    
+    node.clusterIndex = clusters.get(clusterKey);
+  });
+  
+  // Generate colors for all clusters
+  const clusterColors = generateClusterColors(clusterCount);
+  
+  // Apply colors to nodes
+  return nodes.map(node => ({
+    ...node,
+    color: node.clusterIndex !== undefined ? clusterColors[node.clusterIndex] : node.color
+  }));
+};
+
+export function ForceGraphWrapper({ 
+  jsonData, 
+  fullscreen = false, 
+  layoutType, 
+  highlightedNodes, 
+  enableClustering = false, 
+  enableClusterColors = false, 
+  clusteringMode = 'hybrid', 
+  remoteServiceUrl = 'http://localhost:8083', 
+  onClusteringUpdate, 
+  onError,
+  // Semantic clustering parameters
+  clusteringMethod = "hybrid",
+  semanticAlgorithm = "hierarchical",
+  numberOfClusters = null,
+  similarityThreshold = 0.7,
+  nameWeight = 0.6,
+  contentWeight = 0.3,
+  spatialWeight = 0.1
+}: ForceGraphWrapperProps) {
+  // Check for null or invalid jsonData early and report error
+  if (!jsonData || typeof jsonData !== 'object') {
+    console.error("Invalid jsonData provided to ForceGraphWrapper:", jsonData);
+    if (onError) {
+      onError(new Error("Cannot read properties of null (reading 'nodes')"));
+    }
+    return (
+      <div className="h-full w-full flex items-center justify-center bg-black/70">
+        <div className="text-red-500 max-w-md p-6 bg-black/90 rounded-lg">
+          <p className="font-bold mb-2">Error: Invalid graph data</p>
+          <p className="text-sm">The graph data is missing or has an invalid format.</p>
+        </div>
+      </div>
+    );
+  }
+  
+  const containerRef = useRef<HTMLDivElement>(null)
+  const graphRef = useRef<any>(null)
+  const [isFullscreen, setIsFullscreen] = useState(fullscreen)
+  const [isLoading, setIsLoading] = useState(true)
+  const [loadingStep, setLoadingStep] = useState<string>("Initializing...")
+  const [loadingProgress, setLoadingProgress] = useState<number>(0)
+  const [graphLoaded, setGraphLoaded] = useState(false)
+  const [error, setError] = useState<string | null>(null)
+  const [isPaused, setIsPaused] = useState(false)
+  const [debugInfo, setDebugInfo] = useState<string>("")
+  const [selectedNode, setSelectedNode] = useState<NodeObject | null>(null)
+  const [nodeConnections, setNodeConnections] = useState<Connection[]>([])
+  
+  // Add interaction mode state to toggle between navigation and selection
+  const [interactionMode, setInteractionMode] = useState<'navigation' | 'selection'>('navigation')
+  
+  // Track notifications
+  const [notification, setNotification] = useState<{message: string, type: 'success' | 'error' | 'info'} | null>(null)
+  const notificationTimeoutRef = useRef<NodeJS.Timeout | null>(null)
+  
+  // Track highlighted nodes for visual emphasis
+  const [internalHighlightedNodes, setInternalHighlightedNodes] = useState<Set<string>>(new Set())
+  const [highlightLinks, setHighlightLinks] = useState<Set<string>>(new Set())
+  
+  // Track graph data statistics
+  const [graphStats, setGraphStats] = useState<{nodes: number, links: number}>({nodes: 0, links: 0})
+  
+  // Retry mechanism
+  const [retryCount, setRetryCount] = useState(0)
+  const maxRetries = 3
+
+  // Track graph data
+  const [graphData, setGraphData] = useState<any>(null)
+
+  // State for tracking hover
+  const [hoveredNode, setHoveredNode] = useState<any>(null);
+  // Use a ref for hover to prevent recursive state updates
+  const hoveredNodeRef = useRef<any>(null);
+
+  // Add state to track initialization
+  const [isInitialized, setIsInitialized] = useState(false);
+
+  // Add WebGPU clustering engine ref
+  const clusteringEngineRef = useRef<any>(null); // Can be WebGPUClusteringEngine or EnhancedWebGPUClusteringEngine
+  // Track if WebGPU clustering is available
+  const [isClusteringAvailable, setIsClusteringAvailable] = useState<boolean>(false);
+  // Track if clustering is enabled
+  const [isClusteringEnabled, setIsClusteringEnabled] = useState<boolean>(false);
+
+  // Helper function to extract node ID reliably
+  const getNodeId = (nodeObj: any): string => {
+    if (!nodeObj) return '';
+    
+    // If it's a string, return it directly
+    if (typeof nodeObj === 'string') return nodeObj;
+    
+    // If it's an object, try various ID properties
+    if (nodeObj.id) return nodeObj.id;
+    if (nodeObj.name) return nodeObj.name;
+    if (nodeObj.key) return nodeObj.key;
+    if (nodeObj.label) return nodeObj.label;
+    
+    // Fallback to string representation
+    return String(nodeObj);
+  };
+
+  // Add state to track if we're using CPU fallback
+  const [usingCpuFallback, setUsingCpuFallback] = useState(false);
+
+  // Add state for node size control
+  const [nodeSize, setNodeSize] = useState(5);
+  
+  // Add performance mode toggle
+  const [performanceMode, setPerformanceMode] = useState(false);
+
+  // Function to show a temporary notification
+  const showNotification = (message: string, type: 'success' | 'error' | 'info' = 'info') => {
+    // Clear any existing timeouts
+    if (notificationTimeoutRef.current) {
+      clearTimeout(notificationTimeoutRef.current);
+    }
+    
+    // Set new notification
+    setNotification({message, type});
+    
+    // Auto-clear after duration
+    notificationTimeoutRef.current = setTimeout(() => {
+      setNotification(null);
+    }, 3000);
+  };
+  
+  // Clean up notification timeout on unmount
+  useEffect(() => {
+    return () => {
+      if (notificationTimeoutRef.current) {
+        clearTimeout(notificationTimeoutRef.current);
+      }
+    };
+  }, []);
+
+  // Toggle interaction mode
+  const toggleInteractionMode = () => {
+    const newMode = interactionMode === 'navigation' ? 'selection' : 'navigation';
+    setInteractionMode(newMode);
+    showNotification(`Mode changed to: ${newMode}`, 'info');
+    console.log(`Interaction mode changed to: ${newMode}`);
+  };
+
+  // More robust ID normalization function 
+  const normalizeNodeId = (id: any): string => {
+    // Handle null/undefined cases
+    if (id === null || id === undefined) return '';
+    
+    // Handle ThreeJS object references that might be passed directly
+    if (typeof id === 'object') {
+      console.log('Received object for ID normalization:', id);
+      // Try __threeObj property which might contain the ThreeJS object
+      if (id.__threeObj) {
+        console.log('Found __threeObj property');
+        // Look for userData which often contains the original node data
+        if (id.__threeObj.userData && id.__threeObj.userData.id) {
+          console.log(`Using __threeObj.userData.id: "${id.__threeObj.userData.id}"`);
+          id = id.__threeObj.userData.id;
+        } else {
+          // Fall back to the object's id property if it exists
+          console.log(`Using object's id property: "${id.id}"`);
+          id = id.id || '';
+        }
+      } else if (id.id) {
+        // Simple object with id property
+        console.log(`Using simple object id property: "${id.id}"`);
+        id = id.id;
+      } else {
+        // Last resort - try toString or convert to empty string
+        console.log('Could not find id property, using toString()');
+        id = id.toString() || '';
+      }
+    }
+    
+    // Convert to string if not already
+    const strId = String(id);
+    
+    // Log the original ID for debugging
+    console.log(`Normalizing ID: "${strId}" (type: ${typeof id})`);
+    
+    // Remove all quotes, parentheses, and trim whitespace
+    const normalized = strId.replace(/['"()]/g, '').trim();
+    
+    console.log(`  → Normalized to: "${normalized}"`);
+    return normalized;
+  };
+
+  // Debug node connections with additional logging
+  const debugNodeConnections = (nodeId: string) => {
+    if (!graphData) {
+      console.warn("Cannot debug connections: No graph data available");
+      return { outgoing: [], incoming: [], total: 0 };
+    }
+    
+    console.log(`Debugging connections for node: "${nodeId}"`);
+    
+    // More thorough logging of all nodes and links
+    console.log("All nodes:", graphData.nodes.map((n: any) => ({ 
+      id: n.id, 
+      name: n.name
+    })));
+    
+    // Log links with more details
+    console.log("All links:", graphData.links.map((l: any) => {
+      // Extract source and target properly, handling object references
+      const sourceId = typeof l.source === 'object' ? (l.source.__threeObj ? l.source.__threeObj.userData.id : (l.source.id || l.source)) : l.source;
+      const targetId = typeof l.target === 'object' ? (l.target.__threeObj ? l.target.__threeObj.userData.id : (l.target.id || l.target)) : l.target;
+      
+      return { 
+        source: sourceId,
+        target: targetId,
+        name: l.name,
+        // Debug object types
+        sourceType: typeof l.source,
+        targetType: typeof l.target,
+      };
+    }));
+    
+    const connections = {
+      outgoing: [] as any[],
+      incoming: [] as any[],
+      total: 0
+    };
+    
+    console.log(`Looking for connections with node ID: "${nodeId}"`);
+    
+    // Helper function to extract ID reliably from either string or object reference
+    const getReliableId = (idOrObj: any): string => {
+      if (typeof idOrObj === 'string') return idOrObj;
+      
+      // Handle ThreeJS object references
+      if (idOrObj && idOrObj.__threeObj && idOrObj.__threeObj.userData) {
+        return idOrObj.__threeObj.userData.id || '';
+      }
+      
+      // Handle regular objects
+      return idOrObj && idOrObj.id ? idOrObj.id : (idOrObj || '').toString();
+    };
+    
+    // Additional helper to normalize IDs for comparison
+    const normalizeForComparison = (id: string): string => {
+      return id.toString().toLowerCase().trim();
+    };
+    
+    // For reliable comparison
+    const normalizedNodeId = normalizeForComparison(nodeId);
+    
+    // Check if the graph data links array exists
+    if (!graphData.links || !Array.isArray(graphData.links)) {
+      console.warn("No links array found in graph data");
+      return { outgoing: [], incoming: [], total: 0 };
+    }
+    
+    graphData.links.forEach((link: any, index: number) => {
+      try {
+        // Get source and target IDs, handling all possible formats
+        const sourceId = getReliableId(link.source);
+        const targetId = getReliableId(link.target);
+        
+        // Also get names for additional matching
+        const sourceName = typeof link.source === 'object' ? (link.source.name || '') : '';
+        const targetName = typeof link.target === 'object' ? (link.target.name || '') : '';
+        
+        console.log(`Link ${index}: "${sourceId}" → "${targetId}"`);
+        console.log(`  Source reference: ${typeof link.source} | Target reference: ${typeof link.target}`);
+        
+        // Normalized versions for comparison
+        const normalizedSourceId = normalizeForComparison(sourceId);
+        const normalizedTargetId = normalizeForComparison(targetId);
+        const normalizedSourceName = sourceName ? normalizeForComparison(sourceName) : '';
+        const normalizedTargetName = targetName ? normalizeForComparison(targetName) : '';
+        
+        // Try different ways of comparing
+        const sourceMatch = 
+          normalizedSourceId === normalizedNodeId || 
+          normalizedSourceName === normalizedNodeId;
+        
+        const targetMatch = 
+          normalizedTargetId === normalizedNodeId || 
+          normalizedTargetName === normalizedNodeId;
+        
+        if (sourceMatch) {
+          console.log(`  ✅ SOURCE MATCH! Node is source in this link`);
+          connections.outgoing.push({
+            target: targetId,
+            predicate: link.name,
+            link
+          });
+        }
+        
+        if (targetMatch) {
+          console.log(`  ✅ TARGET MATCH! Node is target in this link`);
+          connections.incoming.push({
+            source: sourceId,
+            predicate: link.name,
+            link
+          });
+        }
+        
+        if (!sourceMatch && !targetMatch) {
+          console.log(`  ❌ No match`);
+        }
+      } catch (error) {
+        console.error(`Error processing link ${index}:`, error);
+      }
+    });
+    
+    connections.total = connections.outgoing.length + connections.incoming.length;
+    
+    console.log(`Total connections found: ${connections.total}`, connections);
+    return connections;
+  };
+
+  // Helper function to normalize text by removing quotes and parentheses
+  const normalizeText = (text: string | undefined): string => {
+    if (!text) return '';
+    return text.replace(/['"()]/g, '').trim();
+  };
+
+  // Process the JSON data into the format needed for the graph
+  const processGraphData = async (data: any, applyClusteringFirst: boolean = false) => {
+    console.log("processGraphData called with input:", {
+      hasData: !!data,
+      isObject: typeof data === 'object' && data !== null,
+      hasNodes: data && 'nodes' in data,
+      hasLinks: data && 'links' in data,
+      dataType: typeof data,
+      keysIfObject: data && typeof data === 'object' ? Object.keys(data) : [],
+      applyClusteringFirst
+    });
+
+    // Ensure data exists and has required properties
+    if (!data || typeof data !== 'object' || data === null) {
+      console.error("Invalid graph data: not an object or null");
+      return null;
+    }
+
+    // Check if we need to adapt the data format
+    if (!Array.isArray(data.nodes) || !Array.isArray(data.links)) {
+      // If data doesn't have nodes/links arrays directly, try to extract from a different format
+      console.log("Data doesn't have expected nodes/links format, attempting to adapt...");
+      
+      // Check if the data might be in a nested format (e.g., from the API response)
+      if (data.triples && Array.isArray(data.triples)) {
+        console.log("Found triples array, converting to nodes/links format");
+        return convertTriplesToGraphFormat(data.triples, data.documentName);
+      }
+      
+      console.error("Could not adapt data to required format", data);
+      return null;
+    }
+
+    // Check if we should apply clustering before rendering (for large datasets)
+    if (applyClusteringFirst && data.nodes.length > 10000 && clusteringEngineRef.current) {
+      console.log(`🎯 Large dataset detected (${data.nodes.length} nodes), applying clustering before rendering...`);
+      
+      try {
+        // Use the remote clustering service to get subsampled data
+        const success = await clusteringEngineRef.current.updateNodePositions(
+          data.nodes,
+          data.links || []
+        );
+        
+        if (success) {
+          // Get the clustered/subsampled nodes from the engine
+          const clusteredData = clusteringEngineRef.current.getClusteredData();
+          if (clusteredData && clusteredData.nodes) {
+            console.log(`✅ Pre-clustering successful: ${data.nodes.length} → ${clusteredData.nodes.length} nodes`);
+            
+            // Use the subsampled data instead of original
+            data = {
+              nodes: clusteredData.nodes,
+              links: data.links || [] // Keep original links for now
+            };
+          }
+        }
+      } catch (error) {
+        console.error("Pre-clustering failed, using original data:", error);
+      }
+    }
+
+    // Return processed data with normalized node names and IDs
+    const processed = {
+      nodes: data.nodes.map((node: any) => ({
+        ...node,
+        // Ensure node has all required properties and normalize the ID and name
+        id: normalizeText(node.id) || `node-${Math.random().toString(36).substring(2, 9)}`,
+        name: normalizeText(node.name || node.id) || "Unnamed",
+        group: node.group || "default"
+      })),
+      links: data.links.map((link: any) => ({
+        ...link,
+        // Ensure link has all required properties
+        id: link.id || `link-${Math.random().toString(36).substring(2, 9)}`,
+        name: link.name || "related",
+        source: link.source,
+        target: link.target
+      }))
+    };
+    
+    console.log("Processed graph data:", {
+      nodeCount: processed.nodes.length,
+      linkCount: processed.links.length,
+      firstNode: processed.nodes.length > 0 ? processed.nodes[0] : null,
+      firstLink: processed.links.length > 0 ? processed.links[0] : null
+    });
+    
+    return processed;
+  };
+
+  // Helper function to convert triples to graph format
+  const convertTriplesToGraphFormat = (triples: any[], documentName: string = "Unnamed Document") => {
+    console.log("Converting triples to graph format...");
+    
+    const nodes = new Map<string, NodeObject>();
+    const links: LinkObject[] = [];
+    
+    // Process each triple into nodes and links
+    triples.forEach((triple, index) => {
+      if (!triple.subject || !triple.predicate || !triple.object) {
+        console.warn("Invalid triple format at index", index, triple);
+        return;
+      }
+      
+      // Handle both complex objects and simple string formats
+      let subjectId = typeof triple.subject === 'string' ? triple.subject : triple.subject.id;
+      let subjectName = typeof triple.subject === 'string' ? triple.subject : (triple.subject.value || triple.subject.id);
+      
+      const predicateId = typeof triple.predicate === 'string' ? triple.predicate : triple.predicate.id;
+      const predicateName = typeof triple.predicate === 'string' ? triple.predicate : (triple.predicate.value || triple.predicate.id);
+      
+      let objectId = typeof triple.object === 'string' ? triple.object : triple.object.id;
+      let objectName = typeof triple.object === 'string' ? triple.object : (triple.object.value || triple.object.id);
+      
+      // Normalize IDs and names to remove quotes and parentheses
+      subjectId = normalizeText(subjectId);
+      subjectName = normalizeText(subjectName);
+      objectId = normalizeText(objectId);
+      objectName = normalizeText(objectName);
+      
+      // Add subject node if it doesn't exist
+      if (!nodes.has(subjectId)) {
+        nodes.set(subjectId, {
+          id: subjectId,
+          name: subjectName,
+          group: "concept"
+        });
+      }
+      
+      // Add object node if it doesn't exist
+      if (!nodes.has(objectId)) {
+        nodes.set(objectId, {
+          id: objectId,
+          name: objectName,
+          group: "concept"
+        });
+      }
+      
+      // Create the link between subject and object
+      links.push({
+        id: `link-${subjectId}-${objectId}-${index}`,
+        source: subjectId,
+        target: objectId,
+        name: predicateName
+      });
+    });
+    
+    // Convert nodes map to array
+    const result = {
+      nodes: Array.from(nodes.values()),
+      links
+    };
+    
+    console.log("Converted graph data:", {
+      nodeCount: result.nodes.length,
+      linkCount: result.links.length
+    });
+    
+    return result;
+  };
+
+  // Initialize the graph
+  useEffect(() => {
+    if (!containerRef.current) return;
+    
+    console.log("Starting graph initialization...");
+    
+    // Flag to track if component is mounted
+    let mounted = true;
+    
+    const initializeGraph = async () => {
+      try {
+        setIsLoading(true);
+        setLoadingStep('Initializing 3D engine');
+        
+        if (typeof window === 'undefined') {
+          console.error("Cannot initialize 3D graph in non-browser environment");
+          setError("Browser environment required for 3D visualization");
+          setIsLoading(false);
+          return;
+        }
+        
+        // Import ForceGraph3D dynamically to avoid SSR issues
+        let ForceGraph3D;
+        try {
+          ForceGraph3D = (await import('3d-force-graph')).default;
+          console.log("ForceGraph3D library loaded successfully");
+        } catch (importError) {
+          console.error("Failed to import ForceGraph3D:", importError);
+          setError(`Failed to load 3D visualization library: ${importError instanceof Error ? importError.message : String(importError)}`);
+          setIsLoading(false);
+          return;
+        }
+        
+        if (!ForceGraph3D) {
+          throw new Error("Failed to load ForceGraph3D library - it's undefined after import");
+        }
+        
+        try {
+          // Create the graph instance using the same pattern as before
+          // @ts-ignore - Calling function directly, letting JS handle it
+          const Graph = ForceGraph3D({
+            rendererConfig: {
+              antialias: true,
+              alpha: true,
+              powerPreference: 'high-performance',
+              precision: 'highp', // High precision for better quality
+              depth: true // Enable depth testing for better 3D rendering
+            }
+          })(containerRef.current);
+          
+          if (!Graph) {
+            throw new Error("Failed to create graph instance");
+          }
+          
+          // Store the graph reference
+          graphRef.current = Graph;
+          
+          console.log("3D Graph initialized successfully");
+          
+          // Enhanced GPU-accelerated setup
+          Graph
+            .backgroundColor("#000000")
+            .nodeRelSize(5)
+            .nodeResolution(32) // Higher resolution for smoother nodes
+            .nodeOpacity(0.8)
+            .linkOpacity(0.2)
+            .linkWidth(1)
+            .showNavInfo(false)
+            .onBackgroundClick(() => {
+              if (selectedNode) {
+                clearSelection();
+              }
+            });
+            
+          // Setup safe hover handling to prevent recursion
+          Graph.onNodeHover((node: any) => {
+            // Only update if the hovered node has changed
+            if (node !== hoveredNodeRef.current) {
+              hoveredNodeRef.current = node;
+              setHoveredNode(node);
+              
+              // Update cursor based on hover state without triggering a re-render
+              if (containerRef.current) {
+                containerRef.current.style.cursor = node ? 'pointer' : 'default';
+              }
+            }
+          });
+          
+          // Set up click handling with debouncing
+          let lastClickTime = 0;
+          Graph.onNodeClick((node: any) => {
+            const now = Date.now();
+            if (now - lastClickTime < 300) return; // Debounce clicks
+            lastClickTime = now;
+            
+            console.log("Node click detected", node);
+            handleNodeSelection(node);
+          });
+          
+          // Ready for data loading
+          setLoadingStep('Ready to load graph data');
+          setDebugInfo("3D Graph initialized and ready to load data");
+          setIsInitialized(true); // Mark initialization as complete
+          
+          // Force immediate data loading if data is available
+          if (jsonData) {
+            console.log("Data is available at initialization time, triggering immediate load");
+            // Use a small timeout to ensure state is updated
+            setTimeout(async () => {
+              try {
+                // Check if we should pre-cluster large datasets
+                const shouldPreCluster = jsonData?.nodes?.length > 10000 && isClusteringEnabled;
+                const processedData = await processGraphData(jsonData, shouldPreCluster);
+                if (processedData && graphRef.current) {
+                  console.log("Applying data directly after initialization");
+                  graphRef.current.graphData(processedData);
+                  setGraphData(processedData);
+                  setGraphStats({
+                    nodes: processedData.nodes.length,
+                    links: processedData.links.length
+                  });
+                  
+                  // Zoom to fit
+                  setTimeout(() => {
+                    if (graphRef.current) {
+                      graphRef.current.zoomToFit(800, 30);
+                      setIsLoading(false);
+                    }
+                  }, 500);
+                }
+              } catch (err) {
+                console.error("Error in immediate data loading:", err);
+              }
+            }, 100);
+          }
+        } catch (graphError) {
+          console.error("Error initializing graph instance:", graphError);
+          setError(`Failed to initialize 3D graph: ${graphError instanceof Error ? graphError.message : String(graphError)}`);
+          setIsLoading(false);
+        }
+      } catch (error) {
+        console.error('Error in initialization process:', error);
+        setError(`Initialization error: ${error instanceof Error ? error.message : String(error)}`);
+        setIsLoading(false);
+      }
+    };
+    
+    initializeGraph();
+    
+    // Cleanup function
+    return () => {
+      mounted = false;
+      
+      if (graphRef.current) {
+        try {
+          // Clean up the graph instance
+          graphRef.current._destructor?.();
+        } catch (err) {
+          console.warn("Error during cleanup:", err);
+        }
+      }
+    };
+  }, [jsonData]); // Add jsonData as dependency
+
+  // Effect for loading data after graph initialization
+  useEffect(() => {
+    // Current graph ref value for closure
+    const currentGraphRef = graphRef.current;
+    
+    if (!currentGraphRef || !jsonData || isLoading || !isInitialized) {
+      console.log("Data loading effect - early return:", {
+        graphRefExists: !!currentGraphRef,
+        jsonDataExists: !!jsonData,
+        isCurrentlyLoading: isLoading,
+        isInitialized: isInitialized
+      });
+      return;
+    }
+    
+    console.log("Starting data loading process", { 
+      jsonDataSize: JSON.stringify(jsonData).length,
+      jsonDataSample: JSON.stringify(jsonData).substring(0, 200) + '...'
+    });
+    
+    const loadGraphData = async () => {
+      try {
+        setIsLoading(true);
+        setLoadingStep('Processing data');
+        console.log("Processing graph data...");
+        
+        // Process the graph data
+        // Check if we should pre-cluster large datasets
+        const shouldPreCluster = jsonData?.nodes?.length > 10000 && isClusteringEnabled;
+        let processedData = await processGraphData(jsonData, shouldPreCluster);
+        
+        if (!processedData) {
+          console.error("processGraphData returned null");
+          throw new Error("Failed to process graph data");
+        }
+
+        // Apply cluster coloring if enabled
+        if (enableClusterColors && processedData.nodes) {
+          console.log("🎨 Applying cluster colors to", processedData.nodes.length, "nodes");
+          processedData.nodes = assignClusterColors(processedData.nodes, enableClusterColors, isClusteringEnabled);
+        }
+        
+        console.log("Data processed successfully", {
+          nodeCount: processedData.nodes.length,
+          linkCount: processedData.links.length,
+          sampleNode: processedData.nodes.length > 0 ? processedData.nodes[0] : null
+        });
+        
+        // Store the processed data for reference
+        setGraphData(processedData);
+        
+        // Update graph stats
+        setGraphStats({
+          nodes: processedData.nodes.length,
+          links: processedData.links.length
+        });
+        
+        setLoadingStep('Applying data to graph');
+        console.log("Applying data to graph...");
+        
+        // Safety check - use the captured reference
+        if (!currentGraphRef) {
+          throw new Error("Graph reference lost during data loading");
+        }
+        
+        // Apply data to graph with a try/catch
+        try {
+          console.log("Calling graphData() method on graph instance");
+          currentGraphRef.graphData(processedData);
+          console.log("Graph data applied successfully");
+        } catch (dataError) {
+          console.error("Error applying data to graph:", dataError);
+          throw new Error(`Failed to apply data to graph: ${dataError instanceof Error ? dataError.message : String(dataError)}`);
+        }
+        
+        // Configure force physics with safety checks
+        try {
+          console.log("Configuring force physics...");
+          const charge = currentGraphRef.d3Force('charge');
+          if (charge) charge.strength(-120);
+          
+          const link = currentGraphRef.d3Force('link');
+          if (link) link.distance(60);
+          console.log("Force physics configured");
+        } catch (forceError) {
+          console.warn("Non-critical error configuring forces:", forceError);
+        }
+        
+        // Zoom to fit with a delay and safety mechanism
+        console.log("Scheduling zoom to fit...");
+        setTimeout(() => {
+          try {
+            if (currentGraphRef) {
+              console.log("Executing zoomToFit");
+              currentGraphRef.zoomToFit(800, 30);
+              console.log("Graph loading complete");
+              showNotification("Graph loaded successfully", "success");
+            }
+          } catch (zoomError) {
+            console.warn("Non-critical error during zoom:", zoomError);
+          } finally {
+            setIsLoading(false);
+            console.log("Loading state set to false");
+          }
+        }, 1000);
+        
+      } catch (error) {
+        console.error("Error loading graph data:", error);
+        setError(`Failed to load graph data: ${error instanceof Error ? error.message : String(error)}`);
+        setIsLoading(false);
+      }
+    };
+    
+    loadGraphData();
+  }, [jsonData, isInitialized]);
+
+  // Manual retry function
+  const handleRetry = () => {
+    setRetryCount(prev => prev + 1);
+    setError(null);
+    setIsLoading(true);
+    setLoadingProgress(0);
+    setLoadingStep("Restarting...");
+  };
+  
+  const toggleFullscreen = () => {
+    const newFullscreenState = !isFullscreen;
+    setIsFullscreen(newFullscreenState);
+    
+    if (typeof document !== 'undefined') {
+      // Toggle body overflow to prevent scrolling in fullscreen
+      document.body.style.overflow = newFullscreenState ? 'hidden' : '';
+    }
+    
+    // Force graph resize after state change
+    if (graphRef.current) {
+      setTimeout(() => {
+        if (graphRef.current) {
+          try {
+            // Force the graph to update dimensions
+            graphRef.current.width(containerRef.current?.clientWidth || window.innerWidth);
+            graphRef.current.height(containerRef.current?.clientHeight || window.innerHeight);
+            graphRef.current.zoomToFit(400);
+          } catch (err) {
+            console.warn("Error resizing graph:", err);
+          }
+        }
+      }, 300);
+    }
+  };
+  
+  // Update container styles when fullscreen prop changes
+  useEffect(() => {
+    setIsFullscreen(fullscreen);
+    if (containerRef.current && fullscreen) {
+      containerRef.current.style.position = 'fixed';
+      containerRef.current.style.top = '0';
+      containerRef.current.style.left = '0';
+      containerRef.current.style.right = '0';
+      containerRef.current.style.bottom = '0';
+      containerRef.current.style.width = '100vw';
+      containerRef.current.style.height = '100vh';
+      containerRef.current.style.zIndex = '50';
+    }
+  }, [fullscreen]);
+  
+  const togglePause = () => {
+    if (!graphRef.current) return;
+    
+    setIsPaused(!isPaused);
+    if (isPaused) {
+      graphRef.current.resumeAnimation();
+    } else {
+      graphRef.current.pauseAnimation();
+    }
+  };
+  
+  // Helper function to clear selection
+  const clearSelection = () => {
+    setSelectedNode(null);
+    setNodeConnections([]);
+    
+    // Restore cluster colors if enabled
+    if (graphRef.current && enableClusterColors && graphData?.nodes) {
+      console.log("🔄 Restoring cluster colors after clearSelection");
+      console.log("🔧 clearSelection state check:", {
+        enableClusterColors,
+        isClusteringEnabled,
+        hasClusteringEngine: !!clusteringEngineRef.current,
+        hasGraphData: !!graphData?.nodes
+      });
+      
+      // Get the actual clustered data if available
+      let nodesToUse = graphData.nodes;
+      let useSemanticClusters = false;
+      
+      if (clusteringEngineRef.current) {
+        const clusteredData = clusteringEngineRef.current.getClusteredData();
+        console.log("🔍 Checking clustered data:", {
+          hasClusteredData: !!clusteredData,
+          hasNodes: !!clusteredData?.nodes,
+          nodeCount: clusteredData?.nodes?.length,
+          hasClusterIds: clusteredData?.nodes?.some((n: any) => n.clusterId !== undefined || n.clusterIndex !== undefined)
+        });
+        
+        if (clusteredData && clusteredData.nodes) {
+          console.log("📊 Using clustered data for color restoration");
+          nodesToUse = clusteredData.nodes;
+          // Check if the clustered data actually has cluster IDs
+          useSemanticClusters = clusteredData.nodes.some((n: any) => n.clusterId !== undefined || n.clusterIndex !== undefined);
+        }
+      }
+      
+      const coloredNodes = assignClusterColors(nodesToUse, true, useSemanticClusters);
+      graphRef.current.nodeColor((node: any) => {
+        const coloredNode = coloredNodes.find((n: any) => getNodeId(n) === getNodeId(node));
+        return coloredNode?.color || '#76b900';
+      });
+      graphRef.current.linkColor(() => '#ffffff30');
+      graphRef.current.linkWidth(() => 1);
+      graphRef.current.refresh();
+    }
+    
+    console.log("Selection cleared");
+  };
+
+  // Function to safely zoom to fit
+  const zoomToFit = () => {
+    if (!graphRef.current) return;
+    
+    try {
+      // Use a more conservative zoom with a delay to prevent stack overflow
+      setTimeout(() => {
+        if (graphRef.current) {
+          graphRef.current.zoomToFit(800, 30);
+        }
+      }, 50);
+    } catch (err) {
+      console.warn("Error in zoomToFit:", err);
+    }
+  };
+
+  // Focus on a specific node with safety mechanism
+  const focusOnNode = (nodeId: string) => {
+    if (!graphData || !graphRef.current) return;
+    
+    const node = graphData.nodes.find((n: any) => n.id === nodeId);
+    if (node) {
+      handleNodeSelection(node);
+      
+      // Use setTimeout to prevent possible recursion
+      setTimeout(() => {
+        if (graphRef.current) {
+          try {
+            // Use centerAt and zoom separately with a delay in between
+            graphRef.current.centerAt(node.x, node.y, node.z, 800);
+            
+            // Add delay before zooming
+            setTimeout(() => {
+              if (graphRef.current) {
+                graphRef.current.zoom(1.5, 800);
+              }
+            }, 100);
+          } catch (err) {
+            console.warn("Error focusing on node:", err);
+          }
+        }
+      }, 50);
+    }
+  };
+
+  // Replace or enhance the handleNodeSelection function
+  const handleNodeSelection = (node: any) => {
+    // Function to reliably extract a node's ID
+    const getNodeId = (nodeObj: any): string => {
+      if (!nodeObj) return '';
+      
+      // If it's a string, return it directly
+      if (typeof nodeObj === 'string') return nodeObj;
+      
+      // If it has an ID property, use that
+      if (nodeObj.id && typeof nodeObj.id === 'string') {
+        return nodeObj.id;
+      }
+      
+      // If it's a ThreeJS object with userData
+      if (nodeObj.__threeObj && nodeObj.__threeObj.userData) {
+        return nodeObj.__threeObj.userData.id || '';
+      }
+      
+      // Fallback
+      return '';
+    };
+    
+    // Normalize the node ID
+    const nodeId = getNodeId(node);
+    const prevSelectedNode = selectedNode;
+    
+    // Toggle selection state for the node
+    if (selectedNode && getNodeId(selectedNode) === nodeId) {
+      // Deselect current node
+      setSelectedNode(null);
+      setNodeConnections([]);
+      
+      // Reset any highlight styles while preserving cluster colors
+      if (graphRef.current) {
+        // Reset node colors - preserve cluster colors if enabled
+        if (enableClusterColors && graphData?.nodes) {
+          console.log("🔄 Restoring cluster colors after deselection");
+          
+          // Get the actual clustered data if available
+          let nodesToUse = graphData.nodes;
+          let useSemanticClusters = false;
+          
+          if (clusteringEngineRef.current) {
+            const clusteredData = clusteringEngineRef.current.getClusteredData();
+            console.log("🔍 Checking clustered data:", {
+              hasClusteredData: !!clusteredData,
+              hasNodes: !!clusteredData?.nodes,
+              nodeCount: clusteredData?.nodes?.length,
+              sampleNode: clusteredData?.nodes?.[0],
+              hasClusterIds: clusteredData?.nodes?.some((n: any) => n.clusterId !== undefined || n.clusterIndex !== undefined)
+            });
+            if (clusteredData && clusteredData.nodes) {
+              console.log("📊 Using clustered data for color restoration");
+              nodesToUse = clusteredData.nodes;
+              // Check if the clustered data actually has cluster IDs
+              useSemanticClusters = clusteredData.nodes.some((n: any) => n.clusterId !== undefined || n.clusterIndex !== undefined);
+            }
+          }
+          
+          // Regenerate cluster colors properly
+          const coloredNodes = assignClusterColors(nodesToUse, true, useSemanticClusters);
+          graphRef.current.nodeColor((node: any) => {
+            const coloredNode = coloredNodes.find((n: any) => getNodeId(n) === getNodeId(node));
+            return coloredNode?.color || '#76b900';
+          });
+        } else {
+          // Reset to default colors
+          graphRef.current.nodeColor((node: any) => {
+            const group = node.group || 'default';
+            switch (group) {
+              case 'document': return '#f8f8f2';
+              case 'important': return '#8be9fd';
+              default: return '#76b900';
+            }
+          });
+        }
+        
+        graphRef.current.linkColor(() => '#ffffff30'); // Reset link colors too
+        graphRef.current.linkWidth(() => 1); // Reset to default width
+        graphRef.current.refresh();
+      }
+      
+      showNotification("Node deselected", "info");
+    } else {
+      // Select new node
+      setSelectedNode(node);
+      
+      if (graphRef.current) {
+        // Find all connections to this node
+        const connections: Connection[] = [];
+        const connectedNodes = new Set<string>();
+        
+        // Process link objects from the graph
+        graphRef.current.graphData().links.forEach((link: any) => {
+          const source = typeof link.source === 'object' ? link.source : { id: link.source };
+          const target = typeof link.target === 'object' ? link.target : { id: link.target };
+          
+          const sourceId = getNodeId(source);
+          const targetId = getNodeId(target);
+          
+          if (sourceId === nodeId) {
+            // Outgoing connection
+            connections.push({
+              source: sourceId,
+              target: targetId,
+              label: link.name || link.label,
+              nodeName: target.name || targetId,
+              type: 'outgoing'
+            });
+            connectedNodes.add(targetId);
+          } else if (targetId === nodeId) {
+            // Incoming connection
+            connections.push({
+              source: sourceId,
+              target: targetId,
+              label: link.name || link.label,
+              nodeName: source.name || sourceId,
+              type: 'incoming'
+            });
+            connectedNodes.add(sourceId);
+          }
+        });
+        
+        setNodeConnections(connections);
+        
+        // Apply visual highlighting for the node and its connections
+        // Preserve cluster colors when highlighting nodes
+        graphRef.current
+          .nodeColor((n: any) => {
+            const nId = getNodeId(n);
+            if (nId === nodeId) return '#ffcf00'; // Selected node: bright yellow
+            if (connectedNodes.has(nId)) return '#ff6200'; // Connected nodes: orange
+            
+            // Preserve cluster colors if enabled, otherwise use default
+            if (enableClusterColors) {
+              // Find the original node data to get its cluster color
+              const originalNode = graphData.nodes.find((node: any) => getNodeId(node) === nId);
+              if (originalNode && originalNode.color) {
+                return originalNode.color;
+              }
+            }
+            return '#76b900'; // Default: green
+          })
+          .linkWidth((link: any) => {
+            const sourceId = getNodeId(link.source);
+            const targetId = getNodeId(link.target);
+            
+            // Highlight links that connect to the selected node
+            if (sourceId === nodeId || targetId === nodeId) {
+              return 3; // Thicker line for direct connections
+            }
+            return 1; // Default thickness
+          })
+          .linkColor((link: any) => {
+            const sourceId = getNodeId(link.source);
+            const targetId = getNodeId(link.target);
+            
+            // Highlight links that connect to the selected node
+            if (sourceId === nodeId || targetId === nodeId) {
+              return '#ff9500'; // Orange for connections
+            }
+            return '#cccccc'; // Default: light gray
+          })
+          .refresh();
+        
+        // Zoom to focus on the selected node
+        focusOnNode(nodeId);
+      }
+      
+      showNotification(`Selected node: ${node.name || nodeId}`, "success");
+    }
+  };
+
+  // Function to get node color with optimization
+  const getNodeColor = (node: any) => {
+    try {
+      // Use the current hover state from the ref to prevent recursive calls
+      const isHovered = hoveredNodeRef.current === node;
+      const isSelected = selectedNode && node.id === selectedNode.id;
+      const isConnected = nodeConnections.some(conn => 
+        conn.target === node.id || conn.source === node.id
+      );
+      
+      if (isSelected) return '#50fa7b'; // Bright green for selected
+      if (isHovered) return '#8be9fd'; // Cyan for hovered
+      if (isConnected) return '#bd93f9'; // Purple for connected nodes
+      
+      // Default colors based on group
+      const group = node.group || 'default';
+      switch (group) {
+        case 'document': return '#f8f8f2'; // White for documents
+        case 'important': return '#8be9fd'; // Teal for important nodes
+        default: return '#50fa7b'; // Bright green for most nodes
+      }
+    } catch (error) {
+      console.warn('Error in getNodeColor:', error);
+      return '#50fa7b'; // Default fallback
+    }
+  };
+
+  // Add effect for logging render state
+  useEffect(() => {
+    console.log("Component state:", { 
+      isLoading, 
+      hasGraphData: !!graphData, 
+      graphDataSize: graphData ? { nodes: graphData.nodes.length, links: graphData.links.length } : null,
+      error,
+      selectedNode: selectedNode?.id,
+      isInitialized
+    });
+  }, [isLoading, graphData, error, selectedNode, isInitialized]);
+
+  // Manual data loading function to allow retrying
+  const manuallyLoadGraphData = useCallback(async () => {
+    if (!graphRef.current || !jsonData) {
+      console.warn("Cannot manually load graph data: Missing graph reference or data");
+      return;
+    }
+    
+    try {
+      console.log("Manual graph data loading initiated");
+      setError(null);
+      
+      // Check data format
+      console.log("Validating input data:", {
+        dataType: typeof jsonData,
+        hasNodes: jsonData?.nodes ? true : false,
+        hasLinks: jsonData?.links ? true : false,
+        firstKeys: typeof jsonData === 'object' ? Object.keys(jsonData).slice(0, 3) : []
+      });
+      
+      // Try to process the data 
+      // Check if we should pre-cluster large datasets
+      const shouldPreCluster = jsonData?.nodes?.length > 10000 && isClusteringEnabled;
+      const processedData = await processGraphData(jsonData, shouldPreCluster);
+      
+      if (!processedData) {
+        throw new Error("Failed to process graph data");
+      }
+      
+      console.log("Applying data to graph instance");
+      
+      // Apply the data to the graph
+      graphRef.current.graphData(processedData);
+      
+      // Update our internal state
+      setGraphData(processedData);
+      
+      // Update graph stats
+      setGraphStats({
+        nodes: processedData.nodes.length,
+        links: processedData.links.length
+      });
+      
+      // Show notification
+      showNotification(`Loaded ${processedData.nodes.length} nodes and ${processedData.links.length} links`, "success");
+      
+      // Zoom to fit after a short delay
+      setTimeout(() => {
+        if (graphRef.current) {
+          graphRef.current.zoomToFit(800, 30);
+        }
+      }, 500);
+      
+    } catch (error) {
+      console.error("Manual data loading failed:", error);
+      setError(`Manual data loading failed: ${error instanceof Error ? error.message : String(error)}`);
+    }
+  }, [graphRef.current, jsonData]);
+
+  // Enhanced WebGPU clustering initialization for 3D views with remote fallback
+  useEffect(() => {
+    async function initClustering() {
+      try {
+        // Only create engine if we're in 3D mode and don't already have one
+        if (layoutType === '3d' && !clusteringEngineRef.current) {
+          console.log("🔧 Initializing clustering engine for 3D view...");
+          
+          // Use enhanced clustering engine for 3D views with remote fallback
+          const enhancedEngine = new EnhancedWebGPUClusteringEngine([32, 18, 24], 'http://localhost:8083');
+          
+          console.log("⏳ Waiting for engine initialization...");
+          // Wait longer for proper initialization (remote service check takes time)
+          await new Promise(resolve => setTimeout(resolve, 2000)); // Increased timeout even more
+          
+          console.log("🔍 Checking enhanced engine availability...");
+          console.log("Engine available:", enhancedEngine.isAvailable());
+          console.log("Engine using remote:", enhancedEngine.isUsingRemote());
+          console.log("Engine capabilities:", enhancedEngine.getCapabilities());
+          
+          // Store the engine reference regardless of availability for debugging
+          clusteringEngineRef.current = enhancedEngine;
+          
+          if (enhancedEngine.isAvailable()) {
+            setIsClusteringAvailable(true);
+            
+            if (enhancedEngine.isUsingRemote()) {
+              console.log("✅ Using remote WebGPU clustering service for 3D view");
+              setUsingCpuFallback(false); // Remote GPU is available
+              
+              // Set up event listeners for remote clustering updates
+              enhancedEngine.on('clusteringComplete', (result: any) => {
+                console.log(`🚀 Remote clustering completed in ${result.processingTime}s`);
+              });
+            } else {
+              console.log("✅ Local WebGPU clustering engine initialized for 3D view");
+              setUsingCpuFallback(false); // Local WebGPU is also not CPU fallback
+            }
+            
+            // Auto-enable clustering for large 3D graphs
+            if (graphData && graphData.nodes && graphData.nodes.length > 200) {
+              console.log("🎯 Auto-enabling clustering for large 3D graph with", graphData.nodes.length, "nodes");
+              setIsClusteringEnabled(true);
+              console.log("✅ Enhanced WebGPU clustering auto-enabled for large 3D graph");
+            } else {
+              console.log("📊 Graph has", graphData?.nodes?.length || 0, "nodes (threshold: 200)");
+            }
+          } else {
+            console.log("❌ Neither local WebGPU nor remote clustering available");
+            console.log("🔧 But engine reference stored for manual activation");
+            setUsingCpuFallback(true);
+          }
+        } else if (layoutType !== '3d') {
+          console.log("📋 Using standard CPU rendering for 2D view");
+          setUsingCpuFallback(true);
+          setIsClusteringAvailable(false);
+        } else {
+          console.log("♻️ Clustering engine already initialized");
+        }
+      } catch (error) {
+        console.warn("❌ Failed to initialize enhanced WebGPU clustering:", error);
+        setUsingCpuFallback(true);
+      }
+    }
+    
+    initClustering();
+    
+    // Cleanup
+    return () => {
+      if (clusteringEngineRef.current) {
+        clusteringEngineRef.current.dispose();
+        clusteringEngineRef.current = null;
+      }
+    };
+  }, [layoutType, graphData]);
+  
+  // Update clustering options when semantic clustering parameters change
+  useEffect(() => {
+    if (clusteringEngineRef.current && clusteringEngineRef.current.setClusteringOptions) {
+      const options = {
+        clusteringMethod,
+        semanticAlgorithm,
+        numberOfClusters,
+        similarityThreshold,
+        nameWeight,
+        contentWeight,
+        spatialWeight
+      };
+      
+      console.log("🔧 Updating semantic clustering options:", options);
+      clusteringEngineRef.current.setClusteringOptions(options);
+    }
+  }, [clusteringMethod, semanticAlgorithm, numberOfClusters, similarityThreshold, nameWeight, contentWeight, spatialWeight]);
+
+  // Force re-clustering when algorithm parameters change
+  useEffect(() => {
+    console.log("🔄 Algorithm parameters changed - checking conditions:", {
+      isClusteringEnabled,
+      hasClusteringEngine: !!clusteringEngineRef.current,
+      hasGraphData: !!(graphData && graphData.nodes),
+      nodeCount: graphData?.nodes?.length || 0
+    });
+    
+    if (isClusteringEnabled && clusteringEngineRef.current && graphData && graphData.nodes) {
+      console.log("🔄 Algorithm parameters changed, triggering re-clustering...");
+      
+      // Delay to ensure the clustering options are updated first
+      setTimeout(() => {
+        if (clusteringEngineRef.current && graphData) {
+          console.log("🎯 Calling updateNodePositions with", graphData.nodes.length, "nodes");
+          // Trigger re-clustering with updated parameters
+          clusteringEngineRef.current.updateNodePositions(graphData.nodes, graphData.links || [])
+            .then((success: boolean) => {
+              console.log("🔍 Clustering promise resolved:", { success, hasGraphRef: !!graphRef.current });
+              
+              if (success && graphRef.current) {
+                console.log("✅ Re-clustering completed with new algorithm");
+                
+                // Get the clustered data from the engine
+                const clusteredData = clusteringEngineRef.current.getClusteredData();
+                console.log("🔍 Retrieved clustered data:", {
+                  hasData: !!clusteredData,
+                  hasNodes: !!clusteredData?.nodes,
+                  nodeCount: clusteredData?.nodes?.length,
+                  sampleNode: clusteredData?.nodes?.[0],
+                  hasClusterIds: clusteredData?.nodes?.some((n: any) => n.clusterId !== undefined || n.clusterIndex !== undefined)
+                });
+                
+                if (clusteredData && clusteredData.nodes) {
+                  console.log("🎯 Got clustered data with", clusteredData.nodes.length, "nodes");
+                  
+                  // Update the graph data with new clusters
+                  const updatedGraphData = {
+                    ...graphData,
+                    nodes: clusteredData.nodes
+                  };
+                  setGraphData(updatedGraphData);
+                
+                  // Apply cluster colors if enabled
+                  if (enableClusterColors) {
+                    const coloredNodes = assignClusterColors(clusteredData.nodes, true, true); // Use semantic clusters
+                    graphRef.current.nodeColor((node: any) => {
+                      const coloredNode = coloredNodes.find(n => getNodeId(n) === getNodeId(node));
+                      return coloredNode?.color || node.color || '#76b900';
+                    });
+                    graphRef.current.refresh();
+                  }
+                }
+                
+                showNotification(`Re-clustered with ${semanticAlgorithm} algorithm`, "success");
+              }
+            })
+            .catch((error: any) => {
+              console.error("Re-clustering failed:", error);
+              showNotification("Re-clustering failed", "error");
+            });
+        }
+      }, 100);
+    }
+  }, [clusteringMethod, semanticAlgorithm, numberOfClusters, similarityThreshold, nameWeight, contentWeight, spatialWeight, isClusteringEnabled, enableClusterColors]);
+  
+  // Modify the setupGraphVisualization function
+  const setupGraphVisualization = (graph: any, data: any) => {
+    try {
+      if (!graph) {
+        console.error("Cannot setup graph visualization - missing graph instance");
+        if (onError) onError(new Error("Missing graph instance"));
+        return;
+      }
+      // If data is not yet available, skip data-dependent setup
+      if (!data) {
+        return;
+      }
+
+      if (!data.nodes || !Array.isArray(data.nodes) || !data.links || !Array.isArray(data.links)) {
+        console.error("Invalid graph data structure:", data);
+        showNotification("Invalid graph data structure", "error");
+        return;
+      }
+      
+      // Transform nodes for display with filtering by highlighted nodes
+      data.nodes = data.nodes.map((node: any) => {
+        // Ensure node is not null
+        if (!node) return { id: `node-${Math.random()}`, name: 'Unknown' };
+        
+        const obj = {
+          ...node,
+          id: node.id || `node-${Math.random().toString(36).substring(2, 9)}`, 
+          name: node.name || node.id || 'Unnamed',
+          isHighlighted: internalHighlightedNodes.has(normalizeNodeId(node.id || ''))
+        };
+        
+        return obj;
+      });
+
+      // Apply node positions if provided in the data
+      if (data && data.nodes && data.nodes.some((node: any) => node.x !== undefined && node.y !== undefined)) {
+        graph.graphData(data);
+        setTimeout(() => {
+          graph.zoomToFit(400, 50);
+        }, 500);
+      } else {
+        // Force graph layout with parameters
+        graph
+          .d3Force('link')
+          .distance((link: any) => 80) // Adjust link distance
+          .strength((link: any) => 0.5); // Adjust link strength
+        
+        graph
+          .d3Force('charge')
+          .strength(-120) // Adjust repulsive force
+          .distanceMax(300); // Max distance for repulsive force
+        
+        graph.graphData(data);
+        
+        setTimeout(() => {
+          graph.zoomToFit(400, 70);
+        }, 1000);
+      }
+      
+      // Add node label tooltips
+      graph.nodeLabel((node: any) => {
+        const id = normalizeText(node.id?.toString() || node.name?.toString() || '');
+        return `<div class="graph-tooltip">
+          <div class="graph-tooltip-label">${id}</div>
+        </div>`;
+      });
+      
+      // Add link label tooltips
+      graph.linkLabel((link: any) => {
+        const label = normalizeText(link.name || link.label || '');
+        return `<div class="graph-tooltip link-tooltip">
+          <div class="graph-tooltip-label">${label}</div>
+        </div>`;
+      });
+      
+      // Listen to camera movements to detect if user has interacted with the graph
+      let lastCameraPosition = { x: 0, y: 0, z: 0 };
+      graph.onEngineStop(() => {
+        const currentPos = graph.cameraPosition();
+        const hasChanged = 
+          Math.abs(currentPos.x - lastCameraPosition.x) > 0.1 ||
+          Math.abs(currentPos.y - lastCameraPosition.y) > 0.1 ||
+          Math.abs(currentPos.z - lastCameraPosition.z) > 0.1;
+          
+        if (hasChanged) {
+          // User has moved the camera
+          lastCameraPosition = { ...currentPos };
+        }
+      });
+      
+      // Apply WebGPU clustering if available and selected
+      if (isClusteringEnabled && clusteringEngineRef.current) {
+        try {
+          console.log("Applying WebGPU clustering to 3D graph");
+          // Update graph nodes and links within WebGPU engine instead of calling a non-existent method
+          if (clusteringEngineRef.current) {
+            // Simply use the engine to process the graph data
+            console.log("Setting up WebGPU clustering for 3D visualization");
+          }
+        } catch (error: unknown) {
+          console.error("Failed to apply WebGPU clustering:", error);
+          setUsingCpuFallback(true);
+        }
+      }
+      
+      // Add camera movement handlers for dynamic label visibility
+      graph.onEngineStop(() => {
+        // Force update of node objects when camera stops moving
+        graph.refresh();
+      });
+      
+      // Monitor camera movement to update label visibility
+      const cameraChangeHandler = () => {
+        // Refresh graph to update label visibility based on camera position
+        requestAnimationFrame(() => graph.refresh());
+      };
+      
+      // Attach the handler to camera controls
+      if (graph.controls()) {
+        graph.controls().addEventListener('change', cameraChangeHandler);
+      }
+    } catch (error: unknown) {
+      console.error("Error setting up graph visualization:", error);
+      const errorMessage = error instanceof Error ? error.message : String(error);
+      showNotification(`Error setting up graph: ${errorMessage}`, 'error');
+      if (onError) {
+        onError(error instanceof Error ? error : new Error(String(error)));
+      }
+    }
+  };
+
+  // Helper function to create a clustered force layout with WebGPU acceleration
+  const createClusteredForce = (numClusters = 32) => {
+    return {
+      // This simulates a clustered force function that would be implemented in WebGPU
+      initialize: () => console.log(`Initializing clustered force with ${numClusters} clusters`),
+      strength: -120,
+      distanceMax: 300,
+      // In a full implementation, this would use a GPGPU compute shader to calculate forces
+      // between clusters of nodes rather than individual nodes
+    };
+  };
+
+  // Force direct application of graph data
+  const forceApplyGraphData = async () => {
+    if (!graphRef.current || !jsonData) {
+      console.warn("Cannot force apply graph data - missing graph reference or data");
+      return;
+    }
+    
+    try {
+      console.log("Force applying graph data");
+      // Check if we should pre-cluster large datasets
+      const shouldPreCluster = jsonData?.nodes?.length > 10000 && isClusteringEnabled;
+      const processedData = await processGraphData(jsonData, shouldPreCluster);
+      
+      if (!processedData || !processedData.nodes || !processedData.links) {
+        console.error("Invalid graph data structure after processing:", processedData);
+        throw new Error("Invalid graph data structure after processing");
+      }
+      
+      // Update internal state
+      setGraphData(processedData);
+      
+      // Apply to graph
+      graphRef.current.graphData(processedData);
+      
+      // Update stats
+      setGraphStats({
+        nodes: processedData.nodes.length,
+        links: processedData.links.length
+      });
+      
+      // Setup visualization
+      setupGraphVisualization(graphRef.current, processedData);
+      
+      // Zoom to fit
+      setTimeout(() => {
+        if (graphRef.current) {
+          graphRef.current.zoomToFit(800, 30);
+          setIsLoading(false);
+        }
+      }, 500);
+      
+      showNotification("Graph data applied successfully", "success");
+    } catch (error) {
+      console.error("Error forcing graph data application:", error);
+      setError(`Failed to apply graph data: ${error instanceof Error ? error.message : String(error)}`);
+    }
+  };
+
+  //Effect to call forceApplyGraphData when conditions are right
+  useEffect(() => {
+    // Using current values directly, not through refs in the dependency array
+    if (isInitialized && jsonData && graphRef.current && !graphData && !isLoading) {
+      console.log("Auto-triggering force apply graph data");
+      // Small timeout to ensure all state updates have been processed
+      setTimeout(() => {
+        forceApplyGraphData();
+      }, 50);
+    }
+  }, [isInitialized, jsonData, graphData, isLoading, forceApplyGraphData]);
+
+  // Effect to update graph visualization when selected node or connections change
+  useEffect(() => {
+    if (!graphRef.current) return;
+    
+    console.log("Effect triggered: Updating visual highlighting for selected node and connections");
+    
+    // Helper function to extract ID reliably
+    const getNodeId = (nodeObj: any): string => {
+      if (!nodeObj) return '';
+      
+      // If it's a string, return it directly
+      if (typeof nodeObj === 'string') return nodeObj;
+      
+      // If it has an ID property, use that
+      if (nodeObj.id && typeof nodeObj.id === 'string') {
+        return nodeObj.id;
+      }
+      
+      // If it's a ThreeJS object with userData
+      if (nodeObj.__threeObj && nodeObj.__threeObj.userData) {
+        return nodeObj.__threeObj.userData.id || '';
+      }
+      
+      // Fallback
+      return '';
+    };
+    
+    // Refresh the graph to update colors and highlighting
+    try {
+      // Get selected node ID for comparison
+      const selectedNodeId = selectedNode ? getNodeId(selectedNode) : null;
+      console.log("Selected node ID for highlighting:", selectedNodeId);
+      
+      // If no connections found but we have a selected node and graph data,
+      // let's try to find connections one more time
+      if (selectedNodeId && nodeConnections.length === 0 && graphData) {
+        console.log("No connections found for selected node, trying to find connections again");
+        
+        const selectedNodeIdNorm = typeof selectedNodeId === 'string' 
+          ? selectedNodeId.toLowerCase().trim() 
+          : '';
+        
+        const directLinks = graphData.links.filter((link: any) => {
+          const sourceId = typeof link.source === 'object'
+            ? (link.source.__threeObj 
+               ? link.source.__threeObj.userData.id 
+               : (link.source.id || link.source))
+            : link.source;
+            
+          const targetId = typeof link.target === 'object'
+            ? (link.target.__threeObj 
+               ? link.target.__threeObj.userData.id 
+               : (link.target.id || link.target))
+            : link.target;
+          
+          const normalizedSourceId = String(sourceId).toLowerCase().trim();
+          const normalizedTargetId = String(targetId).toLowerCase().trim();
+          
+          return normalizedSourceId === selectedNodeIdNorm || normalizedTargetId === selectedNodeIdNorm;
+        });
+        
+        console.log(`Found ${directLinks.length} direct links for selected node`);
+        
+        // If we found links, update connection count directly here
+        if (directLinks.length > 0) {
+          const tempConnections: Connection[] = [];
+          
+          directLinks.forEach((link: any) => {
+            const sourceId = typeof link.source === 'object'
+              ? (link.source.__threeObj 
+                 ? link.source.__threeObj.userData.id 
+                 : (link.source.id || link.source))
+              : link.source;
+              
+            const targetId = typeof link.target === 'object'
+              ? (link.target.__threeObj 
+                 ? link.target.__threeObj.userData.id 
+                 : (link.target.id || link.target))
+              : link.target;
+            
+            const normalizedSourceId = String(sourceId).toLowerCase().trim();
+            const normalizedTargetId = String(targetId).toLowerCase().trim();
+            
+            // Get node names for better display
+            const getNodeName = (id: string) => {
+              const node = graphData.nodes.find((n: any) => 
+                String(n.id).toLowerCase().trim() === id.toLowerCase().trim()
+              );
+              return node ? (node.name || id) : id;
+            };
+            
+            if (normalizedSourceId === selectedNodeIdNorm) {
+              // This is an outgoing connection
+              tempConnections.push({
+                source: sourceId,
+                target: targetId,
+                label: link.name || 'connected to',
+                nodeName: getNodeName(targetId),
+                type: 'outgoing'
+              });
+            } else {
+              // This is an incoming connection
+              tempConnections.push({
+                source: sourceId,
+                target: targetId,
+                label: link.name || 'connected from',
+                nodeName: getNodeName(sourceId),
+                type: 'incoming'
+              });
+            }
+          });
+          
+          if (tempConnections.length > 0) {
+            console.log(`Found ${tempConnections.length} connections, updating state`);
+            // Set a timeout to avoid potential recursion
+            setTimeout(() => {
+              setNodeConnections(tempConnections);
+            }, 0);
+          }
+        }
+      }
+      
+      if (selectedNodeId) {
+        console.log("Connection count for highlighting:", nodeConnections.length);
+      }
+      
+      // Create sets for fast lookups
+      const connectedNodeIds = new Set<string>();
+      
+      // Collect all node IDs that are connected to the selected node
+      nodeConnections.forEach(conn => {
+        const sourceId = getNodeId(conn.source);
+        const targetId = getNodeId(conn.target);
+        
+        if (sourceId !== selectedNodeId) {
+          connectedNodeIds.add(sourceId);
+        }
+        
+        if (targetId !== selectedNodeId) {
+          connectedNodeIds.add(targetId);
+        }
+      });
+      
+      graphRef.current
+        .nodeColor((node: any) => {
+          // Get reliable ID for comparison
+          const nodeId = getNodeId(node);
+          
+          const isSelected = selectedNodeId && nodeId === selectedNodeId;
+          const isConnected = selectedNodeId && connectedNodeIds.has(nodeId);
+          
+          if (isSelected) return '#50fa7b'; // Bright green for selected
+          if (isConnected) return '#bd93f9'; // Purple for connected nodes
+          
+          // Default colors based on group
+          const group = node.group || 'default';
+          switch (group) {
+            case 'document': return '#f8f8f2'; // White for documents
+            case 'important': return '#8be9fd'; // Teal for important nodes
+            default: return '#50fa7b'; // Bright green for most nodes
+          }
+        })
+        .linkColor((link: any) => {
+          // Highlight links connected to selected node
+          if (selectedNodeId) {
+            const sourceId = getNodeId(link.source);
+            const targetId = getNodeId(link.target);
+            
+            // Check if this link connects to the selected node
+            const isDirectConnection = sourceId === selectedNodeId || targetId === selectedNodeId;
+            
+            // Check if this link is part of the nodeConnections
+            const isInConnectionsList = nodeConnections.some(conn => {
+              const connSourceId = getNodeId(conn.source);
+              const connTargetId = getNodeId(conn.target);
+              return (connSourceId === sourceId && connTargetId === targetId) || 
+                     (connSourceId === targetId && connTargetId === sourceId);
+            });
+            
+            if (isDirectConnection || isInConnectionsList) {
+              return '#bd93f9'; // Purple for connected links
+            }
+          }
+          return '#ffffff30'; // Default semi-transparent white
+        })
+        .linkWidth((link: any) => {
+          // Make selected links thicker
+          if (selectedNodeId) {
+            const sourceId = getNodeId(link.source);
+            const targetId = getNodeId(link.target);
+            
+            // Check if this link connects to the selected node
+            const isDirectConnection = sourceId === selectedNodeId || targetId === selectedNodeId;
+            
+            // Check if this link is part of the nodeConnections
+            const isInConnectionsList = nodeConnections.some(conn => {
+              const connSourceId = getNodeId(conn.source);
+              const connTargetId = getNodeId(conn.target);
+              return (connSourceId === sourceId && connTargetId === targetId) || 
+                     (connSourceId === targetId && connTargetId === sourceId);
+            });
+            
+            if (isDirectConnection || isInConnectionsList) {
+              return 2.5; // Thicker for selected links
+            }
+          }
+          return 1; // Default link width
+        })
+        // Configure node labels to always show for selected node and its connections
+        .nodeThreeObject((node: any) => {
+          const nodeId = getNodeId(node);
+          const isSelected = selectedNodeId && nodeId === selectedNodeId;
+          const isConnected = selectedNodeId && connectedNodeIds.has(nodeId);
+          const camera = graphRef.current.camera();
+          
+          // Check if we should show the label
+          const showLabel = shouldShowLabel(node, camera, selectedNodeId, connectedNodeIds);
+          
+          if (isSelected || isConnected || showLabel) {
+            const group = new THREE.Group();
+            
+            // Create a sprite for the label
+            const canvas = document.createElement('canvas');
+            const context = canvas.getContext('2d');
+            const text = node.name || node.id;
+            
+            if (context) {
+              // Set canvas size
+              canvas.width = 256;
+              canvas.height = 64;
+              
+              // Draw background
+              context.fillStyle = isSelected ? 'rgba(0, 128, 0, 0.8)' : 'rgba(0, 0, 0, 0.7)';
+              context.fillRect(0, 0, canvas.width, canvas.height);
+              
+              // Draw text
+              context.font = isSelected ? 'bold 24px Arial' : '18px Arial';
+              context.fillStyle = isSelected ? '#ffffff' : '#ffffffcc';
+              context.textAlign = 'center';
+              context.textBaseline = 'middle';
+              context.fillText(text, canvas.width / 2, canvas.height / 2);
+              
+              // Create texture from canvas
+              const texture = new THREE.CanvasTexture(canvas);
+              texture.needsUpdate = true;
+              
+              // Create sprite material and sprite
+              const spriteMaterial = new THREE.SpriteMaterial({ 
+                map: texture,
+                transparent: true
+              });
+              const sprite = new THREE.Sprite(spriteMaterial);
+              
+              // Scale and position the sprite
+              sprite.scale.set(10, 2.5, 1);
+              sprite.position.set(0, node.val ? node.val + 5 : 8, 0);
+              
+              // Add to group
+              group.add(sprite);
+            }
+            
+            // Add selection ring only for selected node
+            if (isSelected) {
+              const ring = createSelectionRing(node);
+              group.add(ring);
+            }
+            
+            return group;
+          }
+          
+          // Return null for other nodes to use the default rendering
+          return null;
+        })
+        .nodeThreeObjectExtend(true)
+        // Add link labels for connections to the selected node
+        .linkThreeObject((link: any) => {
+          // Only process if we have a selected node
+          if (!selectedNodeId) return null;
+          
+          const sourceId = getNodeId(link.source);
+          const targetId = getNodeId(link.target);
+          
+          // Check if this link connects to the selected node
+          const isDirectConnection = sourceId === selectedNodeId || targetId === selectedNodeId;
+          
+          // Check if this link is part of the nodeConnections
+          const isInConnectionsList = nodeConnections.some(conn => {
+            const connSourceId = getNodeId(conn.source);
+            const connTargetId = getNodeId(conn.target);
+            return (connSourceId === sourceId && connTargetId === targetId) || 
+                   (connSourceId === targetId && connTargetId === sourceId);
+          });
+          
+          // Only create labels for selected connections
+          if (isDirectConnection || isInConnectionsList) {
+            // Create a canvas-based sprite for the link label
+            const canvas = document.createElement('canvas');
+            const context = canvas.getContext('2d');
+            const text = link.name || link.label || 'connected';
+            
+            if (context) {
+              // Set canvas size
+              canvas.width = 128;
+              canvas.height = 32;
+              
+              // Draw background
+              context.fillStyle = 'rgba(189, 147, 249, 0.8)'; // Match the purple color
+              context.fillRect(0, 0, canvas.width, canvas.height);
+              
+              // Draw text
+              context.font = '14px Arial';
+              context.fillStyle = '#ffffff';
+              context.textAlign = 'center';
+              context.textBaseline = 'middle';
+              context.fillText(text, canvas.width / 2, canvas.height / 2);
+              
+              // Create texture and sprite
+              const texture = new THREE.CanvasTexture(canvas);
+              const spriteMaterial = new THREE.SpriteMaterial({
+                map: texture,
+                transparent: true
+              });
+              const sprite = new THREE.Sprite(spriteMaterial);
+              
+              // Scale sprite appropriately
+              sprite.scale.set(5, 1.5, 1);
+              
+            return sprite;
+            }
+          }
+          
+          return null;
+        })
+        .linkThreeObjectExtend(true)
+        .linkPositionUpdate((sprite: any, { start, end }: { start: { x: number, y: number, z: number }, end: { x: number, y: number, z: number } }) => {
+          // Position the link label at the middle of the link
+          if (sprite) {
+            const middlePos = {
+              x: start.x + (end.x - start.x) / 2,
+              y: start.y + (end.y - start.y) / 2,
+              z: start.z + (end.z - start.z) / 2
+            };
+            Object.assign(sprite.position, middlePos);
+          }
+          
+          return false; // Don't auto-position
+        });
+
+      // Force a re-render of the graph
+      graphRef.current.refresh();
+    } catch (error) {
+      console.error("Error updating graph visual state:", error);
+    }
+  }, [selectedNode, nodeConnections, graphData]);
+
+  // Add a toggle for clustering
+  const toggleClustering = () => {
+    if (!isClusteringAvailable) {
+      showNotification("WebGPU clustering not available on this device", "error");
+      return;
+    }
+    
+    const newClusteringState = !isClusteringEnabled;
+    setIsClusteringEnabled(newClusteringState);
+    
+    // If enabling clustering, trigger it immediately
+    if (newClusteringState && clusteringEngineRef.current && graphData && graphData.nodes) {
+      console.log("🔄 Clustering enabled, triggering initial clustering...");
+      setTimeout(() => {
+        if (clusteringEngineRef.current && graphData) {
+          console.log("🎯 Performing initial clustering with", graphData.nodes.length, "nodes");
+          clusteringEngineRef.current.updateNodePositions(graphData.nodes, graphData.links || [])
+            .then((success: boolean) => {
+              if (success && graphRef.current) {
+                console.log("✅ Initial clustering completed");
+                
+                // Get the clustered data from the engine
+                const clusteredData = clusteringEngineRef.current.getClusteredData();
+                if (clusteredData && clusteredData.nodes) {
+                  // Update the graph data with new clusters
+                  const updatedGraphData = {
+                    ...graphData,
+                    nodes: clusteredData.nodes
+                  };
+                  setGraphData(updatedGraphData);
+                }
+                
+                showNotification(`Clustering enabled with ${semanticAlgorithm} algorithm`, "success");
+              }
+            })
+            .catch((error: any) => {
+              console.error("Initial clustering failed:", error);
+              showNotification("Initial clustering failed", "error");
+            });
+        }
+      }, 100);
+    }
+    
+    showNotification(
+      newClusteringState 
+        ? "Enabling GPU clustering for faster rendering"
+        : "Disabling GPU clustering", 
+      "info"
+    );
+  };
+
+  // Handle clustering performance updates
+  useEffect(() => {
+    if (graphData && onClusteringUpdate) {
+      const nodeCount = graphData.nodes?.length || 0
+      const linkCount = graphData.links?.length || 0
+      
+      if (nodeCount > 0) {
+        // Report clustering performance metrics based on clustering mode
+        let clusteringTime = 0
+        if (enableClustering && isClusteringEnabled) {
+          switch (clusteringMode) {
+            case 'hybrid':
+              // Hybrid mode: Server GPU clustering + network transfer
+              clusteringTime = Math.max(8, nodeCount * 0.008) // Slightly higher due to network
+              break
+            case 'local':
+              // Local WebGPU clustering
+              clusteringTime = Math.max(5, nodeCount * 0.005)
+              break
+            case 'cpu':
+            default:
+              // CPU clustering (slowest)
+              clusteringTime = Math.max(15, nodeCount * 0.02)
+              break
+          }
+        }
+        
+        const renderingTime = performance.now() % 100 // Simulated render time
+        
+        onClusteringUpdate({
+          renderingTime,
+          clusteringTime,
+          totalNodes: nodeCount,
+          totalLinks: linkCount,
+        })
+      }
+    }
+  }, [graphData, enableClustering, isClusteringEnabled, clusteringMode, onClusteringUpdate])
+
+  // Apply cluster colors when the setting changes
+  useEffect(() => {
+    if (graphRef.current && graphData?.nodes) {
+      console.log("🎨 Cluster colors setting changed:", enableClusterColors);
+      
+      if (enableClusterColors) {
+        // Apply cluster colors
+        const coloredNodes = assignClusterColors(graphData.nodes, true, isClusteringEnabled);
+        
+        // Update the graph with new colors
+        graphRef.current.nodeColor((node: any) => {
+          const coloredNode = coloredNodes.find(n => getNodeId(n) === getNodeId(node));
+          return coloredNode?.color || node.color || '#4CAF50';
+        });
+        
+        // Refresh to show changes
+        graphRef.current.refresh();
+        
+        // Update notification based on actual clustering type used
+        const hasActualSemanticClusters = graphData.nodes.some((node: any) => node.clusterId !== undefined || node.clusterIndex !== undefined);
+        const clusteringType = isClusteringEnabled && hasActualSemanticClusters 
+          ? `${clusteringMethod} - ${semanticAlgorithm}` 
+          : "spatial";
+        showNotification(
+          `Cluster colors applied - nodes are colored by ${clusteringType} cluster`,
+          "success"
+        );
+      } else {
+        // Reset to original colors
+        graphRef.current.nodeColor((node: any) => node.color || '#4CAF50');
+        graphRef.current.refresh();
+        
+        showNotification(
+          "Cluster colors disabled - using original node colors",
+          "info"
+        );
+      }
+    }
+  }, [enableClusterColors, graphData, isClusteringEnabled, clusteringMethod, semanticAlgorithm])
+
+  // Apply clustering when graph data changes
+  useEffect(() => {
+    const applyGPUClustering = async () => {
+      if (!isClusteringEnabled || !clusteringEngineRef.current || !graphData || !graphData.nodes) {
+        return;
+      }
+      
+      try {
+        console.log("🔄 Applying GPU clustering to", graphData.nodes.length, "nodes");
+        
+        // Use the correct updateNodePositions method from EnhancedWebGPUClusteringEngine
+        const success = await clusteringEngineRef.current.updateNodePositions(
+          graphData.nodes,
+          graphData.links || []
+        );
+        
+        console.log("🎯 Clustering success:", success);
+        
+        if (!success) {
+          console.warn("⚠️ Clustering failed");
+          return;
+        }
+        
+        // The clustering results are now applied directly to the nodes
+        // The nodes should now have clusterIndex and nodeIndex properties
+        const clusteredNodes = graphData.nodes;
+        
+        // Apply cluster information to the graph
+        if (graphRef.current) {
+          console.log("Applying", clusteredNodes.length, "clustered nodes to graph");
+          
+          // Group nodes by cluster for more efficient rendering
+          const clusters = new Map<number, number[]>();
+          clusteredNodes.forEach((node: any, index: number) => {
+            if (node.clusterIndex !== undefined) {
+              if (!clusters.has(node.clusterIndex)) {
+                clusters.set(node.clusterIndex, []);
+              }
+              clusters.get(node.clusterIndex)?.push(index);
+            }
+          });
+          
+          // Log clustering stats
+          console.log(`Grouped nodes into ${clusters.size} clusters`);
+          
+          // Update graph colors based on clusters for visualization
+          if (debugInfo.includes("cluster-viz")) {
+            console.log("🌈 Applying cluster visualization colors");
+            
+            try {
+              const clusterColors = new Map<number, string>();
+              // Generate Tokyo-themed colors for each cluster
+              const tokyoColors = generateClusterColors(clusters.size);
+              clusters.forEach((nodes, clusterIndex) => {
+                // Use Tokyo color palette
+                const color = tokyoColors[clusterIndex % tokyoColors.length];
+                clusterColors.set(clusterIndex, color);
+                console.log(`Cluster ${clusterIndex}: ${color} (${nodes.length} nodes)`);
+              });
+              
+              // Set node colors based on cluster - use a more stable approach
+              const colorFunction = (node: any) => {
+                try {
+                  // Find the node in our clustered data by ID
+                  const nodeData = clusteredNodes.find((n: any) => n.id === node.id);
+                  if (nodeData && nodeData.clusterIndex !== undefined) {
+                    const color = clusterColors.get(nodeData.clusterIndex);
+                    if (color) {
+                      return color;
+                    }
+                  }
+                  return "#4CAF50"; // Default green
+                } catch (err) {
+                  console.warn("Error getting node color:", err);
+                  return "#4CAF50";
+                }
+              };
+              
+              // Apply colors with error handling
+              graphRef.current.nodeColor(colorFunction);
+              
+              // Don't force refresh immediately - let the natural render cycle handle it
+              console.log("🎨 Cluster colors applied to", clusters.size, "clusters");
+              
+            } catch (error) {
+              console.error("Error applying cluster visualization:", error);
+              // Reset to default colors if there's an error
+              graphRef.current.nodeColor(() => "#4CAF50");
+            }
+          }
+          
+          // Use optimized rendering settings to prevent WebGL context loss
+          try {
+            graphRef.current
+              .d3AlphaDecay(0.05) // Faster convergence to reduce GPU load
+              .d3VelocityDecay(0.6) // Higher decay for stability
+              .cooldownTime(2000) // Shorter cooldown to reduce GPU stress
+              .enableNodeDrag(false) // Disable dragging during clustering
+              .enablePointerInteraction(true); // Keep basic interaction
+              
+            console.log("🔧 Applied optimized rendering settings for clustering");
+          } catch (error) {
+            console.error("Error applying rendering settings:", error);
+          }
+        }
+        
+        showNotification("GPU clustering applied successfully", "success");
+      } catch (error) {
+        console.error("Error applying GPU clustering:", error);
+        showNotification("GPU clustering failed", "error");
+      }
+    };
+    
+    if (isClusteringEnabled && graphData && graphData.nodes) {
+      console.log("🚀 Starting clustering process...");
+      applyGPUClustering();
+    }
+  }, [isClusteringEnabled, graphData, debugInfo]);
+
+  // Use effect to initialize highlighting from props
+  useEffect(() => {
+    if (highlightedNodes && highlightedNodes.length > 0) {
+      const newHighlightedNodes = new Set<string>(highlightedNodes);
+      setInternalHighlightedNodes(newHighlightedNodes);
+      console.log("Initialized highlighted nodes from props:", highlightedNodes);
+    }
+  }, [highlightedNodes]);
+  
+  // Use effect to apply layout type from props
+  useEffect(() => {
+    if (layoutType && graphRef.current) {
+      console.log("Applying layout type from props:", layoutType);
+      
+      switch (layoutType) {
+        case "hierarchical":
+          graphRef.current.dagMode("td");
+          break;
+        case "radial":
+          graphRef.current.dagMode(null);
+          // Apply radial force
+          if (graphRef.current.d3Force) {
+            graphRef.current.d3Force("radial", d3.forceRadial(100));
+          }
+          break;
+        case "force":
+        default:
+          graphRef.current.dagMode(null);
+          // Remove radial force if it exists
+          if (graphRef.current.d3Force) {
+            graphRef.current.d3Force("radial", null);
+          }
+          break;
+      }
+    }
+  }, [layoutType, graphRef.current, isInitialized]);
+
+  // Add this new method below the toggleInteractionMode function
+  const createSelectionRing = (node: any) => {
+    // Create a ring to highlight the selected node
+    const ring = new THREE.Mesh(
+      new THREE.RingGeometry(node.val * 1.2 || 6, node.val * 1.5 || 7.5, 32),
+      new THREE.MeshBasicMaterial({
+        color: 0xffffff,
+        side: THREE.DoubleSide,
+        transparent: true,
+        opacity: 0.8,
+      })
+    );
+    
+    // Orient the ring to always face the camera
+    ring.lookAt(new THREE.Vector3(0, 0, 1));
+    
+    // Create a pulsing animation effect
+    const clock = new THREE.Clock();
+    ring.onBeforeRender = () => {
+      const elapsed = clock.getElapsedTime();
+      const scale = 1 + 0.1 * Math.sin(elapsed * 3);
+      ring.scale.set(scale, scale, 1);
+    };
+    
+    return ring;
+  };
+
+  // Function to dynamically control label visibility based on camera distance
+  const shouldShowLabel = (node: any, camera: THREE.Camera, selectedNodeId: string | null, connectedNodeIds: Set<string>) => {
+    if (!node) return false;
+    
+    const nodeId = typeof node === 'object' ? node.id : node;
+    
+    // Always show label for selected node and its connections
+    if (selectedNodeId === nodeId || connectedNodeIds.has(nodeId)) {
+      return true;
+    }
+    
+    // Get distance from camera to this node
+    if (typeof node === 'object' && camera && node.x !== undefined && node.y !== undefined && node.z !== undefined) {
+      const nodePosition = new THREE.Vector3(node.x, node.y, node.z);
+      const cameraPosition = camera.position.clone();
+      const distance = nodePosition.distanceTo(cameraPosition);
+      
+      // Show labels for closer nodes or nodes with many connections
+      const hasHighConnectivity = node.val && node.val > 3;
+      
+      // Adjust these thresholds as needed
+      if (distance < 100 || hasHighConnectivity) {
+        return true;
+      }
+    }
+    
+    return false;
+  };
+
+  // Add keyboard handling for node navigation
+  useEffect(() => {
+    // Handle keyboard shortcuts for graph navigation
+    const handleKeyDown = (e: KeyboardEvent) => {
+      if (!graphRef.current || !selectedNode) return;
+      
+      // Extract node ID from selected node
+      const getNodeId = (nodeObj: any): string => {
+        if (!nodeObj) return '';
+        if (typeof nodeObj === 'string') return nodeObj;
+        if (nodeObj.id) return nodeObj.id;
+        if (nodeObj.__threeObj?.userData?.id) return nodeObj.__threeObj.userData.id;
+        return '';
+      };
+      
+      const selectedNodeId = getNodeId(selectedNode);
+      
+      switch (e.key) {
+        case 'Escape':
+          // Clear selection and restore cluster colors
+          setSelectedNode(null);
+          setNodeConnections([]);
+          if (graphRef.current) {
+            // Restore cluster colors if enabled
+            if (enableClusterColors && graphData?.nodes) {
+              console.log("🔄 Restoring cluster colors after Escape key");
+              console.log("🔧 Escape key state check:", {
+                enableClusterColors,
+                isClusteringEnabled,
+                hasClusteringEngine: !!clusteringEngineRef.current,
+                hasGraphData: !!graphData?.nodes
+              });
+              
+              // Get the actual clustered data if available
+              let nodesToUse = graphData.nodes;
+              let useSemanticClusters = false;
+              
+              if (clusteringEngineRef.current) {
+                const clusteredData = clusteringEngineRef.current.getClusteredData();
+                console.log("🔍 Checking clustered data (Escape):", {
+                  hasClusteredData: !!clusteredData,
+                  hasNodes: !!clusteredData?.nodes,
+                  nodeCount: clusteredData?.nodes?.length,
+                  hasClusterIds: clusteredData?.nodes?.some((n: any) => n.clusterId !== undefined || n.clusterIndex !== undefined)
+                });
+                if (clusteredData && clusteredData.nodes) {
+                  console.log("📊 Using clustered data for color restoration");
+                  nodesToUse = clusteredData.nodes;
+                  // Check if the clustered data actually has cluster IDs
+                  useSemanticClusters = clusteredData.nodes.some((n: any) => n.clusterId !== undefined || n.clusterIndex !== undefined);
+                  console.log("🎯 useSemanticClusters set to:", useSemanticClusters);
+                }
+              } else {
+                console.log("❌ No clustering engine available");
+              }
+              
+              const coloredNodes = assignClusterColors(nodesToUse, true, useSemanticClusters);
+              graphRef.current.nodeColor((node: any) => {
+                const coloredNode = coloredNodes.find((n: any) => getNodeId(n) === getNodeId(node));
+                return coloredNode?.color || '#76b900';
+              });
+            } else {
+              // Reset to default colors
+              graphRef.current.nodeColor((node: any) => {
+                const group = node.group || 'default';
+                switch (group) {
+                  case 'document': return '#f8f8f2';
+                  case 'important': return '#8be9fd';
+                  default: return '#76b900';
+                }
+              });
+            }
+            graphRef.current.linkColor(() => '#ffffff30');
+            graphRef.current.linkWidth(() => 1);
+            graphRef.current.refresh();
+          }
+          break;
+          
+        case 'Tab':
+          // Navigate to next connected node
+          e.preventDefault(); // Prevent default tab behavior
+          
+          if (nodeConnections.length > 0) {
+            // Determine direction based on shift key
+            const isShiftPressed = e.shiftKey;
+            
+            // Find the next node to select
+            const nextConnection = isShiftPressed 
+              ? nodeConnections[nodeConnections.length - 1] // Go backwards with Shift+Tab
+              : nodeConnections[0]; // Go forwards with Tab
+            
+            // Determine which node to select next (always select the "other" node from the connection)
+            const nextNodeId = nextConnection.source === selectedNodeId 
+              ? nextConnection.target 
+              : nextConnection.source;
+            
+            // Find the node object in the graph data
+            const graph = graphRef.current;
+            const nextNode = graph.graphData().nodes.find((n: any) => getNodeId(n) === nextNodeId);
+            
+            if (nextNode) {
+              // Select the next node
+              handleNodeSelection(nextNode);
+            }
+          }
+          break;
+      }
+    };
+    
+    // Add keyboard event listener
+    window.addEventListener('keydown', handleKeyDown);
+    
+    // Clean up
+    return () => {
+      window.removeEventListener('keydown', handleKeyDown);
+    };
+  }, [selectedNode, nodeConnections]);
+
+  // New effect to update node size when changed
+  useEffect(() => {
+    if (graphRef.current) {
+      graphRef.current.nodeRelSize(nodeSize);
+    }
+  }, [nodeSize]);
+
+  // Apply performance mode settings
+  useEffect(() => {
+    if (!graphRef.current) return;
+    
+    if (performanceMode) {
+      // Lower quality settings for better performance
+      graphRef.current
+        .nodeResolution(8) // Lower resolution nodes
+        .linkDirectionalParticles(0) // Disable particles
+        .linkWidth(0.5) // Thinner links
+        .cooldownTime(1000) // Shorter physics simulation
+        .d3AlphaDecay(0.05); // Faster convergence
+      
+      showNotification("Performance mode enabled", "info");
+    } else {
+      // Higher quality settings
+      graphRef.current
+        .nodeResolution(32) // Higher resolution nodes
+        .linkWidth(1) // Standard link width
+        .cooldownTime(3000) // Longer physics simulation
+        .d3AlphaDecay(0.02); // Standard convergence
+      
+      // Only show notification when switching back from performance mode
+      if (graphLoaded) {
+        showNotification("Performance mode disabled", "info");
+      }
+    }
+  }, [performanceMode, graphLoaded]);
+
+  // Function to download graph as image
+  const downloadGraphImage = () => {
+    if (!graphRef.current) return;
+    
+    try {
+      // Capture the current canvas content
+      const renderer = graphRef.current.renderer();
+      if (!renderer) return;
+      
+      // Render scene to make sure we have the latest state
+      renderer.render(graphRef.current.scene(), graphRef.current.camera());
+      
+      // Get the canvas and convert to image
+      const canvas = renderer.domElement;
+      
+      // Create download link
+      const link = document.createElement('a');
+      link.download = `knowledge-graph-${new Date().toISOString().slice(0, 10)}.png`;
+      
+      // Convert canvas to data URL and trigger download
+      link.href = canvas.toDataURL('image/png');
+      link.click();
+      
+      showNotification("Graph image saved", "success");
+    } catch (error) {
+      console.error("Error saving graph image:", error);
+      showNotification("Failed to save image", "error");
+    }
+  };
+
+  return (
+    <div className="relative w-full h-full overflow-hidden bg-gray-900">
+      {/* Graph container */}
+      <div 
+        ref={containerRef} 
+        className="w-full h-full"
+      ></div>
+
+      {/* Loading Overlay */}
+      {isLoading && (
+        <div className="absolute inset-0 bg-black/70 flex flex-col items-center justify-center z-20">
+          <div className="loader ease-linear rounded-full border-4 border-t-4 border-gray-200 h-12 w-12 mb-4"></div>
+          <p className="text-white mb-2">{loadingStep}</p>
+          <div className="w-64 bg-gray-700 rounded-full h-2.5">
+            <div className="bg-blue-600 h-2.5 rounded-full" style={{ width: `${loadingProgress}%` }}></div>
+          </div>
+        </div>
+      )}
+
+      {/* Error Display */}
+      {error && (
+        <div className="absolute inset-0 bg-black/80 flex items-center justify-center z-30">
+          <div className="bg-red-900/90 text-white p-6 rounded-lg max-w-lg text-center">
+            <h3 className="text-lg font-bold mb-3">Graph Error</h3>
+            <p className="text-sm mb-4">{error}</p>
+            {retryCount < maxRetries && (
+              <button
+                onClick={handleRetry}
+                className="px-4 py-2 bg-red-600 hover:bg-red-500 rounded text-white text-sm"
+              >
+                Retry ({retryCount + 1}/{maxRetries})
+              </button>
+            )}
+            <button
+              onClick={() => setError(null)} // Allow dismissing the error
+              className="ml-2 px-4 py-2 bg-gray-600 hover:bg-gray-500 rounded text-white text-sm"
+            >
+              Dismiss
+            </button>
+          </div>
+        </div>
+      )}
+
+      {/* Notification Display */}
+      {notification && (
+        <div 
+          className={`absolute top-4 right-4 p-3 rounded-md shadow-lg z-50 text-sm 
+            ${notification.type === 'success' ? 'bg-green-600' : notification.type === 'error' ? 'bg-red-600' : 'bg-blue-600'} 
+            text-white`}
+        >
+          {notification.message}
+        </div>
+      )}
+
+      {/* Top-Left Controls */}
+      <div className="absolute top-4 left-4 z-10 flex flex-col space-y-2">
+        {isClusteringAvailable && (
+          <button
+            onClick={toggleClustering}
+            className={`px-3 py-1.5 rounded text-white text-xs shadow ${isClusteringEnabled ? 'bg-blue-600 hover:bg-blue-500' : 'bg-gray-700/80 hover:bg-gray-600/90'}`}
+          >
+            {isClusteringEnabled ? 'Disable GPU Clustering' : 'Enable GPU Clustering'}
+          </button>
+        )}
+        {!isClusteringAvailable && clusteringEngineRef.current && (
+          <button
+            onClick={() => {
+              console.log("🔧 Manually enabling clustering...");
+              setIsClusteringAvailable(true);
+              setIsClusteringEnabled(true);
+            }}
+            className="px-3 py-1.5 rounded text-white text-xs shadow bg-yellow-600 hover:bg-yellow-500"
+          >
+            Force Enable Clustering
+          </button>
+        )}
+        {isClusteringEnabled && (
+            <button 
+              onClick={() => setDebugInfo(prev => prev.includes('cluster-viz') ? '' : 'cluster-viz')}
+              className={`px-3 py-1.5 rounded text-white text-xs shadow ${debugInfo.includes('cluster-viz') ? 'bg-purple-600 hover:bg-purple-500' : 'bg-gray-700/80 hover:bg-gray-600/90'}`}
+            >
+              Toggle Cluster Viz
+            </button>
+        )}
+        {/* Add 2D View Toggle if needed */}
+        {/* <button className="px-3 py-1.5 bg-gray-700/80 hover:bg-gray-600/90 rounded text-white text-xs shadow flex items-center">
+          <LayoutGrid size={14} className="mr-1" /> 2D View
+        </button> */}
+      </div>
+
+      {/* Top-Right Info Panel */}
+      <div className="absolute top-4 right-24 z-10 bg-gray-800/80 p-3 rounded text-xs text-gray-300 shadow w-48">
+        <p><span className="font-semibold text-white">Mode:</span> {interactionMode}</p>
+        <ul className="list-disc list-inside mt-1 space-y-0.5">
+          <li>Drag to rotate view</li>
+          <li>Scroll to zoom in/out</li>
+        </ul>
+        <p className="mt-2 pt-2 border-t border-gray-600/50"><span className="font-semibold text-white">Nodes:</span> {graphStats.nodes} &bull; <span className="font-semibold text-white">Links:</span> {graphStats.links}</p>
+        <p className="mt-1"><span className="font-semibold text-white">WebGPU Clustering:</span> 
+          <span className="text-green-400">
+            Enabled
+          </span>
+        </p>
+      </div>
+
+      {/* Selected Node Panel */}
+      {selectedNode && (
+        <div className="absolute top-1/2 left-4 -translate-y-1/2 z-10 bg-gray-800/90 p-4 rounded-lg shadow-lg max-w-md text-sm text-gray-200 w-1/3">
+          <div className="flex justify-between items-center mb-3">
+            <h4 className="font-bold text-base text-white break-all">Selected: {selectedNode.name || selectedNode.id}</h4>
+            <button onClick={clearSelection} className="text-gray-400 hover:text-white">
+              <X size={18} />
+            </button>
+          </div>
+          <div className="max-h-48 overflow-y-auto text-xs pr-2">
+            {nodeConnections.length > 0 ? (
+              <>
+                <p className="font-semibold mb-1 text-gray-300">Connections ({nodeConnections.length}):</p>
+                <ul className="space-y-1">
+                  {nodeConnections.map((conn, index) => (
+                    <li key={index} className="flex items-center justify-between bg-gray-700/50 px-2 py-1 rounded">
+                      <span className="italic mr-1">{conn.type === 'outgoing' ? '→' : '←'} {conn.label || 'related'}</span>
+                      <button 
+                        onClick={() => focusOnNode(conn.type === 'outgoing' ? conn.target : conn.source)}
+                        className="font-mono hover:text-blue-400 hover:underline truncate text-left flex-1 mx-2"
+                        title={`Focus on: ${conn.nodeName || (conn.type === 'outgoing' ? conn.target : conn.source)}`}
+                      >
+                         {conn.nodeName || (conn.type === 'outgoing' ? conn.target : conn.source)}
+                      </button>
+                    </li>
+                  ))}
+                </ul>
+              </>
+            ) : (
+              <p className="text-gray-400 italic">No connections found for this node.</p>
+            )}
+          </div>
+        </div>
+      )}
+
+      {/* Debug Info Display (Optional) */}
+      {debugInfo && (
+        <div className="absolute bottom-4 right-4 z-10 bg-black/70 p-2 rounded text-xs text-mono text-gray-300">
+          <pre>{debugInfo}</pre>
+        </div>
+      )}
+    </div>
+  )
+}
+
+export default ForceGraphWrapper;
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/graph-actions.tsx b/nvidia/txt2kg/assets/frontend/components/graph-actions.tsx
new file mode 100644
index 0000000..71bc34a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/graph-actions.tsx
@@ -0,0 +1,100 @@
+"use client"
+
+import { Network, Zap, HelpCircle } from "lucide-react"
+import { useDocuments } from "@/contexts/document-context"
+import { Loader2 } from "lucide-react"
+import { useState } from "react"
+import { Switch } from "@/components/ui/switch"
+import { Label } from "@/components/ui/label"
+import { Tooltip, TooltipContent, TooltipProvider, TooltipTrigger } from "@/components/ui/tooltip"
+
+export function GraphActions() {
+  const { documents, processDocuments, isProcessing, openGraphVisualization } = useDocuments()
+  const [useLangChain, setUseLangChain] = useState(false)
+
+  const hasNewDocuments = documents.some((doc) => doc.status === "New")
+  const hasProcessedDocuments = documents.some(
+    (doc) => doc.status === "Processed" && doc.triples && doc.triples.length > 0,
+  )
+
+  const handleProcessDocuments = async () => {
+    try {
+      // Get IDs of documents with "New" status
+      const newDocumentIds = documents
+        .filter(doc => doc.status === "New")
+        .map(doc => doc.id);
+        
+      if (newDocumentIds.length === 0) {
+        console.log("No new documents to process");
+        return;
+      }
+      
+      await processDocuments(newDocumentIds, {
+        useLangChain,
+        useGraphTransformer: false,
+        promptConfigs: undefined
+      });
+    } catch (error) {
+      console.error('Error processing documents:', error);
+    }
+  }
+
+  return (
+    <div className="flex gap-3 items-center">
+      <div className="flex items-center space-x-2 mr-2">
+        <div className="flex items-center gap-1.5">
+          <Switch
+            id="use-langchain-graph"
+            checked={useLangChain}
+            onCheckedChange={(value) => {
+              setUseLangChain(value);
+              // Dispatch custom event to update other components
+              window.dispatchEvent(new CustomEvent('langChainToggled', { 
+                detail: { useLangChain: value } 
+              }));
+            }}
+          />
+          <Label htmlFor="use-langchain-graph" className="text-xs font-medium">Use LangChain</Label>
+          <TooltipProvider>
+            <Tooltip>
+              <TooltipTrigger asChild>
+                <HelpCircle className="h-3.5 w-3.5 text-muted-foreground cursor-help" />
+              </TooltipTrigger>
+              <TooltipContent className="max-w-[220px] p-3">
+                <p className="text-xs">
+                  Enabling LangChain uses AI-powered knowledge extraction for more accurate triple generation.
+                </p>
+              </TooltipContent>
+            </Tooltip>
+          </TooltipProvider>
+        </div>
+      </div>
+      <button
+        className={`btn-primary ${!hasNewDocuments || isProcessing ? "opacity-60 cursor-not-allowed" : ""}`}
+        disabled={!hasNewDocuments || isProcessing}
+        onClick={handleProcessDocuments}
+      >
+        {isProcessing ? (
+          <>
+            <Loader2 className="h-4 w-4 animate-spin" />
+            Processing...
+          </>
+        ) : (
+          <>
+            <Zap className="h-4 w-4" />
+            Process Documents
+          </>
+        )}
+      </button>
+      <button
+        className={`btn-primary ${!hasProcessedDocuments || isProcessing ? "opacity-60 cursor-not-allowed" : ""}`}
+        disabled={!hasProcessedDocuments || isProcessing}
+        onClick={() => openGraphVisualization()}
+      >
+        <Network className="h-4 w-4" />
+        View Knowledge Graph
+      </button>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/graph-data-form.tsx b/nvidia/txt2kg/assets/frontend/components/graph-data-form.tsx
new file mode 100644
index 0000000..e8c1a9c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/graph-data-form.tsx
@@ -0,0 +1,138 @@
+"use client"
+
+import type React from "react"
+
+import { useState } from "react"
+import type { Triple } from "@/utils/text-processing"
+import { GraphVisualization } from "./graph-visualization"
+
+export function GraphDataForm() {
+  const [triples, setTriples] = useState<Triple[]>([])
+  const [documentName, setDocumentName] = useState("")
+  const [dataSubmitted, setDataSubmitted] = useState(false)
+  const [error, setError] = useState<string | null>(null)
+  const [jsonInput, setJsonInput] = useState("")
+
+  const handleSubmit = (e: React.FormEvent) => {
+    e.preventDefault()
+    setError(null)
+
+    try {
+      if (!jsonInput.trim()) {
+        throw new Error("Please enter JSON data")
+      }
+
+      const parsedTriples = JSON.parse(jsonInput)
+
+      if (!Array.isArray(parsedTriples)) {
+        throw new Error("Invalid data format. Expected an array of triples.")
+      }
+
+      // Validate that each item has the required fields
+      for (const triple of parsedTriples) {
+        if (!triple.subject || !triple.predicate || !triple.object) {
+          throw new Error("Each triple must have 'subject', 'predicate', and 'object' properties.")
+        }
+      }
+
+      // Store in localStorage for persistence
+      localStorage.setItem("graphTriples", jsonInput)
+      if (documentName) {
+        localStorage.setItem("graphDocumentName", documentName)
+      }
+
+      setTriples(parsedTriples)
+      setDataSubmitted(true)
+    } catch (err) {
+      setError(err instanceof Error ? err.message : "Failed to parse JSON data")
+    }
+  }
+
+  const handleJsonChange = (e: React.ChangeEvent<HTMLTextAreaElement>) => {
+    setJsonInput(e.target.value)
+  }
+
+  if (dataSubmitted) {
+    return (
+      <div className="h-[calc(100vh-400px)]">
+        <h2 className="text-xl font-bold mb-4">
+          Knowledge Graph: <span className="text-primary">{documentName || "Custom Data"}</span>
+        </h2>
+        <GraphVisualization triples={triples} fullscreen />
+      </div>
+    )
+  }
+
+  return (
+    <div className="max-w-2xl mx-auto">
+      <p className="text-muted-foreground mb-4">
+        If automatic data loading failed, you can paste your triples data here in JSON format.
+      </p>
+
+      {error && (
+        <div className="bg-destructive/10 border border-destructive rounded-lg p-3 mb-4">
+          <p className="text-destructive">{error}</p>
+        </div>
+      )}
+
+      <div className="bg-primary/5 border border-primary/20 rounded-lg p-4 mb-4">
+        <h3 className="text-sm font-medium text-primary mb-2">Expected JSON Format</h3>
+        <p className="text-xs text-muted-foreground mb-2">
+          The data should be an array of objects, each with "subject", "predicate", and "object" properties:
+        </p>
+        <pre className="text-xs bg-card p-3 rounded overflow-auto">
+          {`[
+  {
+    "subject": "NVIDIA",
+    "predicate": "develops",
+    "object": "GPUs"
+  },
+  {
+    "subject": "GPUs",
+    "predicate": "used for",
+    "object": "AI training"
+  }
+]`}
+        </pre>
+        <p className="text-xs text-muted-foreground mt-2">
+          You can export this format directly from the main application using the "Export as JSON" option.
+        </p>
+      </div>
+
+      <form onSubmit={handleSubmit} className="space-y-4">
+        <div>
+          <label htmlFor="document-name" className="block text-sm font-medium text-foreground mb-2">
+            Document Name (optional)
+          </label>
+          <input
+            type="text"
+            id="document-name"
+            value={documentName}
+            onChange={(e) => setDocumentName(e.target.value)}
+            className="w-full bg-background border border-border rounded-lg p-3 text-foreground"
+            placeholder="My Document"
+          />
+        </div>
+
+        <div>
+          <label htmlFor="triples-data" className="block text-sm font-medium text-foreground mb-2">
+            Triples Data (JSON format)
+          </label>
+          <textarea
+            id="triples-data"
+            value={jsonInput}
+            onChange={handleJsonChange}
+            className="w-full h-64 bg-background border border-border rounded-lg p-3 text-foreground font-mono text-sm"
+            placeholder='[{"subject":"NVIDIA","predicate":"develops","object":"GPUs"}]'
+            required
+          ></textarea>
+        </div>
+
+        <button type="submit" className="btn-primary">
+          Visualize Graph
+        </button>
+      </form>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/graph-toolbar.tsx b/nvidia/txt2kg/assets/frontend/components/graph-toolbar.tsx
new file mode 100644
index 0000000..42e388f
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/graph-toolbar.tsx
@@ -0,0 +1,271 @@
+"use client"
+
+import { Download, Maximize, Minimize, CuboidIcon, LayoutGrid, Database, Search as SearchIcon, Settings, Zap, HelpCircle } from "lucide-react"
+import { Button } from "@/components/ui/button"
+import { Switch } from "@/components/ui/switch"
+import { Label } from "@/components/ui/label"
+import { Input } from "@/components/ui/input"
+import { Separator } from "@/components/ui/separator"
+import {
+  DropdownMenu,
+  DropdownMenuContent,
+  DropdownMenuItem,
+  DropdownMenuTrigger,
+  DropdownMenuSeparator,
+} from "@/components/ui/dropdown-menu"
+import {
+  Tooltip,
+  TooltipContent,
+  TooltipProvider,
+  TooltipTrigger,
+} from "@/components/ui/tooltip"
+
+interface GraphToolbarProps {
+  // View controls
+  use3D: boolean
+  onToggle3D: () => void
+  isFullscreen: boolean
+  onToggleFullscreen: () => void
+  
+  // Layout controls
+  layoutType: "force" | "hierarchical" | "radial"
+  onLayoutChange: (layout: "force" | "hierarchical" | "radial") => void
+  
+  // Data controls
+  includeStoredTriples: boolean
+  onToggleStoredTriples: (enabled: boolean) => void
+  storedTriplesCount: number
+  loadingStoredTriples: boolean
+  
+  // Export
+  onExport: (format: "json" | "csv" | "png") => void
+  
+  // Search
+  searchTerm: string
+  onSearchChange: (term: string) => void
+  onSearch: () => void
+  searchInputRef?: React.RefObject<HTMLInputElement | null>
+  
+  // Stats
+  nodeCount: number
+  edgeCount: number
+}
+
+export function GraphToolbar({
+  use3D,
+  onToggle3D,
+  isFullscreen,
+  onToggleFullscreen,
+  layoutType,
+  onLayoutChange,
+  includeStoredTriples,
+  onToggleStoredTriples,
+  storedTriplesCount,
+  loadingStoredTriples,
+  onExport,
+  searchTerm,
+  onSearchChange,
+  onSearch,
+  searchInputRef,
+  nodeCount,
+  edgeCount
+}: GraphToolbarProps) {
+  return (
+    <TooltipProvider>
+      <div className="bg-background/95 backdrop-blur-sm border border-border/50 rounded-lg p-3 shadow-sm">
+        <div className="flex flex-wrap items-center gap-2 md:gap-4">
+          
+          {/* Primary Actions Group */}
+          <div className="flex items-center gap-2">
+            <Tooltip>
+              <TooltipTrigger asChild>
+                <Button 
+                  size="sm" 
+                  variant={use3D ? "default" : "outline"}
+                  onClick={onToggle3D}
+                  className={`${use3D ? 'bg-nvidia-green hover:bg-nvidia-green/90 text-white border-nvidia-green' : 'border-border hover:bg-muted/50 text-foreground'} px-3 py-2 gap-2`}
+                >
+                  <CuboidIcon className="h-4 w-4" />
+                  <span className="hidden sm:inline">{use3D ? '2D' : '3D'}</span>
+                </Button>
+              </TooltipTrigger>
+              <TooltipContent>
+                Switch to {use3D ? '2D' : '3D'} view
+              </TooltipContent>
+            </Tooltip>
+            
+            <Tooltip>
+              <TooltipTrigger asChild>
+                <Button 
+                  size="sm" 
+                  variant="outline" 
+                  onClick={onToggleFullscreen}
+                  className="px-3 py-2 gap-2"
+                >
+                  {isFullscreen ? <Minimize className="h-4 w-4" /> : <Maximize className="h-4 w-4" />}
+                  <span className="hidden sm:inline">{isFullscreen ? "Exit" : "Fullscreen"}</span>
+                </Button>
+              </TooltipTrigger>
+              <TooltipContent>
+                {isFullscreen ? "Exit fullscreen" : "Enter fullscreen"}
+              </TooltipContent>
+            </Tooltip>
+          </div>
+
+          <Separator orientation="vertical" className="h-6 hidden md:block" />
+
+          {/* Layout Controls Group */}
+          <div className="flex items-center gap-1">
+            <span className="text-xs font-medium text-muted-foreground mr-2 hidden lg:inline">Layout:</span>
+            <div className="flex items-center gap-1">
+              {[
+                { key: "force", label: "Force", icon: null },
+                { key: "hierarchical", label: "Tree", icon: null },
+                { key: "radial", label: "Radial", icon: null }
+              ].map((layout) => (
+                <Tooltip key={layout.key}>
+                  <TooltipTrigger asChild>
+                    <Button
+                      size="sm"
+                      variant={layoutType === layout.key ? "default" : "ghost"}
+                      className={`h-8 px-2 text-xs ${
+                        layoutType === layout.key 
+                          ? "bg-nvidia-green hover:bg-nvidia-green/90 text-white" 
+                          : "hover:bg-muted"
+                      }`}
+                      onClick={() => onLayoutChange(layout.key as "force" | "hierarchical" | "radial")}
+                    >
+                      {layout.label}
+                    </Button>
+                  </TooltipTrigger>
+                  <TooltipContent>
+                    {layout.label} layout
+                  </TooltipContent>
+                </Tooltip>
+              ))}
+            </div>
+          </div>
+
+          <Separator orientation="vertical" className="h-6 hidden md:block" />
+
+          {/* Data Source Controls */}
+          <div className="flex items-center gap-2">
+            <Tooltip>
+              <TooltipTrigger asChild>
+                <div className="flex items-center gap-2">
+                  <Switch
+                    id="stored-triples"
+                    checked={includeStoredTriples}
+                    onCheckedChange={onToggleStoredTriples}
+                    size="sm"
+                  />
+                  <Label 
+                    htmlFor="stored-triples" 
+                    className="text-xs font-medium cursor-pointer flex items-center gap-1"
+                  >
+                    <Database className="h-3 w-3 text-nvidia-green" />
+                    <span className="hidden sm:inline">DB ({storedTriplesCount})</span>
+                    {loadingStoredTriples && <span className="animate-spin text-nvidia-green">⟳</span>}
+                  </Label>
+                </div>
+              </TooltipTrigger>
+              <TooltipContent>
+                Include stored triples from database ({storedTriplesCount} available)
+              </TooltipContent>
+            </Tooltip>
+          </div>
+
+          {/* Stats (on larger screens) */}
+          <div className="hidden lg:flex items-center gap-2 text-xs text-muted-foreground">
+            <Separator orientation="vertical" className="h-6" />
+            <span>{nodeCount} nodes</span>
+            <span>•</span>
+            <span>{edgeCount} edges</span>
+          </div>
+
+          {/* Search */}
+          <div className="flex items-center gap-2 min-w-0">
+            <div className="relative flex-1 min-w-[200px] max-w-[300px]">
+              <SearchIcon className="absolute left-3 top-1/2 transform -translate-y-1/2 h-4 w-4 text-muted-foreground" />
+              <Input
+                ref={searchInputRef || undefined}
+                type="text"
+                placeholder="Search nodes... (Ctrl+K)"
+                value={searchTerm}
+                onChange={(e) => onSearchChange(e.target.value)}
+                onKeyDown={(e) => {
+                  if (e.key === 'Enter') {
+                    e.preventDefault()
+                    onSearch()
+                  }
+                }}
+                className="pl-10 h-8 text-sm"
+              />
+            </div>
+          </div>
+
+          {/* Secondary Actions - Right Side */}
+          <div className="ml-auto flex items-center gap-2">
+            {/* Export Dropdown */}
+            <DropdownMenu>
+              <Tooltip>
+                <TooltipTrigger asChild>
+                  <DropdownMenuTrigger asChild>
+                    <Button 
+                      size="sm" 
+                      variant="outline"
+                      className="px-3 py-2 gap-2"
+                    >
+                      <Download className="h-4 w-4" />
+                      <span className="hidden sm:inline">Export</span>
+                    </Button>
+                  </DropdownMenuTrigger>
+                </TooltipTrigger>
+                <TooltipContent>Export graph data</TooltipContent>
+              </Tooltip>
+              <DropdownMenuContent align="end">
+                <DropdownMenuItem onClick={() => onExport("json")}>
+                  <Download className="h-4 w-4 mr-2" />
+                  Export as JSON
+                </DropdownMenuItem>
+                <DropdownMenuItem onClick={() => onExport("csv")}>
+                  <Download className="h-4 w-4 mr-2" />
+                  Export as CSV
+                </DropdownMenuItem>
+                <DropdownMenuSeparator />
+                <DropdownMenuItem onClick={() => onExport("png")}>
+                  <Download className="h-4 w-4 mr-2" />
+                  Export as PNG
+                </DropdownMenuItem>
+              </DropdownMenuContent>
+            </DropdownMenu>
+
+            {/* Help/Shortcuts */}
+            <Tooltip>
+              <TooltipTrigger asChild>
+                <Button 
+                  size="sm" 
+                  variant="ghost"
+                  className="px-2 py-2 h-8"
+                >
+                  <HelpCircle className="h-4 w-4" />
+                </Button>
+              </TooltipTrigger>
+              <TooltipContent side="bottom" className="max-w-xs">
+                <div className="space-y-1 text-xs">
+                  <div className="font-semibold mb-2">Keyboard Shortcuts:</div>
+                  <div className="flex justify-between"><span>F</span><span>Fullscreen</span></div>
+                  <div className="flex justify-between"><span>3</span><span>Toggle 3D</span></div>
+                  <div className="flex justify-between"><span>Ctrl+K</span><span>Search</span></div>
+                  <div className="flex justify-between"><span>1</span><span>Force layout</span></div>
+                  <div className="flex justify-between"><span>2</span><span>Tree layout</span></div>
+                  <div className="flex justify-between"><span>Shift+3</span><span>Radial layout</span></div>
+                </div>
+              </TooltipContent>
+            </Tooltip>
+          </div>
+        </div>
+      </div>
+    </TooltipProvider>
+  )
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/graph-visualization.tsx b/nvidia/txt2kg/assets/frontend/components/graph-visualization.tsx
new file mode 100644
index 0000000..3a5ae08
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/graph-visualization.tsx
@@ -0,0 +1,255 @@
+"use client"
+
+import { useState, useEffect, useRef, useCallback } from "react"
+import { FallbackGraph } from "./fallback-graph"
+import { CuboidIcon as Cube, LayoutGrid } from "lucide-react"
+import type { Triple } from "@/utils/text-processing"
+
+interface GraphVisualizationProps {
+  triples: Triple[]
+  fullscreen?: boolean
+  highlightedNodes?: string[]
+  layoutType?: string
+  initialMode?: '2d' | '3d'
+}
+
+export function GraphVisualization({ 
+  triples, 
+  fullscreen = false,
+  highlightedNodes = [],
+  layoutType = "force",
+  initialMode = '2d'
+}: GraphVisualizationProps) {
+  // Default to 2D view unless explicitly set to 3D
+  const [use3D, setUse3D] = useState(initialMode === '3d')
+  const [isLoading, setIsLoading] = useState(true)
+  const [error, setError] = useState<string | null>(null)
+  const iframeRef = useRef<HTMLIFrameElement>(null)
+  const loadTimerRef = useRef<NodeJS.Timeout | null>(null)
+  
+  // Handle 3D view errors that come from the iframe
+  const handleIframeError = useCallback((event: MessageEvent) => {
+    if (event.data && event.data.type === '3d-graph-error') {
+      setError(event.data.message || 'Error loading 3D graph');
+      setIsLoading(false);
+    }
+  }, []);
+  
+  // Handle 3D view in an iframe to completely isolate it from the main DOM
+  useEffect(() => {
+    if (use3D) {
+      setIsLoading(true);
+      setError(null);
+      
+      // Set a safety timeout in case the iframe never loads
+      loadTimerRef.current = setTimeout(() => {
+        setIsLoading(false);
+      }, 10000); // 10 second timeout
+      
+      if (iframeRef.current) {
+        // Create an event listener to know when the iframe is loaded
+        const handleLoad = () => {
+          if (loadTimerRef.current) {
+            clearTimeout(loadTimerRef.current);
+            loadTimerRef.current = null;
+          }
+          
+          setTimeout(() => {
+            setIsLoading(false);
+          }, 2000);
+        };
+        
+        // Add the event listener
+        iframeRef.current.addEventListener('load', handleLoad);
+        
+        // Add message listener for error communication
+        window.addEventListener('message', handleIframeError);
+        
+        try {
+          // Get graph ID from URL if available
+          const params = new URLSearchParams(window.location.search);
+          const graphId = params.get("id");
+          
+          // Add highlighted nodes and layout type to the iframe parameters
+          const highlightedNodesParam = highlightedNodes.length > 0 
+            ? `&highlightedNodes=${encodeURIComponent(JSON.stringify(highlightedNodes))}` 
+            : '';
+          
+          const timestamp = Date.now();
+          const baseParams = `&fullscreen=${fullscreen}&layout=${layoutType}${highlightedNodesParam}&t=${timestamp}`;
+          
+          let iframeSrc = '';
+          
+          if (graphId) {
+            // If we have a graph ID, we can just pass that
+            iframeSrc = `/graph3d?id=${graphId}${baseParams}`;
+          } else {
+            // For large triples data, try to use stored database triples first
+            const MAX_URL_TRIPLES = 100; // Maximum number of triples to include in URL
+            
+            if (triples.length > MAX_URL_TRIPLES) {
+              console.log(`Large dataset detected (${triples.length} triples), attempting to use stored database triples`);
+              
+              // Try to store in database first, then use stored source
+              fetch('/api/graph-db/triples', {
+                method: 'POST',
+                headers: { 'Content-Type': 'application/json' },
+                body: JSON.stringify({ 
+                  triples: triples,
+                  documentName: 'Graph Visualization Data'
+                })
+              }).then(response => {
+                if (response.ok) {
+                  console.log('Successfully stored triples in database, using stored source');
+                  // Update iframe to use stored source
+                  if (iframeRef.current) {
+                    iframeRef.current.src = `/graph3d?source=stored${baseParams}`;
+                  }
+                } else {
+                  console.warn('Failed to store in database, using localStorage fallback');
+                  // Fallback to localStorage
+                  const storageId = `graph_${Date.now()}_${Math.random().toString(36).substring(2, 10)}`;
+                  try {
+                    localStorage.setItem(storageId, JSON.stringify(triples));
+                    console.log(`Stored ${triples.length} triples in localStorage with ID: ${storageId}`);
+                    if (iframeRef.current) {
+                      iframeRef.current.src = `/graph3d?storageId=${storageId}${baseParams}`;
+                    }
+                  } catch (storageError) {
+                    console.error("Both database and localStorage failed:", storageError);
+                    console.warn(`Using limited triples (${MAX_URL_TRIPLES} of ${triples.length}) to avoid header size issues`);
+                    const limitedTriples = triples.slice(0, MAX_URL_TRIPLES);
+                    if (iframeRef.current) {
+                      iframeRef.current.src = `/graph3d?triples=${encodeURIComponent(JSON.stringify(limitedTriples))}${baseParams}`;
+                    }
+                  }
+                }
+              }).catch(error => {
+                console.error('Error storing triples in database:', error);
+                // Fallback to localStorage
+                const storageId = `graph_${Date.now()}_${Math.random().toString(36).substring(2, 10)}`;
+                try {
+                  localStorage.setItem(storageId, JSON.stringify(triples));
+                  console.log(`Stored ${triples.length} triples in localStorage with ID: ${storageId}`);
+                  if (iframeRef.current) {
+                    iframeRef.current.src = `/graph3d?storageId=${storageId}${baseParams}`;
+                  }
+                } catch (storageError) {
+                  console.error("Both database and localStorage failed:", storageError);
+                  console.warn(`Using limited triples (${MAX_URL_TRIPLES} of ${triples.length}) to avoid header size issues`);
+                  const limitedTriples = triples.slice(0, MAX_URL_TRIPLES);
+                  if (iframeRef.current) {
+                    iframeRef.current.src = `/graph3d?triples=${encodeURIComponent(JSON.stringify(limitedTriples))}${baseParams}`;
+                  }
+                }
+              });
+              
+              // Set initial iframe src to stored source (will be updated by the fetch above)
+              iframeSrc = `/graph3d?source=stored${baseParams}`;
+            } else {
+              // For small data sets, just use the URL parameter approach
+              iframeSrc = `/graph3d?triples=${encodeURIComponent(JSON.stringify(triples))}${baseParams}`;
+            }
+          }
+          
+          // Set the iframe source
+          iframeRef.current.src = iframeSrc;
+        } catch (err) {
+          console.error("Error setting iframe source:", err);
+          setError("Failed to prepare graph data for visualization");
+          setIsLoading(false);
+        }
+        
+        // Clean up
+        return () => {
+          if (loadTimerRef.current) {
+            clearTimeout(loadTimerRef.current);
+          }
+          if (iframeRef.current) {
+            iframeRef.current.removeEventListener('load', handleLoad);
+          }
+          window.removeEventListener('message', handleIframeError);
+        };
+      }
+    }
+  }, [use3D, triples, fullscreen, handleIframeError, highlightedNodes, layoutType]);
+  
+  // Handle switching to 2D view
+  const switchTo2D = () => {
+    setUse3D(false);
+    setError(null);
+  };
+  
+  // Handle switching to 3D view
+  const switchTo3D = () => {
+    setUse3D(true);
+    setError(null);
+  };
+  
+  return (
+    <div className={`relative ${fullscreen ? "h-full" : "h-[500px]"}`}>
+      {use3D ? (
+        <div className="relative h-full w-full">
+          <iframe
+            ref={iframeRef}
+            className="w-full h-full border-0"
+            title="3D Graph Visualization"
+            sandbox="allow-scripts allow-same-origin"
+          />
+          
+          <button
+            onClick={switchTo2D}
+            className="absolute bottom-2 right-2 px-3 py-1.5 bg-black/70 hover:bg-black/90 text-white text-xs rounded-full flex items-center gap-1.5 z-20"
+            disabled={isLoading}
+          >
+            <LayoutGrid className="h-3.5 w-3.5" />
+            <span>2D View</span>
+          </button>
+          
+          {isLoading && (
+            <div className="absolute inset-0 flex items-center justify-center bg-black/70 z-10">
+              <div className="flex flex-col items-center gap-3">
+                <div className="animate-spin w-12 h-12 rounded-full border-t-2 border-l-2 border-primary border-r-transparent border-b-transparent"></div>
+                <div className="text-primary font-medium">Loading 3D graph visualization...</div>
+                <div className="text-xs text-gray-400">This may take a moment</div>
+              </div>
+            </div>
+          )}
+          
+          {error && (
+            <div className="absolute inset-0 flex items-center justify-center bg-black/80 z-10">
+              <div className="text-red-500 p-6 bg-black/90 rounded-lg max-w-md text-center">
+                <p className="font-bold mb-3">Error loading 3D visualization</p>
+                <p className="text-sm mb-4">{error}</p>
+                <p className="text-xs mb-4 text-gray-400">Your browser may not support WebGL or 3D rendering.</p>
+                <button
+                  onClick={switchTo2D}
+                  className="px-4 py-2 bg-gray-700 hover:bg-gray-600 text-white text-sm rounded"
+                >
+                  Switch to 2D View
+                </button>
+              </div>
+            </div>
+          )}
+        </div>
+      ) : (
+        <div className="relative h-full w-full">
+          <FallbackGraph 
+            triples={triples} 
+            fullscreen={fullscreen}
+            highlightedNodes={highlightedNodes}
+          />
+          
+          <button
+            onClick={switchTo3D}
+            className="absolute bottom-2 right-2 px-3 py-1.5 bg-black/70 hover:bg-black/90 text-white text-xs rounded-full flex items-center gap-1.5"
+          >
+            <Cube className="h-3.5 w-3.5" />
+            <span>3D View</span>
+          </button>
+        </div>
+      )}
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/info-modal.tsx b/nvidia/txt2kg/assets/frontend/components/info-modal.tsx
new file mode 100644
index 0000000..d3f065a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/info-modal.tsx
@@ -0,0 +1,137 @@
+"use client"
+
+import {
+  Dialog,
+  DialogContent,
+  DialogDescription,
+  DialogHeader,
+  DialogTitle,
+  DialogTrigger,
+} from "@/components/ui/dialog"
+import { Info, Sparkles, Eye, Upload, Zap } from "lucide-react"
+
+export function InfoModal() {
+  return (
+    <Dialog>
+      <DialogTrigger asChild>
+        <button className="group">
+          <Info className="h-5 w-5 text-muted-foreground group-hover:text-primary transition-colors" />
+        </button>
+      </DialogTrigger>
+      <DialogContent className="sm:max-w-[550px] max-h-[85vh] overflow-y-auto">
+        <DialogHeader className="pb-6 border-b border-border/10">
+          <div className="flex items-center gap-3 mb-4">
+            <div className="w-10 h-10 rounded-xl bg-nvidia-green/15 flex items-center justify-center">
+              <Sparkles className="h-5 w-5 text-nvidia-green" />
+            </div>
+            <DialogTitle className="text-2xl font-bold text-foreground nvidia-build-gradient-text">
+              Text to Knowledge Graph
+            </DialogTitle>
+          </div>
+          <DialogDescription className="text-base text-muted-foreground leading-relaxed">
+            An AI-powered platform that transforms your documents into structured knowledge graphs. 
+            Extract meaningful relationships from text using state-of-the-art language models and visualize 
+            your data in interactive, explorable formats.
+          </DialogDescription>
+        </DialogHeader>
+        <div className="space-y-8 pt-6">
+          {/* Key Features Section */}
+          <div className="nvidia-build-card">
+            <div className="flex items-center gap-3 mb-6">
+              <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+                <Sparkles className="h-4 w-4 text-nvidia-green" />
+              </div>
+              <h4 className="text-lg font-semibold text-foreground">Key Features</h4>
+            </div>
+            <div className="space-y-4">
+              <div className="flex items-start gap-3">
+                <div className="w-2 h-2 rounded-full bg-nvidia-green mt-2 flex-shrink-0"></div>
+                <p className="text-sm text-foreground leading-relaxed">
+                  <span className="font-semibold">Knowledge Triple Extraction:</span> Automatically identify subject-predicate-object relationships from your text documents
+                </p>
+              </div>
+              <div className="flex items-start gap-3">
+                <div className="w-2 h-2 rounded-full bg-nvidia-green mt-2 flex-shrink-0"></div>
+                <p className="text-sm text-foreground leading-relaxed">
+                  <span className="font-semibold">Interactive Visualization:</span> Explore relationships through dynamic, interactive knowledge graphs
+                </p>
+              </div>
+              <div className="flex items-start gap-3">
+                <div className="w-2 h-2 rounded-full bg-nvidia-green mt-2 flex-shrink-0"></div>
+                <p className="text-sm text-foreground leading-relaxed">
+                  <span className="font-semibold">Multi-Format Export:</span> Export your knowledge graphs in JSON, CSV, and PNG formats
+                </p>
+              </div>
+              <div className="flex items-start gap-3">
+                <div className="w-2 h-2 rounded-full bg-nvidia-green mt-2 flex-shrink-0"></div>
+                <p className="text-sm text-foreground leading-relaxed">
+                  <span className="font-semibold">AI-Powered:</span> Leverage cutting-edge language models including NVIDIA, OpenAI, and Ollama
+                </p>
+              </div>
+            </div>
+          </div>
+
+          {/* How to Use Section */}
+          <div className="nvidia-build-card">
+            <div className="flex items-center gap-3 mb-6">
+              <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+                <Info className="h-4 w-4 text-nvidia-green" />
+              </div>
+              <h4 className="text-lg font-semibold text-foreground">How to Use</h4>
+            </div>
+            <div className="space-y-4">
+              <div className="flex items-start gap-4">
+                <div className="w-6 h-6 rounded-full bg-nvidia-green/15 flex items-center justify-center flex-shrink-0 mt-0.5">
+                  <Upload className="h-3 w-3 text-nvidia-green" />
+                </div>
+                <div>
+                  <p className="text-sm font-semibold text-foreground mb-1">1. Upload Documents</p>
+                  <p className="text-xs text-muted-foreground">Upload markdown, CSV, text, or JSON files to get started</p>
+                </div>
+              </div>
+              <div className="flex items-start gap-4">
+                <div className="w-6 h-6 rounded-full bg-nvidia-green/15 flex items-center justify-center flex-shrink-0 mt-0.5">
+                  <Sparkles className="h-3 w-3 text-nvidia-green" />
+                </div>
+                <div>
+                  <p className="text-sm font-semibold text-foreground mb-1">2. Configure Models</p>
+                  <p className="text-xs text-muted-foreground">Select your preferred language model and configure processing options</p>
+                </div>
+              </div>
+              <div className="flex items-start gap-4">
+                <div className="w-6 h-6 rounded-full bg-nvidia-green/15 flex items-center justify-center flex-shrink-0 mt-0.5">
+                  <Zap className="h-3 w-3 text-nvidia-green" />
+                </div>
+                <div>
+                  <p className="text-sm font-semibold text-foreground mb-1">3. Extract Knowledge</p>
+                  <p className="text-xs text-muted-foreground">Process your documents to generate structured knowledge triples</p>
+                </div>
+              </div>
+              <div className="flex items-start gap-4">
+                <div className="w-6 h-6 rounded-full bg-nvidia-green/15 flex items-center justify-center flex-shrink-0 mt-0.5">
+                  <Eye className="h-3 w-3 text-nvidia-green" />
+                </div>
+                <div>
+                  <p className="text-sm font-semibold text-foreground mb-1">4. Visualize & Explore</p>
+                  <p className="text-xs text-muted-foreground">Navigate your knowledge graph in 2D or 3D interactive visualizations</p>
+                </div>
+              </div>
+            </div>
+          </div>
+
+          {/* Powered by NVIDIA Section */}
+          <div className="bg-nvidia-green/5 border border-nvidia-green/20 rounded-xl p-6">
+            <div className="flex items-center gap-3 mb-4">
+              <div className="text-xs px-3 py-1.5 rounded-full bg-nvidia-green/15 text-nvidia-green border border-nvidia-green/20 font-medium">
+                POWERED BY NVIDIA AI
+              </div>
+            </div>
+            <p className="text-sm text-muted-foreground leading-relaxed">
+              Built with NVIDIA's advanced AI infrastructure and optimized for enterprise-grade knowledge extraction workflows.
+            </p>
+          </div>
+        </div>
+      </DialogContent>
+    </Dialog>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/knowledge-graph-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/knowledge-graph-viewer.tsx
new file mode 100644
index 0000000..209d1b6
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/knowledge-graph-viewer.tsx
@@ -0,0 +1,435 @@
+"use client"
+
+import { useState, useEffect, useRef } from "react"
+import { useDocuments } from "@/contexts/document-context"
+import { useKeyboardShortcuts } from "@/hooks/use-keyboard-shortcuts"
+import { Download, Maximize, Minimize, Search as SearchIcon, CuboidIcon } from "lucide-react"
+import { Button } from "@/components/ui/button"
+import { Input } from "@/components/ui/input"
+import { Slider } from "@/components/ui/slider"
+
+import { Switch } from "@/components/ui/switch"
+import { Label } from "@/components/ui/label"
+import { FallbackGraph } from "@/components/fallback-graph"
+import { GraphVisualization } from "@/components/graph-visualization"
+import { GraphToolbar } from "@/components/graph-toolbar"
+import { Triple } from "@/types/graph"
+import {
+  DropdownMenu,
+  DropdownMenuContent,
+  DropdownMenuItem,
+  DropdownMenuTrigger,
+} from "@/components/ui/dropdown-menu"
+
+type Node = {
+  id: string
+  label: string
+  color?: string
+  size?: number
+  group?: string
+}
+
+type Edge = {
+  source: string
+  target: string
+  label: string
+  id: string
+}
+
+type GraphData = {
+  nodes: Node[]
+  edges: Edge[]
+}
+
+export function KnowledgeGraphViewer() {
+  const { documents } = useDocuments()
+  const [graphData, setGraphData] = useState<GraphData>({ nodes: [], edges: [] })
+  const [isFullscreen, setIsFullscreen] = useState(false)
+  const [searchTerm, setSearchTerm] = useState("")
+  const [highlightedNodes, setHighlightedNodes] = useState<string[]>([])
+  const [layoutType, setLayoutType] = useState<"force" | "hierarchical" | "radial">("force")
+  const [loading, setLoading] = useState(true)
+  const [error, setError] = useState<string | null>(null)
+  const [use3D, setUse3D] = useState(false)
+  const [storedTriples, setStoredTriples] = useState<Triple[]>([])
+  const [includeStoredTriples, setIncludeStoredTriples] = useState(true)
+  const [loadingStoredTriples, setLoadingStoredTriples] = useState(false)
+  const graphContainerRef = useRef<HTMLDivElement>(null)
+  const searchInputRef = useRef<HTMLInputElement>(null)
+
+  // Monitor fullscreen state changes
+  useEffect(() => {
+    const handleFullscreenChange = () => {
+      setIsFullscreen(!!document.fullscreenElement);
+    };
+
+    document.addEventListener('fullscreenchange', handleFullscreenChange);
+    
+    return () => {
+      document.removeEventListener('fullscreenchange', handleFullscreenChange);
+    };
+  }, []);
+
+  // Trigger rerender when fullscreen changes to ensure graph updates properly
+  useEffect(() => {
+    // A small timeout to ensure the graph has time to adjust
+    if (isFullscreen) {
+      const timer = setTimeout(() => {
+        // Dispatch a resize event to make sure canvas and component sizes are updated
+        window.dispatchEvent(new Event('resize'));
+      }, 100);
+      return () => clearTimeout(timer);
+    }
+  }, [isFullscreen]);
+
+  // Fetch stored triples from ArangoDB
+  useEffect(() => {
+    const fetchStoredTriples = async () => {
+      if (!includeStoredTriples) {
+        setStoredTriples([])
+        return
+      }
+
+      try {
+        setLoadingStoredTriples(true)
+        const response = await fetch('/api/graph-db/triples')
+        
+        if (response.ok) {
+          const data = await response.json()
+          setStoredTriples(data.triples || [])
+          console.log(`Loaded ${data.triples?.length || 0} stored triples from ArangoDB`)
+        } else {
+          console.warn('Failed to fetch stored triples:', response.statusText)
+          setStoredTriples([])
+        }
+      } catch (error) {
+        console.error('Error fetching stored triples:', error)
+        setStoredTriples([])
+      } finally {
+        setLoadingStoredTriples(false)
+      }
+    }
+
+    fetchStoredTriples()
+  }, [includeStoredTriples])
+
+  // Generate combined graph data from all processed documents and stored triples
+  useEffect(() => {
+    try {
+      setLoading(true)
+      
+      const allNodes: Node[] = []
+      const allEdges: Edge[] = []
+      const nodeMap = new Map<string, Node>()
+      
+      // Helper function to process triples and add to graph
+      const processTriples = (triples: Triple[], source: "document" | "stored") => {
+        triples.forEach(triple => {
+          // Add subject node if doesn't exist
+          if (!nodeMap.has(triple.subject)) {
+            const subjectNode: Node = {
+              id: triple.subject,
+              label: triple.subject,
+              group: source === "stored" ? "stored-subject" : "subject"
+            }
+            nodeMap.set(triple.subject, subjectNode)
+            allNodes.push(subjectNode)
+          }
+          
+          // Add object node if doesn't exist
+          if (!nodeMap.has(triple.object)) {
+            const objectNode: Node = {
+              id: triple.object,
+              label: triple.object,
+              group: source === "stored" ? "stored-object" : "object"
+            }
+            nodeMap.set(triple.object, objectNode)
+            allNodes.push(objectNode)
+          }
+          
+          // Add edge
+          const edgeId = `${source}-${triple.subject}-${triple.predicate}-${triple.object}`
+          allEdges.push({
+            id: edgeId,
+            source: triple.subject,
+            target: triple.object,
+            label: triple.predicate
+          })
+        })
+      }
+      
+      // Process all documents with triples
+      documents
+        .filter(doc => doc.status === "Processed" && doc.triples && doc.triples.length > 0)
+        .forEach(doc => {
+          if (!doc.triples) return
+          processTriples(doc.triples, "document")
+        })
+      
+      // Process stored triples if enabled
+      if (includeStoredTriples && storedTriples.length > 0) {
+        processTriples(storedTriples, "stored")
+      }
+      
+      setGraphData({ nodes: allNodes, edges: allEdges })
+      setError(null)
+    } catch (err) {
+      console.error("Error generating graph data:", err)
+      setError("Failed to generate knowledge graph visualization.")
+    } finally {
+      setLoading(false)
+    }
+  }, [documents, storedTriples, includeStoredTriples])
+
+  // Convert graph data to triples format for FallbackGraph
+  const getTriples = (): Triple[] => {
+    if (!graphData || !graphData.edges) {
+      return [];
+    }
+    return graphData.edges.map(edge => ({
+      subject: edge.source,
+      predicate: edge.label,
+      object: edge.target
+    }))
+  }
+
+  const handleSearch = () => {
+    if (!searchTerm) {
+      setHighlightedNodes([])
+      return
+    }
+    
+    const lowerSearchTerm = searchTerm.toLowerCase()
+    const matches = graphData.nodes.filter(node => 
+      node.label.toLowerCase().includes(lowerSearchTerm)
+    ).map(node => node.id)
+    
+    setHighlightedNodes(matches)
+  }
+
+  const toggleFullscreen = () => {
+    if (!graphContainerRef.current) return;
+    
+    if (!document.fullscreenElement) {
+      // Enter fullscreen
+      graphContainerRef.current.requestFullscreen().catch(err => {
+        console.error(`Error attempting to enter fullscreen: ${err.message}`);
+      });
+    } else {
+      // Exit fullscreen
+      document.exitFullscreen().catch(err => {
+        console.error(`Error attempting to exit fullscreen: ${err.message}`);
+      });
+    }
+    // No need to set state here as the fullscreenchange event will handle it
+  }
+
+  const toggleViewMode = () => {
+    if (!use3D) {
+      // Navigate to 3D view using stored triples from database
+      // This avoids large request headers by using the database instead of localStorage/URL
+      console.log('Switching to 3D view using stored triples from database');
+      
+      // Check if we have stored triples available
+      if (includeStoredTriples && storedTriples.length > 0) {
+        // Use stored triples from database - no need to pass data through browser
+        window.location.href = `/graph3d?source=stored&layout=${layoutType}`;
+      } else {
+        // Fallback: use current document triples, but try to store them in DB first
+        const currentTriples = getTriples();
+        
+        if (currentTriples.length > 0) {
+          // Try to store current triples in database first, then use stored source
+          fetch('/api/graph-db/triples', {
+            method: 'POST',
+            headers: { 'Content-Type': 'application/json' },
+            body: JSON.stringify({ 
+              triples: currentTriples,
+              documentName: 'Current Graph View'
+            })
+          }).then(response => {
+            if (response.ok) {
+              console.log('Successfully stored current triples in database');
+              // Use stored triples source
+              window.location.href = `/graph3d?source=stored&layout=${layoutType}`;
+            } else {
+              console.warn('Failed to store triples in database, using fallback');
+              // Fallback to localStorage for small datasets only
+              if (currentTriples.length <= 100) {
+                const storageId = `graph_${Date.now()}_${Math.random().toString(36).substring(2, 10)}`;
+                try {
+                  localStorage.setItem(storageId, JSON.stringify(currentTriples));
+                  window.location.href = `/graph3d?source=local&storageId=${storageId}&layout=${layoutType}`;
+                } catch (storageError) {
+                  console.error("localStorage also failed:", storageError);
+                  window.location.href = `/graph3d?source=stored&layout=${layoutType}`;
+                }
+              } else {
+                // For large datasets, just use stored source (may be empty but won't cause header issues)
+                console.warn('Large dataset detected, using stored source to avoid header size limits');
+                window.location.href = `/graph3d?source=stored&layout=${layoutType}`;
+              }
+            }
+          }).catch(error => {
+            console.error('Error storing triples:', error);
+            // Fallback to stored source
+            window.location.href = `/graph3d?source=stored&layout=${layoutType}`;
+          });
+        } else {
+          // No triples available, use stored source
+          window.location.href = `/graph3d?source=stored&layout=${layoutType}`;
+        }
+      }
+    } else {
+      // Toggle back to 2D
+      setUse3D(false);
+    }
+  }
+
+  const exportGraph = (format: "json" | "csv" | "png") => {
+    switch (format) {
+      case "json":
+        const jsonData = JSON.stringify(graphData, null, 2)
+        const jsonBlob = new Blob([jsonData], { type: 'application/json' })
+        const jsonUrl = URL.createObjectURL(jsonBlob)
+        const jsonLink = document.createElement('a')
+        jsonLink.href = jsonUrl
+        jsonLink.download = 'knowledge-graph.json'
+        jsonLink.click()
+        break
+      case "csv":
+        // Create nodes CSV
+        let nodesCSV = "id,label,group\n"
+        graphData.nodes.forEach(node => {
+          nodesCSV += `"${node.id}","${node.label}","${node.group || ''}"\n`
+        })
+        
+        // Create edges CSV
+        let edgesCSV = "id,source,target,label\n"
+        graphData.edges.forEach(edge => {
+          edgesCSV += `"${edge.id}","${edge.source}","${edge.target}","${edge.label}"\n`
+        })
+        
+        // Download nodes CSV
+        const nodesBlob = new Blob([nodesCSV], { type: 'text/csv' })
+        const nodesUrl = URL.createObjectURL(nodesBlob)
+        const nodesLink = document.createElement('a')
+        nodesLink.href = nodesUrl
+        nodesLink.download = 'knowledge-graph-nodes.csv'
+        nodesLink.click()
+        
+        // Download edges CSV
+        const edgesBlob = new Blob([edgesCSV], { type: 'text/csv' })
+        const edgesUrl = URL.createObjectURL(edgesBlob)
+        const edgesLink = document.createElement('a')
+        edgesLink.href = edgesUrl
+        edgesLink.download = 'knowledge-graph-edges.csv'
+        edgesLink.click()
+        break
+      case "png":
+        // Screenshot functionality would be implemented here
+        alert("PNG export would capture the current graph view")
+        break
+    }
+  }
+
+  // Keyboard shortcuts
+  useKeyboardShortcuts([
+    {
+      key: 'f',
+      callback: toggleFullscreen,
+      description: 'Toggle fullscreen'
+    },
+    {
+      key: '3',
+      callback: toggleViewMode,
+      description: 'Toggle 3D view'
+    },
+    {
+      key: 'k',
+      ctrlKey: true,
+      callback: () => searchInputRef.current?.focus(),
+      description: 'Focus search'
+    },
+    {
+      key: '1',
+      callback: () => setLayoutType('force'),
+      description: 'Force layout'
+    },
+    {
+      key: '2',
+      callback: () => setLayoutType('hierarchical'),
+      description: 'Hierarchical layout'
+    },
+    {
+      key: '3',
+      shiftKey: true,
+      callback: () => setLayoutType('radial'),
+      description: 'Radial layout'
+    }
+  ], !isFullscreen) // Disable shortcuts in fullscreen to avoid conflicts
+
+  return (
+    <div className="space-y-4">
+      {/* New Organized Toolbar */}
+      <GraphToolbar
+        use3D={use3D}
+        onToggle3D={toggleViewMode}
+        isFullscreen={isFullscreen}
+        onToggleFullscreen={toggleFullscreen}
+        layoutType={layoutType}
+        onLayoutChange={setLayoutType}
+        includeStoredTriples={includeStoredTriples}
+        onToggleStoredTriples={setIncludeStoredTriples}
+        storedTriplesCount={storedTriples.length}
+        loadingStoredTriples={loadingStoredTriples}
+        onExport={exportGraph}
+        searchTerm={searchTerm}
+        onSearchChange={setSearchTerm}
+        onSearch={handleSearch}
+        searchInputRef={searchInputRef}
+        nodeCount={graphData.nodes.length}
+        edgeCount={graphData.edges.length}
+      />
+      
+      <div className="space-y-6">
+          
+          <div 
+            ref={graphContainerRef}
+            className={`overflow-hidden border border-border rounded-lg transition-all ${isFullscreen ? 'fixed inset-0 z-50 bg-background' : 'relative'}`}
+            style={{ height: isFullscreen ? '100vh' : '500px' }}
+          >
+            {loading ? (
+              <div className="absolute inset-0 flex items-center justify-center">
+                <div className="animate-spin rounded-full h-12 w-12 border-t-2 border-b-2 border-primary"></div>
+              </div>
+            ) : error ? (
+              <div className="absolute inset-0 flex items-center justify-center">
+                <div className="text-destructive">{error}</div>
+              </div>
+            ) : graphData.nodes.length === 0 ? (
+              <div className="absolute inset-0 flex items-center justify-center">
+                <div className="text-center">
+                  <p className="mb-2">No knowledge graph data available</p>
+                  <p className="text-sm text-muted-foreground">Process documents to generate a knowledge graph</p>
+                </div>
+              </div>
+            ) : use3D ? (
+              <GraphVisualization 
+                triples={getTriples()}
+                fullscreen={isFullscreen}
+                highlightedNodes={highlightedNodes}
+                layoutType={layoutType}
+                initialMode='3d'
+              />
+            ) : (
+              <FallbackGraph 
+                triples={getTriples()} 
+                fullscreen={isFullscreen}
+              />
+            )}
+          </div>
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/local-gpu-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/local-gpu-viewer.tsx
new file mode 100644
index 0000000..0158e84
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/local-gpu-viewer.tsx
@@ -0,0 +1,417 @@
+"use client"
+
+import React, { useState, useEffect, useRef } from 'react'
+import { Button } from '@/components/ui/button'
+import { Card, CardContent, CardHeader, CardTitle } from '@/components/ui/card'
+import { Select, SelectContent, SelectItem, SelectTrigger, SelectValue } from '@/components/ui/select'
+import { Badge } from '@/components/ui/badge'
+import { Switch } from '@/components/ui/switch'
+import { Loader2, Zap, Activity, Cpu, MonitorSpeaker } from 'lucide-react'
+import { useToast } from '@/hooks/use-toast'
+
+interface GraphData {
+  nodes: Array<{
+    id: string
+    name: string
+    group?: string
+    [key: string]: any
+  }>
+  links: Array<{
+    source: string
+    target: string
+    name: string
+    [key: string]: any
+  }>
+}
+
+interface LocalGPUViewerProps {
+  graphData: GraphData
+  onError?: (error: Error) => void
+}
+
+interface GPUCapabilities {
+  has_rapids: boolean
+  has_torch_geometric: boolean
+  gpu_available: boolean
+  supported_layouts: string[]
+  supported_clustering: string[]
+}
+
+interface ProcessedData {
+  nodes: any[]
+  edges: any[]
+  gpu_processed: boolean
+  layout_positions: Record<string, [number, number]>
+  clusters: Record<string, number>
+  centrality: Record<string, Record<string, number>>
+  stats: {
+    node_count: number
+    edge_count: number
+    gpu_accelerated: boolean
+    layout_computed: boolean
+    clusters_computed: boolean
+    centrality_computed: boolean
+  }
+  timestamp: string
+}
+
+export function LocalGPUViewer({ graphData, onError }: LocalGPUViewerProps) {
+  const [isProcessing, setIsProcessing] = useState(false)
+  const [processedData, setProcessedData] = useState<ProcessedData | null>(null)
+  const [capabilities, setCapabilities] = useState<GPUCapabilities | null>(null)
+  const [serviceHealth, setServiceHealth] = useState<'unknown' | 'healthy' | 'error'>('unknown')
+  
+  // Processing options
+  const [layoutAlgorithm, setLayoutAlgorithm] = useState('force_atlas2')
+  const [clusteringAlgorithm, setClusteringAlgorithm] = useState('leiden')
+  const [gpuAcceleration, setGpuAcceleration] = useState(true)
+  const [computeCentrality, setComputeCentrality] = useState(true)
+  
+  const { toast } = useToast()
+  const wsRef = useRef<WebSocket | null>(null)
+
+  // Check service health and capabilities on mount
+  useEffect(() => {
+    checkServiceHealth()
+    getCapabilities()
+    setupWebSocket()
+    
+    return () => {
+      if (wsRef.current) {
+        wsRef.current.close()
+      }
+    }
+  }, [])
+
+  const checkServiceHealth = async () => {
+    try {
+      const response = await fetch('/api/local-gpu/health')
+      if (response.ok) {
+        setServiceHealth('healthy')
+      } else {
+        setServiceHealth('error')
+      }
+    } catch (error) {
+      console.error('Local GPU service health check failed:', error)
+      setServiceHealth('error')
+    }
+  }
+
+  const getCapabilities = async () => {
+    try {
+      const response = await fetch('/api/local-gpu/capabilities')
+      if (response.ok) {
+        const caps = await response.json()
+        setCapabilities(caps)
+        
+        // Set defaults based on capabilities
+        if (caps.supported_layouts?.length > 0) {
+          setLayoutAlgorithm(caps.supported_layouts[0])
+        }
+        if (caps.supported_clustering?.length > 0) {
+          setClusteringAlgorithm(caps.supported_clustering[0])
+        }
+      }
+    } catch (error) {
+      console.error('Failed to get GPU capabilities:', error)
+    }
+  }
+
+  const setupWebSocket = () => {
+    try {
+      const ws = new WebSocket('ws://localhost:8081/ws')
+      
+      ws.onopen = () => {
+        console.log('WebSocket connected to local GPU service')
+      }
+      
+      ws.onmessage = (event) => {
+        try {
+          const message = JSON.parse(event.data)
+          if (message.type === 'graph_processed') {
+            setProcessedData(message.data)
+            setIsProcessing(false)
+            
+            toast({
+              title: "GPU Processing Complete",
+              description: `Processed ${message.data.stats.node_count} nodes with ${message.data.gpu_processed ? 'GPU' : 'CPU'} acceleration.`,
+            })
+          }
+        } catch (error) {
+          console.error('WebSocket message error:', error)
+        }
+      }
+      
+      ws.onerror = (error) => {
+        console.error('WebSocket error:', error)
+      }
+      
+      wsRef.current = ws
+    } catch (error) {
+      console.error('WebSocket setup failed:', error)
+    }
+  }
+
+  const processWithLocalGPU = async () => {
+    if (!graphData?.nodes?.length || !graphData?.links?.length) {
+      toast({
+        title: "No Graph Data",
+        description: "Please ensure graph data is loaded before processing.",
+        variant: "destructive"
+      })
+      return
+    }
+
+    setIsProcessing(true)
+    
+    try {
+      const requestData = {
+        graph_data: {
+          nodes: graphData.nodes,
+          links: graphData.links
+        },
+        layout_algorithm: layoutAlgorithm,
+        clustering_algorithm: clusteringAlgorithm,
+        gpu_acceleration: gpuAcceleration,
+        compute_centrality: computeCentrality
+      }
+
+      const response = await fetch('/api/local-gpu/process', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify(requestData)
+      })
+
+      if (!response.ok) {
+        throw new Error(`Local GPU processing failed: ${response.statusText}`)
+      }
+
+      // The result will come via WebSocket, so we just wait
+      toast({
+        title: "Processing Started",
+        description: "Graph processing started on local GPU. Results will appear shortly.",
+      })
+
+    } catch (error) {
+      console.error('Local GPU processing error:', error)
+      const errorMessage = error instanceof Error ? error.message : 'Unknown error occurred'
+      
+      toast({
+        title: "Processing Failed",
+        description: errorMessage,
+        variant: "destructive"
+      })
+      
+      setIsProcessing(false)
+      
+      if (onError) {
+        onError(error instanceof Error ? error : new Error(errorMessage))
+      }
+    }
+  }
+
+  const ServiceStatus = () => (
+    <div className="flex items-center gap-2 text-sm">
+      <div className={`w-2 h-2 rounded-full ${
+        serviceHealth === 'healthy' ? 'bg-green-500' : 
+        serviceHealth === 'error' ? 'bg-red-500' : 'bg-yellow-500'
+      }`} />
+      <span>
+        Local GPU Service: {serviceHealth === 'healthy' ? 'Connected' : 
+                           serviceHealth === 'error' ? 'Disconnected' : 'Checking...'}
+      </span>
+      {capabilities?.gpu_available && (
+        <Badge variant="outline" className="ml-2">
+          <Zap className="w-3 h-3 mr-1" />
+          cuGraph Ready
+        </Badge>
+      )}
+    </div>
+  )
+
+  const StatsDisplay = ({ stats }: { stats: ProcessedData['stats'] }) => (
+    <div className="grid grid-cols-2 md:grid-cols-3 gap-4 text-sm">
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4 text-blue-500" />
+        <span>{stats.node_count.toLocaleString()} nodes</span>
+      </div>
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4 text-green-500" />
+        <span>{stats.edge_count.toLocaleString()} edges</span>
+      </div>
+      <div className="flex items-center gap-2">
+        {stats.gpu_accelerated ? (
+          <Zap className="w-4 h-4 text-yellow-500" />
+        ) : (
+          <Cpu className="w-4 h-4 text-gray-500" />
+        )}
+        <span>{stats.gpu_accelerated ? 'GPU' : 'CPU'} processed</span>
+      </div>
+      {stats.layout_computed && (
+        <div className="flex items-center gap-2">
+          <MonitorSpeaker className="w-4 h-4 text-purple-500" />
+          <span>Layout computed</span>
+        </div>
+      )}
+      {stats.clusters_computed && (
+        <div className="flex items-center gap-2">
+          <Activity className="w-4 h-4 text-orange-500" />
+          <span>Clusters detected</span>
+        </div>
+      )}
+      {stats.centrality_computed && (
+        <div className="flex items-center gap-2">
+          <Activity className="w-4 h-4 text-pink-500" />
+          <span>Centrality computed</span>
+        </div>
+      )}
+    </div>
+  )
+
+  return (
+    <div className="space-y-4">
+      {/* Service Status */}
+      <Card>
+        <CardHeader className="pb-3">
+          <CardTitle className="flex items-center justify-between">
+            <span className="flex items-center gap-2">
+              <Zap className="w-5 h-5" />
+              Local GPU Visualization
+            </span>
+            <ServiceStatus />
+          </CardTitle>
+        </CardHeader>
+        <CardContent className="space-y-4">
+          {/* GPU Capabilities */}
+          {capabilities && (
+            <div className="p-3 bg-muted rounded-lg">
+              <h4 className="font-medium mb-2">GPU Capabilities</h4>
+              <div className="grid grid-cols-2 gap-2 text-sm">
+                <div className="flex items-center gap-2">
+                  <div className={`w-2 h-2 rounded-full ${capabilities.has_rapids ? 'bg-green-500' : 'bg-red-500'}`} />
+                  <span>RAPIDS cuGraph: {capabilities.has_rapids ? 'Available' : 'Not Available'}</span>
+                </div>
+                <div className="flex items-center gap-2">
+                  <div className={`w-2 h-2 rounded-full ${capabilities.has_torch_geometric ? 'bg-green-500' : 'bg-red-500'}`} />
+                  <span>PyTorch Geometric: {capabilities.has_torch_geometric ? 'Available' : 'Not Available'}</span>
+                </div>
+              </div>
+            </div>
+          )}
+
+          {/* Processing Controls */}
+          <div className="grid grid-cols-1 md:grid-cols-2 gap-4">
+            <div className="space-y-4">
+              <div className="space-y-2">
+                <label className="text-sm font-medium">Layout Algorithm</label>
+                <Select value={layoutAlgorithm} onValueChange={setLayoutAlgorithm}>
+                  <SelectTrigger>
+                    <SelectValue />
+                  </SelectTrigger>
+                  <SelectContent>
+                    {capabilities?.supported_layouts?.map(layout => (
+                      <SelectItem key={layout} value={layout}>
+                        {layout.replace('_', ' ').replace(/\b\w/g, l => l.toUpperCase())}
+                      </SelectItem>
+                    ))}
+                  </SelectContent>
+                </Select>
+              </div>
+
+              <div className="space-y-2">
+                <label className="text-sm font-medium">Clustering Algorithm</label>
+                <Select value={clusteringAlgorithm} onValueChange={setClusteringAlgorithm}>
+                  <SelectTrigger>
+                    <SelectValue />
+                  </SelectTrigger>
+                  <SelectContent>
+                    {capabilities?.supported_clustering?.map(clustering => (
+                      <SelectItem key={clustering} value={clustering}>
+                        {clustering.replace('_', ' ').replace(/\b\w/g, l => l.toUpperCase())}
+                      </SelectItem>
+                    ))}
+                  </SelectContent>
+                </Select>
+              </div>
+            </div>
+
+            <div className="space-y-4">
+              <div className="flex items-center justify-between">
+                <label className="text-sm font-medium">GPU Acceleration</label>
+                <Switch 
+                  checked={gpuAcceleration} 
+                  onCheckedChange={setGpuAcceleration}
+                  disabled={!capabilities?.gpu_available}
+                />
+              </div>
+
+              <div className="flex items-center justify-between">
+                <label className="text-sm font-medium">Compute Centrality</label>
+                <Switch 
+                  checked={computeCentrality} 
+                  onCheckedChange={setComputeCentrality}
+                />
+              </div>
+            </div>
+          </div>
+
+          {/* Action Buttons */}
+          <div className="flex gap-2">
+            <Button 
+              onClick={processWithLocalGPU}
+              disabled={isProcessing || serviceHealth !== 'healthy'}
+              className="flex-1"
+            >
+              {isProcessing ? (
+                <>
+                  <Loader2 className="w-4 h-4 mr-2 animate-spin" />
+                  Processing on {gpuAcceleration ? 'GPU' : 'CPU'}...
+                </>
+              ) : (
+                <>
+                  <Zap className="w-4 h-4 mr-2" />
+                  Process with Local GPU
+                </>
+              )}
+            </Button>
+          </div>
+        </CardContent>
+      </Card>
+
+      {/* Results */}
+      {processedData && (
+        <Card className="flex-1">
+          <CardHeader className="pb-3">
+            <CardTitle>Local GPU Processing Results</CardTitle>
+          </CardHeader>
+          <CardContent className="space-y-4">
+            {/* Statistics */}
+            <StatsDisplay stats={processedData.stats} />
+            
+            {/* Visualization */}
+            <div className="w-full h-96 border rounded-lg overflow-hidden">
+              <div className="p-4 bg-blue-50 border-b">
+                <div className="text-sm font-medium text-blue-700">
+                  🚀 Local GPU Visualization (cuGraph Powered)
+                </div>
+                <div className="text-xs text-blue-600 mt-1">
+                  Processing completed locally with {processedData.gpu_processed ? 'GPU' : 'CPU'} acceleration
+                </div>
+              </div>
+              <div className="p-4">
+                <iframe
+                  src="http://localhost:8081"
+                  className="w-full h-80"
+                  title="Local GPU Visualization"
+                  style={{ border: 'none' }}
+                />
+              </div>
+            </div>
+          </CardContent>
+        </Card>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/model-selector.tsx b/nvidia/txt2kg/assets/frontend/components/model-selector.tsx
new file mode 100644
index 0000000..51499ed
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/model-selector.tsx
@@ -0,0 +1,220 @@
+"use client"
+
+import { useState, useEffect, useRef } from "react"
+import { createPortal } from "react-dom"
+import { ChevronDown, Sparkles, Cpu, Server } from "lucide-react"
+import { OllamaIcon } from "@/components/ui/ollama-icon"
+
+// Base models - NVIDIA NeMo as default (first in list)
+const baseModels = [
+  {
+    id: "nvidia-nemotron",
+    name: "NVIDIA NeMo Llama 3.1 70B Nemotron",
+    icon: <Cpu className="h-4 w-4 text-green-500" />,
+    description: "NVIDIA hosted Nemotron optimized Llama 3.1 70B model",
+    model: "nvdev/nvidia/llama-3.1-nemotron-70b-instruct",
+    apiKeyName: "NVIDIA_API_KEY",
+    baseURL: "https://integrate.api.nvidia.com/v1",
+  },
+  {
+    id: "nvidia-nemotron-nano",
+    name: "llama-3.1-nemotron-nano-8b-v1",
+    icon: <Cpu className="h-4 w-4 text-green-500" />,
+    description: "NVIDIA hosted Nemotron Nano 8B model",
+    model: "nvdev/nvidia/llama-3.1-nemotron-nano-8b-instruct",
+    apiKeyName: "NVIDIA_API_KEY",
+    baseURL: "https://integrate.api.nvidia.com/v1",
+  },
+  // Preset Ollama model
+  {
+    id: "ollama-llama3.1:8b",
+    name: "Ollama llama3.1:8b",
+    icon: <OllamaIcon className="h-4 w-4 text-orange-500" />,
+    description: "Local Ollama server with llama3.1:8b model",
+    model: "llama3.1:8b",
+    baseURL: "http://localhost:11434/v1",
+    provider: "ollama",
+  },
+]
+
+// vLLM models removed per user request
+
+// Helper function to create Ollama model objects
+const createOllamaModel = (modelName: string) => ({
+  id: `ollama-${modelName}`,
+  name: `Ollama ${modelName}`,
+  icon: <OllamaIcon className="h-4 w-4 text-orange-500" />,
+  description: `Local Ollama server with ${modelName} model`,
+  model: modelName,
+  baseURL: "http://localhost:11434/v1",
+  provider: "ollama",
+})
+
+export function ModelSelector() {
+  const [models, setModels] = useState(() => [...baseModels])
+  const [selectedModel, setSelectedModel] = useState(() => {
+    // Try to find a default Ollama model first
+    const defaultOllama = models.find(m => m.provider === "ollama")
+    return defaultOllama || models[0]
+  })
+  const [isOpen, setIsOpen] = useState(false)
+  const buttonRef = useRef<HTMLButtonElement | null>(null)
+  const containerRef = useRef<HTMLDivElement | null>(null)
+  const [mounted, setMounted] = useState(false)
+
+  // Load configured Ollama models
+  const loadOllamaModels = () => {
+    try {
+      const selectedOllamaModels = localStorage.getItem("selected_ollama_models")
+      if (selectedOllamaModels) {
+        const modelNames = JSON.parse(selectedOllamaModels)
+        // Filter out models that are already in baseModels to avoid duplicates
+        const baseModelNames = baseModels.filter(m => m.provider === "ollama").map(m => m.model)
+        const filteredModelNames = modelNames.filter((name: string) => !baseModelNames.includes(name))
+        const ollamaModels = filteredModelNames.map(createOllamaModel)
+        const newModels = [...baseModels, ...ollamaModels]
+        setModels(newModels)
+        return newModels
+      }
+    } catch (error) {
+      console.error("Error loading Ollama models:", error)
+    }
+    // Return base models if no Ollama models configured
+    return [...baseModels]
+  }
+
+  // Dispatch custom event when model changes
+  const updateSelectedModel = (model: any) => {
+    setSelectedModel(model)
+    
+    // Dispatch a custom event with the selected model data
+    const event = new CustomEvent('modelSelected', {
+      detail: { model }
+    })
+    window.dispatchEvent(event)
+  }
+
+  useEffect(() => {
+    // Save selected model to localStorage
+    localStorage.setItem("selectedModel", JSON.stringify(selectedModel))
+  }, [selectedModel])
+
+  // Initialize models and selected model
+  useEffect(() => {
+    const loadedModels = loadOllamaModels()
+    
+    // Try to restore selected model from localStorage
+    const savedModel = localStorage.getItem("selectedModel")
+    if (savedModel) {
+      try {
+        const parsed = JSON.parse(savedModel)
+        // Find matching model in our current models array
+        const matchingModel = loadedModels.find(m => m.id === parsed.id)
+        if (matchingModel) {
+          updateSelectedModel(matchingModel)
+        } else {
+          // If saved model not found, use first available model
+          updateSelectedModel(loadedModels[0])
+        }
+      } catch (e) {
+        console.error("Error parsing saved model", e)
+        updateSelectedModel(loadedModels[0])
+      }
+    } else {
+      // If no model in localStorage, use first available model
+      updateSelectedModel(loadedModels[0])
+    }
+  }, [])
+
+  // Listen for Ollama model updates
+  useEffect(() => {
+    const handleOllamaUpdate = (event: CustomEvent) => {
+      console.log("Ollama models updated, reloading...")
+      const newModels = loadOllamaModels()
+      
+      // Check if current selected model still exists
+      const currentModelStillExists = newModels.find(m => m.id === selectedModel.id)
+      if (!currentModelStillExists) {
+        // Select first available model if current one is no longer available
+        updateSelectedModel(newModels[0])
+      }
+    }
+
+    window.addEventListener('ollama-models-updated', handleOllamaUpdate as EventListener)
+    
+    return () => {
+      window.removeEventListener('ollama-models-updated', handleOllamaUpdate as EventListener)
+    }
+  }, [selectedModel.id])
+
+  // Set mounted state after component mounts (for SSR compatibility)
+  useEffect(() => {
+    setMounted(true)
+  }, [])
+
+  // Close on outside click and Escape
+  useEffect(() => {
+    const handleMouseDown = (e: MouseEvent) => {
+      if (containerRef.current && !containerRef.current.contains(e.target as Node)) {
+        setIsOpen(false)
+      }
+    }
+    const handleKeyDown = (e: KeyboardEvent) => {
+      if (e.key === 'Escape') setIsOpen(false)
+    }
+    document.addEventListener('mousedown', handleMouseDown)
+    document.addEventListener('keydown', handleKeyDown)
+    return () => {
+      document.removeEventListener('mousedown', handleMouseDown)
+      document.removeEventListener('keydown', handleKeyDown)
+    }
+  }, [])
+
+  return (
+    <div ref={containerRef} className="relative">
+      <button
+        ref={buttonRef}
+        className="flex items-center gap-2 bg-card border border-border rounded-lg px-4 py-2 text-sm hover:bg-muted/30 transition-colors"
+        onClick={() => setIsOpen(!isOpen)}
+      >
+        <div className="flex items-center gap-2">
+          {selectedModel.icon}
+          <span className="font-medium">{selectedModel.name}</span>
+        </div>
+        <ChevronDown className="h-4 w-4 text-muted-foreground ml-2" />
+      </button>
+
+      {isOpen && mounted && (
+        <div 
+          className="absolute bg-card border border-border rounded-md shadow-md overflow-hidden max-h-80 overflow-y-auto z-50"
+          style={{
+            width: "288px",
+            bottom: "calc(100% + 4px)",
+            left: 0,
+          }}
+        >
+          <ul className="divide-y divide-border/60">
+            {models.map((model) => (
+              <li key={model.id}>
+                <button
+                  className={`w-full text-left px-3 py-2 hover:bg-muted/30 text-sm flex flex-col gap-1 ${model.id === selectedModel.id ? 'bg-primary/10' : ''}`}
+                  onClick={() => {
+                    updateSelectedModel(model)
+                    setIsOpen(false)
+                  }}
+                >
+                  <span className="flex items-center gap-2">
+                    {model.icon}
+                    <span className={`font-medium ${model.id === selectedModel.id ? 'text-primary' : ''}`}>{model.name}</span>
+                  </span>
+                  <span className="text-xs text-muted-foreground pl-6">{model.description}</span>
+                </button>
+              </li>
+            ))}
+          </ul>
+        </div>
+      )}
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/nemotron-chat.tsx b/nvidia/txt2kg/assets/frontend/components/nemotron-chat.tsx
new file mode 100644
index 0000000..81647fb
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/nemotron-chat.tsx
@@ -0,0 +1,137 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { langChainService } from "@/lib/langchain-service"
+import { HumanMessage, SystemMessage } from "@langchain/core/messages"
+import { Button } from "@/components/ui/button"
+import { Textarea } from "@/components/ui/textarea"
+import { Spinner } from "@/components/ui/spinner"
+import { ChatOpenAI } from "@langchain/openai"
+
+export function NemotronChat() {
+  const [input, setInput] = useState("")
+  const [response, setResponse] = useState("")
+  const [isLoading, setIsLoading] = useState(false)
+  const [error, setError] = useState<string | null>(null)
+  const [modelReady, setModelReady] = useState(false)
+  const [model, setModel] = useState<ChatOpenAI | null>(null)
+  
+  // Initialize the Nemotron model directly from environment variables
+  useEffect(() => {
+    const initializeModel = async () => {
+      try {
+        setError(null)
+        // Initialize model using environment variable API key
+        const nemotronModel = await langChainService.getNemotronModel({
+          temperature: 0.7,
+          maxTokens: 1024,
+        })
+        setModel(nemotronModel)
+        setModelReady(true)
+      } catch (modelError) {
+        console.error("Error initializing model:", modelError)
+        setError(`Error initializing model: ${modelError instanceof Error ? modelError.message : String(modelError)}`)
+      }
+    }
+    
+    // Initialize model on component mount
+    initializeModel()
+    
+    // Listen for model selection changes
+    const handleModelSelection = (event: any) => {
+      if (event.detail?.model?.id === 'nvidia-nemotron') {
+        // Re-initialize the model if the Nemotron model is selected
+        initializeModel()
+      }
+    }
+    
+    // Add event listener for model selection changes
+    window.addEventListener('modelSelected', handleModelSelection)
+    
+    // Cleanup event listener
+    return () => {
+      window.removeEventListener('modelSelected', handleModelSelection)
+    }
+  }, [])
+  
+  const handleSubmit = async (e: React.FormEvent) => {
+    e.preventDefault()
+    
+    if (!input.trim() || !model) return
+    
+    setIsLoading(true)
+    setError(null)
+    
+    try {
+      console.log("Starting generation with input:", input.substring(0, 50) + "...")
+      
+      // Create messages
+      const messages = [
+        new SystemMessage("You are a helpful, concise assistant."),
+        new HumanMessage(input)
+      ]
+      
+      console.log("Invoking model with messages")
+      
+      // Generate response using the cached model
+      const result = await model.invoke(messages)
+      
+      console.log("Generation completed successfully")
+      
+      // Update the response
+      setResponse(result.content.toString())
+      
+    } catch (err) {
+      console.error("Error generating response:", err)
+      
+      // More detailed error info
+      if (err instanceof Error) {
+        setError(`Error: ${err.message}\n${err.stack || ""}`)
+      } else {
+        setError(`Unknown error: ${String(err)}`)
+      }
+    } finally {
+      setIsLoading(false)
+    }
+  }
+  
+  return (
+    <div className="p-6 bg-card rounded-lg border border-border">
+      <h2 className="text-xl font-bold mb-4">Chat with NVIDIA Nemotron</h2>
+      
+      {error && (
+        <div className="bg-destructive/10 text-destructive rounded-md p-3 mb-4">
+          {error}
+        </div>
+      )}
+      
+      <form onSubmit={handleSubmit} className="space-y-4">
+        <Textarea
+          value={input}
+          onChange={(e) => setInput(e.target.value)}
+          placeholder="Ask something..."
+          className="min-h-[100px]"
+          disabled={isLoading || !modelReady}
+        />
+        
+        <Button 
+          type="submit" 
+          disabled={isLoading || !input.trim() || !modelReady}
+          className="w-full"
+        >
+          {isLoading ? <Spinner className="mr-2" /> : null}
+          {isLoading ? "Generating..." : "Submit"}
+        </Button>
+      </form>
+      
+      {response && (
+        <div className="mt-6">
+          <h3 className="font-medium mb-2">Response:</h3>
+          <div className="p-4 bg-muted rounded-md whitespace-pre-wrap">
+            {response}
+          </div>
+        </div>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/nemotron-model-hook.tsx b/nvidia/txt2kg/assets/frontend/components/nemotron-model-hook.tsx
new file mode 100644
index 0000000..d310f3d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/nemotron-model-hook.tsx
@@ -0,0 +1,71 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { langChainService } from "@/lib/langchain-service"
+import { HumanMessage, SystemMessage } from "@langchain/core/messages"
+import { ChatOpenAI } from "@langchain/openai"
+
+export function useNemotronModel() {
+  const [model, setModel] = useState<ChatOpenAI | null>(null)
+  const [isLoading, setIsLoading] = useState(true)
+  const [error, setError] = useState<string | null>(null)
+  
+  useEffect(() => {
+    const initializeModel = async () => {
+      try {
+        // Get the model from the service
+        const nemotronModel = await langChainService.getNemotronModel()
+        setModel(nemotronModel)
+        setIsLoading(false)
+      } catch (err) {
+        console.error("Error initializing Nemotron model:", err)
+        setError(err instanceof Error ? err.message : "Failed to initialize model")
+        setIsLoading(false)
+      }
+    }
+    
+    initializeModel()
+  }, [])
+  
+  const generateResponse = async (
+    userInput: string, 
+    systemPrompt: string = "You are a helpful, concise assistant.",
+    options?: {
+      temperature?: number;
+      maxTokens?: number;
+    }
+  ) => {
+    if (!model) {
+      throw new Error("Model not initialized")
+    }
+    
+    const messages = [
+      new SystemMessage(systemPrompt),
+      new HumanMessage(userInput)
+    ]
+    
+    // Option to override model settings
+    if (options) {
+      try {
+        const customModel = await langChainService.getNemotronModel({
+          temperature: options.temperature,
+          maxTokens: options.maxTokens
+        })
+        
+        return await customModel.invoke(messages)
+      } catch (error) {
+        console.error("Error with custom model:", error)
+        throw error
+      }
+    }
+    
+    return await model.invoke(messages)
+  }
+  
+  return {
+    model,
+    isLoading,
+    error,
+    generateResponse
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/neo4j-connection.tsx b/nvidia/txt2kg/assets/frontend/components/neo4j-connection.tsx
new file mode 100644
index 0000000..a93e0dc
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/neo4j-connection.tsx
@@ -0,0 +1,165 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { Network, Database, Zap, AlertCircle, RefreshCw } from "lucide-react"
+
+interface Neo4jConnectionProps {
+  className?: string
+}
+
+export function Neo4jConnection({ className }: Neo4jConnectionProps) {
+  const [connectionStatus, setConnectionStatus] = useState<"connected" | "disconnected" | "checking">("disconnected")
+  const [error, setError] = useState<string | null>(null)
+  const [nodeCount, setNodeCount] = useState<number | null>(null)
+  const [relationshipCount, setRelationshipCount] = useState<number | null>(null)
+  const [connectionUrl, setConnectionUrl] = useState<string>("")
+
+  // Check Neo4j connection status
+  const checkConnection = async () => {
+    setConnectionStatus("checking")
+    setError(null)
+    
+    try {
+      // Get credentials from localStorage
+      const dbUrl = localStorage.getItem("NEO4J_URL")
+      const dbUsername = localStorage.getItem("NEO4J_USERNAME")
+      const dbPassword = localStorage.getItem("NEO4J_PASSWORD")
+      
+      // Add query parameters if credentials exist
+      const queryParams = new URLSearchParams()
+      if (dbUrl) queryParams.append("url", dbUrl)
+      if (dbUsername) queryParams.append("username", dbUsername)
+      if (dbPassword) queryParams.append("password", dbPassword)
+      
+      const queryString = queryParams.toString()
+      const endpoint = queryString ? `/api/neo4j?${queryString}` : '/api/neo4j'
+      
+      const response = await fetch(endpoint)
+      
+      if (!response.ok) {
+        const errorData = await response.json()
+        throw new Error(errorData.error || 'Failed to connect to Neo4j')
+      }
+      
+      const data = await response.json()
+      setNodeCount(data.nodes?.length || 0)
+      setRelationshipCount(data.links?.length || 0)
+      // Use the connection URL from the API response
+      if (data.connectionUrl) {
+        setConnectionUrl(data.connectionUrl)
+      } else if (dbUrl) {
+        setConnectionUrl(dbUrl)
+      }
+      setConnectionStatus("connected")
+    } catch (err) {
+      console.error('Neo4j connection error:', err)
+      setConnectionStatus("disconnected")
+      setError(err instanceof Error ? err.message : 'Unknown error connecting to Neo4j')
+    }
+  }
+
+  // Disconnect from Neo4j
+  const disconnect = async () => {
+    try {
+      const response = await fetch('/api/neo4j/disconnect', {
+        method: 'POST',
+      })
+      
+      if (!response.ok) {
+        const errorData = await response.json()
+        throw new Error(errorData.error || 'Failed to disconnect from Neo4j')
+      }
+      
+      setConnectionStatus("disconnected")
+      setNodeCount(null)
+      setRelationshipCount(null)
+    } catch (err) {
+      console.error('Neo4j disconnect error:', err)
+      setError(err instanceof Error ? err.message : 'Unknown error disconnecting from Neo4j')
+    }
+  }
+
+  // Check connection on component mount
+  // useEffect(() => {
+  //   checkConnection()
+  // }, [])
+
+  return (
+    <div className={`glass-card rounded-xl overflow-hidden ${className}`}>
+      <div className="p-5 border-b border-border/50">
+        <h2 className="text-lg font-semibold flex items-center gap-2">
+          <Network className="h-5 w-5 text-primary" />
+          Graph DB
+        </h2>
+      </div>
+      
+      <div className="p-5 space-y-4">
+        {connectionStatus === "checking" && (
+          <div className="flex items-center gap-2 text-sm">
+            <RefreshCw className="h-4 w-4 animate-spin" />
+            <span>Checking connection...</span>
+          </div>
+        )}
+        
+        {connectionStatus === "connected" && (
+          <>
+            <div className="flex items-center gap-2 text-sm">
+              <span className="h-2 w-2 rounded-full bg-green-500 animate-pulse"></span>
+              <span className="text-foreground font-mono text-xs bg-secondary px-2 py-1 rounded">
+                {connectionUrl}
+              </span>
+            </div>
+            
+            {(nodeCount !== null || relationshipCount !== null) && (
+              <div className="text-sm text-muted-foreground">
+                <div className="flex items-center gap-2">
+                  <Database className="h-4 w-4" />
+                  <span>{nodeCount} nodes, {relationshipCount} relationships</span>
+                </div>
+              </div>
+            )}
+          </>
+        )}
+        
+        {connectionStatus === "disconnected" && (
+          <>
+            <div className="flex items-center gap-2 text-sm">
+              <span className="h-2 w-2 rounded-full bg-destructive"></span>
+              <span className="text-foreground font-mono text-xs bg-secondary px-2 py-1 rounded">
+                Not connected
+              </span>
+            </div>
+          </>
+        )}
+      </div>
+      
+      <div className="flex p-4 gap-2 border-t border-border/50 bg-card">
+        <button 
+          className="btn-outline flex-1 flex items-center justify-center gap-2 py-2 px-3 rounded-md text-sm"
+          onClick={checkConnection}
+        >
+          <RefreshCw className="h-4 w-4" />
+          <span>Refresh</span>
+        </button>
+        
+        {connectionStatus === "connected" ? (
+          <button 
+            className="btn-outline flex-1 flex items-center justify-center gap-2 py-2 px-3 rounded-md text-sm text-destructive border-destructive hover:bg-destructive/10"
+            onClick={disconnect}
+          >
+            <Zap className="h-4 w-4" />
+            <span>Disconnect</span>
+          </button>
+        ) : (
+          <button 
+            className="btn-outline flex-1 flex items-center justify-center gap-2 py-2 px-3 rounded-md text-sm text-primary border-primary hover:bg-primary/10"
+            onClick={checkConnection}
+          >
+            <Zap className="h-4 w-4" />
+            <span>Connect</span>
+          </button>
+        )}
+      </div>
+    </div>
+  )
+}
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/nvidia-icon.tsx b/nvidia/txt2kg/assets/frontend/components/nvidia-icon.tsx
new file mode 100644
index 0000000..1fda3b1
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/nvidia-icon.tsx
@@ -0,0 +1,20 @@
+export function NvidiaIcon({ className }: { className?: string }) {
+  return (
+    <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 48 48" className={className}>
+      <rect width="29" height="32" x="18" y="8" fill="#76b900"></rect>
+      <path
+        fill="#fafafa"
+        d="M18,17.68c0.52-0.05,1.05-0.09,1.57-0.07c4.95,0,7.9,3.85,7.9,3.85l-4.03,3.39 c-1.8-3.02-2.43-4.35-5.44-4.71V17.68z M18,28.72c0.73,0.24,1.52,0.36,2.3,0.36c5.88,0,11.35-7.6,11.35-7.6s-5.07-6.91-12.81-6.66 c-0.28,0-0.56,0.02-0.84,0.03v-2.3l0.84-0.05c10.76-0.37,17.78,8.82,17.78,8.82s-8.05,9.8-16.44,9.8c-0.73,0-1.47-0.07-2.18-0.19 V28.72z M19.95,36.09c-0.66,0-1.32-0.03-1.95-0.1v-2.44c0.59,0.07,1.22,0.12,1.81,0.12c7.82,0,13.47-3.99,18.94-8.7 c0.91,0.73,4.62,2.49,5.39,3.26C38.94,32.59,26.82,36.09,19.95,36.09z"
+      ></path>
+      <path
+        fill="#76b900"
+        d="M18,28.717v2.232c-7.219-1.29-9.225-8.806-9.225-8.806s3.47-3.836,9.225-4.464v2.441h-0.017 c-3.017-0.366-5.388,2.459-5.388,2.459S13.937,27.339,18,28.717"
+      ></path>
+      <path
+        fill="#76b900"
+        d="M5.183,21.829c0,0,4.272-6.313,12.834-6.975v-2.302c-9.486,0.767-17.682,8.789-17.682,8.789 S4.974,34.768,18,35.989v-2.441C8.444,32.361,5.183,21.829,5.183,21.829z"
+      ></path>
+    </svg>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/nvidia-logo.tsx b/nvidia/txt2kg/assets/frontend/components/nvidia-logo.tsx
new file mode 100644
index 0000000..34466aa
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/nvidia-logo.tsx
@@ -0,0 +1,11 @@
+export function NvidiaLogo({ className }: { className?: string }) {
+  return (
+    <svg className={className} viewBox="0 0 1000 200" fill="none" xmlns="http://www.w3.org/2000/svg">
+      <path
+        d="M86.9 132.5H59.4V49.8h27.5v82.7zm614.3-82.7h-27.6v82.7h27.6V49.8zm-307.1 0h-27.5v82.7h27.5V49.8zm-82.7 0h-41.3v82.7h27.5V77.3h13.8c7.6 0 13.8 6.2 13.8 13.8v41.4h27.5V91.1c0-22.8-18.5-41.3-41.3-41.3zm472.9 0h-41.3c-22.8 0-41.3 18.5-41.3 41.3v41.4h27.5V91.1c0-7.6 6.2-13.8 13.8-13.8h13.8v55.2h27.5V49.8zm-196.5 0h-41.3c-22.8 0-41.3 18.5-41.3 41.3v41.4h27.5V91.1c0-7.6 6.2-13.8 13.8-13.8h13.8v55.2h27.5V49.8zm-169 0h-41.3v82.7h27.5V91.1h13.8c7.6 0 13.8 6.2 13.8 13.8v27.6h27.5V91.1c0-22.8-18.5-41.3-41.3-41.3z"
+        fill="#76B900"
+      />
+    </svg>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/ollama-connection.tsx b/nvidia/txt2kg/assets/frontend/components/ollama-connection.tsx
new file mode 100644
index 0000000..d314a60
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ollama-connection.tsx
@@ -0,0 +1,189 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { Button } from "@/components/ui/button"
+import { Input } from "@/components/ui/input"
+import { Label } from "@/components/ui/label"
+import { Card, CardContent, CardDescription, CardHeader, CardTitle } from "@/components/ui/card"
+import { Badge } from "@/components/ui/badge"
+import { Alert, AlertDescription } from "@/components/ui/alert"
+import { Loader2, Server, CheckCircle, XCircle, RefreshCw } from "lucide-react"
+import { OllamaIcon } from "@/components/ui/ollama-icon"
+
+interface OllamaConnectionProps {
+  onConnectionChange?: (connected: boolean, models?: string[]) => void
+}
+
+export function OllamaConnection({ onConnectionChange }: OllamaConnectionProps) {
+  const [baseUrl, setBaseUrl] = useState('http://localhost:11434')
+  const [isConnecting, setIsConnecting] = useState(false)
+  const [isConnected, setIsConnected] = useState(false)
+  const [availableModels, setAvailableModels] = useState<string[]>([])
+  const [selectedModel, setSelectedModel] = useState('llama3.2')
+  const [error, setError] = useState<string | null>(null)
+  const [lastChecked, setLastChecked] = useState<Date | null>(null)
+
+  // Load settings from localStorage
+  useEffect(() => {
+    const savedUrl = localStorage.getItem('ollama_base_url')
+    const savedModel = localStorage.getItem('ollama_model')
+    
+    if (savedUrl) setBaseUrl(savedUrl)
+    if (savedModel) setSelectedModel(savedModel)
+    
+    // Auto-test connection on load
+    testConnection()
+  }, [])
+
+  // Save settings to localStorage
+  useEffect(() => {
+    localStorage.setItem('ollama_base_url', baseUrl)
+    localStorage.setItem('ollama_model', selectedModel)
+  }, [baseUrl, selectedModel])
+
+  const testConnection = async () => {
+    setIsConnecting(true)
+    setError(null)
+
+    try {
+      const response = await fetch('/api/ollama?action=test-connection')
+      const result = await response.json()
+
+      if (result.connected) {
+        setIsConnected(true)
+        setAvailableModels(result.models || [])
+        setLastChecked(new Date())
+        onConnectionChange?.(true, result.models)
+      } else {
+        setIsConnected(false)
+        setError(result.error || 'Connection failed')
+        onConnectionChange?.(false)
+      }
+    } catch (err) {
+      setIsConnected(false)
+      setError(err instanceof Error ? err.message : 'Connection test failed')
+      onConnectionChange?.(false)
+    } finally {
+      setIsConnecting(false)
+    }
+  }
+
+  const formatUrl = (url: string) => {
+    // Ensure the URL ends with the correct path for Ollama API
+    let formatted = url.trim()
+    if (!formatted.startsWith('http://') && !formatted.startsWith('https://')) {
+      formatted = 'http://' + formatted
+    }
+    return formatted
+  }
+
+  return (
+    <Card>
+      <CardHeader>
+        <CardTitle className="flex items-center gap-2">
+          <OllamaIcon className="h-5 w-5 text-orange-500" />
+          Ollama Connection
+        </CardTitle>
+        <CardDescription>
+          Connect to your local Ollama server for offline LLM processing
+        </CardDescription>
+      </CardHeader>
+      <CardContent className="space-y-4">
+        <div className="space-y-2">
+          <Label htmlFor="ollama-url">Ollama Server URL</Label>
+          <div className="flex gap-2">
+            <Input
+              id="ollama-url"
+              value={baseUrl}
+              onChange={(e) => setBaseUrl(e.target.value)}
+              placeholder="http://localhost:11434"
+              className="flex-1"
+            />
+            <Button 
+              onClick={testConnection} 
+              disabled={isConnecting}
+              variant="outline"
+            >
+              {isConnecting ? (
+                <Loader2 className="h-4 w-4 animate-spin" />
+              ) : (
+                <RefreshCw className="h-4 w-4" />
+              )}
+              Test
+            </Button>
+          </div>
+        </div>
+
+        {/* Connection Status */}
+        <div className="flex items-center gap-2">
+          <Label>Status:</Label>
+          {isConnected ? (
+            <Badge variant="default" className="bg-green-500">
+              <CheckCircle className="h-3 w-3 mr-1" />
+              Connected
+            </Badge>
+          ) : (
+            <Badge variant="destructive">
+              <XCircle className="h-3 w-3 mr-1" />
+              Disconnected
+            </Badge>
+          )}
+          {lastChecked && (
+            <span className="text-xs text-muted-foreground">
+              Last checked: {lastChecked.toLocaleTimeString()}
+            </span>
+          )}
+        </div>
+
+        {/* Error Display */}
+        {error && (
+          <Alert variant="destructive">
+            <XCircle className="h-4 w-4" />
+            <AlertDescription>{error}</AlertDescription>
+          </Alert>
+        )}
+
+        {/* Available Models */}
+        {isConnected && availableModels.length > 0 && (
+          <div className="space-y-2">
+            <Label>Available Models ({availableModels.length})</Label>
+            <div className="grid grid-cols-2 gap-2 max-h-32 overflow-y-auto">
+              {availableModels.map((model) => (
+                <Badge
+                  key={model}
+                  variant={model === selectedModel ? "default" : "outline"}
+                  className="cursor-pointer justify-start"
+                  onClick={() => setSelectedModel(model)}
+                >
+                  {model}
+                </Badge>
+              ))}
+            </div>
+            <div className="text-xs text-muted-foreground">
+              Selected model: <strong>{selectedModel}</strong>
+            </div>
+          </div>
+        )}
+
+        {/* Instructions */}
+        {!isConnected && (
+          <Alert>
+            <OllamaIcon className="h-4 w-4" />
+            <AlertDescription>
+              Make sure Ollama is installed and running. Visit{" "}
+              <a 
+                href="https://ollama.com" 
+                target="_blank" 
+                rel="noopener noreferrer"
+                className="underline"
+              >
+                ollama.com
+              </a>{" "}
+              for installation instructions.
+            </AlertDescription>
+          </Alert>
+        )}
+      </CardContent>
+    </Card>
+  )
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/pinecone-connection.tsx b/nvidia/txt2kg/assets/frontend/components/pinecone-connection.tsx
new file mode 100644
index 0000000..4d74753
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/pinecone-connection.tsx
@@ -0,0 +1,178 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { Button } from '@/components/ui/button'
+import { Badge } from '@/components/ui/badge'
+import { InfoIcon } from 'lucide-react'
+import { Tooltip, TooltipContent, TooltipProvider, TooltipTrigger } from '@/components/ui/tooltip'
+import { VectorDBStats } from '@/types/graph'
+
+interface PineconeConnectionProps {
+  className?: string
+}
+
+export function PineconeConnection({ className }: PineconeConnectionProps) {
+  const [connectionStatus, setConnectionStatus] = useState<"connected" | "disconnected" | "checking">("disconnected")
+  const [error, setError] = useState<string | null>(null)
+  const [stats, setStats] = useState<VectorDBStats>({ nodes: 0, relationships: 0, source: 'none' })
+
+  // Fetch vector DB stats
+  const fetchStats = async () => {
+    try {
+      const response = await fetch('/api/pinecone-diag/stats');
+      const data = await response.json();
+      
+      if (response.ok) {
+        setStats({
+          nodes: typeof data.totalVectorCount === 'number' ? data.totalVectorCount : 0,
+          relationships: 0, // Vector DB doesn't store relationships
+          source: data.source || 'unknown',
+          httpHealthy: data.httpHealthy
+        });
+        
+        // If we have a healthy HTTP connection, we're connected
+        if (data.httpHealthy) {
+          setConnectionStatus("connected");
+          setError(null);
+        } else {
+          setConnectionStatus("disconnected");
+          setError(data.error || 'Connection failed');
+        }
+        
+        console.log('Vector DB stats:', data);
+      } else {
+        console.error('Failed to fetch vector DB stats:', data);
+        setConnectionStatus("disconnected");
+        setError(data.error || 'Failed to connect to vector database');
+      }
+    } catch (error) {
+      console.error('Error fetching vector DB stats:', error);
+      setConnectionStatus("disconnected");
+      setError(error instanceof Error ? error.message : 'Error connecting to vector database');
+    }
+  };
+
+  // Check connection status and stats
+  const checkConnection = async () => {
+    setConnectionStatus("checking")
+    setError(null)
+    
+    try {
+      await fetchStats(); // Fetch stats directly - our status is based on having embeddings
+    } catch (error) {
+      console.error('Error connecting to Vector DB:', error)
+      setConnectionStatus("disconnected")
+      setError(error instanceof Error ? error.message : 'Unknown error connecting to Vector DB')
+    }
+  }
+
+  // Reset connection state
+  const disconnect = async () => {
+    setConnectionStatus("disconnected")
+    setStats({ nodes: 0, relationships: 0, source: 'none' })
+  }
+
+  // Initial connection check
+  useEffect(() => {
+    checkConnection()
+  }, [])
+
+  return (
+    <div className={`flex flex-col items-start space-y-4 p-4 border rounded-md ${className}`}>
+      <div className="flex justify-between w-full">
+        <h2 className="text-lg font-medium">Vector DB</h2>
+        <TooltipProvider>
+          <Tooltip>
+            <TooltipTrigger>
+              <InfoIcon className="h-5 w-5 text-muted-foreground" />
+            </TooltipTrigger>
+            <TooltipContent>
+              <p>Local Pinecone stores vector embeddings in memory for semantic search</p>
+            </TooltipContent>
+          </Tooltip>
+        </TooltipProvider>
+      </div>
+      
+      <div className="flex items-center space-x-2">
+        <span className="text-sm">Status:</span>
+        {connectionStatus === "connected" ? (
+          <Badge variant="outline" className="bg-green-50 text-green-700 hover:bg-green-50 border-green-200">Connected</Badge>
+        ) : connectionStatus === "checking" ? (
+          <Badge variant="outline" className="bg-yellow-50 text-yellow-700 hover:bg-yellow-50 border-yellow-200">Checking...</Badge>
+        ) : (
+          <Badge variant="outline" className="bg-red-50 text-red-700 hover:bg-red-50 border-red-200">Disconnected</Badge>
+        )}
+      </div>
+      
+      {error && (
+        <div className="text-sm text-red-600 bg-red-50 p-2 rounded w-full overflow-auto max-h-20">
+          <p className="whitespace-normal break-words">Error: {error}</p>
+          {error.includes('404') && (
+            <p className="mt-1 text-xs">
+              The Pinecone server is running but the index doesn't exist yet. 
+              <button 
+                onClick={async () => {
+                  setConnectionStatus("checking");
+                  setError(null);
+                  try {
+                    const response = await fetch('/api/pinecone-diag/create-index', { method: 'POST' });
+                    if (response.ok) {
+                      // Wait a bit for the index to be created
+                      await new Promise(resolve => setTimeout(resolve, 2000));
+                      checkConnection();
+                    } else {
+                      const data = await response.json();
+                      setError(data.error || 'Failed to create index');
+                      setConnectionStatus("disconnected");
+                    }
+                  } catch (err) {
+                    setError(err instanceof Error ? err.message : 'Error creating index');
+                    setConnectionStatus("disconnected");
+                  }
+                }}
+                className="ml-1 text-blue-600 hover:text-blue-800 underline"
+              >
+                Click here to create the index
+              </button>
+              <br />
+              <span className="text-xs text-gray-600">Or using Docker Compose: </span>
+              <code className="mx-1 px-1 bg-gray-100 rounded">docker-compose restart pinecone</code>
+            </p>
+          )}
+        </div>
+      )}
+      
+      <div className="text-sm space-y-1 w-full">
+        <div className="flex justify-between">
+          <span className="text-muted-foreground">Vectors:</span>
+          <span>{stats.nodes}</span>
+        </div>
+        <div className="flex justify-between">
+          <span className="text-muted-foreground">Source:</span>
+          <span>{stats.source} local</span>
+        </div>
+      </div>
+      
+      <div className="flex space-x-2">
+        <Button 
+          variant="outline" 
+          size="sm" 
+          onClick={checkConnection}
+          disabled={connectionStatus === "checking"}
+        >
+          {connectionStatus === "checking" ? "Checking..." : "Check Connection"}
+        </Button>
+        
+        {connectionStatus === "connected" && (
+          <Button 
+            variant="outline" 
+            size="sm" 
+            onClick={disconnect}
+          >
+            Disconnect
+          </Button>
+        )}
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/prompt-configuration.tsx b/nvidia/txt2kg/assets/frontend/components/prompt-configuration.tsx
new file mode 100644
index 0000000..73f94a0
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/prompt-configuration.tsx
@@ -0,0 +1,264 @@
+import { useState, useEffect } from "react";
+import { Textarea } from "@/components/ui/textarea";
+import { Button } from "@/components/ui/button";
+import { Label } from "@/components/ui/label";
+import { Tabs, TabsContent, TabsList, TabsTrigger } from "@/components/ui/tabs";
+import { AlertCircle, Save, Undo } from "lucide-react";
+import { Alert, AlertDescription } from "@/components/ui/alert";
+
+// Default prompts used for triple extraction
+const DEFAULT_EXTRACTION_PROMPT = `You are a knowledge graph builder that extracts structured information from text.
+Extract subject-predicate-object triples from the following text.
+
+Guidelines:
+- Extract only factual triples present in the text
+- Normalize entity names to their canonical form
+- Assign appropriate confidence scores (0-1)
+- Include entity types in metadata
+- For each triple, include a brief context from the source text
+
+Text: {text}
+
+{format_instructions}`;
+
+const DEFAULT_SYSTEM_PROMPT = `You are an expert that can extract knowledge triples with the form \`('entity', 'relation', 'entity)\` from a text, mainly using entities from the entity list given by the user. Keep relations 2 words max.
+Separate each with a new line. Do not output anything else (no notes, no explanations, etc).`;
+
+const ALTERNATIVE_SYSTEM_PROMPT = `Please convert the above text into a list of knowledge triples with the form ('entity', 'relation', 'entity'). Seperate each with a new line. Do not output anything else. Try to focus on key triples that form a connected graph.`;
+
+const DEFAULT_GRAPH_TRANSFORMER_PROMPT = `You are tasked with converting text into a structured graph format.
+
+Extract entities and their relationships from the text and structure them following these guidelines:
+- Identify key entities (people, organizations, concepts, locations, etc.)
+- Extract relationships between entities
+- Normalize entity names to a canonical form
+- Use descriptive relationship types in UPPERCASE (e.g., WORKS_AT, PART_OF)
+- Add relevant properties where applicable
+
+Organize your response as graph elements with nodes and relationships.`;
+
+export interface PromptConfigurations {
+  defaultExtractionPrompt: string;
+  systemPrompt: string;
+  graphTransformerPrompt: string;
+}
+
+interface PromptConfigurationProps {
+  onChange?: (configs: PromptConfigurations) => void;
+  initialConfigs?: PromptConfigurations;
+  langChainMethod?: 'default' | 'graphtransformer';
+  useLangChain?: boolean;
+}
+
+export function PromptConfiguration({ 
+  onChange, 
+  initialConfigs, 
+  langChainMethod = 'default',
+  useLangChain = true
+}: PromptConfigurationProps) {
+  const [extractionPrompt, setExtractionPrompt] = useState(initialConfigs?.defaultExtractionPrompt || DEFAULT_EXTRACTION_PROMPT);
+  const [systemPrompt, setSystemPrompt] = useState(initialConfigs?.systemPrompt || DEFAULT_SYSTEM_PROMPT);
+  const [graphTransformerPrompt, setGraphTransformerPrompt] = useState(initialConfigs?.graphTransformerPrompt || DEFAULT_GRAPH_TRANSFORMER_PROMPT);
+  const [activeTab, setActiveTab] = useState(useLangChain && langChainMethod === 'default' ? "default" : "system");
+  const [hasChanges, setHasChanges] = useState(false);
+  const [error, setError] = useState<string | null>(null);
+  const [systemPromptTemplate, setSystemPromptTemplate] = useState<'default' | 'alternative'>(
+    systemPrompt === ALTERNATIVE_SYSTEM_PROMPT ? 'alternative' : 'default'
+  );
+
+  // Update state when initialConfigs changes
+  useEffect(() => {
+    if (initialConfigs) {
+      setExtractionPrompt(initialConfigs.defaultExtractionPrompt || DEFAULT_EXTRACTION_PROMPT);
+      setSystemPrompt(initialConfigs.systemPrompt || DEFAULT_SYSTEM_PROMPT);
+      setGraphTransformerPrompt(initialConfigs.graphTransformerPrompt || DEFAULT_GRAPH_TRANSFORMER_PROMPT);
+      
+      // Set the proper template selection based on the loaded system prompt
+      if (initialConfigs.systemPrompt === ALTERNATIVE_SYSTEM_PROMPT) {
+        setSystemPromptTemplate('alternative');
+      } else {
+        setSystemPromptTemplate('default');
+      }
+    }
+  }, [initialConfigs]);
+
+  // Update active tab when langChainMethod or useLangChain changes
+  useEffect(() => {
+    if (useLangChain) {
+      setActiveTab(langChainMethod === 'default' ? "default" : "graph");
+    } else {
+      setActiveTab("system");
+    }
+  }, [langChainMethod, useLangChain]);
+
+  // Check for changes
+  useEffect(() => {
+    const originalConfigs = initialConfigs || {
+      defaultExtractionPrompt: DEFAULT_EXTRACTION_PROMPT,
+      systemPrompt: DEFAULT_SYSTEM_PROMPT,
+      graphTransformerPrompt: DEFAULT_GRAPH_TRANSFORMER_PROMPT
+    };
+
+    const hasChanged = extractionPrompt !== originalConfigs.defaultExtractionPrompt ||
+                       systemPrompt !== originalConfigs.systemPrompt ||
+                       graphTransformerPrompt !== originalConfigs.graphTransformerPrompt;
+                       
+    setHasChanges(hasChanged);
+  }, [extractionPrompt, systemPrompt, graphTransformerPrompt, initialConfigs]);
+
+  // Save changes
+  const handleSave = () => {
+    try {
+      const configs: PromptConfigurations = {
+        defaultExtractionPrompt: extractionPrompt,
+        systemPrompt: systemPrompt,
+        graphTransformerPrompt: graphTransformerPrompt
+      };
+
+      // Save to local storage
+      localStorage.setItem("promptConfigurations", JSON.stringify(configs));
+      
+      // Trigger onChange callback if provided
+      if (onChange) {
+        onChange(configs);
+      }
+      
+      setError(null);
+      setHasChanges(false);
+    } catch (err) {
+      setError("Failed to save prompt configurations");
+      console.error("Error saving prompt configurations:", err);
+    }
+  };
+
+  // Reset to defaults
+  const handleReset = () => {
+    setExtractionPrompt(DEFAULT_EXTRACTION_PROMPT);
+    setSystemPrompt(DEFAULT_SYSTEM_PROMPT);
+    setGraphTransformerPrompt(DEFAULT_GRAPH_TRANSFORMER_PROMPT);
+  };
+
+  // Handle system prompt template change
+  const handleSystemPromptTemplateChange = (e: React.ChangeEvent<HTMLSelectElement>) => {
+    const template = e.target.value as 'default' | 'alternative';
+    setSystemPromptTemplate(template);
+    
+    // Set the corresponding prompt text
+    if (template === 'default') {
+      setSystemPrompt(DEFAULT_SYSTEM_PROMPT);
+    } else {
+      setSystemPrompt(ALTERNATIVE_SYSTEM_PROMPT);
+    }
+  };
+
+  return (
+    <div className="space-y-4">
+      <div className="flex items-center justify-between mb-2">
+        <h3 className="text-sm font-medium">Prompt Configurations</h3>
+        <div className="flex items-center gap-2">
+          <Button 
+            variant="outline" 
+            size="sm" 
+            onClick={handleReset}
+            disabled={!hasChanges}
+            className="h-8 px-2 text-xs"
+          >
+            <Undo className="h-3.5 w-3.5 mr-1" />
+            Reset to Defaults
+          </Button>
+          <Button 
+            size="sm" 
+            onClick={handleSave}
+            disabled={!hasChanges}
+            className="h-8 px-2 text-xs"
+          >
+            <Save className="h-3.5 w-3.5 mr-1" />
+            Save Changes
+          </Button>
+        </div>
+      </div>
+
+      {error && (
+        <Alert variant="destructive" className="my-2">
+          <AlertCircle className="h-4 w-4" />
+          <AlertDescription>{error}</AlertDescription>
+        </Alert>
+      )}
+
+      <Tabs value={activeTab} onValueChange={setActiveTab} className="w-full">
+        <TabsList className="grid w-full" style={{ 
+          gridTemplateColumns: 
+            useLangChain 
+              ? (langChainMethod === 'default' ? '1fr 1fr' : '1fr 1fr') 
+              : '1fr'
+        }}>
+          {useLangChain && langChainMethod === 'default' && (
+            <TabsTrigger value="default">Default Extraction</TabsTrigger>
+          )}
+          <TabsTrigger value="system">System Prompt</TabsTrigger>
+          {useLangChain && langChainMethod === 'graphtransformer' && (
+            <TabsTrigger value="graph">Graph Transformer</TabsTrigger>
+          )}
+        </TabsList>
+        
+        {useLangChain && langChainMethod === 'default' && (
+          <TabsContent value="default" className="space-y-2 pt-2">
+            <div className="text-xs text-muted-foreground mb-2">
+              This prompt is used with LangChain for structured extraction of triples.
+              <span className="block mt-1">
+                Variables: <code className="bg-muted px-1 py-0.5 rounded">{"{text}"}</code> and <code className="bg-muted px-1 py-0.5 rounded">{"{format_instructions}"}</code>
+              </span>
+            </div>
+            <Textarea
+              value={extractionPrompt}
+              onChange={(e) => setExtractionPrompt(e.target.value)}
+              className="min-h-[300px] font-mono text-xs"
+              placeholder="Enter extraction prompt template..."
+            />
+          </TabsContent>
+        )}
+        
+        <TabsContent value="system" className="space-y-2 pt-2">
+          <div className="text-xs text-muted-foreground mb-2">
+            This system prompt is used for the default triple extraction mode (without LangChain).
+          </div>
+          
+          <div className="mb-4">
+            <Label htmlFor="system-prompt-template" className="text-xs font-medium mb-1 block">Prompt Template</Label>
+            <select
+              id="system-prompt-template"
+              value={systemPromptTemplate}
+              onChange={handleSystemPromptTemplateChange}
+              className="w-full p-2 text-sm rounded-md border border-input bg-background"
+            >
+              <option value="default">Detailed Triple Extraction</option>
+              <option value="alternative">Connected Graph Focus</option>
+            </select>
+            <p className="text-xs text-muted-foreground mt-1">Select a template or customize the prompt below</p>
+          </div>
+          
+          <Textarea
+            value={systemPrompt}
+            onChange={(e) => setSystemPrompt(e.target.value)}
+            className="min-h-[300px] font-mono text-xs"
+            placeholder="Enter system prompt..."
+          />
+        </TabsContent>
+        
+        {useLangChain && langChainMethod === 'graphtransformer' && (
+          <TabsContent value="graph" className="space-y-2 pt-2">
+            <div className="text-xs text-muted-foreground mb-2">
+              This prompt is used with the LLMGraphTransformer for structured graph extraction.
+            </div>
+            <Textarea
+              value={graphTransformerPrompt}
+              onChange={(e) => setGraphTransformerPrompt(e.target.value)}
+              className="min-h-[300px] font-mono text-xs"
+              placeholder="Enter graph transformer prompt..."
+            />
+          </TabsContent>
+        )}
+      </Tabs>
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/pygraphistry-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/pygraphistry-viewer.tsx
new file mode 100644
index 0000000..d853db6
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/pygraphistry-viewer.tsx
@@ -0,0 +1,419 @@
+"use client"
+
+import React, { useState, useEffect, useRef } from 'react'
+import { Button } from '@/components/ui/button'
+import { Card, CardContent, CardHeader, CardTitle } from '@/components/ui/card'
+import { Switch } from '@/components/ui/switch'
+import { Select, SelectContent, SelectItem, SelectTrigger, SelectValue } from '@/components/ui/select'
+import { Badge } from '@/components/ui/badge'
+import { Loader2, Zap, Activity, BarChart3, Eye, ExternalLink, Info } from 'lucide-react'
+import { useToast } from '@/hooks/use-toast'
+
+interface GraphData {
+  nodes: Array<{
+    id: string
+    name: string
+    group?: string
+    [key: string]: any
+  }>
+  links: Array<{
+    source: string
+    target: string
+    name: string
+    [key: string]: any
+  }>
+}
+
+interface PyGraphistryViewerProps {
+  graphData: GraphData
+  onError?: (error: Error) => void
+}
+
+interface VisualizationStats {
+  node_count: number
+  edge_count: number
+  gpu_accelerated: boolean
+  clustered: boolean
+  layout_type: string
+  avg_pagerank?: number
+  max_pagerank?: number
+  avg_betweenness?: number
+  max_betweenness?: number
+  density?: number
+}
+
+interface ProcessedGraphData {
+  processed_nodes: any[]
+  processed_edges: any[]
+  embed_url?: string
+  local_viz_data?: {
+    nodes: any[]
+    edges: any[]
+    positions: Record<string, {x: number, y: number}>
+    clusters: Record<string, number>
+    layout_computed: boolean
+    clusters_computed: boolean
+  }
+  stats: VisualizationStats & {
+    has_embed_url?: boolean
+    has_local_viz?: boolean
+  }
+  timestamp: string
+}
+
+export function PyGraphistryViewer({ graphData, onError }: PyGraphistryViewerProps) {
+  const [isProcessing, setIsProcessing] = useState(false)
+  const [processedData, setProcessedData] = useState<ProcessedGraphData | null>(null)
+  const [gpuAcceleration, setGpuAcceleration] = useState(true)
+  const [clustering, setClustering] = useState(false)
+  const [layoutType, setLayoutType] = useState('force')
+  const [serviceHealth, setServiceHealth] = useState<'unknown' | 'healthy' | 'error'>('unknown')
+  const [isServiceInitialized, setIsServiceInitialized] = useState(false)
+  const iframeRef = useRef<HTMLIFrameElement>(null)
+  const { toast } = useToast()
+
+  // Check service health on mount
+  useEffect(() => {
+    checkServiceHealth()
+  }, [])
+
+  const checkServiceHealth = async () => {
+    try {
+      const response = await fetch('/api/pygraphistry/health')
+      if (response.ok) {
+        const health = await response.json()
+        setServiceHealth('healthy')
+        setIsServiceInitialized(health.pygraphistry_initialized)
+      } else {
+        setServiceHealth('error')
+      }
+    } catch (error) {
+      console.error('Service health check failed:', error)
+      setServiceHealth('error')
+    }
+  }
+
+  const processWithPyGraphistry = async () => {
+    if (!graphData?.nodes?.length || !graphData?.links?.length) {
+      toast({
+        title: "No Graph Data",
+        description: "Please ensure graph data is loaded before processing with PyGraphistry.",
+        variant: "destructive"
+      })
+      return
+    }
+
+    setIsProcessing(true)
+    
+    try {
+      const requestData = {
+        graph_data: {
+          nodes: graphData.nodes,
+          links: graphData.links
+        },
+        layout_type: layoutType,
+        gpu_acceleration: gpuAcceleration,
+        clustering: clustering
+      }
+
+      const response = await fetch('/api/pygraphistry/visualize', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify(requestData)
+      })
+
+      if (!response.ok) {
+        throw new Error(`PyGraphistry processing failed: ${response.statusText}`)
+      }
+
+      const result: ProcessedGraphData = await response.json()
+      setProcessedData(result)
+      
+      toast({
+        title: "GPU Processing Complete",
+        description: `Processed ${result.stats.node_count} nodes and ${result.stats.edge_count} edges with ${result.stats.gpu_accelerated ? 'GPU' : 'CPU'} acceleration.`,
+      })
+
+    } catch (error) {
+      console.error('PyGraphistry processing error:', error)
+      const errorMessage = error instanceof Error ? error.message : 'Unknown error occurred'
+      
+      toast({
+        title: "Processing Failed",
+        description: errorMessage,
+        variant: "destructive"
+      })
+      
+      if (onError) {
+        onError(error instanceof Error ? error : new Error(errorMessage))
+      }
+    } finally {
+      setIsProcessing(false)
+    }
+  }
+
+  const getGraphStats = async () => {
+    if (!graphData?.nodes?.length || !graphData?.links?.length) return
+
+    try {
+      const response = await fetch('/api/pygraphistry/stats', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify({
+          nodes: graphData.nodes,
+          links: graphData.links
+        })
+      })
+
+      if (response.ok) {
+        const stats = await response.json()
+        toast({
+          title: "Graph Statistics",
+          description: `Density: ${(stats.density * 100).toFixed(2)}%, Avg PageRank: ${stats.avg_pagerank?.toFixed(4) || 'N/A'}`,
+        })
+      }
+    } catch (error) {
+      console.error('Failed to get graph stats:', error)
+    }
+  }
+
+  const ServiceStatus = () => (
+    <div className="flex items-center gap-2 text-sm">
+      <div className={`w-2 h-2 rounded-full ${
+        serviceHealth === 'healthy' ? 'bg-green-500' : 
+        serviceHealth === 'error' ? 'bg-red-500' : 'bg-yellow-500'
+      }`} />
+      <span>
+        PyGraphistry Service: {serviceHealth === 'healthy' ? 'Connected' : 
+                               serviceHealth === 'error' ? 'Disconnected' : 'Checking...'}
+      </span>
+      {isServiceInitialized && (
+        <Badge variant="outline" className="ml-2">
+          <Zap className="w-3 h-3 mr-1" />
+          GPU Ready
+        </Badge>
+      )}
+    </div>
+  )
+
+  const StatsDisplay = ({ stats }: { stats: VisualizationStats }) => (
+    <div className="grid grid-cols-2 md:grid-cols-4 gap-4 text-sm">
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4" />
+        <span>{stats.node_count} Nodes</span>
+      </div>
+      <div className="flex items-center gap-2">
+        <BarChart3 className="w-4 h-4" />
+        <span>{stats.edge_count} Edges</span>
+      </div>
+      <div className="flex items-center gap-2">
+        <Zap className="w-4 h-4" />
+        <span>{stats.gpu_accelerated ? 'GPU' : 'CPU'}</span>
+      </div>
+      <div className="flex items-center gap-2">
+        <Eye className="w-4 h-4" />
+        <span>{stats.layout_type}</span>
+      </div>
+      {stats.density !== undefined && (
+        <div className="col-span-2">
+          <span>Density: {(stats.density * 100).toFixed(2)}%</span>
+        </div>
+      )}
+      {stats.avg_pagerank !== undefined && (
+        <div className="col-span-2">
+          <span>Avg PageRank: {stats.avg_pagerank.toFixed(4)}</span>
+        </div>
+      )}
+    </div>
+  )
+
+  return (
+    <div className="w-full h-full flex flex-col gap-4">
+      {/* Control Panel */}
+      <Card>
+        <CardHeader className="pb-3">
+          <div className="flex items-center justify-between">
+            <CardTitle className="flex items-center gap-2">
+              <Zap className="w-5 h-5" />
+              PyGraphistry GPU Visualization
+            </CardTitle>
+            <ServiceStatus />
+          </div>
+        </CardHeader>
+        <CardContent className="space-y-4">
+          {/* Configuration Controls */}
+          <div className="grid grid-cols-1 md:grid-cols-3 gap-4">
+            <div className="flex items-center space-x-2">
+              <Switch
+                id="gpu-acceleration"
+                checked={gpuAcceleration}
+                onCheckedChange={setGpuAcceleration}
+                disabled={!isServiceInitialized}
+              />
+              <label htmlFor="gpu-acceleration" className="text-sm font-medium">
+                GPU Acceleration
+              </label>
+            </div>
+            
+            <div className="flex items-center space-x-2">
+              <Switch
+                id="clustering"
+                checked={clustering}
+                onCheckedChange={setClustering}
+              />
+              <label htmlFor="clustering" className="text-sm font-medium">
+                Auto-Clustering
+              </label>
+            </div>
+            
+            <Select value={layoutType} onValueChange={setLayoutType}>
+              <SelectTrigger>
+                <SelectValue placeholder="Layout Type" />
+              </SelectTrigger>
+              <SelectContent>
+                <SelectItem value="force">Force-Directed</SelectItem>
+                <SelectItem value="circular">Circular</SelectItem>
+                <SelectItem value="hierarchical">Hierarchical</SelectItem>
+              </SelectContent>
+            </Select>
+          </div>
+
+          {/* Action Buttons */}
+          <div className="flex gap-2">
+            <Button 
+              onClick={processWithPyGraphistry}
+              disabled={isProcessing || serviceHealth !== 'healthy'}
+              className="flex items-center gap-2"
+            >
+              {isProcessing ? (
+                <Loader2 className="w-4 h-4 animate-spin" />
+              ) : (
+                <Zap className="w-4 h-4" />
+              )}
+              {isProcessing ? 'Processing...' : 'Process with GPU'}
+            </Button>
+            
+            <Button 
+              variant="outline"
+              onClick={getGraphStats}
+              disabled={serviceHealth !== 'healthy'}
+              className="flex items-center gap-2"
+            >
+              <BarChart3 className="w-4 h-4" />
+              Get Stats
+            </Button>
+            
+            <Button 
+              variant="outline"
+              onClick={checkServiceHealth}
+              className="flex items-center gap-2"
+            >
+              <Activity className="w-4 h-4" />
+              Check Service
+            </Button>
+          </div>
+        </CardContent>
+      </Card>
+
+      {/* Results Display */}
+      {processedData && (
+        <Card className="flex-1">
+          <CardHeader className="pb-3">
+            <div className="flex items-center justify-between">
+              <CardTitle>GPU-Accelerated Visualization</CardTitle>
+              {processedData.embed_url && (
+                <Button 
+                  variant="outline" 
+                  size="sm"
+                  onClick={() => window.open(processedData.embed_url, '_blank')}
+                  className="flex items-center gap-2"
+                >
+                  <ExternalLink className="w-4 h-4" />
+                  Open in PyGraphistry
+                </Button>
+              )}
+            </div>
+          </CardHeader>
+          <CardContent className="space-y-4">
+            {/* Statistics */}
+            <StatsDisplay stats={processedData.stats} />
+            
+            {            /* Embed Visualization */}
+            {processedData.embed_url ? (
+              <div className="w-full h-96 border rounded-lg overflow-hidden">
+                <iframe
+                  ref={iframeRef}
+                  src={processedData.embed_url}
+                  className="w-full h-full"
+                  title="PyGraphistry Visualization"
+                  allowFullScreen
+                />
+              </div>
+            ) : processedData.local_viz_data ? (
+              <div className="w-full h-96 border rounded-lg overflow-hidden">
+                <div className="p-4 bg-blue-50 border-b">
+                  <div className="flex items-center gap-2 text-blue-700">
+                    <Info className="w-4 h-4" />
+                    <span className="text-sm font-medium">Local Visualization Mode</span>
+                  </div>
+                  <div className="text-xs text-blue-600 mt-1">
+                    PyGraphistry running in local mode. {processedData.local_viz_data.layout_computed ? 'Layout computed.' : 'No layout computed.'} 
+                    {processedData.local_viz_data.clusters_computed ? ' Clusters detected.' : ''}
+                  </div>
+                </div>
+                <div className="p-4 space-y-2">
+                  <div className="text-sm">
+                    <strong>Processed Data:</strong> {processedData.local_viz_data.nodes.length} nodes, {processedData.local_viz_data.edges.length} edges
+                  </div>
+                  {processedData.local_viz_data.layout_computed && (
+                    <div className="text-sm text-green-600">
+                      ✓ Layout positions computed
+                    </div>
+                  )}
+                  {processedData.local_viz_data.clusters_computed && (
+                    <div className="text-sm text-green-600">
+                      ✓ Cluster analysis completed
+                    </div>
+                  )}
+                  <div className="text-xs text-muted-foreground mt-2">
+                    To enable interactive visualization, set up PyGraphistry cloud credentials (GRAPHISTRY_API_KEY or GRAPHISTRY_USERNAME/PASSWORD)
+                  </div>
+                </div>
+              </div>
+            ) : (
+              <div className="w-full h-96 border rounded-lg flex items-center justify-center bg-muted">
+                <div className="text-center space-y-2">
+                  <div className="text-muted-foreground">
+                    Processed data available - visualization embed not generated
+                  </div>
+                  <div className="text-sm text-muted-foreground">
+                    {processedData.processed_nodes.length} nodes, {processedData.processed_edges.length} edges processed
+                  </div>
+                  <div className="text-xs text-muted-foreground mt-2">
+                    PyGraphistry may be running in local mode without cloud credentials
+                  </div>
+                </div>
+              </div>
+            )}
+          </CardContent>
+        </Card>
+      )}
+
+      {/* Service Health Warning */}
+      {serviceHealth === 'error' && (
+        <Card className="border-destructive">
+          <CardContent className="pt-6">
+            <div className="flex items-center gap-2 text-destructive">
+              <Activity className="w-4 h-4" />
+              <span>PyGraphistry service is not available. Please ensure the service is running on port 8080.</span>
+            </div>
+          </CardContent>
+        </Card>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/rag-query.tsx b/nvidia/txt2kg/assets/frontend/components/rag-query.tsx
new file mode 100644
index 0000000..cc17ec0
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/rag-query.tsx
@@ -0,0 +1,341 @@
+import { useState, useEffect } from "react";
+import { Triple } from "@/types/graph";
+import { Search as SearchIcon, Zap, Database, Cpu } from "lucide-react";
+
+interface RagQueryProps {
+  onQuerySubmit: (query: string, params: RagParams) => Promise<void>;
+  clearResults: () => void;
+  isLoading: boolean;
+  error?: string | null;
+  className?: string;
+  vectorEnabled?: boolean;
+}
+
+type QueryMode = 'traditional' | 'vector-search' | 'pure-rag';
+
+export interface RagParams {
+  kNeighbors: number;
+  fanout: number;
+  numHops: number;
+  topK: number;
+  useVectorSearch?: boolean;
+  usePureRag?: boolean;
+  queryMode?: QueryMode;
+}
+
+export function RagQuery({ 
+  onQuerySubmit, 
+  clearResults, 
+  isLoading, 
+  error,
+  className,
+  vectorEnabled 
+}: RagQueryProps) {
+  const [query, setQuery] = useState("");
+  const [showAdvanced, setShowAdvanced] = useState(false);
+  const [queryMode, setQueryMode] = useState<QueryMode>('traditional');
+  const [params, setParams] = useState<RagParams>({
+    kNeighbors: 4096,
+    fanout: 400,
+    numHops: 2,
+    topK: 5,
+    useVectorSearch: false,
+    usePureRag: false,
+    queryMode: 'traditional'
+  });
+
+  // Update query mode when vectorEnabled changes - but don't override user selection
+  useEffect(() => {
+    // Only set default mode on initial load, not when user selects Traditional Graph
+    if (vectorEnabled && queryMode === 'traditional') {
+      console.log('Not forcing mode change - respecting user selection');
+      // No longer forcing mode change - let user select what they want
+    }
+  }, [vectorEnabled, queryMode]);
+
+  const handleSubmit = async (e: React.FormEvent) => {
+    e.preventDefault();
+    if (!query.trim()) return;
+    
+    // Set the appropriate params based on the selected query mode
+    const updatedParams = { ...params };
+    
+    if (queryMode === 'pure-rag') {
+      updatedParams.usePureRag = true;
+      updatedParams.useVectorSearch = false;
+    } else if (queryMode === 'vector-search') {
+      updatedParams.usePureRag = false;
+      updatedParams.useVectorSearch = true;
+    } else {
+      // traditional
+      updatedParams.usePureRag = false;
+      updatedParams.useVectorSearch = false;
+    }
+    
+    // Pass the current queryMode to the parent component
+    await onQuerySubmit(query, {
+      ...updatedParams,
+      queryMode: queryMode // Add the query mode to the params
+    });
+  };
+
+  const handleReset = () => {
+    setQuery("");
+    clearResults();
+  };
+
+  const updateParam = (key: keyof RagParams, value: number | boolean) => {
+    setParams(prev => ({ ...prev, [key]: value }));
+  };
+
+  // Handle changing the query mode
+  const handleQueryModeChange = (mode: QueryMode) => {
+    console.log(`Changing query mode to: ${mode}`);
+    setQueryMode(mode);
+    
+    // Force update params based on the selected mode
+    const updatedParams = { ...params };
+    if (mode === 'pure-rag') {
+      updatedParams.usePureRag = true;
+      updatedParams.useVectorSearch = false;
+    } else if (mode === 'vector-search') {
+      updatedParams.usePureRag = false;
+      updatedParams.useVectorSearch = true;
+    } else {
+      // traditional mode
+      updatedParams.usePureRag = false;
+      updatedParams.useVectorSearch = false;
+    }
+    setParams(updatedParams);
+  };
+
+  return (
+    <div className={`nvidia-build-card ${className}`}>
+      <div className="mb-6">
+        <div className="flex items-center gap-3 mb-3">
+          <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+            <SearchIcon className="h-4 w-4 text-nvidia-green" />
+          </div>
+          <h2 className="text-xl font-semibold text-foreground">RAG Query Engine</h2>
+        </div>
+        <p className="text-sm text-muted-foreground leading-relaxed">
+          Query your knowledge graph using natural language
+        </p>
+      </div>
+      
+      {/* Query Type Selection */}
+      <div className="mb-6">
+        <h3 className="text-sm font-semibold text-foreground mb-4">Select Query Type</h3>
+        <div className="grid grid-cols-1 md:grid-cols-3 gap-4">
+          <button
+            type="button"
+            onClick={() => handleQueryModeChange('pure-rag')}
+            disabled={!vectorEnabled}
+            className={`relative flex flex-col items-center p-4 border rounded-xl transition-all duration-200 hover:shadow-md ${
+              queryMode === 'pure-rag' 
+                ? 'border-nvidia-green bg-nvidia-green/10 text-nvidia-green shadow-sm' 
+                : vectorEnabled 
+                  ? 'border-border/40 hover:border-border/60 hover:bg-muted/20' 
+                  : 'border-border/30 opacity-50 cursor-not-allowed'
+            }`}
+          >
+            <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center mb-2">
+              <Zap className="h-3 w-3 text-nvidia-green" />
+            </div>
+            <span className="font-semibold">Pure RAG</span>
+            <span className="text-xs mt-1 text-center text-muted-foreground">
+              Pinecone + LangChain without graph database
+            </span>
+            {queryMode === 'pure-rag' && (
+              <div className="absolute top-3 right-3 w-2 h-2 bg-nvidia-green rounded-full"></div>
+            )}
+          </button>
+
+          <button
+            type="button"
+            onClick={() => handleQueryModeChange('traditional')}
+            className={`relative flex flex-col items-center p-4 border rounded-xl transition-all duration-200 hover:shadow-md ${
+              queryMode === 'traditional' 
+                ? 'border-nvidia-green bg-nvidia-green/10 text-nvidia-green shadow-sm' 
+                : 'border-border/40 hover:border-border/60 hover:bg-muted/20'
+            }`}
+          >
+            <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center mb-2">
+              <Database className="h-3 w-3 text-nvidia-green" />
+            </div>
+            <span className="font-semibold">Traditional Graph</span>
+            <span className="text-xs mt-1 text-center text-muted-foreground">
+              Uses graph database search only
+            </span>
+            {queryMode === 'traditional' && (
+              <div className="absolute top-3 right-3 w-2 h-2 bg-nvidia-green rounded-full"></div>
+            )}
+          </button>
+          
+          <button
+            type="button"
+            onClick={() => handleQueryModeChange('vector-search')}
+            disabled={!vectorEnabled}
+            className={`relative flex flex-col items-center p-4 border rounded-xl transition-all duration-200 hover:shadow-md ${
+              queryMode === 'vector-search' 
+                ? 'border-nvidia-green bg-nvidia-green/10 text-nvidia-green shadow-sm' 
+                : vectorEnabled 
+                  ? 'border-border/40 hover:border-border/60 hover:bg-muted/20' 
+                  : 'border-border/30 opacity-50 cursor-not-allowed'
+            }`}
+          >
+            <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center mb-2">
+              <Cpu className="h-3 w-3 text-nvidia-green" />
+            </div>
+            <span className="font-semibold">GraphRAG</span>
+            <span className="text-xs mt-1 text-center text-muted-foreground">
+              Uses G-Retriever - RAG + GNN
+            </span>
+            <div className="nvidia-build-tag mt-2">
+              New
+            </div>
+            {queryMode === 'vector-search' && (
+              <div className="absolute top-3 right-3 w-2 h-2 bg-nvidia-green rounded-full"></div>
+            )}
+          </button>
+        </div>
+      </div>
+      
+      <form onSubmit={handleSubmit} className="space-y-6">
+        <div className="flex gap-3">
+          <div className="relative flex-1">
+            <input
+              type="text"
+              placeholder={
+                queryMode === 'pure-rag' 
+                  ? "Ask a question using RAG..." 
+                  : queryMode === 'vector-search'
+                    ? "Ask a question about your vector-enhanced knowledge graph..."
+                    : "Ask a question about your knowledge graph..."
+              }
+              value={query}
+              onChange={(e) => setQuery(e.target.value)}
+              className="w-full p-3 pl-11 rounded-xl border border-border/60 bg-background text-foreground focus:border-nvidia-green/50 focus:ring-2 focus:ring-nvidia-green/20 transition-colors"
+              disabled={isLoading}
+            />
+            <SearchIcon className="absolute left-3.5 top-1/2 transform -translate-y-1/2 h-4 w-4 text-muted-foreground" />
+          </div>
+          <button 
+            type="submit" 
+            disabled={isLoading || !query.trim()} 
+            className="px-6 py-3 bg-nvidia-green text-white hover:bg-nvidia-green/90 rounded-xl font-medium transition-colors disabled:opacity-50 disabled:pointer-events-none shadow-sm"
+          >
+            {isLoading ? "Searching..." : "Search"}
+          </button>
+          <button
+            type="button"
+            onClick={handleReset}
+            disabled={isLoading}
+            className="px-4 py-3 border border-border/60 bg-background hover:bg-muted/30 text-foreground rounded-xl transition-colors disabled:opacity-50 disabled:pointer-events-none"
+          >
+            Clear
+          </button>
+        </div>
+
+        {error && (
+          <div className="p-4 bg-destructive/10 border border-destructive/30 rounded-xl text-destructive text-sm">
+            <div className="flex items-start gap-2">
+              <div className="w-4 h-4 rounded-full bg-destructive/20 flex items-center justify-center mt-0.5 flex-shrink-0">
+                <span className="text-xs">!</span>
+              </div>
+              <div>
+                <strong>Error:</strong> {error}
+              </div>
+            </div>
+          </div>
+        )}
+
+        <div>
+          <button 
+            type="button" 
+            onClick={() => setShowAdvanced(!showAdvanced)}
+            className="text-sm text-muted-foreground hover:text-foreground transition-colors flex items-center gap-2 font-medium"
+          >
+            <svg xmlns="http://www.w3.org/2000/svg" width="16" height="16" viewBox="0 0 24 24" fill="none" stroke="currentColor" strokeWidth="2" strokeLinecap="round" strokeLinejoin="round" className={`h-4 w-4 transition-transform duration-200 ${showAdvanced ? 'rotate-180' : ''}`}>
+              <path d="m6 9 6 6 6-6"/>
+            </svg>
+            {showAdvanced ? "Hide" : "Show"} Advanced Parameters
+          </button>
+          
+          {showAdvanced && (
+            <div className="mt-6 p-6 border border-border/40 rounded-xl bg-muted/10 space-y-6">
+              {/* Only show graph-related parameters for traditional and vector search modes */}
+              {queryMode !== 'pure-rag' && (
+                <>
+                  <div>
+                    <div className="flex justify-between items-center mb-2">
+                      <label className="text-sm font-medium text-foreground">KNN Neighbors</label>
+                      <span className="text-sm font-semibold text-nvidia-green bg-nvidia-green/10 px-2 py-1 rounded-md">{params.kNeighbors}</span>
+                    </div>
+                    <input
+                      type="range"
+                      min="256"
+                      max="8192"
+                      step="256"
+                      value={params.kNeighbors}
+                      onChange={(e) => updateParam('kNeighbors', parseInt(e.target.value))}
+                      className="w-full h-2 bg-muted/30 rounded-lg appearance-none cursor-pointer slider-thumb"
+                    />
+                  </div>
+
+                  <div>
+                    <div className="flex justify-between items-center mb-2">
+                      <label className="text-sm font-medium text-foreground">Fanout</label>
+                      <span className="text-sm font-semibold text-nvidia-green bg-nvidia-green/10 px-2 py-1 rounded-md">{params.fanout}</span>
+                    </div>
+                    <input
+                      type="range"
+                      min="50"
+                      max="1000"
+                      step="50"
+                      value={params.fanout}
+                      onChange={(e) => updateParam('fanout', parseInt(e.target.value))}
+                      className="w-full h-2 bg-muted/30 rounded-lg appearance-none cursor-pointer slider-thumb"
+                    />
+                  </div>
+
+                  <div>
+                    <div className="flex justify-between items-center mb-2">
+                      <label className="text-sm font-medium text-foreground">Number of Hops</label>
+                      <span className="text-sm font-semibold text-nvidia-green bg-nvidia-green/10 px-2 py-1 rounded-md">{params.numHops}</span>
+                    </div>
+                    <input
+                      type="range"
+                      min="1"
+                      max="4"
+                      step="1"
+                      value={params.numHops}
+                      onChange={(e) => updateParam('numHops', parseInt(e.target.value))}
+                      className="w-full h-2 bg-muted/30 rounded-lg appearance-none cursor-pointer slider-thumb"
+                    />
+                  </div>
+                </>
+              )}
+
+              <div>
+                <div className="flex justify-between items-center mb-2">
+                  <label className="text-sm font-medium text-foreground">Top K Results</label>
+                  <span className="text-sm font-semibold text-nvidia-green bg-nvidia-green/10 px-2 py-1 rounded-md">{params.topK}</span>
+                </div>
+                <input
+                  type="range"
+                  min="1"
+                  max="20"
+                  step="1"
+                  value={params.topK}
+                  onChange={(e) => updateParam('topK', parseInt(e.target.value))}
+                  className="w-full h-2 bg-muted/30 rounded-lg appearance-none cursor-pointer slider-thumb"
+                />
+              </div>
+            </div>
+          )}
+        </div>
+      </form>
+    </div>
+  );
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/remote-gpu-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/remote-gpu-viewer.tsx
new file mode 100644
index 0000000..085a715
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/remote-gpu-viewer.tsx
@@ -0,0 +1,710 @@
+"use client"
+
+import React, { useState, useEffect, useRef, useCallback } from 'react'
+import { Card, CardContent, CardHeader, CardTitle } from '@/components/ui/card'
+import { Button } from '@/components/ui/button'
+import { Badge } from '@/components/ui/badge'
+import { Select, SelectContent, SelectItem, SelectTrigger, SelectValue } from '@/components/ui/select'
+import { Slider } from '@/components/ui/slider'
+import { Switch } from '@/components/ui/switch'
+import { useToast } from '@/hooks/use-toast'
+import { 
+  Zap, 
+  Activity, 
+  Cloud, 
+  Cpu, 
+  ExternalLink, 
+  RefreshCw, 
+  Settings,
+  Download,
+  Play,
+  Pause
+} from 'lucide-react'
+
+interface GraphData {
+  nodes: Array<{
+    id: string
+    name: string
+    group?: string
+    [key: string]: any
+  }>
+  links: Array<{
+    source: string
+    target: string
+    name: string
+    [key: string]: any
+  }>
+}
+
+interface RemoteGPUViewerProps {
+  graphData: GraphData
+  onError?: (error: Error) => void
+  remoteServiceUrl?: string
+}
+
+interface RemoteRenderingSession {
+  session_id: string
+  embed_url: string
+  gpu_processed: boolean
+  stats: {
+    node_count: number
+    edge_count: number
+    gpu_accelerated: boolean
+    processing_time: number
+    layout_computed: boolean
+    clusters_computed: boolean
+    centrality_computed: boolean
+  }
+  render_config: {
+    quality: string
+    interactive: boolean
+    layout_algorithm: string
+    clustering_algorithm: string
+  }
+  timestamp: string
+}
+
+export function RemoteGPUViewer({ graphData, onError, remoteServiceUrl = 'http://localhost:8082' }: RemoteGPUViewerProps) {
+  const [isProcessing, setIsProcessing] = useState(false)
+  const [session, setSession] = useState<RemoteRenderingSession | null>(null)
+  const [serviceHealth, setServiceHealth] = useState<'unknown' | 'healthy' | 'error'>('unknown')
+  
+  // Rendering configuration
+  const [layoutAlgorithm, setLayoutAlgorithm] = useState('force_atlas2')
+  const [clusteringAlgorithm, setClusteringAlgorithm] = useState('leiden')
+  const [renderQuality, setRenderQuality] = useState('high')
+  const [computeCentrality, setComputeCentrality] = useState(true)
+  const [interactiveMode, setInteractiveMode] = useState(true)
+  
+  // WebSocket connection for real-time updates
+  const wsRef = useRef<WebSocket | null>(null)
+  const iframeRef = useRef<HTMLIFrameElement>(null)
+  const { toast } = useToast()
+
+  // Check service health on mount
+  useEffect(() => {
+    checkServiceHealth()
+  }, [])
+
+  // Setup WebSocket when session is created
+  useEffect(() => {
+    if (session?.session_id) {
+      setupWebSocket(session.session_id)
+    }
+    
+    return () => {
+      if (wsRef.current) {
+        wsRef.current.close()
+        wsRef.current = null
+      }
+    }
+  }, [session?.session_id])
+
+  const checkServiceHealth = async () => {
+    try {
+      const response = await fetch(`${remoteServiceUrl}/api/health`)
+      if (response.ok) {
+        const health = await response.json()
+        setServiceHealth('healthy')
+        console.log('Remote GPU service health:', health)
+      } else {
+        setServiceHealth('error')
+      }
+    } catch (error) {
+      console.error('Remote GPU service health check failed:', error)
+      setServiceHealth('error')
+    }
+  }
+
+  const setupWebSocket = (sessionId: string) => {
+    try {
+      const wsUrl = `${remoteServiceUrl.replace('http', 'ws')}/ws/${sessionId}`
+      const ws = new WebSocket(wsUrl)
+      
+      ws.onopen = () => {
+        console.log('WebSocket connected to remote rendering service')
+      }
+      
+      ws.onmessage = (event) => {
+        try {
+          const message = JSON.parse(event.data)
+          handleWebSocketMessage(message)
+        } catch (error) {
+          console.error('WebSocket message error:', error)
+        }
+      }
+      
+      ws.onerror = (error) => {
+        console.error('WebSocket error:', error)
+      }
+      
+      ws.onclose = () => {
+        console.log('WebSocket connection closed')
+      }
+      
+      wsRef.current = ws
+    } catch (error) {
+      console.error('WebSocket setup failed:', error)
+    }
+  }
+
+  const handleWebSocketMessage = (message: any) => {
+    if (message.type === 'parameter_update') {
+      // Handle real-time parameter updates
+      console.log('Parameter update received:', message.data)
+      
+      toast({
+        title: "Visualization Updated",
+        description: "Real-time parameter changes applied.",
+      })
+    }
+  }
+
+  // Detect when to recommend remote rendering based on your library capabilities
+  const shouldUseRemoteRendering = useCallback((nodeCount: number, linkCount: number) => {
+    // Enhanced detection based on your frontend capabilities
+    const estimatedMemoryUsage = (nodeCount * 64) + (linkCount * 32); // bytes per element
+    const maxWebGLNodes = typeof window !== 'undefined' && window.WebGL2RenderingContext ? 50000 : 10000;
+    const maxWebGPUNodes = typeof window !== 'undefined' && 'gpu' in navigator ? 100000 : 25000;
+    
+    // Check available rendering capabilities
+    const hasWebGPU = typeof window !== 'undefined' && 'gpu' in navigator;
+    const hasWebGL2 = typeof window !== 'undefined' && window.WebGL2RenderingContext;
+    
+    // Memory considerations (Three.js geometry limits)
+    const estimatedMemoryMB = estimatedMemoryUsage / (1024 * 1024);
+    const maxClientMemory = hasWebGPU ? 512 : hasWebGL2 ? 256 : 128; // MB
+    
+    return {
+      recommended: nodeCount > (hasWebGPU ? maxWebGPUNodes : maxWebGLNodes) || estimatedMemoryMB > maxClientMemory,
+      reason: nodeCount > maxWebGLNodes 
+        ? `${nodeCount.toLocaleString()} nodes exceed client rendering capacity`
+        : estimatedMemoryMB > maxClientMemory 
+        ? `Estimated ${estimatedMemoryMB.toFixed(1)}MB exceeds browser memory limits`
+        : 'Client rendering suitable',
+      capabilities: {
+        webgpu: hasWebGPU,
+        webgl2: hasWebGL2,
+        maxNodes: hasWebGPU ? maxWebGPUNodes : maxWebGLNodes,
+        estimatedMemory: estimatedMemoryMB
+      }
+    };
+  }, []);
+
+  // Enhanced library-aware configuration
+  const optimizeConfigForLibraries = useCallback((nodeCount: number, config: any) => {
+    // Optimize based on your 3d-force-graph and Three.js usage patterns
+    const optimized = { ...config };
+    
+    // Three.js renderer optimizations
+    if (nodeCount > 25000) {
+      optimized.render_quality = 'high'; // Use instanced rendering
+      optimized.enable_lod = true; // Level-of-detail for distant nodes
+      optimized.max_texture_size = 2048; // Optimize for GPU memory
+    } else if (nodeCount > 10000) {
+      optimized.render_quality = 'medium';
+      optimized.enable_lod = false;
+      optimized.max_texture_size = 1024;
+    }
+    
+    // D3.js force simulation optimizations 
+    if (nodeCount > 50000) {
+      optimized.physics_iterations = 100; // Reduced for large graphs
+      optimized.alpha_decay = 0.05; // Faster convergence
+      optimized.velocity_decay = 0.6; // More damping
+    }
+    
+    // WebGL-specific optimizations
+    optimized.webgl_features = {
+      instance_rendering: nodeCount > 10000,
+      texture_atlasing: nodeCount > 5000,
+      frustum_culling: nodeCount > 15000,
+      occlusion_culling: nodeCount > 25000
+    };
+    
+    return optimized;
+  }, []);
+
+  const processGraphWithLibraryOptimization = useCallback(async () => {
+    if (!graphData || serviceHealth !== 'healthy') return;
+    
+    const nodeCount = graphData.nodes?.length || 0;
+    const linkCount = graphData.links?.length || 0;
+    
+    // Check if remote rendering is recommended
+    const renderingAnalysis = shouldUseRemoteRendering(nodeCount, linkCount);
+    
+    if (!renderingAnalysis.recommended && nodeCount < 10000) {
+      // Use local Three.js + D3.js rendering
+      toast({
+        title: "Local Rendering Recommended",
+        description: `Local rendering optimal for ${nodeCount.toLocaleString()} nodes`,
+      });
+      return;
+    }
+    
+    setIsProcessing(true);
+    
+    toast({
+      title: "Remote GPU Processing",
+      description: renderingAnalysis.reason,
+    });
+    
+    try {
+      // Optimize configuration for your library stack
+      const optimizedConfig = optimizeConfigForLibraries(nodeCount, {
+        layout_algorithm: layoutAlgorithm,
+        clustering_algorithm: clusteringAlgorithm,
+        render_quality: renderQuality,
+        show_labels: nodeCount < 5000, // Labels only for smaller graphs
+        animation_duration: Math.max(1000, Math.min(5000, nodeCount * 2)),
+        background_color: '#000000', // Match your UI theme
+        
+        // Frontend library compatibility settings
+        d3_version: "7.9.0", // Match your package.json
+        threejs_version: "0.176.0",
+        force_graph_version: "1.77.0",
+        
+        // Performance tuning based on your existing patterns
+        webgl_optimization: true,
+        gpu_memory_management: true,
+        progressive_loading: nodeCount > 25000
+      });
+      
+      // Process with remote GPU service
+      const response = await fetch(`${remoteServiceUrl}/api/render`, {
+        method: 'POST',
+        headers: { 'Content-Type': 'application/json' },
+        body: JSON.stringify({
+          graph_data: graphData,
+          config: optimizedConfig
+        })
+      });
+      
+      if (!response.ok) {
+        throw new Error(`Rendering failed: ${response.statusText}`);
+      }
+      
+      const result = await response.json();
+      setSession(result);
+      
+      toast({
+        title: "Remote Processing Complete",
+        description: `Graph with ${nodeCount.toLocaleString()} nodes processed successfully`,
+      });
+      
+    } catch (error) {
+      console.error('Remote rendering failed:', error);
+      const errorMessage = error instanceof Error ? error.message : 'Unknown error';
+      
+      toast({
+        title: "Remote Rendering Failed",
+        description: `Falling back to local rendering: ${errorMessage}`,
+        variant: "destructive"
+      });
+      
+      if (onError) {
+        onError(error instanceof Error ? error : new Error(errorMessage));
+      }
+    } finally {
+      setIsProcessing(false);
+    }
+  }, [graphData, serviceHealth, shouldUseRemoteRendering, optimizeConfigForLibraries, layoutAlgorithm, clusteringAlgorithm, renderQuality, remoteServiceUrl, toast, onError]);
+
+  const processWithRemoteGPU = async () => {
+    if (!graphData?.nodes?.length || !graphData?.links?.length) {
+      toast({
+        title: "No Graph Data",
+        description: "Please ensure graph data is loaded before processing.",
+        variant: "destructive"
+      })
+      return
+    }
+
+    // Check if graph is too large for browser-based rendering
+    const nodeCount = graphData.nodes.length
+    const shouldUseRemote = nodeCount > 1000 || renderQuality === 'ultra'
+
+    if (!shouldUseRemote) {
+      toast({
+        title: "Consider Local Processing",
+        description: `Graph has ${nodeCount} nodes. Remote rendering is optimized for larger graphs.`,
+      })
+    }
+
+    setIsProcessing(true)
+    
+    try {
+      const requestData = {
+        graph_data: {
+          nodes: graphData.nodes,
+          links: graphData.links
+        },
+        layout_algorithm: layoutAlgorithm,
+        clustering_algorithm: clusteringAlgorithm,
+        compute_centrality: computeCentrality,
+        render_quality: renderQuality,
+        interactive_mode: interactiveMode
+      }
+
+      const response = await fetch(`${remoteServiceUrl}/api/render`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify(requestData)
+      })
+
+      if (!response.ok) {
+        throw new Error(`Remote rendering failed: ${response.statusText}`)
+      }
+
+      const result = await response.json()
+      setSession(result)
+      setIsProcessing(false)
+
+      toast({
+        title: "Remote Rendering Complete",
+        description: `Processed ${result.stats.node_count} nodes with ${result.gpu_processed ? 'GPU' : 'CPU'} acceleration in ${result.stats.processing_time.toFixed(2)}s.`,
+      })
+
+    } catch (error) {
+      console.error('Remote GPU processing error:', error)
+      const errorMessage = error instanceof Error ? error.message : 'Unknown error occurred'
+      
+      toast({
+        title: "Remote Processing Failed",
+        description: errorMessage,
+        variant: "destructive"
+      })
+      
+      setIsProcessing(false)
+      
+      if (onError) {
+        onError(error instanceof Error ? error : new Error(errorMessage))
+      }
+    }
+  }
+
+  const updateRenderingParameters = async (updates: Partial<any>) => {
+    if (!session?.session_id || !wsRef.current || wsRef.current.readyState !== WebSocket.OPEN) {
+      console.warn('Cannot update parameters: no active session or WebSocket connection')
+      return
+    }
+
+    try {
+      wsRef.current.send(JSON.stringify({
+        type: "update_params",
+        ...updates
+      }))
+    } catch (error) {
+      console.error('Failed to send parameter update:', error)
+    }
+  }
+
+  const openInNewTab = () => {
+    if (session?.embed_url) {
+      const fullUrl = `${remoteServiceUrl}${session.embed_url}`
+      window.open(fullUrl, '_blank')
+    }
+  }
+
+  const refreshSession = async () => {
+    if (!session?.session_id) return
+
+    try {
+      const response = await fetch(`${remoteServiceUrl}/api/session/${session.session_id}`)
+      if (response.ok) {
+        const sessionStatus = await response.json()
+        console.log('Session status:', sessionStatus)
+        
+        toast({
+          title: "Session Refreshed",
+          description: `Session ${sessionStatus.session_id.substring(0, 8)}... is active`,
+        })
+      }
+    } catch (error) {
+      console.error('Failed to refresh session:', error)
+    }
+  }
+
+  const ServiceStatus = () => (
+    <div className="flex items-center gap-2 text-sm">
+      <div className={`w-2 h-2 rounded-full ${
+        serviceHealth === 'healthy' ? 'bg-green-500' : 
+        serviceHealth === 'error' ? 'bg-red-500' : 'bg-yellow-500'
+      }`} />
+      <span>
+        Remote GPU Service: {serviceHealth === 'healthy' ? 'Connected' : 
+                           serviceHealth === 'error' ? 'Disconnected' : 'Checking...'}
+      </span>
+      {session && (
+        <Badge variant="outline" className="ml-2">
+          <Zap className="w-3 h-3 mr-1" />
+          Session Active
+        </Badge>
+      )}
+    </div>
+  )
+
+  const StatsDisplay = ({ stats }: { stats: RemoteRenderingSession['stats'] }) => (
+    <div className="grid grid-cols-2 md:grid-cols-3 gap-4 text-sm">
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4 text-blue-500" />
+        <span>{stats.node_count.toLocaleString()} nodes</span>
+      </div>
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4 text-green-500" />
+        <span>{stats.edge_count.toLocaleString()} edges</span>
+      </div>
+      <div className="flex items-center gap-2">
+        {stats.gpu_accelerated ? (
+          <Zap className="w-4 h-4 text-yellow-500" />
+        ) : (
+          <Cpu className="w-4 h-4 text-gray-500" />
+        )}
+        <span>{stats.gpu_accelerated ? 'GPU' : 'CPU'} processed</span>
+      </div>
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4 text-purple-500" />
+        <span>{stats.processing_time.toFixed(2)}s processing</span>
+      </div>
+      {stats.layout_computed && (
+        <div className="flex items-center gap-2">
+          <Activity className="w-4 h-4 text-orange-500" />
+          <span>Layout computed</span>
+        </div>
+      )}
+      {stats.clusters_computed && (
+        <div className="flex items-center gap-2">
+          <Activity className="w-4 h-4 text-pink-500" />
+          <span>Clusters detected</span>
+        </div>
+      )}
+    </div>
+  )
+
+  const AutoRemoteCheck = () => {
+    const nodeCount = graphData?.nodes?.length || 0
+    const shouldAutoUseRemote = nodeCount > 10000
+    
+    if (shouldAutoUseRemote) {
+      return (
+        <div className="p-3 bg-blue-50 rounded-lg border border-blue-200">
+          <div className="flex items-center gap-2 text-blue-700 text-sm">
+            <Cloud className="w-4 h-4" />
+            <span className="font-medium">Large Graph Detected</span>
+          </div>
+          <div className="text-xs text-blue-600 mt-1">
+            Graph has {nodeCount.toLocaleString()} nodes. Remote GPU rendering is recommended for optimal performance.
+          </div>
+        </div>
+      )
+    }
+    return null
+  }
+
+  return (
+    <div className="space-y-4">
+      {/* Service Status */}
+      <Card>
+        <CardHeader className="pb-3">
+          <CardTitle className="flex items-center justify-between">
+            <span className="flex items-center gap-2">
+              <Cloud className="w-5 h-5" />
+              Remote GPU Rendering
+            </span>
+            <ServiceStatus />
+          </CardTitle>
+        </CardHeader>
+        <CardContent className="space-y-4">
+          {/* Auto remote check */}
+          <AutoRemoteCheck />
+
+          {/* Processing Controls */}
+          <div className="grid grid-cols-1 md:grid-cols-2 gap-4">
+            <div className="space-y-4">
+              <div className="space-y-2">
+                <label className="text-sm font-medium">Layout Algorithm</label>
+                <Select value={layoutAlgorithm} onValueChange={setLayoutAlgorithm}>
+                  <SelectTrigger>
+                    <SelectValue />
+                  </SelectTrigger>
+                  <SelectContent>
+                    <SelectItem value="force_atlas2">Force Atlas 2 (GPU)</SelectItem>
+                    <SelectItem value="spectral">Spectral Layout (GPU)</SelectItem>
+                  </SelectContent>
+                </Select>
+              </div>
+
+              <div className="space-y-2">
+                <label className="text-sm font-medium">Clustering Algorithm</label>
+                <Select value={clusteringAlgorithm} onValueChange={setClusteringAlgorithm}>
+                  <SelectTrigger>
+                    <SelectValue />
+                  </SelectTrigger>
+                  <SelectContent>
+                    <SelectItem value="leiden">Leiden (GPU)</SelectItem>
+                    <SelectItem value="louvain">Louvain (GPU)</SelectItem>
+                  </SelectContent>
+                </Select>
+              </div>
+            </div>
+
+            <div className="space-y-4">
+              <div className="space-y-2">
+                <label className="text-sm font-medium">Render Quality</label>
+                <Select value={renderQuality} onValueChange={setRenderQuality}>
+                  <SelectTrigger>
+                    <SelectValue />
+                  </SelectTrigger>
+                  <SelectContent>
+                    <SelectItem value="low">Low (Fast)</SelectItem>
+                    <SelectItem value="medium">Medium</SelectItem>
+                    <SelectItem value="high">High</SelectItem>
+                    <SelectItem value="ultra">Ultra (Million+ nodes)</SelectItem>
+                  </SelectContent>
+                </Select>
+              </div>
+
+              <div className="space-y-4">
+                <div className="flex items-center justify-between">
+                  <label className="text-sm font-medium">Compute Centrality</label>
+                  <Switch 
+                    checked={computeCentrality} 
+                    onCheckedChange={setComputeCentrality}
+                  />
+                </div>
+                
+                <div className="flex items-center justify-between">
+                  <label className="text-sm font-medium">Interactive Mode</label>
+                  <Switch 
+                    checked={interactiveMode} 
+                    onCheckedChange={setInteractiveMode}
+                  />
+                </div>
+              </div>
+            </div>
+          </div>
+
+          {/* Action Buttons */}
+          <div className="flex gap-2">
+            <Button 
+              onClick={processWithRemoteGPU} 
+              disabled={isProcessing || serviceHealth !== 'healthy'}
+              className="flex-1"
+              variant="outline"
+            >
+              {isProcessing ? (
+                <>
+                  <RefreshCw className="w-4 h-4 mr-2 animate-spin" />
+                  Processing...
+                </>
+              ) : (
+                <>
+                  <Zap className="w-4 h-4 mr-2" />
+                  Basic Remote GPU
+                </>
+              )}
+            </Button>
+            
+            <Button 
+              onClick={processGraphWithLibraryOptimization} 
+              disabled={isProcessing || serviceHealth !== 'healthy'}
+              className="flex-1"
+            >
+              {isProcessing ? (
+                <>
+                  <RefreshCw className="w-4 h-4 mr-2 animate-spin" />
+                  Optimizing...
+                </>
+              ) : (
+                <>
+                  <Cpu className="w-4 h-4 mr-2" />
+                  Library-Optimized GPU
+                </>
+              )}
+            </Button>
+            
+            {session && (
+              <Button variant="outline" onClick={refreshSession}>
+                <RefreshCw className="w-4 h-4" />
+              </Button>
+            )}
+          </div>
+        </CardContent>
+      </Card>
+
+      {/* Visualization Results */}
+      {session && (
+        <Card className="flex-1">
+          <CardHeader className="pb-3">
+            <div className="flex items-center justify-between">
+              <CardTitle>Remote GPU Visualization</CardTitle>
+              <div className="flex items-center gap-2">
+                <Badge variant="outline">
+                  Session: {session.session_id.substring(0, 8)}...
+                </Badge>
+                <Button 
+                  variant="outline" 
+                  size="sm"
+                  onClick={openInNewTab}
+                >
+                  <ExternalLink className="w-4 h-4 mr-1" />
+                  Open
+                </Button>
+              </div>
+            </div>
+          </CardHeader>
+          <CardContent className="space-y-4">
+            {/* Statistics */}
+            <StatsDisplay stats={session.stats} />
+            
+            {/* Configuration Display */}
+            <div className="p-3 bg-muted rounded-lg">
+              <h4 className="font-medium mb-2">Render Configuration</h4>
+              <div className="grid grid-cols-2 gap-2 text-sm">
+                <div>Quality: {session.render_config.quality}</div>
+                <div>Interactive: {session.render_config.interactive ? 'Yes' : 'No'}</div>
+                <div>Layout: {session.render_config.layout_algorithm}</div>
+                <div>Clustering: {session.render_config.clustering_algorithm}</div>
+              </div>
+            </div>
+            
+            {/* Iframe Visualization */}
+            <div className="w-full h-96 border rounded-lg overflow-hidden">
+              <div className="p-3 bg-green-50 border-b flex items-center justify-between">
+                <div>
+                  <div className="text-sm font-medium text-green-700">
+                    🚀 Remote GPU Visualization
+                  </div>
+                  <div className="text-xs text-green-600">
+                    Interactive {session.gpu_processed ? 'GPU' : 'CPU'}-accelerated visualization served remotely
+                  </div>
+                </div>
+                <div className="flex items-center gap-2">
+                  <Button 
+                    variant="outline" 
+                    size="sm"
+                    onClick={() => updateRenderingParameters({ layout_algorithm: 'spectral' })}
+                    disabled={!wsRef.current || wsRef.current.readyState !== WebSocket.OPEN}
+                  >
+                    <Settings className="w-4 h-4" />
+                  </Button>
+                </div>
+              </div>
+              <iframe
+                ref={iframeRef}
+                src={`${remoteServiceUrl}${session.embed_url}`}
+                className="w-full h-80"
+                title="Remote GPU Visualization"
+                style={{ border: 'none' }}
+                allow="fullscreen"
+              />
+            </div>
+          </CardContent>
+        </Card>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/s3-connection.tsx b/nvidia/txt2kg/assets/frontend/components/s3-connection.tsx
new file mode 100644
index 0000000..26b132c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/s3-connection.tsx
@@ -0,0 +1,193 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { Database } from "lucide-react"
+import { Card, CardContent, CardDescription, CardFooter, CardHeader, CardTitle } from "@/components/ui/card"
+import { Button } from "@/components/ui/button"
+import { Input } from "@/components/ui/input"
+import { Label } from "@/components/ui/label"
+import { listFilesInS3 } from "@/utils/s3-storage"
+import { Skeleton } from "@/components/ui/skeleton"
+import { toast } from "@/hooks/use-toast"
+
+export function S3Connection() {
+  const [isConnected, setIsConnected] = useState(false)
+  const [isLoading, setIsLoading] = useState(false)
+  const [endpoint, setEndpoint] = useState("")
+  const [bucket, setBucket] = useState("")
+  const [accessKey, setAccessKey] = useState("")
+  const [secretKey, setSecretKey] = useState("")
+  const [fileCount, setFileCount] = useState(0)
+
+  useEffect(() => {
+    // Check if we have env variables set in localStorage
+    const savedEndpoint = localStorage.getItem("S3_ENDPOINT")
+    const savedBucket = localStorage.getItem("S3_BUCKET")
+    const savedAccessKey = localStorage.getItem("S3_ACCESS_KEY")
+    const savedSecretKey = localStorage.getItem("S3_SECRET_KEY")
+
+    if (savedEndpoint) setEndpoint(savedEndpoint)
+    if (savedBucket) setBucket(savedBucket)
+    if (savedAccessKey) setAccessKey(savedAccessKey)
+    if (savedSecretKey) setSecretKey(savedSecretKey)
+
+    // If all values are set, check connection
+    if (savedEndpoint && savedBucket && savedAccessKey && savedSecretKey) {
+      checkConnection()
+    }
+  }, [])
+
+  const saveSettings = () => {
+    localStorage.setItem("S3_ENDPOINT", endpoint)
+    localStorage.setItem("S3_BUCKET", bucket)
+    localStorage.setItem("S3_ACCESS_KEY", accessKey)
+    localStorage.setItem("S3_SECRET_KEY", secretKey)
+
+    // Set these in window for runtime access
+    window.process = window.process || {}
+    window.process.env = window.process.env || {}
+    window.process.env.S3_ENDPOINT = endpoint
+    window.process.env.S3_BUCKET = bucket
+    window.process.env.S3_ACCESS_KEY = accessKey
+    window.process.env.S3_SECRET_KEY = secretKey
+  }
+
+  const checkConnection = async () => {
+    setIsLoading(true)
+    
+    try {
+      saveSettings()
+      
+      // Try to list files to verify connection
+      const files = await listFilesInS3()
+      setFileCount(files.length)
+      setIsConnected(true)
+      
+      // Save connection status to localStorage
+      localStorage.setItem("S3_CONNECTED", "true")
+      
+      // Dispatch event to notify other components
+      window.dispatchEvent(new CustomEvent('s3ConnectionChanged', { 
+        detail: { isConnected: true } 
+      }))
+      
+      toast({
+        title: "Connected to S3",
+        description: `Successfully connected to ${bucket} bucket. Found ${files.length} files.`,
+        variant: "default",
+      })
+    } catch (error) {
+      console.error("Failed to connect to S3:", error)
+      setIsConnected(false)
+      
+      // Save connection status to localStorage
+      localStorage.setItem("S3_CONNECTED", "false")
+      
+      // Dispatch event to notify other components
+      window.dispatchEvent(new CustomEvent('s3ConnectionChanged', { 
+        detail: { isConnected: false } 
+      }))
+      
+      toast({
+        title: "Connection Failed",
+        description: error instanceof Error ? error.message : "Could not connect to S3 storage",
+        variant: "destructive",
+      })
+    } finally {
+      setIsLoading(false)
+    }
+  }
+
+  const handleConnect = (e: React.FormEvent) => {
+    e.preventDefault()
+    checkConnection()
+  }
+
+  return (
+    <Card>
+      <CardHeader>
+        <CardTitle className="flex items-center gap-2">
+          <Database className="h-5 w-5 text-primary" />
+          S3 Storage Connection
+        </CardTitle>
+        <CardDescription>
+          Connect to an S3-compatible storage service
+        </CardDescription>
+      </CardHeader>
+      <CardContent>
+        <form onSubmit={handleConnect} className="space-y-4">
+          <div className="grid gap-2">
+            <Label htmlFor="endpoint">Endpoint URL</Label>
+            <Input
+              id="endpoint"
+              placeholder="http://localhost:9000"
+              value={endpoint}
+              onChange={(e) => setEndpoint(e.target.value)}
+              required
+            />
+          </div>
+          
+          <div className="grid gap-2">
+            <Label htmlFor="bucket">Bucket Name</Label>
+            <Input
+              id="bucket"
+              placeholder="txt2kg"
+              value={bucket}
+              onChange={(e) => setBucket(e.target.value)}
+              required
+            />
+          </div>
+          
+          <div className="grid gap-2">
+            <Label htmlFor="access-key">Access Key</Label>
+            <Input
+              id="access-key"
+              placeholder="Access Key ID"
+              value={accessKey}
+              onChange={(e) => setAccessKey(e.target.value)}
+              required
+            />
+          </div>
+          
+          <div className="grid gap-2">
+            <Label htmlFor="secret-key">Secret Key</Label>
+            <Input
+              id="secret-key"
+              type="password"
+              placeholder="Secret Access Key"
+              value={secretKey}
+              onChange={(e) => setSecretKey(e.target.value)}
+              required
+            />
+          </div>
+          
+          <Button type="submit" disabled={isLoading} className="w-full">
+            {isLoading ? (
+              <>
+                <Skeleton className="h-4 w-4 mr-2 rounded-full" />
+                Connecting...
+              </>
+            ) : isConnected ? (
+              "Reconnect"
+            ) : (
+              "Connect"
+            )}
+          </Button>
+        </form>
+      </CardContent>
+      {isConnected && (
+        <CardFooter className="border-t px-6 py-4 bg-muted/50">
+          <div className="flex items-center text-sm">
+            <div className="flex items-center mr-2">
+              <div className="h-2.5 w-2.5 rounded-full bg-green-500 mr-1.5"></div>
+              <span className="font-medium">Connected</span>
+            </div>
+            <span className="text-muted-foreground">
+              {fileCount} files in bucket
+            </span>
+          </div>
+        </CardFooter>
+      )}
+    </Card>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/s3-upload-container.tsx b/nvidia/txt2kg/assets/frontend/components/s3-upload-container.tsx
new file mode 100644
index 0000000..7380e3f
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/s3-upload-container.tsx
@@ -0,0 +1,63 @@
+"use client"
+
+import { useState, useEffect } from "react"
+import { S3Upload } from "@/components/s3-upload"
+
+export function S3UploadContainer() {
+  const [isConnected, setIsConnected] = useState(false)
+
+  useEffect(() => {
+    // Check if we have env variables set in localStorage
+    const checkS3Connection = () => {
+      const savedEndpoint = localStorage.getItem("S3_ENDPOINT")
+      const savedBucket = localStorage.getItem("S3_BUCKET")
+      const savedAccessKey = localStorage.getItem("S3_ACCESS_KEY")
+      const savedSecretKey = localStorage.getItem("S3_SECRET_KEY")
+      const isConnectedStatus = localStorage.getItem("S3_CONNECTED")
+
+      // Consider connected if all values are set and there's a successful connection flag
+      setIsConnected(
+        !!savedEndpoint && 
+        !!savedBucket && 
+        !!savedAccessKey && 
+        !!savedSecretKey && 
+        isConnectedStatus === "true"
+      )
+    }
+
+    // Check connection status on mount
+    checkS3Connection()
+
+    // Also listen for S3 connection changes
+    const handleS3ConnectionChange = () => {
+      checkS3Connection()
+    }
+
+    window.addEventListener('s3ConnectionChanged', handleS3ConnectionChange)
+    
+    return () => {
+      window.removeEventListener('s3ConnectionChanged', handleS3ConnectionChange)
+    }
+  }, [])
+
+  if (!isConnected) {
+    return null
+  }
+
+  return (
+    <div className="mt-8">
+      <div className="border-t border-border/20 pt-8">
+        <div className="flex items-center gap-3 mb-4">
+          <div className="w-6 h-6 rounded-md bg-nvidia-green/15 flex items-center justify-center">
+            <svg className="h-3 w-3 text-nvidia-green" viewBox="0 0 24 24" fill="currentColor">
+              <path d="M3 7v10a2 2 0 002 2h14a2 2 0 002-2V9a2 2 0 00-2-2h-6l-2-2H5a2 2 0 00-2 2z"/>
+            </svg>
+          </div>
+          <h3 className="text-base font-semibold text-foreground">S3 Storage</h3>
+        </div>
+        <p className="text-sm text-muted-foreground mb-6 leading-relaxed">Upload documents directly from S3 bucket</p>
+        <S3Upload />
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/s3-upload.tsx b/nvidia/txt2kg/assets/frontend/components/s3-upload.tsx
new file mode 100644
index 0000000..d77336f
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/s3-upload.tsx
@@ -0,0 +1,150 @@
+"use client"
+
+import { useState } from "react"
+import { Upload, AlertCircle, FileText } from "lucide-react"
+import { uploadFileToS3 } from "@/utils/s3-storage"
+import { useDocuments } from "@/contexts/document-context"
+import { toast } from "@/hooks/use-toast"
+
+export function S3Upload() {
+  const { addDocuments } = useDocuments()
+  const [isDragging, setIsDragging] = useState(false)
+  const [isUploading, setIsUploading] = useState(false)
+  const [error, setError] = useState<string | null>(null)
+
+  const validateFiles = (files: File[]): File[] => {
+    setError(null)
+    const validFiles = Array.from(files).filter((file) => {
+      const isValidType = file.name.endsWith(".md") || file.name.endsWith(".csv") || file.name.endsWith(".txt") || file.name.endsWith(".json")
+      if (!isValidType) {
+        setError("Only markdown (.md), CSV (.csv), text (.txt), and JSON (.json) files are supported.")
+        return false
+      }
+      return true
+    })
+    return validFiles
+  }
+
+  const handleDragOver = (e: React.DragEvent) => {
+    e.preventDefault()
+    setIsDragging(true)
+  }
+
+  const handleDragLeave = () => {
+    setIsDragging(false)
+  }
+
+  const handleDrop = async (e: React.DragEvent) => {
+    e.preventDefault()
+    setIsDragging(false)
+
+    const files = Array.from(e.dataTransfer.files)
+    const validFiles = validateFiles(files)
+
+    if (validFiles.length > 0) {
+      await uploadFiles(validFiles)
+    }
+  }
+
+  const handleFileSelect = async (e: React.ChangeEvent<HTMLInputElement>) => {
+    if (e.target.files) {
+      const files = Array.from(e.target.files)
+      const validFiles = validateFiles(files)
+
+      if (validFiles.length > 0) {
+        await uploadFiles(validFiles)
+      }
+
+      // Reset the input to allow uploading the same file again
+      e.target.value = ""
+    }
+  }
+
+  const uploadFiles = async (files: File[]) => {
+    setIsUploading(true)
+    setError(null)
+    
+    try {
+      // First add to document context so user sees them immediately
+      addDocuments(files)
+      
+      // Then upload to S3
+      for (const file of files) {
+        try {
+          // Upload to S3
+          const key = await uploadFileToS3(file)
+          
+          toast({
+            title: "File Uploaded",
+            description: `${file.name} uploaded to S3 storage successfully`,
+            duration: 3000,
+          })
+        } catch (err) {
+          console.error(`Error uploading ${file.name}:`, err)
+          setError(`Failed to upload ${file.name} to S3: ${err instanceof Error ? err.message : String(err)}`)
+          
+          toast({
+            title: "Upload Failed",
+            description: `Failed to upload ${file.name} to S3`,
+            variant: "destructive",
+            duration: 5000,
+          })
+        }
+      }
+    } finally {
+      setIsUploading(false)
+    }
+  }
+
+  return (
+    <div className="space-y-4">
+      {error && (
+        <div className="bg-destructive/10 border border-destructive/30 rounded-xl p-4 flex items-start gap-3">
+          <AlertCircle className="h-5 w-5 text-destructive mt-0.5 flex-shrink-0" />
+          <p className="text-sm text-destructive">{error}</p>
+        </div>
+      )}
+
+      <div
+        className={`glass-card rounded-xl p-5 text-center cursor-pointer transition-all hover-lift
+                   ${isDragging ? "border-primary animate-glow" : "border-border/50"}
+                   ${isUploading ? "opacity-70 cursor-not-allowed" : ""}`}
+        onDragOver={handleDragOver}
+        onDragLeave={handleDragLeave}
+        onDrop={handleDrop}
+        onClick={() => !isUploading && document.getElementById("s3-file-upload")?.click()}
+      >
+        <input 
+          id="s3-file-upload" 
+          type="file" 
+          multiple 
+          className="hidden" 
+          accept=".md,.csv,.txt" 
+          onChange={handleFileSelect}
+          disabled={isUploading}
+        />
+          <div className="flex flex-col items-center">
+          <div className="w-14 h-14 rounded-full bg-primary/10 flex items-center justify-center mb-3">
+            <Upload className={`h-7 w-7 text-primary ${isUploading ? "animate-pulse" : ""}`} />
+          </div>
+          <h3 className="text-sm font-semibold mb-1.5">
+            {isUploading ? "Uploading to S3..." : "Upload to S3"}
+          </h3>
+          <p className="text-xs text-muted-foreground mb-2">
+            {isUploading ? (
+              "Please wait while files are being uploaded"
+            ) : (
+              <>
+                or <span className="font-medium text-primary">browse files</span>
+              </>
+            )}
+          </p>
+          <div className="flex items-center gap-2 text-[11px] text-muted-foreground bg-secondary/50 px-2.5 py-1 rounded-full">
+            <FileText className="h-3 w-3" />
+            <span>Markdown (.md), CSV (.csv), and text (.txt) files</span>
+          </div>
+        </div>
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/settings-modal.tsx b/nvidia/txt2kg/assets/frontend/components/settings-modal.tsx
new file mode 100644
index 0000000..8b4c20b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/settings-modal.tsx
@@ -0,0 +1,996 @@
+"use client"
+
+import React, { useState, useEffect } from "react"
+import { 
+  Settings, 
+  Database, 
+  Save, 
+  Eye,
+  EyeOff,
+  Search as SearchIcon,
+  Cpu,
+  HardDrive,
+  Server,
+  RefreshCw,
+  Check,
+  X
+} from "lucide-react"
+import { GraphDBType } from "@/lib/graph-db-service"
+import { listFilesInS3 } from "@/utils/s3-storage"
+import { useToast } from "@/hooks/use-toast"
+
+import {
+  Dialog,
+  DialogContent,
+  DialogDescription,
+  DialogHeader,
+  DialogTitle,
+  DialogTrigger,
+} from "@/components/ui/dialog"
+import {
+  Select,
+  SelectContent,
+  SelectItem,
+  SelectTrigger,
+  SelectValue,
+} from "@/components/ui/select"
+import { Input } from "@/components/ui/input"
+import { Label } from "@/components/ui/label"
+import { Button } from "@/components/ui/button"
+import { Skeleton } from "@/components/ui/skeleton"
+
+export function SettingsModal() {
+  const { toast } = useToast()
+  const [isOpen, setIsOpen] = useState(false)
+  const [activeTab, setActiveTab] = useState("models")
+  const [dbUrl, setDbUrl] = useState("")
+  const [dbUsername, setDbUsername] = useState("")
+  const [dbPassword, setDbPassword] = useState("")
+  const [vectorDbHost, setVectorDbHost] = useState("")
+  const [vectorDbPort, setVectorDbPort] = useState("")
+  const [showPassword, setShowPassword] = useState(false)
+  
+  // Graph DB settings
+  const [graphDbType, setGraphDbType] = useState<GraphDBType>("arangodb")
+  const [neo4jUrl, setNeo4jUrl] = useState("")
+  const [neo4jUser, setNeo4jUser] = useState("")
+  const [neo4jPassword, setNeo4jPassword] = useState("")
+  const [arangoUrl, setArangoUrl] = useState("http://localhost:8529")
+  const [arangoDb, setArangoDb] = useState("txt2kg")
+  const [arangoUser, setArangoUser] = useState("")
+  const [arangoPassword, setArangoPassword] = useState("")
+  
+  // Vector DB settings - changed from Milvus to Pinecone
+  const [pineconeApiKey, setPineconeApiKey] = useState("")
+  const [pineconeEnvironment, setPineconeEnvironment] = useState("")
+  const [pineconeIndex, setPineconeIndex] = useState("")
+  
+  // S3 Storage settings
+  const [s3Endpoint, setS3Endpoint] = useState("")
+  const [s3Bucket, setS3Bucket] = useState("")
+  const [s3AccessKey, setS3AccessKey] = useState("")
+  const [s3SecretKey, setS3SecretKey] = useState("")
+  const [isConnecting, setIsConnecting] = useState(false)
+  const [isS3Connected, setIsS3Connected] = useState(false)
+  const [s3FileCount, setS3FileCount] = useState(0)
+  const [s3Error, setS3Error] = useState<string | null>(null)
+  
+  // Embeddings model settings
+  const [embeddingsProvider, setEmbeddingsProvider] = useState("local")
+  const [nvidiaEmbeddingsModel, setNvidiaEmbeddingsModel] = useState("nvidia/llama-3.2-nv-embedqa-1b-v2")
+  
+  // Ollama model configuration
+  const [availableOllamaModels, setAvailableOllamaModels] = useState<string[]>([])
+  const [selectedOllamaModels, setSelectedOllamaModels] = useState<string[]>([])
+  const [isLoadingOllamaModels, setIsLoadingOllamaModels] = useState(false)
+  const [ollamaConnectionStatus, setOllamaConnectionStatus] = useState<'idle' | 'connected' | 'error'>('idle')
+  const [ollamaError, setOllamaError] = useState<string | null>(null)
+  
+  // Listen for open-settings event
+  useEffect(() => {
+    const handleOpenSettings = (event: CustomEvent) => {
+      const { tab } = event.detail
+      setIsOpen(true)
+      if (tab) {
+        setActiveTab(tab)
+      }
+    }
+    
+    window.addEventListener('open-settings', handleOpenSettings as EventListener)
+    
+    return () => {
+      window.removeEventListener('open-settings', handleOpenSettings as EventListener)
+    }
+  }, [])
+  
+  // Automatically fetch Ollama models when modal opens and models tab is active
+  useEffect(() => {
+    if (isOpen && activeTab === "models" && ollamaConnectionStatus === 'idle') {
+      fetchOllamaModels()
+    }
+  }, [isOpen, activeTab])
+  
+  // Load saved settings when modal opens
+  useEffect(() => {
+    if (isOpen) {
+      const storedDbUrl = localStorage.getItem("NEO4J_URL") || ""
+      const storedDbUsername = localStorage.getItem("NEO4J_USERNAME") || ""
+      const storedDbPassword = localStorage.getItem("NEO4J_PASSWORD") || ""
+      const storedVectorDbHost = localStorage.getItem("VECTOR_DB_HOST") || ""
+      const storedVectorDbPort = localStorage.getItem("VECTOR_DB_PORT") || ""
+      
+      setDbUrl(storedDbUrl)
+      setDbUsername(storedDbUsername)
+      setDbPassword(storedDbPassword)
+      setVectorDbHost(storedVectorDbHost)
+      setVectorDbPort(storedVectorDbPort)
+      
+      // Load embeddings settings
+      const storedEmbeddingsProvider = localStorage.getItem("embeddings_provider") || "local"
+      const storedNvidiaModel = localStorage.getItem("nvidia_embeddings_model") || "nvidia/llama-3.2-nv-embedqa-1b-v2"
+      setEmbeddingsProvider(storedEmbeddingsProvider)
+      setNvidiaEmbeddingsModel(storedNvidiaModel)
+      
+      // Load Ollama model configuration
+      const storedSelectedModels = localStorage.getItem("selected_ollama_models")
+      if (storedSelectedModels) {
+        try {
+          setSelectedOllamaModels(JSON.parse(storedSelectedModels))
+        } catch (e) {
+          console.error("Error parsing stored Ollama models:", e)
+        }
+      }
+      
+      // Load S3 settings
+      const savedS3Endpoint = localStorage.getItem("S3_ENDPOINT") || ""
+      const savedS3Bucket = localStorage.getItem("S3_BUCKET") || ""
+      const savedS3AccessKey = localStorage.getItem("S3_ACCESS_KEY") || ""
+      const savedS3SecretKey = localStorage.getItem("S3_SECRET_KEY") || ""
+      const s3Connected = localStorage.getItem("S3_CONNECTED") === "true"
+      
+      setS3Endpoint(savedS3Endpoint)
+      setS3Bucket(savedS3Bucket)
+      setS3AccessKey(savedS3AccessKey)
+      setS3SecretKey(savedS3SecretKey)
+      setIsS3Connected(s3Connected)
+    }
+    
+    // Load graph DB type
+    const storedGraphDbType = localStorage.getItem("graph_db_type") || "arangodb"
+    setGraphDbType(storedGraphDbType as GraphDBType)
+    
+    // Load Neo4j settings
+    setNeo4jUrl(localStorage.getItem("neo4j_url") || "")
+    setNeo4jUser(localStorage.getItem("neo4j_user") || "")
+    setNeo4jPassword(localStorage.getItem("neo4j_password") || "")
+    
+    // Load ArangoDB settings
+    setArangoUrl(localStorage.getItem("arango_url") || "http://localhost:8529")
+    setArangoDb(localStorage.getItem("arango_db") || "txt2kg")
+    setArangoUser(localStorage.getItem("arango_user") || "")
+    setArangoPassword(localStorage.getItem("arango_password") || "")
+    
+    setPineconeApiKey(localStorage.getItem("pinecone_api_key") || "")
+    setPineconeEnvironment(localStorage.getItem("pinecone_environment") || "")
+    setPineconeIndex(localStorage.getItem("pinecone_index") || "")
+  }, [isOpen])
+  
+  // Save database settings
+  const saveDbSettings = async (e: React.FormEvent) => {
+    e.preventDefault()
+    
+    // Save graph DB type
+    localStorage.setItem("graph_db_type", graphDbType)
+    
+    // Save Neo4j settings
+    localStorage.setItem("neo4j_url", neo4jUrl)
+    localStorage.setItem("neo4j_user", neo4jUser)
+    localStorage.setItem("neo4j_password", neo4jPassword)
+    
+    // Save ArangoDB settings
+    localStorage.setItem("arango_url", arangoUrl)
+    localStorage.setItem("arango_db", arangoDb)
+    localStorage.setItem("arango_user", arangoUser)
+    localStorage.setItem("arango_password", arangoPassword)
+    
+    // Sync settings with server
+    try {
+      await fetch('/api/settings', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify({
+          settings: {
+            graph_db_type: graphDbType,
+            neo4j_url: neo4jUrl,
+            neo4j_user: neo4jUser,
+            neo4j_password: neo4jPassword,
+            arango_url: arangoUrl,
+            arango_db: arangoDb,
+            arango_user: arangoUser,
+            arango_password: arangoPassword
+          }
+        }),
+      });
+    } catch (error) {
+      console.error('Error syncing settings:', error);
+      toast({
+        variant: "destructive",
+        title: "Error",
+        description: "Failed to sync settings with server"
+      });
+    }
+    
+    toast({
+      title: "Success",
+      description: "Graph database settings saved"
+    });
+    setIsOpen(false)
+  }
+  
+  // Save vector database settings
+  const saveVectorDbSettings = async (e: React.FormEvent) => {
+    e.preventDefault()
+    
+    localStorage.setItem("pinecone_api_key", pineconeApiKey)
+    localStorage.setItem("pinecone_environment", pineconeEnvironment)
+    localStorage.setItem("pinecone_index", pineconeIndex)
+    
+    // Sync settings with server
+    try {
+      await fetch('/api/settings', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify({
+          settings: {
+            pinecone_api_key: pineconeApiKey,
+            pinecone_environment: pineconeEnvironment,
+            pinecone_index: pineconeIndex,
+          }
+        }),
+      });
+    } catch (error) {
+      console.error('Error syncing settings:', error);
+      toast({
+        variant: "destructive",
+        title: "Error",
+        description: "Failed to sync settings with server"
+      });
+    }
+    
+    toast({
+      title: "Success",
+      description: "Vector database settings saved"
+    })
+  }
+  
+  // Save S3 settings and check connection
+  const saveS3Settings = async (e: React.FormEvent) => {
+    e.preventDefault()
+    
+    setIsConnecting(true)
+    setS3Error(null)
+    
+    try {
+      // Save S3 settings to localStorage
+      localStorage.setItem("S3_ENDPOINT", s3Endpoint)
+      localStorage.setItem("S3_BUCKET", s3Bucket)
+      localStorage.setItem("S3_ACCESS_KEY", s3AccessKey)
+      localStorage.setItem("S3_SECRET_KEY", s3SecretKey)
+
+      // Set these in window for runtime access
+      window.process = window.process || {}
+      window.process.env = window.process.env || {}
+      window.process.env.S3_ENDPOINT = s3Endpoint
+      window.process.env.S3_BUCKET = s3Bucket
+      window.process.env.S3_ACCESS_KEY = s3AccessKey
+      window.process.env.S3_SECRET_KEY = s3SecretKey
+      
+      // Try to list files to verify connection
+      const files = await listFilesInS3()
+      setS3FileCount(files.length)
+      setIsS3Connected(true)
+      
+      // Save connection status to localStorage
+      localStorage.setItem("S3_CONNECTED", "true")
+      
+      // Dispatch event to notify other components
+      window.dispatchEvent(new CustomEvent('s3ConnectionChanged', { 
+        detail: { isConnected: true } 
+      }))
+      
+      toast({
+        title: "Success",
+        description: `Connected to S3 bucket. Found ${files.length} files.`
+      })
+    } catch (error) {
+      console.error("Failed to connect to S3:", error)
+      setIsS3Connected(false)
+      
+      // Save connection status to localStorage
+      localStorage.setItem("S3_CONNECTED", "false")
+      
+      // Dispatch event to notify other components
+      window.dispatchEvent(new CustomEvent('s3ConnectionChanged', { 
+        detail: { isConnected: false } 
+      }))
+      
+      setS3Error(error instanceof Error ? error.message : "Could not connect to S3 storage")
+      toast({
+        variant: "destructive",
+        title: "S3 Connection Failed",
+        description: error instanceof Error ? error.message : "Unknown error"
+      })
+    } finally {
+      setIsConnecting(false)
+    }
+  }
+  
+  // Fetch available Ollama models
+  const fetchOllamaModels = async () => {
+    setIsLoadingOllamaModels(true)
+    setOllamaError(null)
+    
+    try {
+      const response = await fetch('/api/ollama?action=test-connection')
+      const data = await response.json()
+      
+      if (data.connected && data.models) {
+        setAvailableOllamaModels(data.models)
+        setOllamaConnectionStatus('connected')
+        
+        // If no models are selected yet, select all by default
+        if (selectedOllamaModels.length === 0) {
+          setSelectedOllamaModels(data.models)
+        }
+      } else {
+        setOllamaConnectionStatus('error')
+        setOllamaError(data.error || 'Failed to connect to Ollama')
+        setAvailableOllamaModels([])
+      }
+    } catch (error) {
+      console.error('Error fetching Ollama models:', error)
+      setOllamaConnectionStatus('error')
+      setOllamaError(error instanceof Error ? error.message : 'Unknown error')
+      setAvailableOllamaModels([])
+    } finally {
+      setIsLoadingOllamaModels(false)
+    }
+  }
+
+  // Save Ollama model settings
+  const saveOllamaSettings = (e: React.FormEvent) => {
+    e.preventDefault()
+    
+    // Save selected models to localStorage
+    localStorage.setItem("selected_ollama_models", JSON.stringify(selectedOllamaModels))
+    
+    // Dispatch event to notify model selector to refresh
+    if (typeof window !== 'undefined') {
+      window.dispatchEvent(new CustomEvent('ollama-models-updated', {
+        detail: { selectedModels: selectedOllamaModels }
+      }))
+    }
+    
+    toast({
+      title: "Success",
+      description: "Ollama model settings saved"
+    })
+  }
+
+  // Toggle Ollama model selection
+  const toggleOllamaModel = (modelName: string) => {
+    setSelectedOllamaModels(prev => {
+      if (prev.includes(modelName)) {
+        return prev.filter(m => m !== modelName)
+      } else {
+        return [...prev, modelName]
+      }
+    })
+  }
+
+  // Save embeddings settings
+  const saveEmbeddingsSettings = (e: React.FormEvent) => {
+    e.preventDefault();
+    
+    // Save embeddings provider to localStorage
+    localStorage.setItem("embeddings_provider", embeddingsProvider);
+    
+    // If using NVIDIA API, also save the model
+    if (embeddingsProvider === "nvidia") {
+      localStorage.setItem("nvidia_embeddings_model", nvidiaEmbeddingsModel);
+    }
+    
+    // Save to environment variables (this works in development; in production needs server-side implementation)
+    process.env.EMBEDDINGS_PROVIDER = embeddingsProvider;
+    
+    if (embeddingsProvider === "nvidia") {
+      process.env.NVIDIA_EMBEDDINGS_MODEL = nvidiaEmbeddingsModel;
+    }
+    
+    // Reset the EmbeddingsService instance to pick up new settings
+    try {
+      // Import dynamically to avoid issues with circular dependencies
+      import("@/lib/embeddings").then(({ EmbeddingsService }) => {
+        EmbeddingsService.reset();
+        console.log("EmbeddingsService reset successfully");
+        
+        // Dispatch a custom event to notify components that embeddings settings have changed
+        if (typeof window !== 'undefined') {
+          window.dispatchEvent(new CustomEvent('embeddings-settings-changed'));
+        }
+      });
+    } catch (error) {
+      console.error("Error resetting EmbeddingsService:", error);
+    }
+    
+    toast({
+      title: "Success",
+      description: "Embeddings settings saved"
+    })
+  }
+  
+  return (
+    <Dialog open={isOpen} onOpenChange={setIsOpen}>
+      <DialogTrigger asChild>
+        <button className="flex items-center justify-center gap-2 p-2 hover:bg-primary/10 rounded-full transition-colors" title="Settings">
+          <Settings className="h-5 w-5 text-muted-foreground hover:text-primary transition-colors" />
+        </button>
+      </DialogTrigger>
+      <DialogContent className="sm:max-w-[600px] max-h-[90vh] overflow-y-auto bg-background border-border">
+        <DialogHeader className="pb-6 border-b border-border/10">
+          <div className="flex items-center gap-3 mb-3">
+            <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+              <Settings className="h-4 w-4 text-nvidia-green" />
+            </div>
+            <DialogTitle className="text-xl font-semibold text-foreground">
+              Settings
+            </DialogTitle>
+          </div>
+          <DialogDescription className="text-sm text-muted-foreground leading-relaxed">
+            Configure your API keys and DB connections
+          </DialogDescription>
+        </DialogHeader>
+        
+        <div className="mt-4">
+          <div className="mb-4">
+            <Select value={activeTab} onValueChange={setActiveTab}>
+              <SelectTrigger className="w-full border-border/60 bg-background text-foreground focus:border-primary/50 focus:ring-primary/20">
+                <SelectValue placeholder="Select a category" />
+              </SelectTrigger>
+              <SelectContent>
+                <SelectItem value="graph">
+                  <div className="flex items-center gap-3">
+                    <Database className="h-4 w-4 text-nvidia-green" />
+                    <span>Graph Database</span>
+                  </div>
+                </SelectItem>
+                <SelectItem value="vectordb">
+                  <div className="flex items-center gap-3">
+                    <SearchIcon className="h-4 w-4 text-nvidia-green" />
+                    <span>Vector Database</span>
+                  </div>
+                </SelectItem>
+                <SelectItem value="s3">
+                  <div className="flex items-center gap-3">
+                    <HardDrive className="h-4 w-4 text-nvidia-green" />
+                    <span>S3 Storage</span>
+                  </div>
+                </SelectItem>
+                <SelectItem value="embeddings">
+                  <div className="flex items-center gap-3">
+                    <Cpu className="h-4 w-4 text-nvidia-green" />
+                    <span>Embeddings</span>
+                  </div>
+                </SelectItem>
+                <SelectItem value="models">
+                  <div className="flex items-center gap-3">
+                    <Server className="h-4 w-4 text-nvidia-green" />
+                    <span>Model Management</span>
+                  </div>
+                </SelectItem>
+              </SelectContent>
+            </Select>
+          </div>
+          
+
+          {activeTab === "graph" && (
+            <div className="bg-muted/30 border border-border/40 rounded-xl p-4">
+              <form onSubmit={saveDbSettings} className="space-y-4">
+                <div className="space-y-2">
+                  <label className="text-sm font-semibold text-foreground flex items-center gap-2">
+                    <Database className="h-4 w-4 text-nvidia-green" />
+                    Database Type
+                  </label>
+                  <select
+                    value={graphDbType}
+                    onChange={(e) => setGraphDbType(e.target.value as GraphDBType)}
+                    className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary transition-colors"
+                  >
+                    <option value="neo4j">Neo4j</option>
+                    <option value="arangodb">ArangoDB</option>
+                  </select>
+                </div>
+              
+                {graphDbType === "neo4j" && (
+                  <div className="bg-background/50 rounded-lg p-3 space-y-3">
+                    <h4 className="text-sm font-medium text-foreground mb-2">Neo4j Configuration</h4>
+                    <div className="grid grid-cols-1 gap-3">
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Connection URL</label>
+                        <input
+                          type="text"
+                          value={neo4jUrl}
+                          onChange={(e) => setNeo4jUrl(e.target.value)}
+                          placeholder="bolt://localhost:7687"
+                          className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                        />
+                      </div>
+                      <div className="grid grid-cols-2 gap-3">
+                        <div>
+                          <label className="text-xs font-medium text-muted-foreground mb-1 block">Username</label>
+                          <input
+                            type="text"
+                            value={neo4jUser}
+                            onChange={(e) => setNeo4jUser(e.target.value)}
+                            placeholder="neo4j"
+                            className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                          />
+                        </div>
+                        <div>
+                          <label className="text-xs font-medium text-muted-foreground mb-1 block">Password</label>
+                          <div className="relative">
+                            <input
+                              type={showPassword ? "text" : "password"}
+                              value={neo4jPassword}
+                              onChange={(e) => setNeo4jPassword(e.target.value)}
+                              placeholder="password"
+                              className="w-full bg-background border border-border/60 rounded-md p-2 pr-8 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                            />
+                            <button
+                              type="button"
+                              className="absolute right-2 top-1/2 -translate-y-1/2 text-muted-foreground hover:text-foreground transition-colors"
+                              onClick={() => setShowPassword(!showPassword)}
+                            >
+                              {showPassword ? (
+                                <EyeOff className="h-3 w-3" />
+                              ) : (
+                                <Eye className="h-3 w-3" />
+                              )}
+                            </button>
+                          </div>
+                        </div>
+                      </div>
+                    </div>
+                  </div>
+                )}
+              
+                {graphDbType === "arangodb" && (
+                  <div className="bg-background/50 rounded-lg p-3 space-y-3">
+                    <h4 className="text-sm font-medium text-foreground mb-2">ArangoDB Configuration</h4>
+                    <div className="grid grid-cols-1 gap-3">
+                      <div className="grid grid-cols-2 gap-3">
+                        <div>
+                          <label className="text-xs font-medium text-muted-foreground mb-1 block">Connection URL</label>
+                          <input
+                            type="text"
+                            value={arangoUrl}
+                            onChange={(e) => setArangoUrl(e.target.value)}
+                            placeholder="http://localhost:8529"
+                            className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                          />
+                        </div>
+                        <div>
+                          <label className="text-xs font-medium text-muted-foreground mb-1 block">Database Name</label>
+                          <input
+                            type="text"
+                            value={arangoDb}
+                            onChange={(e) => setArangoDb(e.target.value)}
+                            placeholder="txt2kg"
+                            className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                          />
+                        </div>
+                      </div>
+                      <div className="grid grid-cols-2 gap-3">
+                        <div>
+                          <label className="text-xs font-medium text-muted-foreground mb-1 block">Username</label>
+                          <input
+                            type="text"
+                            value={arangoUser}
+                            onChange={(e) => setArangoUser(e.target.value)}
+                            placeholder="root"
+                            className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                          />
+                        </div>
+                        <div>
+                          <label className="text-xs font-medium text-muted-foreground mb-1 block">Password</label>
+                          <div className="relative">
+                            <input
+                              type={showPassword ? "text" : "password"}
+                              value={arangoPassword}
+                              onChange={(e) => setArangoPassword(e.target.value)}
+                              placeholder="password"
+                              className="w-full bg-background border border-border/60 rounded-md p-2 pr-8 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                            />
+                            <button
+                              type="button"
+                              className="absolute right-2 top-1/2 -translate-y-1/2 text-muted-foreground hover:text-foreground transition-colors"
+                              onClick={() => setShowPassword(!showPassword)}
+                            >
+                              {showPassword ? (
+                                <EyeOff className="h-3 w-3" />
+                              ) : (
+                                <Eye className="h-3 w-3" />
+                              )}
+                            </button>
+                          </div>
+                        </div>
+                      </div>
+                    </div>
+                  </div>
+                )}
+              
+                <div className="flex justify-end pt-3 border-t border-border/30">
+                  <button 
+                    type="submit" 
+                    className="flex items-center gap-2 px-4 py-2 rounded-md bg-nvidia-green hover:bg-nvidia-green/90 text-white transition-colors text-sm font-medium shadow-sm"
+                  >
+                    <Save className="h-4 w-4" />
+                    Save Settings
+                  </button>
+                </div>
+              </form>
+            </div>
+          )}
+          
+          {activeTab === "vectordb" && (
+            <div className="bg-muted/30 border border-border/40 rounded-xl p-4">
+              <form onSubmit={saveVectorDbSettings} className="space-y-4">
+                <div className="space-y-2">
+                  <label className="text-sm font-semibold text-foreground flex items-center gap-2">
+                    <SearchIcon className="h-4 w-4 text-nvidia-green" />
+                    Pinecone Configuration
+                  </label>
+                </div>
+                
+                <div className="bg-background/50 rounded-lg p-3 space-y-3">
+                  <div className="grid grid-cols-1 gap-3">
+                    <div>
+                      <label className="text-xs font-medium text-muted-foreground mb-1 block">API Key</label>
+                      <input
+                        type="password"
+                        value={pineconeApiKey}
+                        onChange={(e) => setPineconeApiKey(e.target.value)}
+                        placeholder="Enter your Pinecone API key"
+                        className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                      />
+                    </div>
+                    <div className="grid grid-cols-2 gap-3">
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Environment</label>
+                        <input
+                          type="text"
+                          value={pineconeEnvironment}
+                          onChange={(e) => setPineconeEnvironment(e.target.value)}
+                          placeholder="us-west1-gcp"
+                          className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                        />
+                      </div>
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Index Name</label>
+                        <input
+                          type="text"
+                          value={pineconeIndex}
+                          onChange={(e) => setPineconeIndex(e.target.value)}
+                          placeholder="knowledge-graph"
+                          className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                        />
+                      </div>
+                    </div>
+                  </div>
+                </div>
+                
+                <div className="flex justify-end pt-3 border-t border-border/30">
+                  <button 
+                    type="submit" 
+                    className="flex items-center gap-2 px-4 py-2 rounded-md bg-primary hover:bg-primary/90 text-primary-foreground transition-colors text-sm font-medium shadow-sm"
+                  >
+                    <Save className="h-4 w-4" />
+                    Save Settings
+                  </button>
+                </div>
+              </form>
+            </div>
+          )}
+          
+          {activeTab === "s3" && (
+            <div className="bg-muted/30 border border-border/40 rounded-xl p-4">
+              <form onSubmit={saveS3Settings} className="space-y-4">
+                <div className="space-y-2">
+                  <label className="text-sm font-semibold text-foreground flex items-center gap-2">
+                    <HardDrive className="h-4 w-4 text-nvidia-green" />
+                    S3 Storage Configuration
+                  </label>
+                </div>
+                
+                <div className="bg-background/50 rounded-lg p-3 space-y-3">
+                  <div className="grid grid-cols-1 gap-3">
+                    <div className="grid grid-cols-2 gap-3">
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Endpoint URL</label>
+                        <Input
+                          placeholder="http://localhost:9000"
+                          value={s3Endpoint}
+                          onChange={(e) => setS3Endpoint(e.target.value)}
+                          required
+                          className="h-8 text-sm"
+                        />
+                      </div>
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Bucket Name</label>
+                        <Input
+                          placeholder="txt2kg"
+                          value={s3Bucket}
+                          onChange={(e) => setS3Bucket(e.target.value)}
+                          required
+                          className="h-8 text-sm"
+                        />
+                      </div>
+                    </div>
+                    <div className="grid grid-cols-2 gap-3">
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Access Key</label>
+                        <Input
+                          placeholder="Access Key ID"
+                          value={s3AccessKey}
+                          onChange={(e) => setS3AccessKey(e.target.value)}
+                          required
+                          className="h-8 text-sm"
+                        />
+                      </div>
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Secret Key</label>
+                        <Input
+                          type="password"
+                          placeholder="Secret Access Key"
+                          value={s3SecretKey}
+                          onChange={(e) => setS3SecretKey(e.target.value)}
+                          required
+                          className="h-8 text-sm"
+                        />
+                      </div>
+                    </div>
+                  </div>
+                </div>
+                
+                {s3Error && (
+                  <div className="text-xs text-destructive bg-destructive/10 p-2 rounded-md border border-destructive/30">
+                    {s3Error}
+                  </div>
+                )}
+                
+                {isS3Connected && (
+                  <div className="bg-green-50 dark:bg-green-950/30 border border-green-200 dark:border-green-800/50 rounded-lg p-3">
+                    <div className="flex items-center gap-2 text-green-800 dark:text-green-300 text-sm">
+                      <div className="h-2 w-2 rounded-full bg-green-500 animate-pulse"></div>
+                      <span className="font-medium">Connected</span>
+                      <span className="text-green-700 dark:text-green-400 ml-2">
+                        {s3FileCount} {s3FileCount === 1 ? 'file' : 'files'} in bucket
+                      </span>
+                    </div>
+                  </div>
+                )}
+                
+                <div className="flex justify-end pt-3 border-t border-border/30">
+                  <Button 
+                    type="submit" 
+                    disabled={isConnecting} 
+                    className="flex items-center gap-2 px-4 py-2 rounded-md bg-primary hover:bg-primary/90 text-primary-foreground transition-colors text-sm font-medium shadow-sm"
+                  >
+                    {isConnecting ? (
+                      <>
+                        <div className="h-4 w-4 border-2 border-current border-t-transparent rounded-full animate-spin" />
+                        Connecting...
+                      </>
+                    ) : isS3Connected ? (
+                      <>
+                        <Save className="h-4 w-4" />
+                        Update Connection
+                      </>
+                    ) : (
+                      <>
+                        <HardDrive className="h-4 w-4" />
+                        Connect to S3
+                      </>
+                    )}
+                  </Button>
+                </div>
+              </form>
+            </div>
+          )}
+          
+          {activeTab === "embeddings" && (
+            <div className="bg-muted/30 border border-border/40 rounded-xl p-4">
+              <form onSubmit={saveEmbeddingsSettings} className="space-y-4">
+                <div className="space-y-2">
+                  <label className="text-sm font-semibold text-foreground flex items-center gap-2">
+                    <Cpu className="h-4 w-4 text-nvidia-green" />
+                    Embeddings Provider
+                  </label>
+                </div>
+                
+                <div className="bg-background/50 rounded-lg p-3 space-y-3">
+                  <div>
+                    <label className="text-xs font-medium text-muted-foreground mb-1 block">Provider Type</label>
+                    <select
+                      value={embeddingsProvider}
+                      onChange={(e) => setEmbeddingsProvider(e.target.value)}
+                      className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                    >
+                      <option value="local">Local Sentence Transformer</option>
+                      <option value="nvidia">NVIDIA API</option>
+                    </select>
+                  </div>
+                  
+                  {embeddingsProvider === "nvidia" && (
+                    <div className="space-y-3 pt-3 border-t border-border/30">
+                      <div>
+                        <label className="text-xs font-medium text-muted-foreground mb-1 block">Model Name</label>
+                        <input
+                          type="text"
+                          value={nvidiaEmbeddingsModel}
+                          onChange={(e) => setNvidiaEmbeddingsModel(e.target.value)}
+                          placeholder="nvidia/llama-3.2-nv-embedqa-1b-v2"
+                          className="w-full bg-background border border-border/60 rounded-md p-2 text-sm text-foreground focus:ring-1 focus:ring-primary/50 focus:border-primary transition-colors"
+                        />
+                      </div>
+                      
+                      <div className="bg-amber-50 dark:bg-amber-950/30 border border-amber-200 dark:border-amber-800/50 rounded-md p-2">
+                        <p className="text-xs text-amber-800 dark:text-amber-300/90 flex items-start gap-2">
+                          <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24" fill="currentColor" className="w-3 h-3 mt-0.5 flex-shrink-0 text-amber-600 dark:text-amber-400">
+                            <path fillRule="evenodd" d="M9.401 3.003c1.155-2 4.043-2 5.197 0l7.355 12.748c1.154 2-.29 4.5-2.599 4.5H4.645c-2.309 0-3.752-2.5-2.598-4.5L9.4 3.003zM12 8.25a.75.75 0 01.75.75v3.75a.75.75 0 01-1.5 0V9a.75.75 0 01.75-.75zm0 8.25a.75.75 0 100-1.5.75.75 0 000 1.5z" clipRule="evenodd" />
+                          </svg>
+                          NVIDIA API key is configured via environment variables
+                        </p>
+                      </div>
+                    </div>
+                  )}
+                </div>
+                
+                <div className="flex justify-end pt-3 border-t border-border/30">
+                  <button 
+                    type="submit" 
+                    className="flex items-center gap-2 px-4 py-2 rounded-md bg-primary hover:bg-primary/90 text-primary-foreground transition-colors text-sm font-medium shadow-sm"
+                  >
+                    <Save className="h-4 w-4" />
+                    Save Settings
+                  </button>
+                </div>
+              </form>
+            </div>
+          )}
+          
+          {activeTab === "models" && (
+            <div className="bg-muted/30 border border-border/40 rounded-xl p-4">
+              <div className="space-y-4">
+                <div className="flex items-center justify-between">
+                  <div className="space-y-1">
+                    <label className="text-sm font-semibold text-foreground flex items-center gap-2">
+                      <Server className="h-4 w-4 text-nvidia-green" />
+                      Ollama Model Configuration
+                    </label>
+                    <p className="text-xs text-muted-foreground">
+                      Select models for triple extraction dropdown
+                    </p>
+                  </div>
+                  <button
+                    type="button"
+                    onClick={fetchOllamaModels}
+                    disabled={isLoadingOllamaModels}
+                    className="flex items-center gap-2 px-3 py-1.5 text-xs bg-secondary hover:bg-secondary/80 text-secondary-foreground rounded-md transition-colors shadow-sm disabled:opacity-50 disabled:cursor-not-allowed"
+                  >
+                    <RefreshCw className={`h-3 w-3 ${isLoadingOllamaModels ? 'animate-spin' : ''}`} />
+                    {isLoadingOllamaModels ? 'Loading...' : 'Refresh'}
+                  </button>
+                </div>
+
+                {ollamaConnectionStatus === 'error' && ollamaError && (
+                  <div className="bg-destructive/10 border border-destructive/20 rounded-lg p-3">
+                    <div className="flex items-center gap-2 text-destructive text-xs">
+                      <X className="h-3 w-3 flex-shrink-0" />
+                      <span className="font-medium">Connection Error</span>
+                    </div>
+                    <p className="text-xs text-destructive/90 mt-1">{ollamaError}</p>
+                  </div>
+                )}
+
+                {ollamaConnectionStatus === 'connected' && (
+                  <div className="bg-green-50 dark:bg-green-950/30 border border-green-200 dark:border-green-800/50 rounded-lg p-3">
+                    <div className="flex items-center gap-2 text-green-800 dark:text-green-300 text-xs">
+                      <Check className="h-3 w-3 flex-shrink-0" />
+                      <span className="font-medium">Connected</span>
+                      <span className="text-green-700 dark:text-green-400">
+                        • {availableOllamaModels.length} model{availableOllamaModels.length !== 1 ? 's' : ''} found
+                      </span>
+                    </div>
+                  </div>
+                )}
+
+                {availableOllamaModels.length > 0 && (
+                  <div className="bg-background/50 rounded-lg p-3 space-y-3">
+                    <form onSubmit={saveOllamaSettings} className="space-y-3">
+                      <div className="space-y-2">
+                        <div className="flex items-center justify-between">
+                          <label className="text-xs font-medium text-muted-foreground">Available Models</label>
+                          <span className="text-xs text-muted-foreground">
+                            {selectedOllamaModels.length} of {availableOllamaModels.length} selected
+                          </span>
+                        </div>
+                        <div className="grid gap-1 max-h-48 overflow-y-auto border border-border/60 rounded-md p-2 bg-background">
+                          {availableOllamaModels.map((model) => (
+                            <label
+                              key={model}
+                              className="flex items-center gap-2 p-2 rounded hover:bg-muted/50 cursor-pointer transition-colors text-sm"
+                            >
+                              <input
+                                type="checkbox"
+                                checked={selectedOllamaModels.includes(model)}
+                                onChange={() => toggleOllamaModel(model)}
+                                className="h-3 w-3 rounded border-input text-primary focus:ring-1 focus:ring-primary/50"
+                              />
+                              <Server className="h-3 w-3 text-muted-foreground flex-shrink-0" />
+                              <span className="text-xs font-medium text-foreground truncate">{model}</span>
+                            </label>
+                          ))}
+                        </div>
+                      </div>
+
+                      <div className="flex justify-between items-center pt-2 border-t border-border/30">
+                        <div className="flex gap-2 text-xs">
+                          <button
+                            type="button"
+                            onClick={() => setSelectedOllamaModels(availableOllamaModels)}
+                            className="text-primary hover:text-primary/80 transition-colors font-medium"
+                          >
+                            All
+                          </button>
+                          <span className="text-muted-foreground/50">|</span>
+                          <button
+                            type="button"
+                            onClick={() => setSelectedOllamaModels([])}
+                            className="text-primary hover:text-primary/80 transition-colors font-medium"
+                          >
+                            None
+                          </button>
+                        </div>
+                        <button 
+                          type="submit" 
+                          className="flex items-center gap-2 px-3 py-1.5 rounded-md bg-primary hover:bg-primary/90 text-primary-foreground transition-colors text-xs font-medium shadow-sm"
+                        >
+                          <Save className="h-3 w-3" />
+                          Save Settings
+                        </button>
+                      </div>
+                    </form>
+                  </div>
+                )}
+
+                {availableOllamaModels.length === 0 && ollamaConnectionStatus === 'idle' && (
+                  <div className="text-center py-8 bg-muted/20 rounded-lg border border-border/30">
+                    <Server className="h-8 w-8 mx-auto text-muted-foreground/50 mb-2" />
+                    <p className="text-xs text-muted-foreground">
+                      Click "Refresh" to load available models
+                    </p>
+                  </div>
+                )}
+              </div>
+            </div>
+          )}
+        </div>
+      </DialogContent>
+    </Dialog>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/tabs/ConfigureTab.tsx b/nvidia/txt2kg/assets/frontend/components/tabs/ConfigureTab.tsx
new file mode 100644
index 0000000..8a461f9
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/tabs/ConfigureTab.tsx
@@ -0,0 +1,211 @@
+import React from "react"
+import { Database, FileText, Cpu, Sparkles, Zap } from "lucide-react"
+import { ModelSelector } from "@/components/model-selector"
+import { EmbeddingsGenerator } from "@/components/embeddings-generator"
+import { useDocuments } from "@/contexts/document-context"
+import { useState, useEffect } from "react"
+import { OllamaIcon } from "@/components/ui/ollama-icon"
+
+export function ConfigureTab() {
+  // Use state from the parent component
+  const [selectedModelInfo, setSelectedModelInfo] = useState({ 
+    name: "Ollama Qwen3 1.7B", 
+    icon: <OllamaIcon className="h-4 w-4 text-orange-500" />
+  })
+  const [embeddingModelInfo, setEmbeddingModelInfo] = useState("all-MiniLM-L6-v2")
+  const [embeddingsProvider, setEmbeddingsProvider] = useState<string>("local")
+  const [nvidiaEmbeddingsModel, setNvidiaEmbeddingsModel] = useState<string>("nvidia/llama-3.2-nv-embedqa-1b-v2")
+  
+  // Update model info when component mounts and when localStorage changes
+  useEffect(() => {
+    // Initial load from localStorage
+    const updateModelInfo = () => {
+      try {
+        const savedModel = localStorage.getItem("selectedModel")
+        if (savedModel) {
+          const model = JSON.parse(savedModel)
+          setSelectedModelInfo({
+            name: model.name,
+            icon: getModelIcon(model.id)
+          })
+        }
+        
+        // Load embedding settings
+        const provider = localStorage.getItem("embeddings_provider") || "local"
+        setEmbeddingsProvider(provider)
+        
+        // Load NVIDIA model if using NVIDIA
+        if (provider === "nvidia") {
+          const model = localStorage.getItem("nvidia_embeddings_model") || "nvidia/llama-3.2-nv-embedqa-1b-v2"
+          setNvidiaEmbeddingsModel(model)
+        }
+      } catch (e) {
+        console.error("Error loading model info:", e)
+      }
+    }
+    
+    // Update on load
+    updateModelInfo()
+    
+    // Set up event listener for storage changes
+    window.addEventListener('storage', updateModelInfo)
+    
+    // Custom event for when model selection changes
+    const handleModelChange = (e: CustomEvent) => {
+      if (e.detail?.model) {
+        setSelectedModelInfo({
+          name: e.detail.model.name,
+          icon: getModelIcon(e.detail.model.id)
+        })
+      }
+    }
+    
+    // Listen for LangChain toggle changes
+    const handleLangChainToggle = (e: CustomEvent) => {
+      if (e.detail?.useLangChain !== undefined) {
+        // When LangChain is enabled, use GTE-large model, otherwise use default model
+        if (embeddingsProvider === "local") {
+          setEmbeddingModelInfo(e.detail.useLangChain ? "Alibaba-NLP/gte-modernbert-base" : "all-MiniLM-L6-v2")
+        }
+      }
+    }
+    
+    // Listen for embeddings settings changes
+    const handleEmbeddingsSettingsChanged = () => {
+      const provider = localStorage.getItem("embeddings_provider") || "local"
+      setEmbeddingsProvider(provider)
+      
+      if (provider === "nvidia") {
+        const model = localStorage.getItem("nvidia_embeddings_model") || "nvidia/llama-3.2-nv-embedqa-1b-v2"
+        setNvidiaEmbeddingsModel(model)
+      } else {
+        // Local provider - use sentence transformers
+        const useLangChain = localStorage.getItem("useLangChain") === "true"
+        setEmbeddingModelInfo(useLangChain ? "Alibaba-NLP/gte-modernbert-base" : "all-MiniLM-L6-v2")
+      }
+    }
+    
+    // Listen for custom model change events
+    window.addEventListener('modelSelected', handleModelChange as EventListener)
+    window.addEventListener('langChainToggled', handleLangChainToggle as EventListener)
+    window.addEventListener('embeddings-settings-changed', handleEmbeddingsSettingsChanged as EventListener)
+    
+    return () => {
+      window.removeEventListener('storage', updateModelInfo)
+      window.removeEventListener('modelSelected', handleModelChange as EventListener)
+      window.removeEventListener('langChainToggled', handleLangChainToggle as EventListener)
+      window.removeEventListener('embeddings-settings-changed', handleEmbeddingsSettingsChanged as EventListener)
+    }
+  }, [embeddingsProvider])
+  
+  // Function to get the appropriate icon based on model ID
+  const getModelIcon = (modelId: string) => {
+    if (modelId?.startsWith("nvidia-")) {
+      return <Cpu className="h-4 w-4 text-green-500" />
+    } else if (modelId?.startsWith("ollama-")) {
+      return <OllamaIcon className="h-4 w-4 text-orange-500" />
+    }
+    return <Cpu className="h-4 w-4 text-gray-500" />
+  }
+  
+  // Calculate which model to display
+  const displayEmbeddingModel = embeddingsProvider === "nvidia" ? nvidiaEmbeddingsModel : embeddingModelInfo
+  const embeddingProviderIcon = embeddingsProvider === "nvidia" ? 
+    <Cpu className="h-4 w-4 text-green-500" /> : 
+    <Database className="h-4 w-4 text-indigo-500" />
+  const embeddingTooltip = embeddingsProvider === "nvidia" ? 
+    "Using NVIDIA API embedding model" : 
+    "Using sentence-transformers service model"
+  
+  return (
+    <div className="grid md:grid-cols-2 gap-8 lg:gap-12">
+      {/* Left column: Model selection */}
+      <div className="space-y-8">
+        <div className="nvidia-build-card">
+          <div className="flex items-center gap-3 mb-6">
+            <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+              <Cpu className="h-4 w-4 text-nvidia-green" />
+            </div>
+            <h2 className="text-lg font-semibold text-foreground">Current Configuration</h2>
+          </div>
+          <div className="space-y-4">
+            <div className="bg-muted/10 border border-border/20 p-4 rounded-xl">
+              <h3 className="text-sm font-semibold text-foreground mb-3">Selected Model</h3>
+              <div className="flex items-center gap-3 text-sm">
+                <div className="w-6 h-6 rounded-md bg-muted/30 flex items-center justify-center">
+                  {selectedModelInfo.icon}
+                </div>
+                <span className="text-foreground font-medium">{selectedModelInfo.name}</span>
+              </div>
+            </div>
+            
+            <div className="bg-muted/10 border border-border/20 p-4 rounded-xl">
+              <h3 className="text-sm font-semibold text-foreground mb-3">Embedding Model</h3>
+              <div className="flex items-center gap-3 text-sm">
+                <div className="w-6 h-6 rounded-md bg-muted/30 flex items-center justify-center">
+                  {embeddingProviderIcon}
+                </div>
+                <span className="text-foreground font-medium truncate flex-1">{displayEmbeddingModel}</span>
+              </div>
+            </div>
+            
+            <ProcessingSummary />
+          </div>
+        </div>
+        
+        <div className="nvidia-build-card">
+          <div className="flex items-center gap-3 mb-6">
+            <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+              <Sparkles className="h-4 w-4 text-nvidia-green" />
+            </div>
+            <h2 className="text-lg font-semibold text-foreground">Select Triple Extraction Model</h2>
+          </div>
+          
+          <div className="space-y-4">
+            <ModelSelector />
+          </div>
+        </div>
+      </div>
+      
+      {/* Right column: Document Processing */}
+      <div className="space-y-8">
+        <div className="nvidia-build-card">
+          <div className="flex items-center gap-3 mb-6">
+            <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+              <Zap className="h-4 w-4 text-nvidia-green" />
+            </div>
+            <h2 className="text-lg font-semibold text-foreground">Process Documents</h2>
+          </div>
+          <p className="text-sm text-muted-foreground mb-6 leading-relaxed">
+            Extract structured knowledge triples from documents for knowledge graph construction
+          </p>
+          <EmbeddingsGenerator showTripleExtraction={true} />
+        </div>
+      </div>
+    </div>
+  )
+}
+
+// Create a wrapper component that uses the document context
+function ProcessingSummary() {
+  const { documents } = useDocuments()
+  
+  // Count documents with "New" status that are ready for processing
+  const docsReadyCount = documents.filter(doc => doc.status === "New").length
+  
+  return (
+    <div className="bg-muted/10 border border-border/20 p-4 rounded-xl">
+      <h3 className="text-sm font-semibold text-foreground mb-3">Documents Ready</h3>
+      <div className="flex items-center gap-3 text-sm">
+        <div className="w-6 h-6 rounded-md bg-muted/30 flex items-center justify-center">
+          <FileText className="h-4 w-4 text-nvidia-green" />
+        </div>
+        <span className="text-foreground font-medium">
+          {docsReadyCount === 1 
+            ? '1 document ready for processing' 
+            : `${docsReadyCount} documents ready for processing`}
+        </span>
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/tabs/EditTab.tsx b/nvidia/txt2kg/assets/frontend/components/tabs/EditTab.tsx
new file mode 100644
index 0000000..39e69e8
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/tabs/EditTab.tsx
@@ -0,0 +1,24 @@
+import { TripleViewer } from "@/components/triple-viewer"
+import { Network } from "lucide-react"
+
+export function EditTab() {
+  return (
+    <div className="nvidia-build-card p-0 overflow-hidden">
+      <div className="p-6 border-b border-border/10">
+        <div className="flex items-center gap-3 mb-3">
+          <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+            <Network className="h-4 w-4 text-nvidia-green" />
+          </div>
+          <h2 className="text-lg font-semibold text-foreground">Knowledge Triples</h2>
+        </div>
+        <p className="text-sm text-muted-foreground leading-relaxed">
+          Review and edit extracted knowledge triples from your documents
+        </p>
+      </div>
+      
+      <div className="min-h-[500px]">
+        <TripleViewer />
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/tabs/UploadTab.tsx b/nvidia/txt2kg/assets/frontend/components/tabs/UploadTab.tsx
new file mode 100644
index 0000000..4fd18d1
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/tabs/UploadTab.tsx
@@ -0,0 +1,58 @@
+import { AlertCircle, Upload, Database, Table } from "lucide-react"
+import { UploadDocuments } from "@/components/upload-documents"
+import { S3UploadContainer } from "@/components/s3-upload-container"
+import { DatabaseConnection } from "@/components/database-connection"
+import { DocumentsTable } from "@/components/documents-table"
+
+interface UploadTabProps {
+  onTabChange: (tab: string) => void
+}
+
+export function UploadTab({ onTabChange }: UploadTabProps) {
+  return (
+    <div className="flex flex-col lg:flex-row gap-8 lg:gap-12">
+      <div className="w-full lg:w-1/3 space-y-8">
+        <div className="nvidia-build-card">
+          <div className="flex items-center gap-3 mb-6">
+            <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+              <Upload className="h-4 w-4 text-nvidia-green" />
+            </div>
+            <h2 className="text-lg font-semibold text-foreground">Upload Documents</h2>
+          </div>
+          <UploadDocuments />
+          
+          <S3UploadContainer />
+        </div>
+        
+        <div className="nvidia-build-card">
+          <div className="flex items-center gap-3 mb-6">
+            <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+              <Database className="h-4 w-4 text-nvidia-green" />
+            </div>
+            <h2 className="text-lg font-semibold text-foreground">Storage Connections</h2>
+          </div>
+          <div className="space-y-6">
+            <DatabaseConnection />
+          </div>
+        </div>
+      </div>
+      
+      <div className="w-full lg:w-2/3">
+        <div className="nvidia-build-card p-0 overflow-hidden">
+          <div className="p-6 border-b border-border/10">
+            <div className="flex items-center gap-3 mb-3">
+              <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+                <Table className="h-4 w-4 text-nvidia-green" />
+              </div>
+              <h2 className="text-lg font-semibold text-foreground">Documents Overview</h2>
+            </div>
+            <p className="text-sm text-muted-foreground leading-relaxed">
+              Upload and manage your documents for knowledge graph extraction
+            </p>
+          </div>
+          <DocumentsTable onTabChange={onTabChange} />
+        </div>
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/tabs/VisualizeTab.tsx b/nvidia/txt2kg/assets/frontend/components/tabs/VisualizeTab.tsx
new file mode 100644
index 0000000..318560d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/tabs/VisualizeTab.tsx
@@ -0,0 +1,24 @@
+import { KnowledgeGraphViewer } from "@/components/knowledge-graph-viewer"
+import { Eye } from "lucide-react"
+
+export function VisualizeTab() {
+  return (
+    <div className="nvidia-build-card p-0 overflow-hidden">
+      <div className="p-6 border-b border-border/10">
+        <div className="flex items-center gap-3 mb-3">
+          <div className="w-8 h-8 rounded-lg bg-nvidia-green/15 flex items-center justify-center">
+            <Eye className="h-4 w-4 text-nvidia-green" />
+          </div>
+          <h2 className="text-lg font-semibold text-foreground">Knowledge Graph Visualization</h2>
+        </div>
+        <p className="text-sm text-muted-foreground leading-relaxed">
+          Explore connections between entities and visualize your knowledge graph
+        </p>
+      </div>
+      
+      <div className="p-6">
+        <KnowledgeGraphViewer />
+      </div>
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/theme-provider.tsx b/nvidia/txt2kg/assets/frontend/components/theme-provider.tsx
new file mode 100644
index 0000000..198ab63
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/theme-provider.tsx
@@ -0,0 +1,78 @@
+"use client"
+
+import type React from "react"
+
+import { createContext, useContext, useEffect, useState } from "react"
+
+type Theme = "dark" | "light" | "system"
+
+type ThemeProviderProps = {
+  children: React.ReactNode
+  defaultTheme?: Theme
+  storageKey?: string
+}
+
+type ThemeProviderState = {
+  theme: Theme
+  setTheme: (theme: Theme) => void
+}
+
+const initialState: ThemeProviderState = {
+  theme: "system",
+  setTheme: () => null,
+}
+
+const ThemeProviderContext = createContext<ThemeProviderState>(initialState)
+
+export function ThemeProvider({
+  children,
+  defaultTheme = "dark",
+  storageKey = "txt2kg-theme",
+  ...props
+}: ThemeProviderProps) {
+  const [theme, setTheme] = useState<Theme>(defaultTheme)
+
+  useEffect(() => {
+    const savedTheme = localStorage.getItem(storageKey)
+    if (savedTheme) {
+      setTheme(savedTheme as Theme)
+    }
+  }, [storageKey])
+
+  useEffect(() => {
+    const root = window.document.documentElement
+
+    root.classList.remove("light", "dark")
+
+    if (theme === "system") {
+      const systemTheme = window.matchMedia("(prefers-color-scheme: dark)").matches ? "dark" : "light"
+      root.classList.add(systemTheme)
+      return
+    }
+
+    root.classList.add(theme)
+  }, [theme])
+
+  const value = {
+    theme,
+    setTheme: (theme: Theme) => {
+      localStorage.setItem(storageKey, theme)
+      setTheme(theme)
+    },
+  }
+
+  return (
+    <ThemeProviderContext.Provider {...props} value={value}>
+      {children}
+    </ThemeProviderContext.Provider>
+  )
+}
+
+export const useTheme = () => {
+  const context = useContext(ThemeProviderContext)
+
+  if (context === undefined) throw new Error("useTheme must be used within a ThemeProvider")
+
+  return context
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/theme-toggle.tsx b/nvidia/txt2kg/assets/frontend/components/theme-toggle.tsx
new file mode 100644
index 0000000..08251b2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/theme-toggle.tsx
@@ -0,0 +1,24 @@
+"use client"
+
+import { Moon, Sun } from "lucide-react"
+import { useTheme } from "./theme-provider"
+
+export function ThemeToggle() {
+  const { theme, setTheme } = useTheme()
+
+  return (
+    <button
+      className="btn-icon relative"
+      onClick={() => setTheme(theme === "dark" ? "light" : "dark")}
+      aria-label="Toggle theme"
+    >
+      <Sun
+        className={`h-5 w-5 transition-all ${theme === "dark" ? "opacity-0 scale-0 rotate-90 absolute" : "opacity-100 scale-100 rotate-0 relative"}`}
+      />
+      <Moon
+        className={`h-5 w-5 transition-all ${theme === "light" ? "opacity-0 scale-0 -rotate-90 absolute" : "opacity-100 scale-100 rotate-0 relative"}`}
+      />
+    </button>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/triple-editor.tsx b/nvidia/txt2kg/assets/frontend/components/triple-editor.tsx
new file mode 100644
index 0000000..2f0a5f4
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/triple-editor.tsx
@@ -0,0 +1,89 @@
+"use client"
+
+import type React from "react"
+
+import { useState } from "react"
+import type { Triple } from "@/utils/text-processing"
+import { Check, X } from "lucide-react"
+
+interface TripleEditorProps {
+  triple?: Triple
+  index?: number
+  onSave: (triple: Triple, index?: number) => void
+  onCancel: () => void
+}
+
+export function TripleEditor({ triple, index, onSave, onCancel }: TripleEditorProps) {
+  const [subject, setSubject] = useState(triple?.subject || "")
+  const [predicate, setPredicate] = useState(triple?.predicate || "")
+  const [object, setObject] = useState(triple?.object || "")
+
+  const handleSubmit = (e: React.FormEvent) => {
+    e.preventDefault()
+    if (subject.trim() && predicate.trim() && object.trim()) {
+      onSave({ subject, predicate, object }, index)
+    }
+  }
+
+  return (
+    <form onSubmit={handleSubmit} className="p-4 bg-muted/30 border-b border-border">
+      <div className="grid grid-cols-3 gap-4 mb-3">
+        <div>
+          <label htmlFor="subject" className="block text-xs text-muted-foreground mb-1">
+            Subject
+          </label>
+          <input
+            id="subject"
+            type="text"
+            value={subject}
+            onChange={(e) => setSubject(e.target.value)}
+            className="w-full bg-background border border-border rounded-md p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary"
+            placeholder="Entity"
+            required
+          />
+        </div>
+        <div>
+          <label htmlFor="predicate" className="block text-xs text-muted-foreground mb-1">
+            Predicate
+          </label>
+          <input
+            id="predicate"
+            type="text"
+            value={predicate}
+            onChange={(e) => setPredicate(e.target.value)}
+            className="w-full bg-background border border-border rounded-md p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary"
+            placeholder="Relation"
+            required
+          />
+        </div>
+        <div>
+          <label htmlFor="object" className="block text-xs text-muted-foreground mb-1">
+            Object
+          </label>
+          <input
+            id="object"
+            type="text"
+            value={object}
+            onChange={(e) => setObject(e.target.value)}
+            className="w-full bg-background border border-border rounded-md p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary"
+            placeholder="Entity"
+            required
+          />
+        </div>
+      </div>
+      <div className="flex justify-end gap-2">
+        <button
+          type="button"
+          onClick={onCancel}
+          className="p-2 text-muted-foreground hover:text-foreground rounded-full hover:bg-muted/50 transition-colors"
+        >
+          <X className="h-4 w-4" />
+        </button>
+        <button type="submit" className="p-2 text-primary hover:text-primary/80 rounded-full hover:bg-primary/10 transition-colors">
+          <Check className="h-4 w-4" />
+        </button>
+      </div>
+    </form>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/triple-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/triple-viewer.tsx
new file mode 100644
index 0000000..cdde972
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/triple-viewer.tsx
@@ -0,0 +1,827 @@
+"use client"
+
+import { useState, useEffect, useRef } from "react"
+import { useDocuments } from "@/contexts/document-context"
+import type { Triple } from "@/utils/text-processing"
+import { Pencil, Trash2, Plus, Download, ChevronDown, FileJson, FileText, List, Network, Check, X, Database } from "lucide-react"
+import { TripleEditor } from "./triple-editor"
+
+// Add this new EntityEditor component before the TripleViewer component
+interface EntityEditorProps {
+  entity: string
+  onSave: (oldEntity: string, newEntity: string) => void
+  onCancel: () => void
+}
+
+function EntityEditor({ entity, onSave, onCancel }: EntityEditorProps) {
+  const [newEntityName, setNewEntityName] = useState(entity)
+
+  const handleSubmit = (e: React.FormEvent) => {
+    e.preventDefault()
+    if (newEntityName.trim()) {
+      onSave(entity, newEntityName.trim())
+    }
+  }
+
+  return (
+    <form onSubmit={handleSubmit} className="p-4 bg-muted/20 border-b border-border">
+      <div className="mb-3">
+        <label htmlFor="entity" className="block text-xs text-muted-foreground mb-1">
+          Entity Name
+        </label>
+        <input
+          id="entity"
+          type="text"
+          value={newEntityName}
+          onChange={(e) => setNewEntityName(e.target.value)}
+          className="w-full bg-background border border-border rounded-md p-2 text-sm text-foreground focus:ring-2 focus:ring-primary/50 focus:border-primary"
+          placeholder="Entity"
+          required
+        />
+      </div>
+      <div className="flex justify-end gap-2">
+        <button
+          type="button"
+          onClick={onCancel}
+          className="p-2 text-muted-foreground hover:text-foreground rounded-full hover:bg-muted/30"
+        >
+          <X className="h-4 w-4" />
+        </button>
+        <button type="submit" className="p-2 text-primary hover:text-primary/80 rounded-full hover:bg-primary/10">
+          <Check className="h-4 w-4" />
+        </button>
+      </div>
+    </form>
+  )
+}
+
+export function TripleViewer() {
+  const { documents, addTriple, editTriple, deleteTriple, updateTriples } = useDocuments()
+  const [selectedDocId, setSelectedDocId] = useState<string | null>(null)
+
+  const [editingIndex, setEditingIndex] = useState<number | null>(null)
+  const [isAddingTriple, setIsAddingTriple] = useState(false)
+  const [showExportMenu, setShowExportMenu] = useState(false)
+  const [viewMode, setViewMode] = useState<'triples' | 'entities'>('triples')
+  const [editingEntityIndex, setEditingEntityIndex] = useState<number | null>(null)
+  const [isAddingEntity, setIsAddingEntity] = useState(false)
+  const [newEntityName, setNewEntityName] = useState('')
+  const [isStoringToDb, setIsStoringToDb] = useState(false)
+  const [storeStatus, setStoreStatus] = useState<'idle' | 'loading' | 'success' | 'error'>('idle')
+  const [isDropdownOpen, setIsDropdownOpen] = useState(false)
+  const [searchQuery, setSearchQuery] = useState('')
+  const dropdownRef = useRef<HTMLDivElement>(null)
+
+  // Handle click outside to close dropdown
+  useEffect(() => {
+    function handleClickOutside(event: MouseEvent) {
+      if (dropdownRef.current && !dropdownRef.current.contains(event.target as Node)) {
+        setIsDropdownOpen(false)
+      }
+    }
+    document.addEventListener("mousedown", handleClickOutside)
+    return () => {
+      document.removeEventListener("mousedown", handleClickOutside)
+    }
+  }, [])
+
+  const processedDocs = documents.filter((doc) => doc.status === "Processed")
+  const selectedDoc = selectedDocId
+    ? documents.find((doc) => doc.id === selectedDocId)
+    : processedDocs.length > 0
+      ? processedDocs[0]
+      : null
+
+  // Filter documents based on search query
+  const filteredDocs = processedDocs.filter(doc => 
+    doc.name.toLowerCase().includes(searchQuery.toLowerCase())
+  )
+
+  // Extract unique entities from triples
+  const getUniqueEntities = () => {
+    if (!selectedDoc?.triples) return [];
+    
+    const entitiesSet = new Set<string>();
+    selectedDoc.triples.forEach(triple => {
+      if (triple.subject && typeof triple.subject === 'string') {
+        entitiesSet.add(triple.subject);
+      }
+      if (triple.object && typeof triple.object === 'string') {
+        entitiesSet.add(triple.object);
+      }
+    });
+    
+    return Array.from(entitiesSet).sort();
+  };
+
+  const uniqueEntities = getUniqueEntities();
+
+  // Helper function to normalize triple text by removing parentheses and quotes
+  const normalizeText = (text: string | null | undefined): string => {
+    if (!text || typeof text !== 'string') return '';
+    return text.replace(/['"()]/g, '').trim();
+  };
+
+  if (processedDocs.length === 0) {
+    return (
+      <div className="p-8 text-center">
+        <div className="flex flex-col items-center justify-center">
+          <div className="w-16 h-16 rounded-full bg-secondary/50 flex items-center justify-center mb-4">
+            <FileText className="h-8 w-8 text-muted-foreground" />
+          </div>
+          <p className="text-muted-foreground mb-2">No processed documents available</p>
+          <p className="text-xs text-muted-foreground max-w-md mx-auto">
+            Upload markdown, CSV, or text files and click "Generate Graph" to extract knowledge triples
+          </p>
+        </div>
+      </div>
+    )
+  }
+
+  const handleSaveTriple = (triple: Triple, index?: number) => {
+    if (selectedDoc) {
+      if (index !== undefined) {
+        editTriple(selectedDoc.id, index, triple)
+      } else {
+        addTriple(selectedDoc.id, triple)
+      }
+    }
+    setEditingIndex(null)
+    setIsAddingTriple(false)
+  }
+
+  const handleDeleteTriple = (index: number) => {
+    if (selectedDoc) {
+      if (confirm("Are you sure you want to delete this triple?")) {
+        deleteTriple(selectedDoc.id, index)
+      }
+    }
+  }
+
+  const exportTriplesCSV = () => {
+    if (!selectedDoc || !selectedDoc.triples) return
+
+    const tripleCsv = [
+      "Subject,Predicate,Object",
+      ...selectedDoc.triples.map((t) => `"${normalizeText(t.subject)}","${normalizeText(t.predicate)}","${normalizeText(t.object)}"`),
+    ].join("\n")
+
+    const blob = new Blob([tripleCsv], { type: "text/csv" })
+    const url = URL.createObjectURL(blob)
+    const a = document.createElement("a")
+    a.href = url
+    a.download = `${selectedDoc.name.replace(/\.[^/.]+$/, "")}_triples.csv`
+    document.body.appendChild(a)
+    a.click()
+    document.body.removeChild(a)
+    URL.revokeObjectURL(url)
+    setShowExportMenu(false)
+  }
+
+  const exportTriplesJSON = () => {
+    if (!selectedDoc || !selectedDoc.triples) return
+
+    // Export the triples in the exact format expected by the graph viewer
+    const triplesJSON = JSON.stringify(selectedDoc.triples, null, 2)
+
+    const blob = new Blob([triplesJSON], { type: "application/json" })
+    const url = URL.createObjectURL(blob)
+    const a = document.createElement("a")
+    a.href = url
+    a.download = `${selectedDoc.name.replace(/\.[^/.]+$/, "")}_triples.json`
+    document.body.appendChild(a)
+    a.click()
+    document.body.removeChild(a)
+    URL.revokeObjectURL(url)
+    setShowExportMenu(false)
+  }
+
+  // Export entities list to CSV
+  const exportEntitiesCSV = () => {
+    if (!uniqueEntities.length) return;
+
+    const entitiesCsv = [
+      "Entity",
+      ...uniqueEntities.map(entity => `"${normalizeText(entity)}"`),
+    ].join("\n");
+
+    const blob = new Blob([entitiesCsv], { type: "text/csv" });
+    const url = URL.createObjectURL(blob);
+    const a = document.createElement("a");
+    a.href = url;
+    a.download = `${selectedDoc?.name.replace(/\.[^/.]+$/, "") || "graph"}_entities.csv`;
+    document.body.appendChild(a);
+    a.click();
+    document.body.removeChild(a);
+    URL.revokeObjectURL(url);
+    setShowExportMenu(false);
+  }
+
+  const handleSaveEntity = (oldEntity: string, newEntity: string) => {
+    if (selectedDoc && selectedDoc.triples) {
+      // Create new triples array with updated entity names
+      const updatedTriples = selectedDoc.triples.map(triple => {
+        const updatedTriple = { ...triple };
+        
+        // Update subject if it matches the old entity name
+        if (updatedTriple.subject === oldEntity) {
+          updatedTriple.subject = newEntity;
+        }
+        
+        // Update object if it matches the old entity name
+        if (updatedTriple.object === oldEntity) {
+          updatedTriple.object = newEntity;
+        }
+        
+        return updatedTriple;
+      });
+      
+      // Update the document with the new triples
+      updateTriples(selectedDoc.id, updatedTriples);
+    }
+    
+    // Reset editing state
+    setEditingEntityIndex(null);
+  };
+
+  const handleAddEntity = () => {
+    if (!newEntityName.trim() || !selectedDoc) return
+    
+    // Add a self-referential triple: "entity" is "entity"
+    // This is a simple way to add an entity to the graph
+    const selfReferentialTriple: Triple = {
+      subject: newEntityName.trim(),
+      predicate: 'is',
+      object: newEntityName.trim()
+    }
+    
+    // Add the new triple to the document
+    addTriple(selectedDoc.id, selfReferentialTriple)
+    
+    // Reset state
+    setNewEntityName('')
+    setIsAddingEntity(false)
+  }
+
+  const handleDeleteEntity = (entity: string) => {
+    if (!selectedDoc || !selectedDoc.triples) return;
+    
+    if (confirm(`Are you sure you want to delete the entity "${entity}"? This will remove all triples containing this entity.`)) {
+      // Filter out all triples that contain the entity
+      const filteredTriples = selectedDoc.triples.filter(triple => 
+        triple.subject !== entity && triple.object !== entity
+      );
+      
+      // Update the document with the filtered triples
+      updateTriples(selectedDoc.id, filteredTriples);
+    }
+  };
+
+  // Function to store triples in the Neo4j database
+  const storeInGraphDb = async () => {
+    if (!selectedDoc || !selectedDoc.triples || selectedDoc.triples.length === 0) {
+      alert("No triples to store in the database");
+      return;
+    }
+
+    try {
+      setIsStoringToDb(true);
+      setStoreStatus('loading');
+
+      const response = await fetch('/api/graph-db/triples', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify({
+          triples: selectedDoc.triples,
+          documentName: selectedDoc.name
+        }),
+      });
+
+      if (!response.ok) {
+        const errorData = await response.json();
+        throw new Error(errorData.error || 'Failed to store triples in the database');
+      }
+
+      setStoreStatus('success');
+      setTimeout(() => setStoreStatus('idle'), 3000);
+    } catch (error) {
+      console.error("Error storing triples in graph database:", error);
+      setStoreStatus('error');
+      alert(error instanceof Error ? error.message : 'An error occurred while storing triples');
+    } finally {
+      setIsStoringToDb(false);
+    }
+  };
+
+  // Function to store all triples from all documents in the graph database
+  const storeAllTriplesInGraphDb = async () => {
+    // Get all documents with triples
+    const docsWithTriples = documents.filter(doc => doc.triples && doc.triples.length > 0);
+    
+    if (docsWithTriples.length === 0) {
+      alert("No documents with triples to store in the database");
+      return;
+    }
+
+    try {
+      setIsStoringToDb(true);
+      setStoreStatus('loading');
+
+      // Collect all triples from all documents
+      const allTriples = docsWithTriples.flatMap(doc => doc.triples || []);
+      
+      const response = await fetch('/api/graph-db/triples', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify({
+          triples: allTriples,
+          documentName: 'All Documents'
+        }),
+      });
+
+      if (!response.ok) {
+        const errorData = await response.json();
+        throw new Error(errorData.error || 'Failed to store all triples in the database');
+      }
+
+      setStoreStatus('success');
+      setTimeout(() => setStoreStatus('idle'), 3000);
+    } catch (error) {
+      console.error("Error storing all triples in graph database:", error);
+      setStoreStatus('error');
+      alert(error instanceof Error ? error.message : 'An error occurred while storing all triples');
+    } finally {
+      setIsStoringToDb(false);
+    }
+  };
+
+  return (
+    <div className="p-6">
+      {/* Header Section with improved layout */}
+      <div className="flex flex-col lg:flex-row lg:justify-between lg:items-center gap-4 mb-6">
+        <div className="flex items-center gap-4 relative" ref={dropdownRef}>
+          <label className="text-sm font-semibold text-foreground whitespace-nowrap">Select Document</label>
+          <div className="relative w-64">
+            <button
+              className="w-full flex items-center justify-between bg-card border border-border rounded-lg p-3 text-foreground text-sm hover:bg-muted/30 transition-colors"
+              onClick={() => setIsDropdownOpen(!isDropdownOpen)}
+            >
+              <span className="truncate">
+                {selectedDoc?.name || "Select document"}
+              </span>
+              <svg
+                xmlns="http://www.w3.org/2000/svg"
+                width="16"
+                height="16"
+                viewBox="0 0 24 24"
+                fill="none"
+                stroke="currentColor"
+                strokeWidth="2"
+                strokeLinecap="round"
+                strokeLinejoin="round"
+                className={`transition-transform ${isDropdownOpen ? 'rotate-180' : ''}`}
+              >
+                <polyline points="6 9 12 15 18 9"></polyline>
+              </svg>
+            </button>
+            
+            {isDropdownOpen && (
+              <div className="absolute z-10 mt-1 w-full bg-card border border-border rounded-lg shadow-lg max-h-64 overflow-y-auto">
+                <div className="p-2 sticky top-0 bg-card border-b border-border">
+                  <input
+                    type="text"
+                    className="w-full bg-background border border-border rounded-md p-1.5 text-sm"
+                    placeholder="Search documents..."
+                    value={searchQuery}
+                    onChange={(e) => setSearchQuery(e.target.value)}
+                    onClick={(e) => e.stopPropagation()}
+                  />
+                </div>
+                {filteredDocs.length === 0 ? (
+                  <div className="p-2 text-center text-muted-foreground text-sm">
+                    No documents found
+                  </div>
+                ) : (
+                  filteredDocs.map((doc) => (
+                    <button
+                      key={doc.id}
+                      className={`w-full text-left p-2 hover:bg-muted/30 text-sm ${
+                        doc.id === selectedDoc?.id ? 'bg-primary/10 text-primary' : ''
+                      }`}
+                      onClick={() => {
+                        setSelectedDocId(doc.id)
+                        setEditingIndex(null)
+                        setIsAddingTriple(false)
+                        setIsDropdownOpen(false)
+                        setSearchQuery('')
+                      }}
+                    >
+                      {doc.name}
+                    </button>
+                  ))
+                )}
+              </div>
+            )}
+          </div>
+        </div>
+        
+        {/* Primary Action - Store All Documents */}
+        <div className="flex justify-end">
+          <button
+            onClick={storeAllTriplesInGraphDb}
+            disabled={isStoringToDb || documents.filter(doc => doc.triples && doc.triples.length > 0).length === 0}
+            className={`inline-flex items-center gap-2 px-6 py-3 text-sm font-medium rounded-lg transition-all shadow-sm ${
+              storeStatus === 'success' 
+                ? 'bg-green-50 border border-green-200 text-green-700 dark:bg-green-900/20 dark:border-green-800 dark:text-green-400' 
+                : storeStatus === 'error' 
+                  ? 'bg-red-50 border border-red-200 text-red-700 dark:bg-red-900/20 dark:border-red-800 dark:text-red-400' 
+                  : 'bg-nvidia-green hover:bg-nvidia-green/90 text-white border-nvidia-green hover:shadow-md disabled:opacity-50 disabled:cursor-not-allowed'
+            }`}
+          >
+            <Database className="h-4 w-4" />
+            <span>
+              {storeStatus === 'loading' ? 'Storing All Documents...' : 
+               storeStatus === 'success' ? 'All Documents Stored!' : 
+               storeStatus === 'error' ? 'Failed' : 
+               'Store All in Graph DB'}
+            </span>
+          </button>
+        </div>
+      </div>
+      
+      {selectedDoc && (
+        <>
+          {/* Knowledge Graph Stats */}
+          <div className="mb-6">
+            <div className="flex items-center justify-between">
+              <h4 className="text-sm font-semibold text-foreground">Document Statistics</h4>
+              <div className="flex items-center gap-6 text-sm">
+                <div className="flex items-center gap-2">
+                  <span className="font-bold text-nvidia-green text-base">{selectedDoc.triples?.length || 0}</span>
+                  <span className="text-xs text-muted-foreground font-medium">Triples</span>
+                </div>
+                <div className="flex items-center gap-2">
+                  <span className="font-bold text-nvidia-green text-base">{uniqueEntities.length}</span>
+                  <span className="text-xs text-muted-foreground font-medium">Entities</span>
+                </div>
+              </div>
+            </div>
+          </div>
+
+          {/* Tab Navigation */}
+          <div className="mb-6">
+            <div className="inline-flex items-center justify-center rounded-xl bg-muted/20 border border-border/15 p-2 shadow-sm backdrop-blur-sm w-fit">
+              <button 
+                onClick={() => setViewMode('triples')}
+                className={`inline-flex items-center justify-center gap-3 whitespace-nowrap rounded-lg px-4 py-3 text-sm font-medium transition-all duration-200 hover:bg-background/60 ${
+                  viewMode === 'triples' 
+                    ? 'bg-background text-foreground shadow-sm border border-border/20' 
+                    : 'text-muted-foreground'
+                }`}
+              >
+                <div className={`nvidia-build-tab-icon ${viewMode === 'triples' ? 'scale-105' : ''}`}>
+                  <List className="h-3 w-3 text-nvidia-green" />
+                </div>
+                <span>Triples</span>
+              </button>
+              <button 
+                onClick={() => setViewMode('entities')}
+                className={`inline-flex items-center justify-center gap-3 whitespace-nowrap rounded-lg px-4 py-3 text-sm font-medium transition-all duration-200 hover:bg-background/60 ${
+                  viewMode === 'entities' 
+                    ? 'bg-background text-foreground shadow-sm border border-border/20' 
+                    : 'text-muted-foreground'
+                }`}
+              >
+                <div className={`nvidia-build-tab-icon ${viewMode === 'entities' ? 'scale-105' : ''}`}>
+                  <Network className="h-3 w-3 text-nvidia-green" />
+                </div>
+                <span>Entities</span>
+              </button>
+            </div>
+            
+            {selectedDoc.chunkCount && selectedDoc.chunkCount > 1 && (
+              <div className="flex justify-end items-center mt-4">
+                <span className="text-xs px-3 py-1.5 rounded-full bg-nvidia-green/10 text-nvidia-green border border-nvidia-green/20 font-medium">
+                  Processed in {selectedDoc.chunkCount} chunks
+                </span>
+              </div>
+            )}
+          </div>
+
+
+
+          {viewMode === 'triples' ? (
+            selectedDoc.triples && selectedDoc.triples.length > 0 ? (
+              <div>
+                {/* Action Buttons Section */}
+                <div className="flex flex-col sm:flex-row sm:justify-between sm:items-center gap-4 mb-6">
+                  <div className="flex items-center">
+                    <h3 className="text-lg font-semibold text-foreground">
+                      Knowledge Triples ({selectedDoc.triples?.length || 0})
+                    </h3>
+                  </div>
+
+                  <div className="flex flex-wrap items-center gap-3">
+                    {/* Primary Action - Add Triple */}
+                    <button
+                      onClick={() => {
+                        setIsAddingTriple(true)
+                        setEditingIndex(null)
+                      }}
+                      className="inline-flex items-center gap-2 px-4 py-2.5 text-sm font-medium bg-nvidia-green hover:bg-nvidia-green/90 text-white rounded-lg transition-all shadow-sm hover:shadow-md"
+                    >
+                      <Plus className="h-4 w-4" />
+                      <span>Add Triple</span>
+                    </button>
+
+                    {/* Secondary Actions Group */}
+                    <div className="flex items-center gap-2">
+                      <button
+                        onClick={storeInGraphDb}
+                        disabled={isStoringToDb || !selectedDoc.triples || selectedDoc.triples.length === 0}
+                        className={`inline-flex items-center gap-2 px-4 py-2.5 text-sm font-medium rounded-lg transition-all shadow-sm ${
+                          storeStatus === 'success' 
+                            ? 'bg-green-50 border border-green-200 text-green-700 dark:bg-green-900/20 dark:border-green-800 dark:text-green-400' 
+                            : storeStatus === 'error' 
+                              ? 'bg-red-50 border border-red-200 text-red-700 dark:bg-red-900/20 dark:border-red-800 dark:text-red-400' 
+                              : 'bg-background border border-border hover:bg-muted/50 text-foreground hover:shadow-md disabled:opacity-50 disabled:cursor-not-allowed'
+                        }`}
+                      >
+                        <Database className="h-4 w-4" />
+                        <span>
+                          {storeStatus === 'loading' ? 'Storing...' :
+                           storeStatus === 'success' ? 'Stored!' :
+                           storeStatus === 'error' ? 'Failed' : 
+                           'Store in Graph DB'}
+                        </span>
+                      </button>
+
+                      <div className="relative">
+                        <button 
+                          onClick={() => setShowExportMenu(!showExportMenu)} 
+                          className="inline-flex items-center gap-2 px-4 py-2.5 text-sm font-medium bg-background border border-border hover:bg-muted/50 text-foreground rounded-lg transition-all shadow-sm hover:shadow-md relative z-40"
+                        >
+                          <Download className="h-4 w-4" />
+                          <span>Export</span>
+                          <ChevronDown className="h-3 w-3 ml-1" />
+                        </button>
+
+                        {showExportMenu && (
+                        <div className="absolute right-0 mt-2 w-64 bg-card border border-border rounded-lg shadow-lg z-50 overflow-hidden">
+                            <button
+                              onClick={exportTriplesJSON}
+                              className="w-full text-left px-4 py-3 hover:bg-muted/30 flex items-center gap-3 transition-colors"
+                            >
+                              <FileJson className="h-4 w-4 text-primary" />
+                              <div>
+                                <div className="text-sm font-medium">Export as JSON</div>
+                                <div className="text-xs text-muted-foreground">For Graph Viewer</div>
+                              </div>
+                            </button>
+                            <button
+                              onClick={exportTriplesCSV}
+                              className="w-full text-left px-4 py-3 hover:bg-muted/30 flex items-center gap-3 transition-colors"
+                            >
+                              <FileText className="h-4 w-4 text-primary" />
+                              <div>
+                                <div className="text-sm font-medium">Export as CSV</div>
+                                <div className="text-xs text-muted-foreground">For spreadsheets</div>
+                              </div>
+                            </button>
+                          </div>
+                        )}
+                      </div>
+                    </div>
+                  </div>
+                </div>
+
+                <div className="border border-border rounded-xl overflow-hidden">
+                  <div className="flex justify-between items-center p-4 bg-muted/30 border-b border-border">
+                    <div className="grid grid-cols-3 gap-4 w-full">
+                      <div className="text-sm font-semibold text-muted-foreground">Subject</div>
+                      <div className="text-sm font-semibold text-muted-foreground">Predicate</div>
+                      <div className="text-sm font-semibold text-muted-foreground">Object</div>
+                    </div>
+                  </div>
+
+                  {isAddingTriple && (
+                    <div className="border-b border-border">
+                      <TripleEditor onSave={handleSaveTriple} onCancel={() => setIsAddingTriple(false)} />
+                    </div>
+                  )}
+
+                  <div className="max-h-96 overflow-y-auto">
+                    {selectedDoc.triples.map((triple, index) => (
+                      <div key={index} className="border-b border-border last:border-b-0">
+                        {editingIndex === index ? (
+                          <TripleEditor
+                            triple={triple}
+                            index={index}
+                            onSave={handleSaveTriple}
+                            onCancel={() => setEditingIndex(null)}
+                          />
+                        ) : (
+                          <div className="flex justify-between items-center p-4 hover:bg-muted/30 transition-colors">
+                            <div className="grid grid-cols-3 gap-4 w-full">
+                              <div className="text-sm text-foreground truncate" title={triple.subject}>
+                                {normalizeText(triple.subject)}
+                              </div>
+                              <div className="text-sm text-foreground truncate" title={triple.predicate}>
+                                {normalizeText(triple.predicate)}
+                              </div>
+                              <div className="text-sm text-foreground truncate" title={triple.object}>
+                                {normalizeText(triple.object)}
+                              </div>
+                            </div>
+                            <div className="flex items-center gap-1 ml-2">
+                              <button
+                                onClick={() => setEditingIndex(index)}
+                                className="p-1.5 text-muted-foreground hover:text-foreground rounded-full hover:bg-muted/50 transition-colors"
+                                title="Edit Triple"
+                              >
+                                <Pencil className="h-3.5 w-3.5" />
+                              </button>
+                              <button
+                                onClick={() => handleDeleteTriple(index)}
+                                className="p-1.5 text-muted-foreground hover:text-destructive rounded-full hover:bg-destructive/10 transition-colors"
+                                title="Delete Triple"
+                              >
+                                <Trash2 className="h-3.5 w-3.5" />
+                              </button>
+                            </div>
+                          </div>
+                        )}
+                      </div>
+                    ))}
+                  </div>
+                </div>
+              </div>
+            ) : (
+              <div className="p-8 text-center border border-border rounded-xl">
+                <div className="flex flex-col items-center justify-center">
+                  <p className="text-muted-foreground mb-2">No triples found in this document</p>
+                  <p className="text-xs text-muted-foreground mb-6">
+                    Try regenerating the graph or add triples manually
+                  </p>
+                  <button
+                    onClick={() => setIsAddingTriple(true)}
+                    className="inline-flex items-center gap-2 px-4 py-2.5 text-sm font-medium bg-nvidia-green hover:bg-nvidia-green/90 text-white rounded-lg transition-all shadow-sm hover:shadow-md"
+                  >
+                    <Plus className="h-4 w-4" />
+                    <span>Add First Triple</span>
+                  </button>
+                </div>
+              </div>
+            )
+          ) : (
+            // Entities View
+            <div>
+              {/* Entities Action Buttons Section */}
+              <div className="flex flex-col sm:flex-row sm:justify-between sm:items-center gap-4 mb-6">
+                <div className="flex items-center">
+                  <h3 className="text-lg font-semibold text-foreground">
+                    {uniqueEntities.length > 0
+                      ? `Entities (${uniqueEntities.length})`
+                      : "No Entities Found"}
+                  </h3>
+                </div>
+                
+                <div className="flex flex-wrap items-center gap-3">
+                  {/* Primary Action - Add Entity */}
+                  <button
+                    onClick={() => setIsAddingEntity(true)}
+                    className="inline-flex items-center gap-2 px-4 py-2.5 text-sm font-medium bg-nvidia-green hover:bg-nvidia-green/90 text-white rounded-lg transition-all shadow-sm hover:shadow-md"
+                  >
+                    <Plus className="h-4 w-4" />
+                    <span>Add Entity</span>
+                  </button>
+
+                  {/* Secondary Action - Export */}
+                  <div className="relative">
+                    <button 
+                      onClick={() => setShowExportMenu(!showExportMenu)} 
+                      className="inline-flex items-center gap-2 px-4 py-2.5 text-sm font-medium bg-background border border-border hover:bg-muted/50 text-foreground rounded-lg transition-all shadow-sm hover:shadow-md relative z-40"
+                    >
+                      <Download className="h-4 w-4" />
+                      <span>Export</span>
+                      <ChevronDown className="h-3 w-3 ml-1" />
+                    </button>
+
+                    {showExportMenu && (
+                      <div className="absolute right-0 mt-2 w-64 bg-card border border-border rounded-lg shadow-lg z-50 overflow-hidden">
+                        <button
+                          onClick={exportEntitiesCSV}
+                          className="w-full text-left px-4 py-3 hover:bg-muted/30 flex items-center gap-3 transition-colors"
+                        >
+                          <FileText className="h-4 w-4 text-primary" />
+                          <div>
+                            <div className="text-sm font-medium">Export Entities as CSV</div>
+                            <div className="text-xs text-muted-foreground">For spreadsheets</div>
+                          </div>
+                        </button>
+                      </div>
+                    )}
+                  </div>
+                </div>
+              </div>
+
+              {isAddingEntity && (
+                <div className="mb-6 p-4 bg-muted/20 border border-border rounded-lg">
+                  <div className="mb-3">
+                    <label htmlFor="newEntity" className="block text-sm font-medium text-foreground mb-2">
+                      New Entity Name
+                    </label>
+                    <div className="flex gap-3">
+                      <input
+                        id="newEntity"
+                        type="text"
+                        value={newEntityName}
+                        onChange={(e) => setNewEntityName(e.target.value)}
+                        className="flex-1 bg-background border border-border rounded-lg p-3 text-sm text-foreground focus:ring-2 focus:ring-nvidia-green/50 focus:border-nvidia-green transition-colors"
+                        placeholder="Enter entity name"
+                      />
+                      <button 
+                        onClick={handleAddEntity}
+                        disabled={!newEntityName.trim()}
+                        className="inline-flex items-center gap-2 px-4 py-3 text-sm font-medium bg-nvidia-green hover:bg-nvidia-green/90 text-white rounded-lg transition-all shadow-sm hover:shadow-md disabled:opacity-50 disabled:cursor-not-allowed"
+                      >
+                        <Plus className="h-4 w-4" />
+                        <span>Add</span>
+                      </button>
+                      <button 
+                        onClick={() => {
+                          setIsAddingEntity(false)
+                          setNewEntityName('')
+                        }}
+                        className="p-3 text-muted-foreground hover:text-foreground rounded-lg hover:bg-muted/30 transition-colors"
+                      >
+                        <X className="h-4 w-4" />
+                      </button>
+                    </div>
+                  </div>
+                </div>
+              )}
+
+              {uniqueEntities.length > 0 ? (
+                <div className="border border-border rounded-xl overflow-hidden">
+                  <div className="flex justify-between items-center p-4 bg-card border-b border-border">
+                    <div className="text-sm font-medium text-muted-foreground">Entity</div>
+                  </div>
+                  <div className="max-h-96 overflow-y-auto">
+                    {uniqueEntities.map((entity, index) => (
+                      <div key={index} className="border-b border-border last:border-b-0">
+                        {editingEntityIndex === index ? (
+                          <EntityEditor
+                            entity={entity}
+                            onSave={handleSaveEntity}
+                            onCancel={() => setEditingEntityIndex(null)}
+                          />
+                        ) : (
+                          <div className="flex justify-between items-center p-4 hover:bg-muted/20">
+                            <div className="text-sm text-foreground truncate" title={entity}>
+                              {normalizeText(entity)}
+                            </div>
+                            <div className="flex items-center gap-1 ml-2">
+                              <button
+                                onClick={() => setEditingEntityIndex(index)}
+                                className="p-1.5 text-muted-foreground hover:text-foreground rounded-full hover:bg-muted/30"
+                                title="Edit Entity"
+                              >
+                                <Pencil className="h-3.5 w-3.5" />
+                              </button>
+                              <button
+                                onClick={() => handleDeleteEntity(entity)}
+                                className="p-1.5 text-muted-foreground hover:text-destructive rounded-full hover:bg-destructive/10"
+                                title="Delete Entity"
+                              >
+                                <Trash2 className="h-3.5 w-3.5" />
+                              </button>
+                            </div>
+                          </div>
+                        )}
+                      </div>
+                    ))}
+                  </div>
+                </div>
+              ) : (
+                <div className="p-8 text-center border border-border rounded-xl">
+                  <div className="flex flex-col items-center justify-center">
+                    <p className="text-muted-foreground mb-2">No entities found</p>
+                    <p className="text-xs text-muted-foreground">
+                      Add triples to create entities in the knowledge graph
+                    </p>
+                  </div>
+                </div>
+              )}
+            </div>
+          )}
+        </>
+      )}
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/alert-dialog.tsx b/nvidia/txt2kg/assets/frontend/components/ui/alert-dialog.tsx
new file mode 100644
index 0000000..25e7b47
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/alert-dialog.tsx
@@ -0,0 +1,141 @@
+"use client"
+
+import * as React from "react"
+import * as AlertDialogPrimitive from "@radix-ui/react-alert-dialog"
+
+import { cn } from "@/lib/utils"
+import { buttonVariants } from "@/components/ui/button"
+
+const AlertDialog = AlertDialogPrimitive.Root
+
+const AlertDialogTrigger = AlertDialogPrimitive.Trigger
+
+const AlertDialogPortal = AlertDialogPrimitive.Portal
+
+const AlertDialogOverlay = React.forwardRef<
+  React.ElementRef<typeof AlertDialogPrimitive.Overlay>,
+  React.ComponentPropsWithoutRef<typeof AlertDialogPrimitive.Overlay>
+>(({ className, ...props }, ref) => (
+  <AlertDialogPrimitive.Overlay
+    className={cn(
+      "fixed inset-0 z-50 bg-black/80  data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0",
+      className
+    )}
+    {...props}
+    ref={ref}
+  />
+))
+AlertDialogOverlay.displayName = AlertDialogPrimitive.Overlay.displayName
+
+const AlertDialogContent = React.forwardRef<
+  React.ElementRef<typeof AlertDialogPrimitive.Content>,
+  React.ComponentPropsWithoutRef<typeof AlertDialogPrimitive.Content>
+>(({ className, ...props }, ref) => (
+  <AlertDialogPortal>
+    <AlertDialogOverlay />
+    <AlertDialogPrimitive.Content
+      ref={ref}
+      className={cn(
+        "fixed left-[50%] top-[50%] z-50 grid w-full max-w-lg translate-x-[-50%] translate-y-[-50%] gap-4 border bg-background p-6 shadow-lg duration-200 data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0 data-[state=closed]:zoom-out-95 data-[state=open]:zoom-in-95 data-[state=closed]:slide-out-to-left-1/2 data-[state=closed]:slide-out-to-top-[48%] data-[state=open]:slide-in-from-left-1/2 data-[state=open]:slide-in-from-top-[48%] sm:rounded-lg",
+        className
+      )}
+      {...props}
+    />
+  </AlertDialogPortal>
+))
+AlertDialogContent.displayName = AlertDialogPrimitive.Content.displayName
+
+const AlertDialogHeader = ({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLDivElement>) => (
+  <div
+    className={cn(
+      "flex flex-col space-y-2 text-center sm:text-left",
+      className
+    )}
+    {...props}
+  />
+)
+AlertDialogHeader.displayName = "AlertDialogHeader"
+
+const AlertDialogFooter = ({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLDivElement>) => (
+  <div
+    className={cn(
+      "flex flex-col-reverse sm:flex-row sm:justify-end sm:space-x-2",
+      className
+    )}
+    {...props}
+  />
+)
+AlertDialogFooter.displayName = "AlertDialogFooter"
+
+const AlertDialogTitle = React.forwardRef<
+  React.ElementRef<typeof AlertDialogPrimitive.Title>,
+  React.ComponentPropsWithoutRef<typeof AlertDialogPrimitive.Title>
+>(({ className, ...props }, ref) => (
+  <AlertDialogPrimitive.Title
+    ref={ref}
+    className={cn("text-lg font-semibold", className)}
+    {...props}
+  />
+))
+AlertDialogTitle.displayName = AlertDialogPrimitive.Title.displayName
+
+const AlertDialogDescription = React.forwardRef<
+  React.ElementRef<typeof AlertDialogPrimitive.Description>,
+  React.ComponentPropsWithoutRef<typeof AlertDialogPrimitive.Description>
+>(({ className, ...props }, ref) => (
+  <AlertDialogPrimitive.Description
+    ref={ref}
+    className={cn("text-sm text-muted-foreground", className)}
+    {...props}
+  />
+))
+AlertDialogDescription.displayName =
+  AlertDialogPrimitive.Description.displayName
+
+const AlertDialogAction = React.forwardRef<
+  React.ElementRef<typeof AlertDialogPrimitive.Action>,
+  React.ComponentPropsWithoutRef<typeof AlertDialogPrimitive.Action>
+>(({ className, ...props }, ref) => (
+  <AlertDialogPrimitive.Action
+    ref={ref}
+    className={cn(buttonVariants(), className)}
+    {...props}
+  />
+))
+AlertDialogAction.displayName = AlertDialogPrimitive.Action.displayName
+
+const AlertDialogCancel = React.forwardRef<
+  React.ElementRef<typeof AlertDialogPrimitive.Cancel>,
+  React.ComponentPropsWithoutRef<typeof AlertDialogPrimitive.Cancel>
+>(({ className, ...props }, ref) => (
+  <AlertDialogPrimitive.Cancel
+    ref={ref}
+    className={cn(
+      buttonVariants({ variant: "outline" }),
+      "mt-2 sm:mt-0",
+      className
+    )}
+    {...props}
+  />
+))
+AlertDialogCancel.displayName = AlertDialogPrimitive.Cancel.displayName
+
+export {
+  AlertDialog,
+  AlertDialogPortal,
+  AlertDialogOverlay,
+  AlertDialogTrigger,
+  AlertDialogContent,
+  AlertDialogHeader,
+  AlertDialogFooter,
+  AlertDialogTitle,
+  AlertDialogDescription,
+  AlertDialogAction,
+  AlertDialogCancel,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/alert.tsx b/nvidia/txt2kg/assets/frontend/components/ui/alert.tsx
new file mode 100644
index 0000000..41fa7e0
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/alert.tsx
@@ -0,0 +1,59 @@
+import * as React from "react"
+import { cva, type VariantProps } from "class-variance-authority"
+
+import { cn } from "@/lib/utils"
+
+const alertVariants = cva(
+  "relative w-full rounded-lg border p-4 [&>svg~*]:pl-7 [&>svg+div]:translate-y-[-3px] [&>svg]:absolute [&>svg]:left-4 [&>svg]:top-4 [&>svg]:text-foreground",
+  {
+    variants: {
+      variant: {
+        default: "bg-background text-foreground",
+        destructive:
+          "border-destructive/50 text-destructive dark:border-destructive [&>svg]:text-destructive",
+      },
+    },
+    defaultVariants: {
+      variant: "default",
+    },
+  }
+)
+
+const Alert = React.forwardRef<
+  HTMLDivElement,
+  React.HTMLAttributes<HTMLDivElement> & VariantProps<typeof alertVariants>
+>(({ className, variant, ...props }, ref) => (
+  <div
+    ref={ref}
+    role="alert"
+    className={cn(alertVariants({ variant }), className)}
+    {...props}
+  />
+))
+Alert.displayName = "Alert"
+
+const AlertTitle = React.forwardRef<
+  HTMLParagraphElement,
+  React.HTMLAttributes<HTMLHeadingElement>
+>(({ className, ...props }, ref) => (
+  <h5
+    ref={ref}
+    className={cn("mb-1 font-medium leading-none tracking-tight", className)}
+    {...props}
+  />
+))
+AlertTitle.displayName = "AlertTitle"
+
+const AlertDescription = React.forwardRef<
+  HTMLParagraphElement,
+  React.HTMLAttributes<HTMLParagraphElement>
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    className={cn("text-sm [&_p]:leading-relaxed", className)}
+    {...props}
+  />
+))
+AlertDescription.displayName = "AlertDescription"
+
+export { Alert, AlertTitle, AlertDescription }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/avatar.tsx b/nvidia/txt2kg/assets/frontend/components/ui/avatar.tsx
new file mode 100644
index 0000000..51e507b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/avatar.tsx
@@ -0,0 +1,50 @@
+"use client"
+
+import * as React from "react"
+import * as AvatarPrimitive from "@radix-ui/react-avatar"
+
+import { cn } from "@/lib/utils"
+
+const Avatar = React.forwardRef<
+  React.ElementRef<typeof AvatarPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof AvatarPrimitive.Root>
+>(({ className, ...props }, ref) => (
+  <AvatarPrimitive.Root
+    ref={ref}
+    className={cn(
+      "relative flex h-10 w-10 shrink-0 overflow-hidden rounded-full",
+      className
+    )}
+    {...props}
+  />
+))
+Avatar.displayName = AvatarPrimitive.Root.displayName
+
+const AvatarImage = React.forwardRef<
+  React.ElementRef<typeof AvatarPrimitive.Image>,
+  React.ComponentPropsWithoutRef<typeof AvatarPrimitive.Image>
+>(({ className, ...props }, ref) => (
+  <AvatarPrimitive.Image
+    ref={ref}
+    className={cn("aspect-square h-full w-full", className)}
+    {...props}
+  />
+))
+AvatarImage.displayName = AvatarPrimitive.Image.displayName
+
+const AvatarFallback = React.forwardRef<
+  React.ElementRef<typeof AvatarPrimitive.Fallback>,
+  React.ComponentPropsWithoutRef<typeof AvatarPrimitive.Fallback>
+>(({ className, ...props }, ref) => (
+  <AvatarPrimitive.Fallback
+    ref={ref}
+    className={cn(
+      "flex h-full w-full items-center justify-center rounded-full bg-muted",
+      className
+    )}
+    {...props}
+  />
+))
+AvatarFallback.displayName = AvatarPrimitive.Fallback.displayName
+
+export { Avatar, AvatarImage, AvatarFallback }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/badge.tsx b/nvidia/txt2kg/assets/frontend/components/ui/badge.tsx
new file mode 100644
index 0000000..f000e3e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/badge.tsx
@@ -0,0 +1,36 @@
+import * as React from "react"
+import { cva, type VariantProps } from "class-variance-authority"
+
+import { cn } from "@/lib/utils"
+
+const badgeVariants = cva(
+  "inline-flex items-center rounded-full border px-2.5 py-0.5 text-xs font-semibold transition-colors focus:outline-none focus:ring-2 focus:ring-ring focus:ring-offset-2",
+  {
+    variants: {
+      variant: {
+        default:
+          "border-transparent bg-primary text-primary-foreground hover:bg-primary/80",
+        secondary:
+          "border-transparent bg-secondary text-secondary-foreground hover:bg-secondary/80",
+        destructive:
+          "border-transparent bg-destructive text-destructive-foreground hover:bg-destructive/80",
+        outline: "text-foreground",
+      },
+    },
+    defaultVariants: {
+      variant: "default",
+    },
+  }
+)
+
+export interface BadgeProps
+  extends React.HTMLAttributes<HTMLDivElement>,
+    VariantProps<typeof badgeVariants> {}
+
+function Badge({ className, variant, ...props }: BadgeProps) {
+  return (
+    <div className={cn(badgeVariants({ variant }), className)} {...props} />
+  )
+}
+
+export { Badge, badgeVariants }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/button.tsx b/nvidia/txt2kg/assets/frontend/components/ui/button.tsx
new file mode 100644
index 0000000..36496a2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/button.tsx
@@ -0,0 +1,56 @@
+import * as React from "react"
+import { Slot } from "@radix-ui/react-slot"
+import { cva, type VariantProps } from "class-variance-authority"
+
+import { cn } from "@/lib/utils"
+
+const buttonVariants = cva(
+  "inline-flex items-center justify-center gap-2 whitespace-nowrap rounded-md text-sm font-medium ring-offset-background transition-colors focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:pointer-events-none disabled:opacity-50 [&_svg]:pointer-events-none [&_svg]:size-4 [&_svg]:shrink-0",
+  {
+    variants: {
+      variant: {
+        default: "bg-primary text-primary-foreground hover:bg-primary/90",
+        destructive:
+          "bg-destructive text-destructive-foreground hover:bg-destructive/90",
+        outline:
+          "border border-input bg-background hover:bg-accent hover:text-accent-foreground",
+        secondary:
+          "bg-secondary text-secondary-foreground hover:bg-secondary/80",
+        ghost: "hover:bg-accent hover:text-accent-foreground",
+        link: "text-primary underline-offset-4 hover:underline",
+      },
+      size: {
+        default: "h-10 px-4 py-2",
+        sm: "h-9 rounded-md px-3",
+        lg: "h-11 rounded-md px-8",
+        icon: "h-10 w-10",
+      },
+    },
+    defaultVariants: {
+      variant: "default",
+      size: "default",
+    },
+  }
+)
+
+export interface ButtonProps
+  extends React.ButtonHTMLAttributes<HTMLButtonElement>,
+    VariantProps<typeof buttonVariants> {
+  asChild?: boolean
+}
+
+const Button = React.forwardRef<HTMLButtonElement, ButtonProps>(
+  ({ className, variant, size, asChild = false, ...props }, ref) => {
+    const Comp = asChild ? Slot : "button"
+    return (
+      <Comp
+        className={cn(buttonVariants({ variant, size, className }))}
+        ref={ref}
+        {...props}
+      />
+    )
+  }
+)
+Button.displayName = "Button"
+
+export { Button, buttonVariants }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/card.tsx b/nvidia/txt2kg/assets/frontend/components/ui/card.tsx
new file mode 100644
index 0000000..f62edea
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/card.tsx
@@ -0,0 +1,79 @@
+import * as React from "react"
+
+import { cn } from "@/lib/utils"
+
+const Card = React.forwardRef<
+  HTMLDivElement,
+  React.HTMLAttributes<HTMLDivElement>
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    className={cn(
+      "rounded-lg border bg-card text-card-foreground shadow-sm",
+      className
+    )}
+    {...props}
+  />
+))
+Card.displayName = "Card"
+
+const CardHeader = React.forwardRef<
+  HTMLDivElement,
+  React.HTMLAttributes<HTMLDivElement>
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    className={cn("flex flex-col space-y-1.5 p-6", className)}
+    {...props}
+  />
+))
+CardHeader.displayName = "CardHeader"
+
+const CardTitle = React.forwardRef<
+  HTMLDivElement,
+  React.HTMLAttributes<HTMLDivElement>
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    className={cn(
+      "text-2xl font-semibold leading-none tracking-tight",
+      className
+    )}
+    {...props}
+  />
+))
+CardTitle.displayName = "CardTitle"
+
+const CardDescription = React.forwardRef<
+  HTMLDivElement,
+  React.HTMLAttributes<HTMLDivElement>
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    className={cn("text-sm text-muted-foreground", className)}
+    {...props}
+  />
+))
+CardDescription.displayName = "CardDescription"
+
+const CardContent = React.forwardRef<
+  HTMLDivElement,
+  React.HTMLAttributes<HTMLDivElement>
+>(({ className, ...props }, ref) => (
+  <div ref={ref} className={cn("p-6 pt-0", className)} {...props} />
+))
+CardContent.displayName = "CardContent"
+
+const CardFooter = React.forwardRef<
+  HTMLDivElement,
+  React.HTMLAttributes<HTMLDivElement>
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    className={cn("flex items-center p-6 pt-0", className)}
+    {...props}
+  />
+))
+CardFooter.displayName = "CardFooter"
+
+export { Card, CardHeader, CardFooter, CardTitle, CardDescription, CardContent }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/checkbox.tsx b/nvidia/txt2kg/assets/frontend/components/ui/checkbox.tsx
new file mode 100644
index 0000000..df61a13
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/checkbox.tsx
@@ -0,0 +1,30 @@
+"use client"
+
+import * as React from "react"
+import * as CheckboxPrimitive from "@radix-ui/react-checkbox"
+import { Check } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+
+const Checkbox = React.forwardRef<
+  React.ElementRef<typeof CheckboxPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof CheckboxPrimitive.Root>
+>(({ className, ...props }, ref) => (
+  <CheckboxPrimitive.Root
+    ref={ref}
+    className={cn(
+      "peer h-4 w-4 shrink-0 rounded-sm border border-primary ring-offset-background focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:cursor-not-allowed disabled:opacity-50 data-[state=checked]:bg-primary data-[state=checked]:text-primary-foreground",
+      className
+    )}
+    {...props}
+  >
+    <CheckboxPrimitive.Indicator
+      className={cn("flex items-center justify-center text-current")}
+    >
+      <Check className="h-4 w-4" />
+    </CheckboxPrimitive.Indicator>
+  </CheckboxPrimitive.Root>
+))
+Checkbox.displayName = CheckboxPrimitive.Root.displayName
+
+export { Checkbox }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/collapsible.tsx b/nvidia/txt2kg/assets/frontend/components/ui/collapsible.tsx
new file mode 100644
index 0000000..9fa4894
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/collapsible.tsx
@@ -0,0 +1,11 @@
+"use client"
+
+import * as CollapsiblePrimitive from "@radix-ui/react-collapsible"
+
+const Collapsible = CollapsiblePrimitive.Root
+
+const CollapsibleTrigger = CollapsiblePrimitive.CollapsibleTrigger
+
+const CollapsibleContent = CollapsiblePrimitive.CollapsibleContent
+
+export { Collapsible, CollapsibleTrigger, CollapsibleContent }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/dialog.tsx b/nvidia/txt2kg/assets/frontend/components/ui/dialog.tsx
new file mode 100644
index 0000000..c0a7b98
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/dialog.tsx
@@ -0,0 +1,122 @@
+"use client"
+
+import * as React from "react"
+import * as DialogPrimitive from "@radix-ui/react-dialog"
+import { X } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+
+const Dialog = DialogPrimitive.Root
+
+const DialogTrigger = DialogPrimitive.Trigger
+
+const DialogPortal = DialogPrimitive.Portal
+
+const DialogClose = DialogPrimitive.Close
+
+const DialogOverlay = React.forwardRef<
+  React.ElementRef<typeof DialogPrimitive.Overlay>,
+  React.ComponentPropsWithoutRef<typeof DialogPrimitive.Overlay>
+>(({ className, ...props }, ref) => (
+  <DialogPrimitive.Overlay
+    ref={ref}
+    className={cn(
+      "fixed inset-0 z-50 bg-black/80  data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0",
+      className
+    )}
+    {...props}
+  />
+))
+DialogOverlay.displayName = DialogPrimitive.Overlay.displayName
+
+const DialogContent = React.forwardRef<
+  React.ElementRef<typeof DialogPrimitive.Content>,
+  React.ComponentPropsWithoutRef<typeof DialogPrimitive.Content>
+>(({ className, children, ...props }, ref) => (
+  <DialogPortal>
+    <DialogOverlay />
+    <DialogPrimitive.Content
+      ref={ref}
+      className={cn(
+        "fixed left-[50%] top-[50%] z-50 grid w-full max-w-lg translate-x-[-50%] translate-y-[-50%] gap-4 border bg-background p-6 shadow-lg duration-200 data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0 data-[state=closed]:zoom-out-95 data-[state=open]:zoom-in-95 data-[state=closed]:slide-out-to-left-1/2 data-[state=closed]:slide-out-to-top-[48%] data-[state=open]:slide-in-from-left-1/2 data-[state=open]:slide-in-from-top-[48%] sm:rounded-lg",
+        className
+      )}
+      {...props}
+    >
+      {children}
+      <DialogPrimitive.Close className="absolute right-4 top-4 rounded-lg p-2 text-muted-foreground hover:text-foreground hover:bg-muted/50 transition-all duration-200 focus:outline-none focus:ring-2 focus:ring-ring focus:ring-offset-2 disabled:pointer-events-none">
+        <X className="h-4 w-4" />
+        <span className="sr-only">Close</span>
+      </DialogPrimitive.Close>
+    </DialogPrimitive.Content>
+  </DialogPortal>
+))
+DialogContent.displayName = DialogPrimitive.Content.displayName
+
+const DialogHeader = ({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLDivElement>) => (
+  <div
+    className={cn(
+      "flex flex-col space-y-1.5 text-center sm:text-left",
+      className
+    )}
+    {...props}
+  />
+)
+DialogHeader.displayName = "DialogHeader"
+
+const DialogFooter = ({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLDivElement>) => (
+  <div
+    className={cn(
+      "flex flex-col-reverse sm:flex-row sm:justify-end sm:space-x-2",
+      className
+    )}
+    {...props}
+  />
+)
+DialogFooter.displayName = "DialogFooter"
+
+const DialogTitle = React.forwardRef<
+  React.ElementRef<typeof DialogPrimitive.Title>,
+  React.ComponentPropsWithoutRef<typeof DialogPrimitive.Title>
+>(({ className, ...props }, ref) => (
+  <DialogPrimitive.Title
+    ref={ref}
+    className={cn(
+      "text-lg font-semibold leading-none tracking-tight",
+      className
+    )}
+    {...props}
+  />
+))
+DialogTitle.displayName = DialogPrimitive.Title.displayName
+
+const DialogDescription = React.forwardRef<
+  React.ElementRef<typeof DialogPrimitive.Description>,
+  React.ComponentPropsWithoutRef<typeof DialogPrimitive.Description>
+>(({ className, ...props }, ref) => (
+  <DialogPrimitive.Description
+    ref={ref}
+    className={cn("text-sm text-muted-foreground", className)}
+    {...props}
+  />
+))
+DialogDescription.displayName = DialogPrimitive.Description.displayName
+
+export {
+  Dialog,
+  DialogPortal,
+  DialogOverlay,
+  DialogClose,
+  DialogTrigger,
+  DialogContent,
+  DialogHeader,
+  DialogFooter,
+  DialogTitle,
+  DialogDescription,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/dropdown-menu.tsx b/nvidia/txt2kg/assets/frontend/components/ui/dropdown-menu.tsx
new file mode 100644
index 0000000..0fc4c0e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/dropdown-menu.tsx
@@ -0,0 +1,200 @@
+"use client"
+
+import * as React from "react"
+import * as DropdownMenuPrimitive from "@radix-ui/react-dropdown-menu"
+import { Check, ChevronRight, Circle } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+
+const DropdownMenu = DropdownMenuPrimitive.Root
+
+const DropdownMenuTrigger = DropdownMenuPrimitive.Trigger
+
+const DropdownMenuGroup = DropdownMenuPrimitive.Group
+
+const DropdownMenuPortal = DropdownMenuPrimitive.Portal
+
+const DropdownMenuSub = DropdownMenuPrimitive.Sub
+
+const DropdownMenuRadioGroup = DropdownMenuPrimitive.RadioGroup
+
+const DropdownMenuSubTrigger = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.SubTrigger>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.SubTrigger> & {
+    inset?: boolean
+  }
+>(({ className, inset, children, ...props }, ref) => (
+  <DropdownMenuPrimitive.SubTrigger
+    ref={ref}
+    className={cn(
+      "flex cursor-default gap-2 select-none items-center rounded-sm px-2 py-1.5 text-sm outline-none focus:bg-accent data-[state=open]:bg-accent [&_svg]:pointer-events-none [&_svg]:size-4 [&_svg]:shrink-0",
+      inset && "pl-8",
+      className
+    )}
+    {...props}
+  >
+    {children}
+    <ChevronRight className="ml-auto" />
+  </DropdownMenuPrimitive.SubTrigger>
+))
+DropdownMenuSubTrigger.displayName =
+  DropdownMenuPrimitive.SubTrigger.displayName
+
+const DropdownMenuSubContent = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.SubContent>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.SubContent>
+>(({ className, ...props }, ref) => (
+  <DropdownMenuPrimitive.SubContent
+    ref={ref}
+    className={cn(
+      "z-50 min-w-[8rem] overflow-hidden rounded-md border bg-popover p-1 text-popover-foreground shadow-lg data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0 data-[state=closed]:zoom-out-95 data-[state=open]:zoom-in-95 data-[side=bottom]:slide-in-from-top-2 data-[side=left]:slide-in-from-right-2 data-[side=right]:slide-in-from-left-2 data-[side=top]:slide-in-from-bottom-2",
+      className
+    )}
+    {...props}
+  />
+))
+DropdownMenuSubContent.displayName =
+  DropdownMenuPrimitive.SubContent.displayName
+
+const DropdownMenuContent = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.Content>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.Content>
+>(({ className, sideOffset = 4, ...props }, ref) => (
+  <DropdownMenuPrimitive.Portal>
+    <DropdownMenuPrimitive.Content
+      ref={ref}
+      sideOffset={sideOffset}
+      className={cn(
+        "z-50 min-w-[8rem] overflow-hidden rounded-md border bg-popover p-1 text-popover-foreground shadow-md data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0 data-[state=closed]:zoom-out-95 data-[state=open]:zoom-in-95 data-[side=bottom]:slide-in-from-top-2 data-[side=left]:slide-in-from-right-2 data-[side=right]:slide-in-from-left-2 data-[side=top]:slide-in-from-bottom-2",
+        className
+      )}
+      {...props}
+    />
+  </DropdownMenuPrimitive.Portal>
+))
+DropdownMenuContent.displayName = DropdownMenuPrimitive.Content.displayName
+
+const DropdownMenuItem = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.Item>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.Item> & {
+    inset?: boolean
+  }
+>(({ className, inset, ...props }, ref) => (
+  <DropdownMenuPrimitive.Item
+    ref={ref}
+    className={cn(
+      "relative flex cursor-default select-none items-center gap-2 rounded-sm px-2 py-1.5 text-sm outline-none transition-colors focus:bg-accent focus:text-accent-foreground data-[disabled]:pointer-events-none data-[disabled]:opacity-50 [&_svg]:pointer-events-none [&_svg]:size-4 [&_svg]:shrink-0",
+      inset && "pl-8",
+      className
+    )}
+    {...props}
+  />
+))
+DropdownMenuItem.displayName = DropdownMenuPrimitive.Item.displayName
+
+const DropdownMenuCheckboxItem = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.CheckboxItem>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.CheckboxItem>
+>(({ className, children, checked, ...props }, ref) => (
+  <DropdownMenuPrimitive.CheckboxItem
+    ref={ref}
+    className={cn(
+      "relative flex cursor-default select-none items-center rounded-sm py-1.5 pl-8 pr-2 text-sm outline-none transition-colors focus:bg-accent focus:text-accent-foreground data-[disabled]:pointer-events-none data-[disabled]:opacity-50",
+      className
+    )}
+    checked={checked}
+    {...props}
+  >
+    <span className="absolute left-2 flex h-3.5 w-3.5 items-center justify-center">
+      <DropdownMenuPrimitive.ItemIndicator>
+        <Check className="h-4 w-4" />
+      </DropdownMenuPrimitive.ItemIndicator>
+    </span>
+    {children}
+  </DropdownMenuPrimitive.CheckboxItem>
+))
+DropdownMenuCheckboxItem.displayName =
+  DropdownMenuPrimitive.CheckboxItem.displayName
+
+const DropdownMenuRadioItem = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.RadioItem>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.RadioItem>
+>(({ className, children, ...props }, ref) => (
+  <DropdownMenuPrimitive.RadioItem
+    ref={ref}
+    className={cn(
+      "relative flex cursor-default select-none items-center rounded-sm py-1.5 pl-8 pr-2 text-sm outline-none transition-colors focus:bg-accent focus:text-accent-foreground data-[disabled]:pointer-events-none data-[disabled]:opacity-50",
+      className
+    )}
+    {...props}
+  >
+    <span className="absolute left-2 flex h-3.5 w-3.5 items-center justify-center">
+      <DropdownMenuPrimitive.ItemIndicator>
+        <Circle className="h-2 w-2 fill-current" />
+      </DropdownMenuPrimitive.ItemIndicator>
+    </span>
+    {children}
+  </DropdownMenuPrimitive.RadioItem>
+))
+DropdownMenuRadioItem.displayName = DropdownMenuPrimitive.RadioItem.displayName
+
+const DropdownMenuLabel = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.Label>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.Label> & {
+    inset?: boolean
+  }
+>(({ className, inset, ...props }, ref) => (
+  <DropdownMenuPrimitive.Label
+    ref={ref}
+    className={cn(
+      "px-2 py-1.5 text-sm font-semibold",
+      inset && "pl-8",
+      className
+    )}
+    {...props}
+  />
+))
+DropdownMenuLabel.displayName = DropdownMenuPrimitive.Label.displayName
+
+const DropdownMenuSeparator = React.forwardRef<
+  React.ElementRef<typeof DropdownMenuPrimitive.Separator>,
+  React.ComponentPropsWithoutRef<typeof DropdownMenuPrimitive.Separator>
+>(({ className, ...props }, ref) => (
+  <DropdownMenuPrimitive.Separator
+    ref={ref}
+    className={cn("-mx-1 my-1 h-px bg-muted", className)}
+    {...props}
+  />
+))
+DropdownMenuSeparator.displayName = DropdownMenuPrimitive.Separator.displayName
+
+const DropdownMenuShortcut = ({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLSpanElement>) => {
+  return (
+    <span
+      className={cn("ml-auto text-xs tracking-widest opacity-60", className)}
+      {...props}
+    />
+  )
+}
+DropdownMenuShortcut.displayName = "DropdownMenuShortcut"
+
+export {
+  DropdownMenu,
+  DropdownMenuTrigger,
+  DropdownMenuContent,
+  DropdownMenuItem,
+  DropdownMenuCheckboxItem,
+  DropdownMenuRadioItem,
+  DropdownMenuLabel,
+  DropdownMenuSeparator,
+  DropdownMenuShortcut,
+  DropdownMenuGroup,
+  DropdownMenuPortal,
+  DropdownMenuSub,
+  DropdownMenuSubContent,
+  DropdownMenuSubTrigger,
+  DropdownMenuRadioGroup,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/input.tsx b/nvidia/txt2kg/assets/frontend/components/ui/input.tsx
new file mode 100644
index 0000000..9fbdfbb
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/input.tsx
@@ -0,0 +1,22 @@
+import * as React from "react"
+
+import { cn } from "@/lib/utils"
+
+const Input = React.forwardRef<HTMLInputElement, React.ComponentProps<"input">>(
+  ({ className, type, ...props }, ref) => {
+    return (
+      <input
+        type={type}
+        className={cn(
+          "flex h-9 w-full rounded-md border border-input bg-background px-3 py-1.5 text-sm ring-offset-background file:border-0 file:bg-transparent file:text-xs file:font-medium file:text-foreground placeholder:text-muted-foreground focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:cursor-not-allowed disabled:opacity-50",
+          className
+        )}
+        ref={ref}
+        {...props}
+      />
+    )
+  }
+)
+Input.displayName = "Input"
+
+export { Input }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/label.tsx b/nvidia/txt2kg/assets/frontend/components/ui/label.tsx
new file mode 100644
index 0000000..3317f2e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/label.tsx
@@ -0,0 +1,26 @@
+"use client"
+
+import * as React from "react"
+import * as LabelPrimitive from "@radix-ui/react-label"
+import { cva, type VariantProps } from "class-variance-authority"
+
+import { cn } from "@/lib/utils"
+
+const labelVariants = cva(
+  "text-sm font-medium leading-none text-foreground peer-disabled:cursor-not-allowed peer-disabled:opacity-70"
+)
+
+const Label = React.forwardRef<
+  React.ElementRef<typeof LabelPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof LabelPrimitive.Root> &
+    VariantProps<typeof labelVariants>
+>(({ className, ...props }, ref) => (
+  <LabelPrimitive.Root
+    ref={ref}
+    className={cn(labelVariants(), className)}
+    {...props}
+  />
+))
+Label.displayName = LabelPrimitive.Root.displayName
+
+export { Label }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/ollama-icon.tsx b/nvidia/txt2kg/assets/frontend/components/ui/ollama-icon.tsx
new file mode 100644
index 0000000..d2815be
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/ollama-icon.tsx
@@ -0,0 +1,21 @@
+import React from 'react'
+import Image from 'next/image'
+
+interface OllamaIconProps {
+  className?: string
+}
+
+export function OllamaIcon({ className = "h-4 w-4" }: OllamaIconProps) {
+  return (
+    <Image
+      src="/ollama-logo.svg"
+      alt="Ollama"
+      width={16}
+      height={16}
+      className={`${className} dark:invert`}
+      style={{
+        objectFit: 'contain'
+      }}
+    />
+  )
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/pagination.tsx b/nvidia/txt2kg/assets/frontend/components/ui/pagination.tsx
new file mode 100644
index 0000000..ea40d19
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/pagination.tsx
@@ -0,0 +1,117 @@
+import * as React from "react"
+import { ChevronLeft, ChevronRight, MoreHorizontal } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+import { ButtonProps, buttonVariants } from "@/components/ui/button"
+
+const Pagination = ({ className, ...props }: React.ComponentProps<"nav">) => (
+  <nav
+    role="navigation"
+    aria-label="pagination"
+    className={cn("mx-auto flex w-full justify-center", className)}
+    {...props}
+  />
+)
+Pagination.displayName = "Pagination"
+
+const PaginationContent = React.forwardRef<
+  HTMLUListElement,
+  React.ComponentProps<"ul">
+>(({ className, ...props }, ref) => (
+  <ul
+    ref={ref}
+    className={cn("flex flex-row items-center gap-1", className)}
+    {...props}
+  />
+))
+PaginationContent.displayName = "PaginationContent"
+
+const PaginationItem = React.forwardRef<
+  HTMLLIElement,
+  React.ComponentProps<"li">
+>(({ className, ...props }, ref) => (
+  <li ref={ref} className={cn("", className)} {...props} />
+))
+PaginationItem.displayName = "PaginationItem"
+
+type PaginationLinkProps = {
+  isActive?: boolean
+} & Pick<ButtonProps, "size"> &
+  React.ComponentProps<"a">
+
+const PaginationLink = ({
+  className,
+  isActive,
+  size = "icon",
+  ...props
+}: PaginationLinkProps) => (
+  <a
+    aria-current={isActive ? "page" : undefined}
+    className={cn(
+      buttonVariants({
+        variant: isActive ? "outline" : "ghost",
+        size,
+      }),
+      className
+    )}
+    {...props}
+  />
+)
+PaginationLink.displayName = "PaginationLink"
+
+const PaginationPrevious = ({
+  className,
+  ...props
+}: React.ComponentProps<typeof PaginationLink>) => (
+  <PaginationLink
+    aria-label="Go to previous page"
+    size="default"
+    className={cn("gap-1 pl-2.5", className)}
+    {...props}
+  >
+    <ChevronLeft className="h-4 w-4" />
+    <span>Previous</span>
+  </PaginationLink>
+)
+PaginationPrevious.displayName = "PaginationPrevious"
+
+const PaginationNext = ({
+  className,
+  ...props
+}: React.ComponentProps<typeof PaginationLink>) => (
+  <PaginationLink
+    aria-label="Go to next page"
+    size="default"
+    className={cn("gap-1 pr-2.5", className)}
+    {...props}
+  >
+    <span>Next</span>
+    <ChevronRight className="h-4 w-4" />
+  </PaginationLink>
+)
+PaginationNext.displayName = "PaginationNext"
+
+const PaginationEllipsis = ({
+  className,
+  ...props
+}: React.ComponentProps<"span">) => (
+  <span
+    aria-hidden
+    className={cn("flex h-9 w-9 items-center justify-center", className)}
+    {...props}
+  >
+    <MoreHorizontal className="h-4 w-4" />
+    <span className="sr-only">More pages</span>
+  </span>
+)
+PaginationEllipsis.displayName = "PaginationEllipsis"
+
+export {
+  Pagination,
+  PaginationContent,
+  PaginationEllipsis,
+  PaginationItem,
+  PaginationLink,
+  PaginationNext,
+  PaginationPrevious,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/popover.tsx b/nvidia/txt2kg/assets/frontend/components/ui/popover.tsx
new file mode 100644
index 0000000..a0ec48b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/popover.tsx
@@ -0,0 +1,31 @@
+"use client"
+
+import * as React from "react"
+import * as PopoverPrimitive from "@radix-ui/react-popover"
+
+import { cn } from "@/lib/utils"
+
+const Popover = PopoverPrimitive.Root
+
+const PopoverTrigger = PopoverPrimitive.Trigger
+
+const PopoverContent = React.forwardRef<
+  React.ElementRef<typeof PopoverPrimitive.Content>,
+  React.ComponentPropsWithoutRef<typeof PopoverPrimitive.Content>
+>(({ className, align = "center", sideOffset = 4, ...props }, ref) => (
+  <PopoverPrimitive.Portal>
+    <PopoverPrimitive.Content
+      ref={ref}
+      align={align}
+      sideOffset={sideOffset}
+      className={cn(
+        "z-50 w-72 rounded-md border bg-popover p-4 text-popover-foreground shadow-md outline-none data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0 data-[state=closed]:zoom-out-95 data-[state=open]:zoom-in-95 data-[side=bottom]:slide-in-from-top-2 data-[side=left]:slide-in-from-right-2 data-[side=right]:slide-in-from-left-2 data-[side=top]:slide-in-from-bottom-2",
+        className
+      )}
+      {...props}
+    />
+  </PopoverPrimitive.Portal>
+))
+PopoverContent.displayName = PopoverPrimitive.Content.displayName
+
+export { Popover, PopoverTrigger, PopoverContent }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/progress.tsx b/nvidia/txt2kg/assets/frontend/components/ui/progress.tsx
new file mode 100644
index 0000000..5c87ea4
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/progress.tsx
@@ -0,0 +1,28 @@
+"use client"
+
+import * as React from "react"
+import * as ProgressPrimitive from "@radix-ui/react-progress"
+
+import { cn } from "@/lib/utils"
+
+const Progress = React.forwardRef<
+  React.ElementRef<typeof ProgressPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof ProgressPrimitive.Root>
+>(({ className, value, ...props }, ref) => (
+  <ProgressPrimitive.Root
+    ref={ref}
+    className={cn(
+      "relative h-4 w-full overflow-hidden rounded-full bg-secondary",
+      className
+    )}
+    {...props}
+  >
+    <ProgressPrimitive.Indicator
+      className="h-full w-full flex-1 bg-primary transition-all"
+      style={{ transform: `translateX(-${100 - (value || 0)}%)` }}
+    />
+  </ProgressPrimitive.Root>
+))
+Progress.displayName = ProgressPrimitive.Root.displayName
+
+export { Progress }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/radio-group.tsx b/nvidia/txt2kg/assets/frontend/components/ui/radio-group.tsx
new file mode 100644
index 0000000..e9bde17
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/radio-group.tsx
@@ -0,0 +1,44 @@
+"use client"
+
+import * as React from "react"
+import * as RadioGroupPrimitive from "@radix-ui/react-radio-group"
+import { Circle } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+
+const RadioGroup = React.forwardRef<
+  React.ElementRef<typeof RadioGroupPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof RadioGroupPrimitive.Root>
+>(({ className, ...props }, ref) => {
+  return (
+    <RadioGroupPrimitive.Root
+      className={cn("grid gap-2", className)}
+      {...props}
+      ref={ref}
+    />
+  )
+})
+RadioGroup.displayName = RadioGroupPrimitive.Root.displayName
+
+const RadioGroupItem = React.forwardRef<
+  React.ElementRef<typeof RadioGroupPrimitive.Item>,
+  React.ComponentPropsWithoutRef<typeof RadioGroupPrimitive.Item>
+>(({ className, ...props }, ref) => {
+  return (
+    <RadioGroupPrimitive.Item
+      ref={ref}
+      className={cn(
+        "aspect-square h-4 w-4 rounded-full border border-primary text-primary ring-offset-background focus:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:cursor-not-allowed disabled:opacity-50",
+        className
+      )}
+      {...props}
+    >
+      <RadioGroupPrimitive.Indicator className="flex items-center justify-center">
+        <Circle className="h-2.5 w-2.5 fill-current text-current" />
+      </RadioGroupPrimitive.Indicator>
+    </RadioGroupPrimitive.Item>
+  )
+})
+RadioGroupItem.displayName = RadioGroupPrimitive.Item.displayName
+
+export { RadioGroup, RadioGroupItem }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/scroll-area.tsx b/nvidia/txt2kg/assets/frontend/components/ui/scroll-area.tsx
new file mode 100644
index 0000000..0b4a48d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/scroll-area.tsx
@@ -0,0 +1,48 @@
+"use client"
+
+import * as React from "react"
+import * as ScrollAreaPrimitive from "@radix-ui/react-scroll-area"
+
+import { cn } from "@/lib/utils"
+
+const ScrollArea = React.forwardRef<
+  React.ElementRef<typeof ScrollAreaPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof ScrollAreaPrimitive.Root>
+>(({ className, children, ...props }, ref) => (
+  <ScrollAreaPrimitive.Root
+    ref={ref}
+    className={cn("relative overflow-hidden", className)}
+    {...props}
+  >
+    <ScrollAreaPrimitive.Viewport className="h-full w-full rounded-[inherit]">
+      {children}
+    </ScrollAreaPrimitive.Viewport>
+    <ScrollBar />
+    <ScrollAreaPrimitive.Corner />
+  </ScrollAreaPrimitive.Root>
+))
+ScrollArea.displayName = ScrollAreaPrimitive.Root.displayName
+
+const ScrollBar = React.forwardRef<
+  React.ElementRef<typeof ScrollAreaPrimitive.ScrollAreaScrollbar>,
+  React.ComponentPropsWithoutRef<typeof ScrollAreaPrimitive.ScrollAreaScrollbar>
+>(({ className, orientation = "vertical", ...props }, ref) => (
+  <ScrollAreaPrimitive.ScrollAreaScrollbar
+    ref={ref}
+    orientation={orientation}
+    className={cn(
+      "flex touch-none select-none transition-colors",
+      orientation === "vertical" &&
+        "h-full w-2.5 border-l border-l-transparent p-[1px]",
+      orientation === "horizontal" &&
+        "h-2.5 flex-col border-t border-t-transparent p-[1px]",
+      className
+    )}
+    {...props}
+  >
+    <ScrollAreaPrimitive.ScrollAreaThumb className="relative flex-1 rounded-full bg-border" />
+  </ScrollAreaPrimitive.ScrollAreaScrollbar>
+))
+ScrollBar.displayName = ScrollAreaPrimitive.ScrollAreaScrollbar.displayName
+
+export { ScrollArea, ScrollBar }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/select.tsx b/nvidia/txt2kg/assets/frontend/components/ui/select.tsx
new file mode 100644
index 0000000..cbe5a36
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/select.tsx
@@ -0,0 +1,160 @@
+"use client"
+
+import * as React from "react"
+import * as SelectPrimitive from "@radix-ui/react-select"
+import { Check, ChevronDown, ChevronUp } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+
+const Select = SelectPrimitive.Root
+
+const SelectGroup = SelectPrimitive.Group
+
+const SelectValue = SelectPrimitive.Value
+
+const SelectTrigger = React.forwardRef<
+  React.ElementRef<typeof SelectPrimitive.Trigger>,
+  React.ComponentPropsWithoutRef<typeof SelectPrimitive.Trigger>
+>(({ className, children, ...props }, ref) => (
+  <SelectPrimitive.Trigger
+    ref={ref}
+    className={cn(
+      "flex h-10 w-full items-center justify-between rounded-md border border-input bg-background px-3 py-2 text-sm ring-offset-background placeholder:text-muted-foreground focus:outline-none focus:ring-2 focus:ring-ring focus:ring-offset-2 disabled:cursor-not-allowed disabled:opacity-50 [&>span]:line-clamp-1",
+      className
+    )}
+    {...props}
+  >
+    {children}
+    <SelectPrimitive.Icon asChild>
+      <ChevronDown className="h-4 w-4 opacity-50" />
+    </SelectPrimitive.Icon>
+  </SelectPrimitive.Trigger>
+))
+SelectTrigger.displayName = SelectPrimitive.Trigger.displayName
+
+const SelectScrollUpButton = React.forwardRef<
+  React.ElementRef<typeof SelectPrimitive.ScrollUpButton>,
+  React.ComponentPropsWithoutRef<typeof SelectPrimitive.ScrollUpButton>
+>(({ className, ...props }, ref) => (
+  <SelectPrimitive.ScrollUpButton
+    ref={ref}
+    className={cn(
+      "flex cursor-default items-center justify-center py-1",
+      className
+    )}
+    {...props}
+  >
+    <ChevronUp className="h-4 w-4" />
+  </SelectPrimitive.ScrollUpButton>
+))
+SelectScrollUpButton.displayName = SelectPrimitive.ScrollUpButton.displayName
+
+const SelectScrollDownButton = React.forwardRef<
+  React.ElementRef<typeof SelectPrimitive.ScrollDownButton>,
+  React.ComponentPropsWithoutRef<typeof SelectPrimitive.ScrollDownButton>
+>(({ className, ...props }, ref) => (
+  <SelectPrimitive.ScrollDownButton
+    ref={ref}
+    className={cn(
+      "flex cursor-default items-center justify-center py-1",
+      className
+    )}
+    {...props}
+  >
+    <ChevronDown className="h-4 w-4" />
+  </SelectPrimitive.ScrollDownButton>
+))
+SelectScrollDownButton.displayName =
+  SelectPrimitive.ScrollDownButton.displayName
+
+const SelectContent = React.forwardRef<
+  React.ElementRef<typeof SelectPrimitive.Content>,
+  React.ComponentPropsWithoutRef<typeof SelectPrimitive.Content>
+>(({ className, children, position = "popper", ...props }, ref) => (
+  <SelectPrimitive.Portal>
+    <SelectPrimitive.Content
+      ref={ref}
+      className={cn(
+        "relative z-50 max-h-96 min-w-[8rem] overflow-hidden rounded-md border bg-popover text-popover-foreground shadow-md data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0 data-[state=closed]:zoom-out-95 data-[state=open]:zoom-in-95 data-[side=bottom]:slide-in-from-top-2 data-[side=left]:slide-in-from-right-2 data-[side=right]:slide-in-from-left-2 data-[side=top]:slide-in-from-bottom-2",
+        position === "popper" &&
+          "data-[side=bottom]:translate-y-1 data-[side=left]:-translate-x-1 data-[side=right]:translate-x-1 data-[side=top]:-translate-y-1",
+        className
+      )}
+      position={position}
+      {...props}
+    >
+      <SelectScrollUpButton />
+      <SelectPrimitive.Viewport
+        className={cn(
+          "p-1",
+          position === "popper" &&
+            "h-[var(--radix-select-trigger-height)] w-full min-w-[var(--radix-select-trigger-width)]"
+        )}
+      >
+        {children}
+      </SelectPrimitive.Viewport>
+      <SelectScrollDownButton />
+    </SelectPrimitive.Content>
+  </SelectPrimitive.Portal>
+))
+SelectContent.displayName = SelectPrimitive.Content.displayName
+
+const SelectLabel = React.forwardRef<
+  React.ElementRef<typeof SelectPrimitive.Label>,
+  React.ComponentPropsWithoutRef<typeof SelectPrimitive.Label>
+>(({ className, ...props }, ref) => (
+  <SelectPrimitive.Label
+    ref={ref}
+    className={cn("py-1.5 pl-8 pr-2 text-sm font-semibold", className)}
+    {...props}
+  />
+))
+SelectLabel.displayName = SelectPrimitive.Label.displayName
+
+const SelectItem = React.forwardRef<
+  React.ElementRef<typeof SelectPrimitive.Item>,
+  React.ComponentPropsWithoutRef<typeof SelectPrimitive.Item>
+>(({ className, children, ...props }, ref) => (
+  <SelectPrimitive.Item
+    ref={ref}
+    className={cn(
+      "relative flex w-full cursor-default select-none items-center rounded-sm py-1.5 pl-8 pr-2 text-sm outline-none focus:bg-accent focus:text-accent-foreground data-[disabled]:pointer-events-none data-[disabled]:opacity-50",
+      className
+    )}
+    {...props}
+  >
+    <span className="absolute left-2 flex h-3.5 w-3.5 items-center justify-center">
+      <SelectPrimitive.ItemIndicator>
+        <Check className="h-4 w-4" />
+      </SelectPrimitive.ItemIndicator>
+    </span>
+
+    <SelectPrimitive.ItemText>{children}</SelectPrimitive.ItemText>
+  </SelectPrimitive.Item>
+))
+SelectItem.displayName = SelectPrimitive.Item.displayName
+
+const SelectSeparator = React.forwardRef<
+  React.ElementRef<typeof SelectPrimitive.Separator>,
+  React.ComponentPropsWithoutRef<typeof SelectPrimitive.Separator>
+>(({ className, ...props }, ref) => (
+  <SelectPrimitive.Separator
+    ref={ref}
+    className={cn("-mx-1 my-1 h-px bg-muted", className)}
+    {...props}
+  />
+))
+SelectSeparator.displayName = SelectPrimitive.Separator.displayName
+
+export {
+  Select,
+  SelectGroup,
+  SelectValue,
+  SelectTrigger,
+  SelectContent,
+  SelectLabel,
+  SelectItem,
+  SelectSeparator,
+  SelectScrollUpButton,
+  SelectScrollDownButton,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/separator.tsx b/nvidia/txt2kg/assets/frontend/components/ui/separator.tsx
new file mode 100644
index 0000000..12d81c4
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/separator.tsx
@@ -0,0 +1,31 @@
+"use client"
+
+import * as React from "react"
+import * as SeparatorPrimitive from "@radix-ui/react-separator"
+
+import { cn } from "@/lib/utils"
+
+const Separator = React.forwardRef<
+  React.ElementRef<typeof SeparatorPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof SeparatorPrimitive.Root>
+>(
+  (
+    { className, orientation = "horizontal", decorative = true, ...props },
+    ref
+  ) => (
+    <SeparatorPrimitive.Root
+      ref={ref}
+      decorative={decorative}
+      orientation={orientation}
+      className={cn(
+        "shrink-0 bg-border",
+        orientation === "horizontal" ? "h-[1px] w-full" : "h-full w-[1px]",
+        className
+      )}
+      {...props}
+    />
+  )
+)
+Separator.displayName = SeparatorPrimitive.Root.displayName
+
+export { Separator }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/sheet.tsx b/nvidia/txt2kg/assets/frontend/components/ui/sheet.tsx
new file mode 100644
index 0000000..a37f17b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/sheet.tsx
@@ -0,0 +1,140 @@
+"use client"
+
+import * as React from "react"
+import * as SheetPrimitive from "@radix-ui/react-dialog"
+import { cva, type VariantProps } from "class-variance-authority"
+import { X } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+
+const Sheet = SheetPrimitive.Root
+
+const SheetTrigger = SheetPrimitive.Trigger
+
+const SheetClose = SheetPrimitive.Close
+
+const SheetPortal = SheetPrimitive.Portal
+
+const SheetOverlay = React.forwardRef<
+  React.ElementRef<typeof SheetPrimitive.Overlay>,
+  React.ComponentPropsWithoutRef<typeof SheetPrimitive.Overlay>
+>(({ className, ...props }, ref) => (
+  <SheetPrimitive.Overlay
+    className={cn(
+      "fixed inset-0 z-50 bg-black/80  data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=open]:fade-in-0",
+      className
+    )}
+    {...props}
+    ref={ref}
+  />
+))
+SheetOverlay.displayName = SheetPrimitive.Overlay.displayName
+
+const sheetVariants = cva(
+  "fixed z-50 gap-4 bg-background p-6 shadow-lg transition ease-in-out data-[state=open]:animate-in data-[state=closed]:animate-out data-[state=closed]:duration-300 data-[state=open]:duration-500",
+  {
+    variants: {
+      side: {
+        top: "inset-x-0 top-0 border-b data-[state=closed]:slide-out-to-top data-[state=open]:slide-in-from-top",
+        bottom:
+          "inset-x-0 bottom-0 border-t data-[state=closed]:slide-out-to-bottom data-[state=open]:slide-in-from-bottom",
+        left: "inset-y-0 left-0 h-full w-3/4 border-r data-[state=closed]:slide-out-to-left data-[state=open]:slide-in-from-left sm:max-w-sm",
+        right:
+          "inset-y-0 right-0 h-full w-3/4  border-l data-[state=closed]:slide-out-to-right data-[state=open]:slide-in-from-right sm:max-w-sm",
+      },
+    },
+    defaultVariants: {
+      side: "right",
+    },
+  }
+)
+
+interface SheetContentProps
+  extends React.ComponentPropsWithoutRef<typeof SheetPrimitive.Content>,
+    VariantProps<typeof sheetVariants> {}
+
+const SheetContent = React.forwardRef<
+  React.ElementRef<typeof SheetPrimitive.Content>,
+  SheetContentProps
+>(({ side = "right", className, children, ...props }, ref) => (
+  <SheetPortal>
+    <SheetOverlay />
+    <SheetPrimitive.Content
+      ref={ref}
+      className={cn(sheetVariants({ side }), className)}
+      {...props}
+    >
+      {children}
+      <SheetPrimitive.Close className="absolute right-4 top-4 rounded-sm opacity-70 ring-offset-background transition-opacity hover:opacity-100 focus:outline-none focus:ring-2 focus:ring-ring focus:ring-offset-2 disabled:pointer-events-none data-[state=open]:bg-secondary">
+        <X className="h-4 w-4" />
+        <span className="sr-only">Close</span>
+      </SheetPrimitive.Close>
+    </SheetPrimitive.Content>
+  </SheetPortal>
+))
+SheetContent.displayName = SheetPrimitive.Content.displayName
+
+const SheetHeader = ({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLDivElement>) => (
+  <div
+    className={cn(
+      "flex flex-col space-y-2 text-center sm:text-left",
+      className
+    )}
+    {...props}
+  />
+)
+SheetHeader.displayName = "SheetHeader"
+
+const SheetFooter = ({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLDivElement>) => (
+  <div
+    className={cn(
+      "flex flex-col-reverse sm:flex-row sm:justify-end sm:space-x-2",
+      className
+    )}
+    {...props}
+  />
+)
+SheetFooter.displayName = "SheetFooter"
+
+const SheetTitle = React.forwardRef<
+  React.ElementRef<typeof SheetPrimitive.Title>,
+  React.ComponentPropsWithoutRef<typeof SheetPrimitive.Title>
+>(({ className, ...props }, ref) => (
+  <SheetPrimitive.Title
+    ref={ref}
+    className={cn("text-lg font-semibold text-foreground", className)}
+    {...props}
+  />
+))
+SheetTitle.displayName = SheetPrimitive.Title.displayName
+
+const SheetDescription = React.forwardRef<
+  React.ElementRef<typeof SheetPrimitive.Description>,
+  React.ComponentPropsWithoutRef<typeof SheetPrimitive.Description>
+>(({ className, ...props }, ref) => (
+  <SheetPrimitive.Description
+    ref={ref}
+    className={cn("text-sm text-muted-foreground", className)}
+    {...props}
+  />
+))
+SheetDescription.displayName = SheetPrimitive.Description.displayName
+
+export {
+  Sheet,
+  SheetPortal,
+  SheetOverlay,
+  SheetTrigger,
+  SheetClose,
+  SheetContent,
+  SheetHeader,
+  SheetFooter,
+  SheetTitle,
+  SheetDescription,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/sidebar.tsx b/nvidia/txt2kg/assets/frontend/components/ui/sidebar.tsx
new file mode 100644
index 0000000..eeb2d7a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/sidebar.tsx
@@ -0,0 +1,763 @@
+"use client"
+
+import * as React from "react"
+import { Slot } from "@radix-ui/react-slot"
+import { VariantProps, cva } from "class-variance-authority"
+import { PanelLeft } from "lucide-react"
+
+import { useIsMobile } from "@/hooks/use-mobile"
+import { cn } from "@/lib/utils"
+import { Button } from "@/components/ui/button"
+import { Input } from "@/components/ui/input"
+import { Separator } from "@/components/ui/separator"
+import { Sheet, SheetContent } from "@/components/ui/sheet"
+import { Skeleton } from "@/components/ui/skeleton"
+import {
+  Tooltip,
+  TooltipContent,
+  TooltipProvider,
+  TooltipTrigger,
+} from "@/components/ui/tooltip"
+
+const SIDEBAR_COOKIE_NAME = "sidebar:state"
+const SIDEBAR_COOKIE_MAX_AGE = 60 * 60 * 24 * 7
+const SIDEBAR_WIDTH = "16rem"
+const SIDEBAR_WIDTH_MOBILE = "18rem"
+const SIDEBAR_WIDTH_ICON = "3rem"
+const SIDEBAR_KEYBOARD_SHORTCUT = "b"
+
+type SidebarContext = {
+  state: "expanded" | "collapsed"
+  open: boolean
+  setOpen: (open: boolean) => void
+  openMobile: boolean
+  setOpenMobile: (open: boolean) => void
+  isMobile: boolean
+  toggleSidebar: () => void
+}
+
+const SidebarContext = React.createContext<SidebarContext | null>(null)
+
+function useSidebar() {
+  const context = React.useContext(SidebarContext)
+  if (!context) {
+    throw new Error("useSidebar must be used within a SidebarProvider.")
+  }
+
+  return context
+}
+
+const SidebarProvider = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div"> & {
+    defaultOpen?: boolean
+    open?: boolean
+    onOpenChange?: (open: boolean) => void
+  }
+>(
+  (
+    {
+      defaultOpen = true,
+      open: openProp,
+      onOpenChange: setOpenProp,
+      className,
+      style,
+      children,
+      ...props
+    },
+    ref
+  ) => {
+    const isMobile = useIsMobile()
+    const [openMobile, setOpenMobile] = React.useState(false)
+
+    // This is the internal state of the sidebar.
+    // We use openProp and setOpenProp for control from outside the component.
+    const [_open, _setOpen] = React.useState(defaultOpen)
+    const open = openProp ?? _open
+    const setOpen = React.useCallback(
+      (value: boolean | ((value: boolean) => boolean)) => {
+        const openState = typeof value === "function" ? value(open) : value
+        if (setOpenProp) {
+          setOpenProp(openState)
+        } else {
+          _setOpen(openState)
+        }
+
+        // This sets the cookie to keep the sidebar state.
+        document.cookie = `${SIDEBAR_COOKIE_NAME}=${openState}; path=/; max-age=${SIDEBAR_COOKIE_MAX_AGE}`
+      },
+      [setOpenProp, open]
+    )
+
+    // Helper to toggle the sidebar.
+    const toggleSidebar = React.useCallback(() => {
+      return isMobile
+        ? setOpenMobile((open) => !open)
+        : setOpen((open) => !open)
+    }, [isMobile, setOpen, setOpenMobile])
+
+    // Adds a keyboard shortcut to toggle the sidebar.
+    React.useEffect(() => {
+      const handleKeyDown = (event: KeyboardEvent) => {
+        if (
+          event.key === SIDEBAR_KEYBOARD_SHORTCUT &&
+          (event.metaKey || event.ctrlKey)
+        ) {
+          event.preventDefault()
+          toggleSidebar()
+        }
+      }
+
+      window.addEventListener("keydown", handleKeyDown)
+      return () => window.removeEventListener("keydown", handleKeyDown)
+    }, [toggleSidebar])
+
+    // We add a state so that we can do data-state="expanded" or "collapsed".
+    // This makes it easier to style the sidebar with Tailwind classes.
+    const state = open ? "expanded" : "collapsed"
+
+    const contextValue = React.useMemo<SidebarContext>(
+      () => ({
+        state,
+        open,
+        setOpen,
+        isMobile,
+        openMobile,
+        setOpenMobile,
+        toggleSidebar,
+      }),
+      [state, open, setOpen, isMobile, openMobile, setOpenMobile, toggleSidebar]
+    )
+
+    return (
+      <SidebarContext.Provider value={contextValue}>
+        <TooltipProvider delayDuration={0}>
+          <div
+            style={
+              {
+                "--sidebar-width": SIDEBAR_WIDTH,
+                "--sidebar-width-icon": SIDEBAR_WIDTH_ICON,
+                ...style,
+              } as React.CSSProperties
+            }
+            className={cn(
+              "group/sidebar-wrapper flex min-h-svh w-full has-[[data-variant=inset]]:bg-sidebar",
+              className
+            )}
+            ref={ref}
+            {...props}
+          >
+            {children}
+          </div>
+        </TooltipProvider>
+      </SidebarContext.Provider>
+    )
+  }
+)
+SidebarProvider.displayName = "SidebarProvider"
+
+const Sidebar = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div"> & {
+    side?: "left" | "right"
+    variant?: "sidebar" | "floating" | "inset"
+    collapsible?: "offcanvas" | "icon" | "none"
+  }
+>(
+  (
+    {
+      side = "left",
+      variant = "sidebar",
+      collapsible = "offcanvas",
+      className,
+      children,
+      ...props
+    },
+    ref
+  ) => {
+    const { isMobile, state, openMobile, setOpenMobile } = useSidebar()
+
+    if (collapsible === "none") {
+      return (
+        <div
+          className={cn(
+            "flex h-full w-[--sidebar-width] flex-col bg-sidebar text-sidebar-foreground",
+            className
+          )}
+          ref={ref}
+          {...props}
+        >
+          {children}
+        </div>
+      )
+    }
+
+    if (isMobile) {
+      return (
+        <Sheet open={openMobile} onOpenChange={setOpenMobile} {...props}>
+          <SheetContent
+            data-sidebar="sidebar"
+            data-mobile="true"
+            className="w-[--sidebar-width] bg-sidebar p-0 text-sidebar-foreground [&>button]:hidden"
+            style={
+              {
+                "--sidebar-width": SIDEBAR_WIDTH_MOBILE,
+              } as React.CSSProperties
+            }
+            side={side}
+          >
+            <div className="flex h-full w-full flex-col">{children}</div>
+          </SheetContent>
+        </Sheet>
+      )
+    }
+
+    return (
+      <div
+        ref={ref}
+        className="group peer hidden md:block text-sidebar-foreground"
+        data-state={state}
+        data-collapsible={state === "collapsed" ? collapsible : ""}
+        data-variant={variant}
+        data-side={side}
+      >
+        {/* This is what handles the sidebar gap on desktop */}
+        <div
+          className={cn(
+            "duration-200 relative h-svh w-[--sidebar-width] bg-transparent transition-[width] ease-linear",
+            "group-data-[collapsible=offcanvas]:w-0",
+            "group-data-[side=right]:rotate-180",
+            variant === "floating" || variant === "inset"
+              ? "group-data-[collapsible=icon]:w-[calc(var(--sidebar-width-icon)_+_theme(spacing.4))]"
+              : "group-data-[collapsible=icon]:w-[--sidebar-width-icon]"
+          )}
+        />
+        <div
+          className={cn(
+            "duration-200 fixed inset-y-0 z-10 hidden h-svh w-[--sidebar-width] transition-[left,right,width] ease-linear md:flex",
+            side === "left"
+              ? "left-0 group-data-[collapsible=offcanvas]:left-[calc(var(--sidebar-width)*-1)]"
+              : "right-0 group-data-[collapsible=offcanvas]:right-[calc(var(--sidebar-width)*-1)]",
+            // Adjust the padding for floating and inset variants.
+            variant === "floating" || variant === "inset"
+              ? "p-2 group-data-[collapsible=icon]:w-[calc(var(--sidebar-width-icon)_+_theme(spacing.4)_+2px)]"
+              : "group-data-[collapsible=icon]:w-[--sidebar-width-icon] group-data-[side=left]:border-r group-data-[side=right]:border-l",
+            className
+          )}
+          {...props}
+        >
+          <div
+            data-sidebar="sidebar"
+            className="flex h-full w-full flex-col bg-sidebar group-data-[variant=floating]:rounded-lg group-data-[variant=floating]:border group-data-[variant=floating]:border-sidebar-border group-data-[variant=floating]:shadow"
+          >
+            {children}
+          </div>
+        </div>
+      </div>
+    )
+  }
+)
+Sidebar.displayName = "Sidebar"
+
+const SidebarTrigger = React.forwardRef<
+  React.ElementRef<typeof Button>,
+  React.ComponentProps<typeof Button>
+>(({ className, onClick, ...props }, ref) => {
+  const { toggleSidebar } = useSidebar()
+
+  return (
+    <Button
+      ref={ref}
+      data-sidebar="trigger"
+      variant="ghost"
+      size="icon"
+      className={cn("h-7 w-7", className)}
+      onClick={(event) => {
+        onClick?.(event)
+        toggleSidebar()
+      }}
+      {...props}
+    >
+      <PanelLeft />
+      <span className="sr-only">Toggle Sidebar</span>
+    </Button>
+  )
+})
+SidebarTrigger.displayName = "SidebarTrigger"
+
+const SidebarRail = React.forwardRef<
+  HTMLButtonElement,
+  React.ComponentProps<"button">
+>(({ className, ...props }, ref) => {
+  const { toggleSidebar } = useSidebar()
+
+  return (
+    <button
+      ref={ref}
+      data-sidebar="rail"
+      aria-label="Toggle Sidebar"
+      tabIndex={-1}
+      onClick={toggleSidebar}
+      title="Toggle Sidebar"
+      className={cn(
+        "absolute inset-y-0 z-20 hidden w-4 -translate-x-1/2 transition-all ease-linear after:absolute after:inset-y-0 after:left-1/2 after:w-[2px] hover:after:bg-sidebar-border group-data-[side=left]:-right-4 group-data-[side=right]:left-0 sm:flex",
+        "[[data-side=left]_&]:cursor-w-resize [[data-side=right]_&]:cursor-e-resize",
+        "[[data-side=left][data-state=collapsed]_&]:cursor-e-resize [[data-side=right][data-state=collapsed]_&]:cursor-w-resize",
+        "group-data-[collapsible=offcanvas]:translate-x-0 group-data-[collapsible=offcanvas]:after:left-full group-data-[collapsible=offcanvas]:hover:bg-sidebar",
+        "[[data-side=left][data-collapsible=offcanvas]_&]:-right-2",
+        "[[data-side=right][data-collapsible=offcanvas]_&]:-left-2",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarRail.displayName = "SidebarRail"
+
+const SidebarInset = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"main">
+>(({ className, ...props }, ref) => {
+  return (
+    <main
+      ref={ref}
+      className={cn(
+        "relative flex min-h-svh flex-1 flex-col bg-background",
+        "peer-data-[variant=inset]:min-h-[calc(100svh-theme(spacing.4))] md:peer-data-[variant=inset]:m-2 md:peer-data-[state=collapsed]:peer-data-[variant=inset]:ml-2 md:peer-data-[variant=inset]:ml-0 md:peer-data-[variant=inset]:rounded-xl md:peer-data-[variant=inset]:shadow",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarInset.displayName = "SidebarInset"
+
+const SidebarInput = React.forwardRef<
+  React.ElementRef<typeof Input>,
+  React.ComponentProps<typeof Input>
+>(({ className, ...props }, ref) => {
+  return (
+    <Input
+      ref={ref}
+      data-sidebar="input"
+      className={cn(
+        "h-8 w-full bg-background shadow-none focus-visible:ring-2 focus-visible:ring-sidebar-ring",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarInput.displayName = "SidebarInput"
+
+const SidebarHeader = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div">
+>(({ className, ...props }, ref) => {
+  return (
+    <div
+      ref={ref}
+      data-sidebar="header"
+      className={cn("flex flex-col gap-2 p-2", className)}
+      {...props}
+    />
+  )
+})
+SidebarHeader.displayName = "SidebarHeader"
+
+const SidebarFooter = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div">
+>(({ className, ...props }, ref) => {
+  return (
+    <div
+      ref={ref}
+      data-sidebar="footer"
+      className={cn("flex flex-col gap-2 p-2", className)}
+      {...props}
+    />
+  )
+})
+SidebarFooter.displayName = "SidebarFooter"
+
+const SidebarSeparator = React.forwardRef<
+  React.ElementRef<typeof Separator>,
+  React.ComponentProps<typeof Separator>
+>(({ className, ...props }, ref) => {
+  return (
+    <Separator
+      ref={ref}
+      data-sidebar="separator"
+      className={cn("mx-2 w-auto bg-sidebar-border", className)}
+      {...props}
+    />
+  )
+})
+SidebarSeparator.displayName = "SidebarSeparator"
+
+const SidebarContent = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div">
+>(({ className, ...props }, ref) => {
+  return (
+    <div
+      ref={ref}
+      data-sidebar="content"
+      className={cn(
+        "flex min-h-0 flex-1 flex-col gap-2 overflow-auto group-data-[collapsible=icon]:overflow-hidden",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarContent.displayName = "SidebarContent"
+
+const SidebarGroup = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div">
+>(({ className, ...props }, ref) => {
+  return (
+    <div
+      ref={ref}
+      data-sidebar="group"
+      className={cn("relative flex w-full min-w-0 flex-col p-2", className)}
+      {...props}
+    />
+  )
+})
+SidebarGroup.displayName = "SidebarGroup"
+
+const SidebarGroupLabel = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div"> & { asChild?: boolean }
+>(({ className, asChild = false, ...props }, ref) => {
+  const Comp = asChild ? Slot : "div"
+
+  return (
+    <Comp
+      ref={ref}
+      data-sidebar="group-label"
+      className={cn(
+        "duration-200 flex h-8 shrink-0 items-center rounded-md px-2 text-xs font-medium text-sidebar-foreground/70 outline-none ring-sidebar-ring transition-[margin,opa] ease-linear focus-visible:ring-2 [&>svg]:size-4 [&>svg]:shrink-0",
+        "group-data-[collapsible=icon]:-mt-8 group-data-[collapsible=icon]:opacity-0",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarGroupLabel.displayName = "SidebarGroupLabel"
+
+const SidebarGroupAction = React.forwardRef<
+  HTMLButtonElement,
+  React.ComponentProps<"button"> & { asChild?: boolean }
+>(({ className, asChild = false, ...props }, ref) => {
+  const Comp = asChild ? Slot : "button"
+
+  return (
+    <Comp
+      ref={ref}
+      data-sidebar="group-action"
+      className={cn(
+        "absolute right-3 top-3.5 flex aspect-square w-5 items-center justify-center rounded-md p-0 text-sidebar-foreground outline-none ring-sidebar-ring transition-transform hover:bg-sidebar-accent hover:text-sidebar-accent-foreground focus-visible:ring-2 [&>svg]:size-4 [&>svg]:shrink-0",
+        // Increases the hit area of the button on mobile.
+        "after:absolute after:-inset-2 after:md:hidden",
+        "group-data-[collapsible=icon]:hidden",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarGroupAction.displayName = "SidebarGroupAction"
+
+const SidebarGroupContent = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div">
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    data-sidebar="group-content"
+    className={cn("w-full text-sm", className)}
+    {...props}
+  />
+))
+SidebarGroupContent.displayName = "SidebarGroupContent"
+
+const SidebarMenu = React.forwardRef<
+  HTMLUListElement,
+  React.ComponentProps<"ul">
+>(({ className, ...props }, ref) => (
+  <ul
+    ref={ref}
+    data-sidebar="menu"
+    className={cn("flex w-full min-w-0 flex-col gap-1", className)}
+    {...props}
+  />
+))
+SidebarMenu.displayName = "SidebarMenu"
+
+const SidebarMenuItem = React.forwardRef<
+  HTMLLIElement,
+  React.ComponentProps<"li">
+>(({ className, ...props }, ref) => (
+  <li
+    ref={ref}
+    data-sidebar="menu-item"
+    className={cn("group/menu-item relative", className)}
+    {...props}
+  />
+))
+SidebarMenuItem.displayName = "SidebarMenuItem"
+
+const sidebarMenuButtonVariants = cva(
+  "peer/menu-button flex w-full items-center gap-2 overflow-hidden rounded-md p-2 text-left text-sm outline-none ring-sidebar-ring transition-[width,height,padding] hover:bg-sidebar-accent hover:text-sidebar-accent-foreground focus-visible:ring-2 active:bg-sidebar-accent active:text-sidebar-accent-foreground disabled:pointer-events-none disabled:opacity-50 group-has-[[data-sidebar=menu-action]]/menu-item:pr-8 aria-disabled:pointer-events-none aria-disabled:opacity-50 data-[active=true]:bg-sidebar-accent data-[active=true]:font-medium data-[active=true]:text-sidebar-accent-foreground data-[state=open]:hover:bg-sidebar-accent data-[state=open]:hover:text-sidebar-accent-foreground group-data-[collapsible=icon]:!size-8 group-data-[collapsible=icon]:!p-2 [&>span:last-child]:truncate [&>svg]:size-4 [&>svg]:shrink-0",
+  {
+    variants: {
+      variant: {
+        default: "hover:bg-sidebar-accent hover:text-sidebar-accent-foreground",
+        outline:
+          "bg-background shadow-[0_0_0_1px_hsl(var(--sidebar-border))] hover:bg-sidebar-accent hover:text-sidebar-accent-foreground hover:shadow-[0_0_0_1px_hsl(var(--sidebar-accent))]",
+      },
+      size: {
+        default: "h-8 text-sm",
+        sm: "h-7 text-xs",
+        lg: "h-12 text-sm group-data-[collapsible=icon]:!p-0",
+      },
+    },
+    defaultVariants: {
+      variant: "default",
+      size: "default",
+    },
+  }
+)
+
+const SidebarMenuButton = React.forwardRef<
+  HTMLButtonElement,
+  React.ComponentProps<"button"> & {
+    asChild?: boolean
+    isActive?: boolean
+    tooltip?: string | React.ComponentProps<typeof TooltipContent>
+  } & VariantProps<typeof sidebarMenuButtonVariants>
+>(
+  (
+    {
+      asChild = false,
+      isActive = false,
+      variant = "default",
+      size = "default",
+      tooltip,
+      className,
+      ...props
+    },
+    ref
+  ) => {
+    const Comp = asChild ? Slot : "button"
+    const { isMobile, state } = useSidebar()
+
+    const button = (
+      <Comp
+        ref={ref}
+        data-sidebar="menu-button"
+        data-size={size}
+        data-active={isActive}
+        className={cn(sidebarMenuButtonVariants({ variant, size }), className)}
+        {...props}
+      />
+    )
+
+    if (!tooltip) {
+      return button
+    }
+
+    if (typeof tooltip === "string") {
+      tooltip = {
+        children: tooltip,
+      }
+    }
+
+    return (
+      <Tooltip>
+        <TooltipTrigger asChild>{button}</TooltipTrigger>
+        <TooltipContent
+          side="right"
+          align="center"
+          hidden={state !== "collapsed" || isMobile}
+          {...tooltip}
+        />
+      </Tooltip>
+    )
+  }
+)
+SidebarMenuButton.displayName = "SidebarMenuButton"
+
+const SidebarMenuAction = React.forwardRef<
+  HTMLButtonElement,
+  React.ComponentProps<"button"> & {
+    asChild?: boolean
+    showOnHover?: boolean
+  }
+>(({ className, asChild = false, showOnHover = false, ...props }, ref) => {
+  const Comp = asChild ? Slot : "button"
+
+  return (
+    <Comp
+      ref={ref}
+      data-sidebar="menu-action"
+      className={cn(
+        "absolute right-1 top-1.5 flex aspect-square w-5 items-center justify-center rounded-md p-0 text-sidebar-foreground outline-none ring-sidebar-ring transition-transform hover:bg-sidebar-accent hover:text-sidebar-accent-foreground focus-visible:ring-2 peer-hover/menu-button:text-sidebar-accent-foreground [&>svg]:size-4 [&>svg]:shrink-0",
+        // Increases the hit area of the button on mobile.
+        "after:absolute after:-inset-2 after:md:hidden",
+        "peer-data-[size=sm]/menu-button:top-1",
+        "peer-data-[size=default]/menu-button:top-1.5",
+        "peer-data-[size=lg]/menu-button:top-2.5",
+        "group-data-[collapsible=icon]:hidden",
+        showOnHover &&
+          "group-focus-within/menu-item:opacity-100 group-hover/menu-item:opacity-100 data-[state=open]:opacity-100 peer-data-[active=true]/menu-button:text-sidebar-accent-foreground md:opacity-0",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarMenuAction.displayName = "SidebarMenuAction"
+
+const SidebarMenuBadge = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div">
+>(({ className, ...props }, ref) => (
+  <div
+    ref={ref}
+    data-sidebar="menu-badge"
+    className={cn(
+      "absolute right-1 flex h-5 min-w-5 items-center justify-center rounded-md px-1 text-xs font-medium tabular-nums text-sidebar-foreground select-none pointer-events-none",
+      "peer-hover/menu-button:text-sidebar-accent-foreground peer-data-[active=true]/menu-button:text-sidebar-accent-foreground",
+      "peer-data-[size=sm]/menu-button:top-1",
+      "peer-data-[size=default]/menu-button:top-1.5",
+      "peer-data-[size=lg]/menu-button:top-2.5",
+      "group-data-[collapsible=icon]:hidden",
+      className
+    )}
+    {...props}
+  />
+))
+SidebarMenuBadge.displayName = "SidebarMenuBadge"
+
+const SidebarMenuSkeleton = React.forwardRef<
+  HTMLDivElement,
+  React.ComponentProps<"div"> & {
+    showIcon?: boolean
+  }
+>(({ className, showIcon = false, ...props }, ref) => {
+  // Random width between 50 to 90%.
+  const width = React.useMemo(() => {
+    return `${Math.floor(Math.random() * 40) + 50}%`
+  }, [])
+
+  return (
+    <div
+      ref={ref}
+      data-sidebar="menu-skeleton"
+      className={cn("rounded-md h-8 flex gap-2 px-2 items-center", className)}
+      {...props}
+    >
+      {showIcon && (
+        <Skeleton
+          className="size-4 rounded-md"
+          data-sidebar="menu-skeleton-icon"
+        />
+      )}
+      <Skeleton
+        className="h-4 flex-1 max-w-[--skeleton-width]"
+        data-sidebar="menu-skeleton-text"
+        style={
+          {
+            "--skeleton-width": width,
+          } as React.CSSProperties
+        }
+      />
+    </div>
+  )
+})
+SidebarMenuSkeleton.displayName = "SidebarMenuSkeleton"
+
+const SidebarMenuSub = React.forwardRef<
+  HTMLUListElement,
+  React.ComponentProps<"ul">
+>(({ className, ...props }, ref) => (
+  <ul
+    ref={ref}
+    data-sidebar="menu-sub"
+    className={cn(
+      "mx-3.5 flex min-w-0 translate-x-px flex-col gap-1 border-l border-sidebar-border px-2.5 py-0.5",
+      "group-data-[collapsible=icon]:hidden",
+      className
+    )}
+    {...props}
+  />
+))
+SidebarMenuSub.displayName = "SidebarMenuSub"
+
+const SidebarMenuSubItem = React.forwardRef<
+  HTMLLIElement,
+  React.ComponentProps<"li">
+>(({ ...props }, ref) => <li ref={ref} {...props} />)
+SidebarMenuSubItem.displayName = "SidebarMenuSubItem"
+
+const SidebarMenuSubButton = React.forwardRef<
+  HTMLAnchorElement,
+  React.ComponentProps<"a"> & {
+    asChild?: boolean
+    size?: "sm" | "md"
+    isActive?: boolean
+  }
+>(({ asChild = false, size = "md", isActive, className, ...props }, ref) => {
+  const Comp = asChild ? Slot : "a"
+
+  return (
+    <Comp
+      ref={ref}
+      data-sidebar="menu-sub-button"
+      data-size={size}
+      data-active={isActive}
+      className={cn(
+        "flex h-7 min-w-0 -translate-x-px items-center gap-2 overflow-hidden rounded-md px-2 text-sidebar-foreground outline-none ring-sidebar-ring hover:bg-sidebar-accent hover:text-sidebar-accent-foreground focus-visible:ring-2 active:bg-sidebar-accent active:text-sidebar-accent-foreground disabled:pointer-events-none disabled:opacity-50 aria-disabled:pointer-events-none aria-disabled:opacity-50 [&>span:last-child]:truncate [&>svg]:size-4 [&>svg]:shrink-0 [&>svg]:text-sidebar-accent-foreground",
+        "data-[active=true]:bg-sidebar-accent data-[active=true]:text-sidebar-accent-foreground",
+        size === "sm" && "text-xs",
+        size === "md" && "text-sm",
+        "group-data-[collapsible=icon]:hidden",
+        className
+      )}
+      {...props}
+    />
+  )
+})
+SidebarMenuSubButton.displayName = "SidebarMenuSubButton"
+
+export {
+  Sidebar,
+  SidebarContent,
+  SidebarFooter,
+  SidebarGroup,
+  SidebarGroupAction,
+  SidebarGroupContent,
+  SidebarGroupLabel,
+  SidebarHeader,
+  SidebarInput,
+  SidebarInset,
+  SidebarMenu,
+  SidebarMenuAction,
+  SidebarMenuBadge,
+  SidebarMenuButton,
+  SidebarMenuItem,
+  SidebarMenuSkeleton,
+  SidebarMenuSub,
+  SidebarMenuSubButton,
+  SidebarMenuSubItem,
+  SidebarProvider,
+  SidebarRail,
+  SidebarSeparator,
+  SidebarTrigger,
+  useSidebar,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/skeleton.tsx b/nvidia/txt2kg/assets/frontend/components/ui/skeleton.tsx
new file mode 100644
index 0000000..01b8b6d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/skeleton.tsx
@@ -0,0 +1,15 @@
+import { cn } from "@/lib/utils"
+
+function Skeleton({
+  className,
+  ...props
+}: React.HTMLAttributes<HTMLDivElement>) {
+  return (
+    <div
+      className={cn("animate-pulse rounded-md bg-muted", className)}
+      {...props}
+    />
+  )
+}
+
+export { Skeleton }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/slider.tsx b/nvidia/txt2kg/assets/frontend/components/ui/slider.tsx
new file mode 100644
index 0000000..c31c2b3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/slider.tsx
@@ -0,0 +1,28 @@
+"use client"
+
+import * as React from "react"
+import * as SliderPrimitive from "@radix-ui/react-slider"
+
+import { cn } from "@/lib/utils"
+
+const Slider = React.forwardRef<
+  React.ElementRef<typeof SliderPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof SliderPrimitive.Root>
+>(({ className, ...props }, ref) => (
+  <SliderPrimitive.Root
+    ref={ref}
+    className={cn(
+      "relative flex w-full touch-none select-none items-center",
+      className
+    )}
+    {...props}
+  >
+    <SliderPrimitive.Track className="relative h-2 w-full grow overflow-hidden rounded-full bg-secondary">
+      <SliderPrimitive.Range className="absolute h-full bg-primary" />
+    </SliderPrimitive.Track>
+    <SliderPrimitive.Thumb className="block h-5 w-5 rounded-full border-2 border-primary bg-background ring-offset-background transition-colors focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:pointer-events-none disabled:opacity-50" />
+  </SliderPrimitive.Root>
+))
+Slider.displayName = SliderPrimitive.Root.displayName
+
+export { Slider }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/spinner.tsx b/nvidia/txt2kg/assets/frontend/components/ui/spinner.tsx
new file mode 100644
index 0000000..f161715
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/spinner.tsx
@@ -0,0 +1,22 @@
+import React from "react"
+import { Loader2, LucideProps } from "lucide-react"
+import { cn } from "@/lib/utils"
+
+interface SpinnerProps extends Omit<LucideProps, "ref"> {
+  size?: "sm" | "md" | "lg"
+}
+
+export function Spinner({ size = "md", className, ...props }: SpinnerProps) {
+  const sizeClasses = {
+    sm: "h-3 w-3",
+    md: "h-4 w-4",
+    lg: "h-6 w-6",
+  }
+
+  return (
+    <Loader2 
+      className={cn(`animate-spin ${sizeClasses[size]}`, className)} 
+      {...props} 
+    />
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/switch.tsx b/nvidia/txt2kg/assets/frontend/components/ui/switch.tsx
new file mode 100644
index 0000000..bc69cf2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/switch.tsx
@@ -0,0 +1,29 @@
+"use client"
+
+import * as React from "react"
+import * as SwitchPrimitives from "@radix-ui/react-switch"
+
+import { cn } from "@/lib/utils"
+
+const Switch = React.forwardRef<
+  React.ElementRef<typeof SwitchPrimitives.Root>,
+  React.ComponentPropsWithoutRef<typeof SwitchPrimitives.Root>
+>(({ className, ...props }, ref) => (
+  <SwitchPrimitives.Root
+    className={cn(
+      "peer inline-flex h-6 w-11 shrink-0 cursor-pointer items-center rounded-full border-2 border-transparent transition-colors focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 focus-visible:ring-offset-background disabled:cursor-not-allowed disabled:opacity-50 data-[state=checked]:bg-primary data-[state=unchecked]:bg-input",
+      className
+    )}
+    {...props}
+    ref={ref}
+  >
+    <SwitchPrimitives.Thumb
+      className={cn(
+        "pointer-events-none block h-5 w-5 rounded-full bg-background shadow-lg ring-0 transition-transform data-[state=checked]:translate-x-5 data-[state=unchecked]:translate-x-0"
+      )}
+    />
+  </SwitchPrimitives.Root>
+))
+Switch.displayName = SwitchPrimitives.Root.displayName
+
+export { Switch }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/table.tsx b/nvidia/txt2kg/assets/frontend/components/ui/table.tsx
new file mode 100644
index 0000000..7f3502f
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/table.tsx
@@ -0,0 +1,117 @@
+import * as React from "react"
+
+import { cn } from "@/lib/utils"
+
+const Table = React.forwardRef<
+  HTMLTableElement,
+  React.HTMLAttributes<HTMLTableElement>
+>(({ className, ...props }, ref) => (
+  <div className="relative w-full overflow-auto">
+    <table
+      ref={ref}
+      className={cn("w-full caption-bottom text-sm", className)}
+      {...props}
+    />
+  </div>
+))
+Table.displayName = "Table"
+
+const TableHeader = React.forwardRef<
+  HTMLTableSectionElement,
+  React.HTMLAttributes<HTMLTableSectionElement>
+>(({ className, ...props }, ref) => (
+  <thead ref={ref} className={cn("[&_tr]:border-b", className)} {...props} />
+))
+TableHeader.displayName = "TableHeader"
+
+const TableBody = React.forwardRef<
+  HTMLTableSectionElement,
+  React.HTMLAttributes<HTMLTableSectionElement>
+>(({ className, ...props }, ref) => (
+  <tbody
+    ref={ref}
+    className={cn("[&_tr:last-child]:border-0", className)}
+    {...props}
+  />
+))
+TableBody.displayName = "TableBody"
+
+const TableFooter = React.forwardRef<
+  HTMLTableSectionElement,
+  React.HTMLAttributes<HTMLTableSectionElement>
+>(({ className, ...props }, ref) => (
+  <tfoot
+    ref={ref}
+    className={cn(
+      "border-t bg-muted/50 font-medium [&>tr]:last:border-b-0",
+      className
+    )}
+    {...props}
+  />
+))
+TableFooter.displayName = "TableFooter"
+
+const TableRow = React.forwardRef<
+  HTMLTableRowElement,
+  React.HTMLAttributes<HTMLTableRowElement>
+>(({ className, ...props }, ref) => (
+  <tr
+    ref={ref}
+    className={cn(
+      "border-b transition-colors hover:bg-muted/50 data-[state=selected]:bg-muted",
+      className
+    )}
+    {...props}
+  />
+))
+TableRow.displayName = "TableRow"
+
+const TableHead = React.forwardRef<
+  HTMLTableCellElement,
+  React.ThHTMLAttributes<HTMLTableCellElement>
+>(({ className, ...props }, ref) => (
+  <th
+    ref={ref}
+    className={cn(
+      "h-12 px-4 text-left align-middle font-medium text-muted-foreground [&:has([role=checkbox])]:pr-0",
+      className
+    )}
+    {...props}
+  />
+))
+TableHead.displayName = "TableHead"
+
+const TableCell = React.forwardRef<
+  HTMLTableCellElement,
+  React.TdHTMLAttributes<HTMLTableCellElement>
+>(({ className, ...props }, ref) => (
+  <td
+    ref={ref}
+    className={cn("p-4 align-middle [&:has([role=checkbox])]:pr-0", className)}
+    {...props}
+  />
+))
+TableCell.displayName = "TableCell"
+
+const TableCaption = React.forwardRef<
+  HTMLTableCaptionElement,
+  React.HTMLAttributes<HTMLTableCaptionElement>
+>(({ className, ...props }, ref) => (
+  <caption
+    ref={ref}
+    className={cn("mt-4 text-sm text-muted-foreground", className)}
+    {...props}
+  />
+))
+TableCaption.displayName = "TableCaption"
+
+export {
+  Table,
+  TableHeader,
+  TableBody,
+  TableFooter,
+  TableHead,
+  TableRow,
+  TableCell,
+  TableCaption,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/tabs.tsx b/nvidia/txt2kg/assets/frontend/components/ui/tabs.tsx
new file mode 100644
index 0000000..4a87cf8
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/tabs.tsx
@@ -0,0 +1,55 @@
+"use client"
+
+import * as React from "react"
+import * as TabsPrimitive from "@radix-ui/react-tabs"
+
+import { cn } from "@/lib/utils"
+
+const Tabs = TabsPrimitive.Root
+
+const TabsList = React.forwardRef<
+  React.ElementRef<typeof TabsPrimitive.List>,
+  React.ComponentPropsWithoutRef<typeof TabsPrimitive.List>
+>(({ className, ...props }, ref) => (
+  <TabsPrimitive.List
+    ref={ref}
+    className={cn(
+      "inline-flex h-auto items-center justify-center rounded-xl bg-muted/30 border border-border/20 p-1.5 text-muted-foreground shadow-sm",
+      className
+    )}
+    {...props}
+  />
+))
+TabsList.displayName = TabsPrimitive.List.displayName
+
+const TabsTrigger = React.forwardRef<
+  React.ElementRef<typeof TabsPrimitive.Trigger>,
+  React.ComponentPropsWithoutRef<typeof TabsPrimitive.Trigger>
+>(({ className, ...props }, ref) => (
+  <TabsPrimitive.Trigger
+    ref={ref}
+    className={cn(
+      "inline-flex items-center justify-center whitespace-nowrap rounded-lg px-3 py-2 text-sm font-medium ring-offset-background transition-all duration-200 focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:pointer-events-none disabled:opacity-50 data-[state=active]:bg-background data-[state=active]:text-foreground data-[state=active]:shadow-sm hover:bg-background/50",
+      className
+    )}
+    {...props}
+  />
+))
+TabsTrigger.displayName = TabsPrimitive.Trigger.displayName
+
+const TabsContent = React.forwardRef<
+  React.ElementRef<typeof TabsPrimitive.Content>,
+  React.ComponentPropsWithoutRef<typeof TabsPrimitive.Content>
+>(({ className, ...props }, ref) => (
+  <TabsPrimitive.Content
+    ref={ref}
+    className={cn(
+      "mt-2 ring-offset-background focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2",
+      className
+    )}
+    {...props}
+  />
+))
+TabsContent.displayName = TabsPrimitive.Content.displayName
+
+export { Tabs, TabsList, TabsTrigger, TabsContent }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/textarea.tsx b/nvidia/txt2kg/assets/frontend/components/ui/textarea.tsx
new file mode 100644
index 0000000..0ce9097
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/textarea.tsx
@@ -0,0 +1,22 @@
+import * as React from "react"
+
+import { cn } from "@/lib/utils"
+
+const Textarea = React.forwardRef<
+  HTMLTextAreaElement,
+  React.ComponentProps<"textarea">
+>(({ className, ...props }, ref) => {
+  return (
+    <textarea
+      className={cn(
+        "flex min-h-[64px] w-full rounded-md border border-input bg-background px-3 py-1.5 text-sm ring-offset-background placeholder:text-muted-foreground focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:cursor-not-allowed disabled:opacity-50",
+        className
+      )}
+      ref={ref}
+      {...props}
+    />
+  )
+})
+Textarea.displayName = "Textarea"
+
+export { Textarea }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/toast.tsx b/nvidia/txt2kg/assets/frontend/components/ui/toast.tsx
new file mode 100644
index 0000000..92b0fc8
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/toast.tsx
@@ -0,0 +1,129 @@
+"use client"
+
+import * as React from "react"
+import * as ToastPrimitives from "@radix-ui/react-toast"
+import { cva, type VariantProps } from "class-variance-authority"
+import { X } from "lucide-react"
+
+import { cn } from "@/lib/utils"
+
+const ToastProvider = ToastPrimitives.Provider
+
+const ToastViewport = React.forwardRef<
+  React.ElementRef<typeof ToastPrimitives.Viewport>,
+  React.ComponentPropsWithoutRef<typeof ToastPrimitives.Viewport>
+>(({ className, ...props }, ref) => (
+  <ToastPrimitives.Viewport
+    ref={ref}
+    className={cn(
+      "fixed top-0 z-[100] flex max-h-screen w-full flex-col-reverse p-4 sm:bottom-0 sm:right-0 sm:top-auto sm:flex-col md:max-w-[420px]",
+      className
+    )}
+    {...props}
+  />
+))
+ToastViewport.displayName = ToastPrimitives.Viewport.displayName
+
+const toastVariants = cva(
+  "group pointer-events-auto relative flex w-full items-center justify-between space-x-4 overflow-hidden rounded-md border p-6 pr-8 shadow-md transition-all data-[swipe=cancel]:translate-x-0 data-[swipe=end]:translate-x-[var(--radix-toast-swipe-end-x)] data-[swipe=move]:translate-x-[var(--radix-toast-swipe-move-x)] data-[swipe=move]:transition-none data-[state=open]:animate-in data-[state=closed]:animate-out data-[swipe=end]:animate-out data-[state=closed]:fade-out-80 data-[state=closed]:slide-out-to-right-full data-[state=open]:slide-in-from-top-full data-[state=open]:sm:slide-in-from-bottom-full",
+  {
+    variants: {
+      variant: {
+        default: "border bg-background text-foreground",
+        destructive:
+          "destructive group border-destructive bg-destructive text-destructive-foreground",
+      },
+    },
+    defaultVariants: {
+      variant: "default",
+    },
+  }
+)
+
+const Toast = React.forwardRef<
+  React.ElementRef<typeof ToastPrimitives.Root>,
+  React.ComponentPropsWithoutRef<typeof ToastPrimitives.Root> &
+    VariantProps<typeof toastVariants>
+>(({ className, variant, ...props }, ref) => {
+  return (
+    <ToastPrimitives.Root
+      ref={ref}
+      className={cn(toastVariants({ variant }), className)}
+      {...props}
+    />
+  )
+})
+Toast.displayName = ToastPrimitives.Root.displayName
+
+const ToastAction = React.forwardRef<
+  React.ElementRef<typeof ToastPrimitives.Action>,
+  React.ComponentPropsWithoutRef<typeof ToastPrimitives.Action>
+>(({ className, ...props }, ref) => (
+  <ToastPrimitives.Action
+    ref={ref}
+    className={cn(
+      "inline-flex h-8 shrink-0 items-center justify-center rounded-md border bg-transparent px-3 text-sm font-medium ring-offset-background transition-colors hover:bg-secondary focus:outline-none focus:ring-2 focus:ring-ring focus:ring-offset-2 disabled:pointer-events-none disabled:opacity-50 group-[.destructive]:border-muted/40 group-[.destructive]:hover:border-destructive/30 group-[.destructive]:hover:bg-destructive group-[.destructive]:hover:text-destructive-foreground group-[.destructive]:focus:ring-destructive",
+      className
+    )}
+    {...props}
+  />
+))
+ToastAction.displayName = ToastPrimitives.Action.displayName
+
+const ToastClose = React.forwardRef<
+  React.ElementRef<typeof ToastPrimitives.Close>,
+  React.ComponentPropsWithoutRef<typeof ToastPrimitives.Close>
+>(({ className, ...props }, ref) => (
+  <ToastPrimitives.Close
+    ref={ref}
+    className={cn(
+      "absolute right-2 top-2 rounded-md p-1 text-foreground/50 opacity-0 transition-opacity hover:text-foreground focus:opacity-100 focus:outline-none focus:ring-2 group-hover:opacity-100 group-[.destructive]:text-red-300 group-[.destructive]:hover:text-red-50 group-[.destructive]:focus:ring-red-400 group-[.destructive]:focus:ring-offset-red-600",
+      className
+    )}
+    toast-close=""
+    {...props}
+  >
+    <X className="h-4 w-4" />
+  </ToastPrimitives.Close>
+))
+ToastClose.displayName = ToastPrimitives.Close.displayName
+
+const ToastTitle = React.forwardRef<
+  React.ElementRef<typeof ToastPrimitives.Title>,
+  React.ComponentPropsWithoutRef<typeof ToastPrimitives.Title>
+>(({ className, ...props }, ref) => (
+  <ToastPrimitives.Title
+    ref={ref}
+    className={cn("text-sm font-semibold", className)}
+    {...props}
+  />
+))
+ToastTitle.displayName = ToastPrimitives.Title.displayName
+
+const ToastDescription = React.forwardRef<
+  React.ElementRef<typeof ToastPrimitives.Description>,
+  React.ComponentPropsWithoutRef<typeof ToastPrimitives.Description>
+>(({ className, ...props }, ref) => (
+  <ToastPrimitives.Description
+    ref={ref}
+    className={cn("text-sm opacity-90", className)}
+    {...props}
+  />
+))
+ToastDescription.displayName = ToastPrimitives.Description.displayName
+
+type ToastProps = React.ComponentPropsWithoutRef<typeof Toast>
+
+type ToastActionElement = React.ReactElement<typeof ToastAction>
+
+export {
+  type ToastProps,
+  type ToastActionElement,
+  ToastProvider,
+  ToastViewport,
+  Toast,
+  ToastTitle,
+  ToastDescription,
+  ToastClose,
+  ToastAction,
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/toaster.tsx b/nvidia/txt2kg/assets/frontend/components/ui/toaster.tsx
new file mode 100644
index 0000000..171beb4
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/toaster.tsx
@@ -0,0 +1,35 @@
+"use client"
+
+import { useToast } from "@/hooks/use-toast"
+import {
+  Toast,
+  ToastClose,
+  ToastDescription,
+  ToastProvider,
+  ToastTitle,
+  ToastViewport,
+} from "@/components/ui/toast"
+
+export function Toaster() {
+  const { toasts } = useToast()
+
+  return (
+    <ToastProvider>
+      {toasts.map(function ({ id, title, description, action, ...props }) {
+        return (
+          <Toast key={id} {...props}>
+            <div className="grid gap-1">
+              {title && <ToastTitle>{title}</ToastTitle>}
+              {description && (
+                <ToastDescription>{description}</ToastDescription>
+              )}
+            </div>
+            {action}
+            <ToastClose />
+          </Toast>
+        )
+      })}
+      <ToastViewport />
+    </ToastProvider>
+  )
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/toggle-group.tsx b/nvidia/txt2kg/assets/frontend/components/ui/toggle-group.tsx
new file mode 100644
index 0000000..1c876bb
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/toggle-group.tsx
@@ -0,0 +1,61 @@
+"use client"
+
+import * as React from "react"
+import * as ToggleGroupPrimitive from "@radix-ui/react-toggle-group"
+import { type VariantProps } from "class-variance-authority"
+
+import { cn } from "@/lib/utils"
+import { toggleVariants } from "@/components/ui/toggle"
+
+const ToggleGroupContext = React.createContext<
+  VariantProps<typeof toggleVariants>
+>({
+  size: "default",
+  variant: "default",
+})
+
+const ToggleGroup = React.forwardRef<
+  React.ElementRef<typeof ToggleGroupPrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof ToggleGroupPrimitive.Root> &
+    VariantProps<typeof toggleVariants>
+>(({ className, variant, size, children, ...props }, ref) => (
+  <ToggleGroupPrimitive.Root
+    ref={ref}
+    className={cn("flex items-center justify-center gap-1", className)}
+    {...props}
+  >
+    <ToggleGroupContext.Provider value={{ variant, size }}>
+      {children}
+    </ToggleGroupContext.Provider>
+  </ToggleGroupPrimitive.Root>
+))
+
+ToggleGroup.displayName = ToggleGroupPrimitive.Root.displayName
+
+const ToggleGroupItem = React.forwardRef<
+  React.ElementRef<typeof ToggleGroupPrimitive.Item>,
+  React.ComponentPropsWithoutRef<typeof ToggleGroupPrimitive.Item> &
+    VariantProps<typeof toggleVariants>
+>(({ className, children, variant, size, ...props }, ref) => {
+  const context = React.useContext(ToggleGroupContext)
+
+  return (
+    <ToggleGroupPrimitive.Item
+      ref={ref}
+      className={cn(
+        toggleVariants({
+          variant: context.variant || variant,
+          size: context.size || size,
+        }),
+        className
+      )}
+      {...props}
+    >
+      {children}
+    </ToggleGroupPrimitive.Item>
+  )
+})
+
+ToggleGroupItem.displayName = ToggleGroupPrimitive.Item.displayName
+
+export { ToggleGroup, ToggleGroupItem }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/toggle.tsx b/nvidia/txt2kg/assets/frontend/components/ui/toggle.tsx
new file mode 100644
index 0000000..c19bea3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/toggle.tsx
@@ -0,0 +1,45 @@
+"use client"
+
+import * as React from "react"
+import * as TogglePrimitive from "@radix-ui/react-toggle"
+import { cva, type VariantProps } from "class-variance-authority"
+
+import { cn } from "@/lib/utils"
+
+const toggleVariants = cva(
+  "inline-flex items-center justify-center rounded-md text-sm font-medium ring-offset-background transition-colors hover:bg-muted hover:text-muted-foreground focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring focus-visible:ring-offset-2 disabled:pointer-events-none disabled:opacity-50 data-[state=on]:bg-accent data-[state=on]:text-accent-foreground [&_svg]:pointer-events-none [&_svg]:size-4 [&_svg]:shrink-0 gap-2",
+  {
+    variants: {
+      variant: {
+        default: "bg-transparent",
+        outline:
+          "border border-input bg-transparent hover:bg-accent hover:text-accent-foreground",
+      },
+      size: {
+        default: "h-10 px-3 min-w-10",
+        sm: "h-9 px-2.5 min-w-9",
+        lg: "h-11 px-5 min-w-11",
+      },
+    },
+    defaultVariants: {
+      variant: "default",
+      size: "default",
+    },
+  }
+)
+
+const Toggle = React.forwardRef<
+  React.ElementRef<typeof TogglePrimitive.Root>,
+  React.ComponentPropsWithoutRef<typeof TogglePrimitive.Root> &
+    VariantProps<typeof toggleVariants>
+>(({ className, variant, size, ...props }, ref) => (
+  <TogglePrimitive.Root
+    ref={ref}
+    className={cn(toggleVariants({ variant, size, className }))}
+    {...props}
+  />
+))
+
+Toggle.displayName = TogglePrimitive.Root.displayName
+
+export { Toggle, toggleVariants }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/tooltip.tsx b/nvidia/txt2kg/assets/frontend/components/ui/tooltip.tsx
new file mode 100644
index 0000000..30fc44d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/tooltip.tsx
@@ -0,0 +1,30 @@
+"use client"
+
+import * as React from "react"
+import * as TooltipPrimitive from "@radix-ui/react-tooltip"
+
+import { cn } from "@/lib/utils"
+
+const TooltipProvider = TooltipPrimitive.Provider
+
+const Tooltip = TooltipPrimitive.Root
+
+const TooltipTrigger = TooltipPrimitive.Trigger
+
+const TooltipContent = React.forwardRef<
+  React.ElementRef<typeof TooltipPrimitive.Content>,
+  React.ComponentPropsWithoutRef<typeof TooltipPrimitive.Content>
+>(({ className, sideOffset = 4, ...props }, ref) => (
+  <TooltipPrimitive.Content
+    ref={ref}
+    sideOffset={sideOffset}
+    className={cn(
+      "z-50 overflow-hidden rounded-md border bg-popover px-3 py-1.5 text-sm text-popover-foreground shadow-md animate-in fade-in-0 zoom-in-95 data-[state=closed]:animate-out data-[state=closed]:fade-out-0 data-[state=closed]:zoom-out-95 data-[side=bottom]:slide-in-from-top-2 data-[side=left]:slide-in-from-right-2 data-[side=right]:slide-in-from-left-2 data-[side=top]:slide-in-from-bottom-2",
+      className
+    )}
+    {...props}
+  />
+))
+TooltipContent.displayName = TooltipPrimitive.Content.displayName
+
+export { Tooltip, TooltipTrigger, TooltipContent, TooltipProvider }
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/use-mobile.tsx b/nvidia/txt2kg/assets/frontend/components/ui/use-mobile.tsx
new file mode 100644
index 0000000..2b0fe1d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/use-mobile.tsx
@@ -0,0 +1,19 @@
+import * as React from "react"
+
+const MOBILE_BREAKPOINT = 768
+
+export function useIsMobile() {
+  const [isMobile, setIsMobile] = React.useState<boolean | undefined>(undefined)
+
+  React.useEffect(() => {
+    const mql = window.matchMedia(`(max-width: ${MOBILE_BREAKPOINT - 1}px)`)
+    const onChange = () => {
+      setIsMobile(window.innerWidth < MOBILE_BREAKPOINT)
+    }
+    mql.addEventListener("change", onChange)
+    setIsMobile(window.innerWidth < MOBILE_BREAKPOINT)
+    return () => mql.removeEventListener("change", onChange)
+  }, [])
+
+  return !!isMobile
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/ui/use-toast.ts b/nvidia/txt2kg/assets/frontend/components/ui/use-toast.ts
new file mode 100644
index 0000000..02e111d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/ui/use-toast.ts
@@ -0,0 +1,194 @@
+"use client"
+
+// Inspired by react-hot-toast library
+import * as React from "react"
+
+import type {
+  ToastActionElement,
+  ToastProps,
+} from "@/components/ui/toast"
+
+const TOAST_LIMIT = 1
+const TOAST_REMOVE_DELAY = 1000000
+
+type ToasterToast = ToastProps & {
+  id: string
+  title?: React.ReactNode
+  description?: React.ReactNode
+  action?: ToastActionElement
+}
+
+const actionTypes = {
+  ADD_TOAST: "ADD_TOAST",
+  UPDATE_TOAST: "UPDATE_TOAST",
+  DISMISS_TOAST: "DISMISS_TOAST",
+  REMOVE_TOAST: "REMOVE_TOAST",
+} as const
+
+let count = 0
+
+function genId() {
+  count = (count + 1) % Number.MAX_SAFE_INTEGER
+  return count.toString()
+}
+
+type ActionType = typeof actionTypes
+
+type Action =
+  | {
+      type: ActionType["ADD_TOAST"]
+      toast: ToasterToast
+    }
+  | {
+      type: ActionType["UPDATE_TOAST"]
+      toast: Partial<ToasterToast>
+    }
+  | {
+      type: ActionType["DISMISS_TOAST"]
+      toastId?: ToasterToast["id"]
+    }
+  | {
+      type: ActionType["REMOVE_TOAST"]
+      toastId?: ToasterToast["id"]
+    }
+
+interface State {
+  toasts: ToasterToast[]
+}
+
+const toastTimeouts = new Map<string, ReturnType<typeof setTimeout>>()
+
+const addToRemoveQueue = (toastId: string) => {
+  if (toastTimeouts.has(toastId)) {
+    return
+  }
+
+  const timeout = setTimeout(() => {
+    toastTimeouts.delete(toastId)
+    dispatch({
+      type: "REMOVE_TOAST",
+      toastId: toastId,
+    })
+  }, TOAST_REMOVE_DELAY)
+
+  toastTimeouts.set(toastId, timeout)
+}
+
+export const reducer = (state: State, action: Action): State => {
+  switch (action.type) {
+    case "ADD_TOAST":
+      return {
+        ...state,
+        toasts: [action.toast, ...state.toasts].slice(0, TOAST_LIMIT),
+      }
+
+    case "UPDATE_TOAST":
+      return {
+        ...state,
+        toasts: state.toasts.map((t) =>
+          t.id === action.toast.id ? { ...t, ...action.toast } : t
+        ),
+      }
+
+    case "DISMISS_TOAST": {
+      const { toastId } = action
+
+      // ! Side effects ! - This could be extracted into a dismissToast() action,
+      // but I'll keep it here for simplicity
+      if (toastId) {
+        addToRemoveQueue(toastId)
+      } else {
+        state.toasts.forEach((toast) => {
+          addToRemoveQueue(toast.id)
+        })
+      }
+
+      return {
+        ...state,
+        toasts: state.toasts.map((t) =>
+          t.id === toastId || toastId === undefined
+            ? {
+                ...t,
+                open: false,
+              }
+            : t
+        ),
+      }
+    }
+    case "REMOVE_TOAST":
+      if (action.toastId === undefined) {
+        return {
+          ...state,
+          toasts: [],
+        }
+      }
+      return {
+        ...state,
+        toasts: state.toasts.filter((t) => t.id !== action.toastId),
+      }
+  }
+}
+
+const listeners: Array<(state: State) => void> = []
+
+let memoryState: State = { toasts: [] }
+
+function dispatch(action: Action) {
+  memoryState = reducer(memoryState, action)
+  listeners.forEach((listener) => {
+    listener(memoryState)
+  })
+}
+
+type Toast = Omit<ToasterToast, "id">
+
+function toast({ ...props }: Toast) {
+  const id = genId()
+
+  const update = (props: ToasterToast) =>
+    dispatch({
+      type: "UPDATE_TOAST",
+      toast: { ...props, id },
+    })
+  const dismiss = () => dispatch({ type: "DISMISS_TOAST", toastId: id })
+
+  dispatch({
+    type: "ADD_TOAST",
+    toast: {
+      ...props,
+      id,
+      open: true,
+      onOpenChange: (open) => {
+        if (!open) dismiss()
+      },
+    },
+  })
+
+  return {
+    id: id,
+    dismiss,
+    update,
+  }
+}
+
+function useToast() {
+  const [state, setState] = React.useState<State>(memoryState)
+
+  React.useEffect(() => {
+    listeners.push(setState)
+    return () => {
+      const index = listeners.indexOf(setState)
+      if (index > -1) {
+        listeners.splice(index, 1)
+      }
+    }
+  }, [state])
+
+  return {
+    ...state,
+    toast,
+    dismiss: (toastId?: string) => dispatch({ type: "DISMISS_TOAST", toastId }),
+  }
+}
+
+export { useToast, toast }
diff --git a/nvidia/txt2kg/assets/frontend/components/unified-gpu-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/unified-gpu-viewer.tsx
new file mode 100644
index 0000000..5a73922
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/unified-gpu-viewer.tsx
@@ -0,0 +1,509 @@
+"use client"
+
+import React, { useState, useEffect, useRef } from 'react'
+import { Button } from '@/components/ui/button'
+import { Card, CardContent, CardHeader, CardTitle } from '@/components/ui/card'
+import { Select, SelectContent, SelectItem, SelectTrigger, SelectValue } from '@/components/ui/select'
+import { Badge } from '@/components/ui/badge'
+import { Switch } from '@/components/ui/switch'
+import { Tabs, TabsContent, TabsList, TabsTrigger } from '@/components/ui/tabs'
+import { Loader2, Zap, Activity, Cpu, Cloud, ExternalLink, Settings } from 'lucide-react'
+import { useToast } from '@/hooks/use-toast'
+
+interface GraphData {
+  nodes: Array<{
+    id: string
+    name: string
+    group?: string
+    [key: string]: any
+  }>
+  links: Array<{
+    source: string
+    target: string
+    name: string
+    [key: string]: any
+  }>
+}
+
+interface UnifiedGPUViewerProps {
+  graphData: GraphData
+  onError?: (error: Error) => void
+}
+
+interface ProcessingCapabilities {
+  processing_modes: {
+    pygraphistry_cloud: { available: boolean, description: string }
+    local_gpu: { available: boolean, description: string } 
+    local_cpu: { available: boolean, description: string }
+  }
+  has_rapids: boolean
+  has_torch_geometric: boolean
+  gpu_available: boolean
+}
+
+interface ProcessedData {
+  processed_nodes: any[]
+  processed_edges: any[]
+  processing_mode: string
+  embed_url?: string
+  gpu_processed?: boolean
+  layout_positions?: Record<string, [number, number]>
+  clusters?: Record<string, number>
+  centrality?: Record<string, Record<string, number>>
+  stats: {
+    node_count: number
+    edge_count: number
+    gpu_accelerated: boolean
+    has_embed_url?: boolean
+    layout_computed?: boolean
+    clusters_computed?: boolean
+    centrality_computed?: boolean
+  }
+  timestamp: string
+}
+
+export function UnifiedGPUViewer({ graphData, onError }: UnifiedGPUViewerProps) {
+  const [isProcessing, setIsProcessing] = useState(false)
+  const [processedData, setProcessedData] = useState<ProcessedData | null>(null)
+  const [capabilities, setCapabilities] = useState<ProcessingCapabilities | null>(null)
+  const [serviceHealth, setServiceHealth] = useState<'unknown' | 'healthy' | 'error'>('unknown')
+  
+  // Processing mode and options
+  const [processingMode, setProcessingMode] = useState<'pygraphistry_cloud' | 'local_gpu' | 'local_cpu'>('pygraphistry_cloud')
+  
+  // PyGraphistry options
+  const [layoutType, setLayoutType] = useState('force')
+  const [gpuAcceleration, setGpuAcceleration] = useState(true)
+  const [clustering, setClustering] = useState(false)
+  
+  // Local GPU options  
+  const [layoutAlgorithm, setLayoutAlgorithm] = useState('force_atlas2')
+  const [clusteringAlgorithm, setClusteringAlgorithm] = useState('leiden')
+  const [computeCentrality, setComputeCentrality] = useState(true)
+  
+  const { toast } = useToast()
+  const wsRef = useRef<WebSocket | null>(null)
+
+  // Check service health and capabilities on mount
+  useEffect(() => {
+    checkServiceHealth()
+    getCapabilities()
+    setupWebSocket()
+    
+    return () => {
+      if (wsRef.current) {
+        wsRef.current.close()
+      }
+    }
+  }, [])
+
+  const checkServiceHealth = async () => {
+    try {
+      const response = await fetch('/api/unified-gpu/health')
+      if (response.ok) {
+        setServiceHealth('healthy')
+      } else {
+        setServiceHealth('error')
+      }
+    } catch (error) {
+      console.error('Unified service health check failed:', error)
+      setServiceHealth('error')
+    }
+  }
+
+  const getCapabilities = async () => {
+    try {
+      const response = await fetch('/api/unified-gpu/capabilities')
+      if (response.ok) {
+        const caps = await response.json()
+        setCapabilities(caps)
+        
+        // Set default mode based on capabilities
+        if (caps.processing_modes.pygraphistry_cloud.available) {
+          setProcessingMode('pygraphistry_cloud')
+        } else if (caps.processing_modes.local_gpu.available) {
+          setProcessingMode('local_gpu')
+        } else {
+          setProcessingMode('local_cpu')
+        }
+      }
+    } catch (error) {
+      console.error('Failed to get capabilities:', error)
+    }
+  }
+
+  const setupWebSocket = () => {
+    try {
+      const ws = new WebSocket('ws://localhost:8080/ws')
+      
+      ws.onopen = () => {
+        console.log('WebSocket connected to unified service')
+      }
+      
+      ws.onmessage = (event) => {
+        try {
+          const message = JSON.parse(event.data)
+          if (message.type === 'graph_processed') {
+            setProcessedData(message.data)
+            setIsProcessing(false)
+            
+            toast({
+              title: "Processing Complete",
+              description: `Processed ${message.data.stats.node_count} nodes with ${message.data.processing_mode}`,
+            })
+          }
+        } catch (error) {
+          console.error('WebSocket message error:', error)
+        }
+      }
+      
+      wsRef.current = ws
+    } catch (error) {
+      console.error('WebSocket setup failed:', error)
+    }
+  }
+
+  const processGraph = async () => {
+    if (!graphData?.nodes?.length || !graphData?.links?.length) {
+      toast({
+        title: "No Graph Data",
+        description: "Please ensure graph data is loaded before processing.",
+        variant: "destructive"
+      })
+      return
+    }
+
+    setIsProcessing(true)
+    
+    try {
+      const requestData = {
+        graph_data: {
+          nodes: graphData.nodes,
+          links: graphData.links
+        },
+        processing_mode: processingMode,
+        
+        // PyGraphistry options
+        layout_type: layoutType,
+        gpu_acceleration: gpuAcceleration,
+        clustering: clustering,
+        
+        // Local GPU options
+        layout_algorithm: layoutAlgorithm,
+        clustering_algorithm: clusteringAlgorithm,
+        compute_centrality: computeCentrality
+      }
+
+      const response = await fetch('/api/unified-gpu/visualize', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify(requestData)
+      })
+
+      if (!response.ok) {
+        throw new Error(`Processing failed: ${response.statusText}`)
+      }
+
+      // Result will come via WebSocket or direct response
+      const result = await response.json()
+      if (!wsRef.current || wsRef.current.readyState !== WebSocket.OPEN) {
+        // Handle direct response if WebSocket not available
+        setProcessedData(result)
+        setIsProcessing(false)
+        
+        toast({
+          title: "Processing Complete",
+          description: `Processed ${result.stats.node_count} nodes with ${result.processing_mode}`,
+        })
+      }
+
+    } catch (error) {
+      console.error('Processing error:', error)
+      const errorMessage = error instanceof Error ? error.message : 'Unknown error occurred'
+      
+      toast({
+        title: "Processing Failed",
+        description: errorMessage,
+        variant: "destructive"
+      })
+      
+      setIsProcessing(false)
+      
+      if (onError) {
+        onError(error instanceof Error ? error : new Error(errorMessage))
+      }
+    }
+  }
+
+  const ServiceStatus = () => (
+    <div className="flex items-center gap-2 text-sm">
+      <div className={`w-2 h-2 rounded-full ${
+        serviceHealth === 'healthy' ? 'bg-green-500' : 
+        serviceHealth === 'error' ? 'bg-red-500' : 'bg-yellow-500'
+      }`} />
+      <span>
+        Unified Service: {serviceHealth === 'healthy' ? 'Connected' : 
+                         serviceHealth === 'error' ? 'Disconnected' : 'Checking...'}
+      </span>
+    </div>
+  )
+
+  const ProcessingModeSelector = () => (
+    <div className="space-y-2">
+      <label className="text-sm font-medium">Processing Mode</label>
+      <Select value={processingMode} onValueChange={(value: any) => setProcessingMode(value)}>
+        <SelectTrigger>
+          <SelectValue />
+        </SelectTrigger>
+        <SelectContent>
+          {capabilities?.processing_modes.pygraphistry_cloud.available && (
+            <SelectItem value="pygraphistry_cloud">
+              <div className="flex items-center gap-2">
+                <Cloud className="w-4 h-4" />
+                PyGraphistry Cloud
+              </div>
+            </SelectItem>
+          )}
+          {capabilities?.processing_modes.local_gpu.available && (
+            <SelectItem value="local_gpu">
+              <div className="flex items-center gap-2">
+                <Zap className="w-4 h-4" />
+                Local GPU (cuGraph)
+              </div>
+            </SelectItem>
+          )}
+          <SelectItem value="local_cpu">
+            <div className="flex items-center gap-2">
+              <Cpu className="w-4 h-4" />
+              Local CPU
+            </div>
+          </SelectItem>
+        </SelectContent>
+      </Select>
+      <p className="text-xs text-muted-foreground">
+        {capabilities?.processing_modes[processingMode]?.description}
+      </p>
+    </div>
+  )
+
+  const StatsDisplay = ({ stats }: { stats: ProcessedData['stats'] }) => (
+    <div className="grid grid-cols-2 md:grid-cols-3 gap-4 text-sm">
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4 text-blue-500" />
+        <span>{stats.node_count.toLocaleString()} nodes</span>
+      </div>
+      <div className="flex items-center gap-2">
+        <Activity className="w-4 h-4 text-green-500" />
+        <span>{stats.edge_count.toLocaleString()} edges</span>
+      </div>
+      <div className="flex items-center gap-2">
+        {stats.gpu_accelerated ? (
+          <Zap className="w-4 h-4 text-yellow-500" />
+        ) : (
+          <Cpu className="w-4 h-4 text-gray-500" />
+        )}
+        <span>{stats.gpu_accelerated ? 'GPU' : 'CPU'} accelerated</span>
+      </div>
+    </div>
+  )
+
+  return (
+    <div className="space-y-4">
+      {/* Service Status */}
+      <Card>
+        <CardHeader className="pb-3">
+          <CardTitle className="flex items-center justify-between">
+            <span className="flex items-center gap-2">
+              <Zap className="w-5 h-5" />
+              Unified GPU Visualization
+            </span>
+            <ServiceStatus />
+          </CardTitle>
+        </CardHeader>
+        <CardContent className="space-y-4">
+          {/* Processing Mode Selection */}
+          <ProcessingModeSelector />
+
+          {/* Processing Options */}
+          <Tabs value={processingMode} className="w-full">
+            <TabsList className="grid w-full grid-cols-3">
+              <TabsTrigger value="pygraphistry_cloud" disabled={!capabilities?.processing_modes.pygraphistry_cloud.available}>
+                Cloud
+              </TabsTrigger>
+              <TabsTrigger value="local_gpu" disabled={!capabilities?.processing_modes.local_gpu.available}>
+                Local GPU
+              </TabsTrigger>
+              <TabsTrigger value="local_cpu">
+                Local CPU
+              </TabsTrigger>
+            </TabsList>
+
+            <TabsContent value="pygraphistry_cloud" className="space-y-4">
+              <div className="grid grid-cols-1 md:grid-cols-2 gap-4">
+                <div className="space-y-2">
+                  <label className="text-sm font-medium">Layout Type</label>
+                  <Select value={layoutType} onValueChange={setLayoutType}>
+                    <SelectTrigger>
+                      <SelectValue />
+                    </SelectTrigger>
+                    <SelectContent>
+                      <SelectItem value="force">Force Directed</SelectItem>
+                      <SelectItem value="circular">Circular</SelectItem>
+                      <SelectItem value="hierarchical">Hierarchical</SelectItem>
+                    </SelectContent>
+                  </Select>
+                </div>
+                <div className="space-y-4">
+                  <div className="flex items-center justify-between">
+                    <label className="text-sm font-medium">GPU Acceleration</label>
+                    <Switch checked={gpuAcceleration} onCheckedChange={setGpuAcceleration} />
+                  </div>
+                  <div className="flex items-center justify-between">
+                    <label className="text-sm font-medium">Auto-Clustering</label>
+                    <Switch checked={clustering} onCheckedChange={setClustering} />
+                  </div>
+                </div>
+              </div>
+            </TabsContent>
+
+            <TabsContent value="local_gpu" className="space-y-4">
+              <div className="grid grid-cols-1 md:grid-cols-2 gap-4">
+                <div className="space-y-4">
+                  <div className="space-y-2">
+                    <label className="text-sm font-medium">Layout Algorithm</label>
+                    <Select value={layoutAlgorithm} onValueChange={setLayoutAlgorithm}>
+                      <SelectTrigger>
+                        <SelectValue />
+                      </SelectTrigger>
+                      <SelectContent>
+                        <SelectItem value="force_atlas2">Force Atlas 2</SelectItem>
+                        <SelectItem value="spectral">Spectral Layout</SelectItem>
+                        <SelectItem value="fruchterman_reingold">Fruchterman-Reingold</SelectItem>
+                      </SelectContent>
+                    </Select>
+                  </div>
+                  <div className="space-y-2">
+                    <label className="text-sm font-medium">Clustering Algorithm</label>
+                    <Select value={clusteringAlgorithm} onValueChange={setClusteringAlgorithm}>
+                      <SelectTrigger>
+                        <SelectValue />
+                      </SelectTrigger>
+                      <SelectContent>
+                        <SelectItem value="leiden">Leiden</SelectItem>
+                        <SelectItem value="louvain">Louvain</SelectItem>
+                        <SelectItem value="spectral">Spectral Clustering</SelectItem>
+                      </SelectContent>
+                    </Select>
+                  </div>
+                </div>
+                <div className="space-y-4">
+                  <div className="flex items-center justify-between">
+                    <label className="text-sm font-medium">Compute Centrality</label>
+                    <Switch checked={computeCentrality} onCheckedChange={setComputeCentrality} />
+                  </div>
+                  {capabilities && (
+                    <div className="p-2 bg-muted rounded text-xs">
+                      RAPIDS cuGraph: {capabilities.has_rapids ? '✓ Available' : '✗ Not Available'}
+                    </div>
+                  )}
+                </div>
+              </div>
+            </TabsContent>
+
+            <TabsContent value="local_cpu" className="space-y-4">
+              <div className="p-4 bg-muted rounded-lg text-center">
+                <p className="text-sm text-muted-foreground">
+                  CPU fallback mode - basic processing without GPU acceleration
+                </p>
+              </div>
+            </TabsContent>
+          </Tabs>
+
+          {/* Action Button */}
+          <Button 
+            onClick={processGraph}
+            disabled={isProcessing || serviceHealth !== 'healthy'}
+            className="w-full"
+          >
+            {isProcessing ? (
+              <>
+                <Loader2 className="w-4 h-4 mr-2 animate-spin" />
+                Processing with {processingMode.replace('_', ' ')}...
+              </>
+            ) : (
+              <>
+                <Zap className="w-4 h-4 mr-2" />
+                Process Graph
+              </>
+            )}
+          </Button>
+        </CardContent>
+      </Card>
+
+      {/* Results */}
+      {processedData && (
+        <Card className="flex-1">
+          <CardHeader className="pb-3">
+            <div className="flex items-center justify-between">
+              <CardTitle>Processing Results</CardTitle>
+              <Badge variant="outline">
+                {processedData.processing_mode.replace('_', ' ')}
+              </Badge>
+            </div>
+          </CardHeader>
+          <CardContent className="space-y-4">
+            {/* Statistics */}
+            <StatsDisplay stats={processedData.stats} />
+            
+            {/* Visualization */}
+            <div className="w-full h-96 border rounded-lg overflow-hidden">
+              {processedData.embed_url ? (
+                // PyGraphistry Cloud embed
+                <>
+                  <div className="p-3 bg-blue-50 border-b flex items-center justify-between">
+                    <div>
+                      <div className="text-sm font-medium text-blue-700">
+                        ☁️ PyGraphistry Cloud Visualization
+                      </div>
+                      <div className="text-xs text-blue-600">
+                        Interactive GPU-accelerated visualization
+                      </div>
+                    </div>
+                    <Button 
+                      variant="outline" 
+                      size="sm"
+                      onClick={() => window.open(processedData.embed_url, '_blank')}
+                    >
+                      <ExternalLink className="w-4 h-4 mr-1" />
+                      Open
+                    </Button>
+                  </div>
+                  <iframe
+                    src={processedData.embed_url}
+                    className="w-full h-80"
+                    title="PyGraphistry Visualization"
+                    style={{ border: 'none' }}
+                  />
+                </>
+              ) : (
+                // Local processing result
+                <div className="p-4 bg-green-50 border-b">
+                  <div className="text-sm font-medium text-green-700">
+                    🚀 Local {processedData.gpu_processed ? 'GPU' : 'CPU'} Processing Complete
+                  </div>
+                  <div className="text-xs text-green-600 mt-1">
+                    {processedData.stats.layout_computed && '✓ Layout computed '}
+                    {processedData.stats.clusters_computed && '✓ Clusters detected '}
+                    {processedData.stats.centrality_computed && '✓ Centrality computed'}
+                  </div>
+                </div>
+              )}
+            </div>
+          </CardContent>
+        </Card>
+      )}
+    </div>
+  )
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/components/upload-documents.tsx b/nvidia/txt2kg/assets/frontend/components/upload-documents.tsx
new file mode 100644
index 0000000..f3c9b4e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/upload-documents.tsx
@@ -0,0 +1,97 @@
+"use client"
+
+import type React from "react"
+
+import { useState } from "react"
+import { Upload, AlertCircle, FileText } from "lucide-react"
+import { useDocuments } from "@/contexts/document-context"
+
+export function UploadDocuments() {
+  const { addDocuments } = useDocuments()
+  const [isDragging, setIsDragging] = useState(false)
+  const [error, setError] = useState<string | null>(null)
+
+  const validateFiles = (files: File[]): File[] => {
+    setError(null)
+    const validFiles = Array.from(files).filter((file) => {
+      const isValidType = file.name.endsWith(".md") || file.name.endsWith(".csv") || file.name.endsWith(".txt") || file.name.endsWith(".json")
+      if (!isValidType) {
+        setError("Only markdown (.md), CSV (.csv), text (.txt), and JSON (.json) files are supported.")
+        return false
+      }
+      return true
+    })
+    return validFiles
+  }
+
+  const handleDragOver = (e: React.DragEvent) => {
+    e.preventDefault()
+    setIsDragging(true)
+  }
+
+  const handleDragLeave = () => {
+    setIsDragging(false)
+  }
+
+  const handleDrop = (e: React.DragEvent) => {
+    e.preventDefault()
+    setIsDragging(false)
+
+    const files = Array.from(e.dataTransfer.files)
+    const validFiles = validateFiles(files)
+
+    if (validFiles.length > 0) {
+      addDocuments(validFiles)
+    }
+  }
+
+  const handleFileSelect = (e: React.ChangeEvent<HTMLInputElement>) => {
+    if (e.target.files) {
+      const files = Array.from(e.target.files)
+      const validFiles = validateFiles(files)
+
+      if (validFiles.length > 0) {
+        addDocuments(validFiles)
+      }
+
+      // Reset the input to allow uploading the same file again
+      e.target.value = ""
+    }
+  }
+
+  return (
+    <div className="space-y-6">
+      {error && (
+        <div className="bg-destructive/5 border border-destructive/20 rounded-lg p-4 flex items-start gap-3">
+          <AlertCircle className="h-4 w-4 text-destructive mt-0.5 flex-shrink-0" />
+          <p className="text-sm text-destructive">{error}</p>
+        </div>
+      )}
+
+      <div
+        className={`relative border-2 border-dashed rounded-xl p-8 text-center cursor-pointer transition-all duration-200 hover:border-nvidia-green/50 hover:bg-nvidia-green/5
+                   ${isDragging ? "border-nvidia-green bg-nvidia-green/10 scale-[1.02]" : "border-border/40 hover:border-nvidia-green/40"}`}
+        onDragOver={handleDragOver}
+        onDragLeave={handleDragLeave}
+        onDrop={handleDrop}
+        onClick={() => document.getElementById("file-upload")?.click()}
+      >
+        <input id="file-upload" type="file" multiple className="hidden" accept=".md,.csv,.txt,.json" onChange={handleFileSelect} />
+        <div className="flex flex-col items-center">
+          <div className="w-16 h-16 rounded-2xl bg-nvidia-green/10 flex items-center justify-center mb-4 border border-nvidia-green/20">
+            <Upload className="h-8 w-8 text-nvidia-green" />
+          </div>
+          <h3 className="text-lg font-semibold text-foreground mb-2">Drag & Drop Files</h3>
+          <p className="text-sm text-muted-foreground mb-4">
+            or <button className="font-medium text-nvidia-green hover:text-nvidia-green/80 underline underline-offset-2">browse files</button>
+          </p>
+          <div className="inline-flex items-center gap-2 text-xs text-muted-foreground bg-muted/40 px-3 py-1.5 rounded-full border border-border/30">
+            <FileText className="h-3 w-3" />
+            <span>.md, .csv, .txt, .json supported</span>
+          </div>
+        </div>
+      </div>
+    </div>
+  )
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/components/webgpu-3d-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/webgpu-3d-viewer.tsx
new file mode 100644
index 0000000..122cba8
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/webgpu-3d-viewer.tsx
@@ -0,0 +1,482 @@
+"use client"
+
+import React, { useEffect, useRef, useState, useCallback } from 'react'
+import { Button } from '@/components/ui/button'
+import { Card, CardContent, CardDescription, CardHeader, CardTitle } from '@/components/ui/card'
+import { Badge } from '@/components/ui/badge'
+import { Switch } from '@/components/ui/switch'
+import { Alert, AlertDescription } from '@/components/ui/alert'
+import { Tabs, TabsContent, TabsList, TabsTrigger } from '@/components/ui/tabs'
+import { Loader2, Cpu, Server, Monitor, Wifi, RotateCcw } from 'lucide-react'
+import { useToast } from '@/hooks/use-toast'
+import { EnhancedWebGPUClusteringEngine, RemoteWebGPUClusteringClient } from '@/utils/remote-webgpu-clustering'
+import { WebRTCGraphViewer } from './webrtc-graph-viewer'
+import { ForceGraphWrapper } from './force-graph-wrapper'
+
+interface PerformanceMetrics {
+  renderingTime: number
+  clusteringTime?: number
+  totalNodes: number
+  totalLinks: number
+  memoryUsage?: number
+}
+
+interface WebGPU3DViewerProps {
+  graphData: {
+    nodes: any[]
+    links: any[]
+  } | null
+  remoteServiceUrl?: string
+  enableClustering?: boolean
+  onClusteringUpdate?: (metrics: PerformanceMetrics) => void
+  onError?: (error: string) => void
+}
+
+interface RenderingMode {
+  id: 'local' | 'hybrid' | 'webrtc'
+  name: string
+  description: string
+  available: boolean
+  recommended?: boolean
+}
+
+export function WebGPU3DViewer({ 
+  graphData, 
+  remoteServiceUrl = 'http://localhost:8083',
+  enableClustering = true,
+  onClusteringUpdate,
+  onError 
+}: WebGPU3DViewerProps) {
+  const [activeMode, setActiveMode] = useState<string>('local')
+  const [isInitializing, setIsInitializing] = useState(true)
+  const [renderingModes, setRenderingModes] = useState<RenderingMode[]>([])
+  const [clusteringEngine, setClusteringEngine] = useState<EnhancedWebGPUClusteringEngine | null>(null)
+  const [remoteClient, setRemoteClient] = useState<RemoteWebGPUClusteringClient | null>(null)
+  const [capabilities, setCapabilities] = useState<any>(null)
+
+  const { toast } = useToast()
+
+  // Initialize rendering modes and capabilities
+  useEffect(() => {
+    const initializeCapabilities = async () => {
+      try {
+        setIsInitializing(true)
+        
+        // Initialize enhanced clustering engine
+        const engine = new EnhancedWebGPUClusteringEngine([32, 18, 24], remoteServiceUrl)
+        await new Promise(resolve => setTimeout(resolve, 200)) // Give time to initialize
+        
+        // Initialize remote client for WebRTC capabilities
+        const client = new RemoteWebGPUClusteringClient(remoteServiceUrl, false) // Disable proxy mode for WebSocket
+        const remoteAvailable = await client.checkAvailability()
+        const remoteCaps = client.getCapabilities()
+        
+        setClusteringEngine(engine)
+        
+        if (remoteAvailable) {
+          console.log('Remote client available, setting client state')
+          setRemoteClient(client)
+          setCapabilities(remoteCaps)
+        } else {
+          console.log('Remote client not available')
+          setRemoteClient(null)
+          setCapabilities(null)
+        }
+        
+        // Determine available rendering modes
+        const modes: RenderingMode[] = [
+          {
+            id: 'local',
+            name: 'Local WebGPU',
+            description: 'Client-side WebGPU clustering and Three.js rendering',
+            available: Boolean(engine.isAvailable() && !engine.isUsingRemote()),
+            recommended: Boolean(engine.isAvailable() && !engine.isUsingRemote())
+          },
+          {
+            id: 'hybrid',
+            name: 'Hybrid GPU/CPU',
+            description: 'Server GPU clustering, client CPU rendering',
+            available: Boolean(remoteAvailable && remoteCaps?.modes?.hybrid?.available),
+            recommended: Boolean(!engine.isAvailable() || engine.isUsingRemote())
+          },
+          {
+            id: 'webrtc',
+            name: 'WebRTC Streaming',
+            description: 'Full server GPU rendering streamed to browser',
+            available: Boolean(remoteAvailable && remoteCaps?.modes?.webrtc_stream?.available)
+          }
+        ]
+        
+        setRenderingModes(modes)
+        
+        // Auto-select best available mode
+        const recommendedMode = modes.find(m => m.recommended && m.available)
+        const fallbackMode = modes.find(m => m.available)
+        
+        if (recommendedMode) {
+          setActiveMode(recommendedMode.id)
+          toast({
+            title: "Rendering Mode Selected",
+            description: `Using ${recommendedMode.name} for optimal performance`,
+          })
+        } else if (fallbackMode) {
+          setActiveMode(fallbackMode.id)
+          toast({
+            title: "Fallback Mode",
+            description: `Using ${fallbackMode.name} as fallback`,
+            variant: "destructive"
+          })
+        } else {
+          onError?.('No rendering modes available')
+        }
+        
+      } catch (error) {
+        console.error('Failed to initialize 3D viewer capabilities:', error)
+        onError?.(`Initialization failed: ${error}`)
+      } finally {
+        setIsInitializing(false)
+      }
+    }
+
+    initializeCapabilities()
+
+    return () => {
+      if (clusteringEngine) {
+        clusteringEngine.dispose()
+      }
+      if (remoteClient) {
+        remoteClient.dispose()
+      }
+    }
+  }, [remoteServiceUrl])
+
+  // Handle clustering updates and performance metrics
+  useEffect(() => {
+    if (graphData && onClusteringUpdate) {
+      const startTime = performance.now()
+      
+      // Simulate clustering time for performance metrics
+      // In a real implementation, this would come from the actual clustering engine
+      const nodeCount = graphData.nodes?.length || 0
+      const linkCount = graphData.links?.length || 0
+      
+      if (nodeCount > 0) {
+        // Simulate clustering processing time based on node count
+        const clusteringTime = enableClustering ? Math.max(10, nodeCount * 0.01) : 0
+        const renderingTime = performance.now() - startTime
+        
+        setTimeout(() => {
+          onClusteringUpdate({
+            renderingTime,
+            clusteringTime,
+            totalNodes: nodeCount,
+            totalLinks: linkCount,
+          })
+        }, clusteringTime)
+      }
+    }
+  }, [graphData, enableClustering, onClusteringUpdate])
+
+  // Handle mode change
+  const handleModeChange = useCallback((mode: string) => {
+    const selectedMode = renderingModes.find(m => m.id === mode)
+    if (selectedMode && selectedMode.available) {
+      setActiveMode(mode)
+      toast({
+        title: "Rendering Mode Changed",
+        description: `Switched to ${selectedMode.name}`,
+      })
+    }
+  }, [renderingModes])
+
+  if (isInitializing) {
+    return (
+      <Card>
+        <CardHeader>
+          <CardTitle className="flex items-center gap-2">
+            <Loader2 className="h-5 w-5 animate-spin" />
+            Initializing 3D GPU Viewer
+          </CardTitle>
+        </CardHeader>
+        <CardContent>
+          <p>Detecting WebGPU capabilities and remote services...</p>
+          <Button 
+            variant="outline" 
+            size="sm" 
+            className="mt-4"
+            onClick={() => setIsInitializing(false)}
+          >
+            Skip initialization and continue
+          </Button>
+        </CardContent>
+      </Card>
+    )
+  }
+
+  if (renderingModes.length === 0 || !renderingModes.some(m => m.available)) {
+    return (
+      <Card>
+        <CardHeader>
+          <CardTitle className="flex items-center gap-2 text-red-600">
+            <Monitor className="h-5 w-5" />
+            No Rendering Options Available
+          </CardTitle>
+        </CardHeader>
+        <CardContent>
+          <Alert>
+            <AlertDescription>
+              Neither local WebGPU nor remote GPU services are available. 
+              Please ensure WebGPU is supported in your browser or that the remote service is running.
+            </AlertDescription>
+          </Alert>
+        </CardContent>
+      </Card>
+    )
+  }
+
+  return (
+    <div className="space-y-4">
+      {/* Rendering Mode Tabs */}
+      <Tabs value={activeMode} onValueChange={handleModeChange}>
+        <TabsList className="grid w-full grid-cols-3">
+          {renderingModes.map((mode) => (
+            <TabsTrigger 
+              key={mode.id}
+              value={mode.id} 
+              disabled={!mode.available}
+              className="flex items-center gap-2"
+            >
+              {mode.id === 'local' && <Cpu className="w-4 h-4" />}
+              {mode.id === 'hybrid' && <Server className="w-4 h-4" />}
+              {mode.id === 'webrtc' && <Monitor className="w-4 h-4" />}
+              {mode.name}
+            </TabsTrigger>
+          ))}
+        </TabsList>
+
+        {/* Local WebGPU Mode */}
+        <TabsContent value="local" className="space-y-4">
+          <Card>
+            <CardHeader>
+              <CardTitle>Local WebGPU Clustering + Three.js Rendering</CardTitle>
+              <CardDescription>
+                Uses your browser's WebGPU for clustering and Three.js for 3D rendering
+              </CardDescription>
+            </CardHeader>
+            <CardContent>
+              {clusteringEngine && !clusteringEngine.isUsingRemote() ? (
+                <div className="space-y-4">
+                  <Alert>
+                    <Cpu className="h-4 w-4" />
+                    <AlertDescription>
+                      Local WebGPU is available. This provides the best performance with no network latency.
+                    </AlertDescription>
+                  </Alert>
+
+                  {/* Clustering Controls */}
+                  <div className="flex items-center justify-between mb-4">
+                    <div className="flex items-center space-x-2">
+                      <Switch
+                        id="local-clustering"
+                        checked={enableClustering}
+                        onCheckedChange={(checked: boolean) => {
+                          // Handle clustering toggle
+                          if (onClusteringUpdate && graphData) {
+                            const nodeCount = graphData.nodes?.length || 0
+                            const linkCount = graphData.links?.length || 0
+                            onClusteringUpdate({
+                              renderingTime: performance.now() % 100,
+                              clusteringTime: checked ? Math.max(10, nodeCount * 0.01) : 0,
+                              totalNodes: nodeCount,
+                              totalLinks: linkCount,
+                            })
+                          }
+                        }}
+                      />
+                      <label htmlFor="local-clustering" className="text-sm font-medium">Local WebGPU Clustering</label>
+                    </div>
+                    <Badge variant={enableClustering ? "default" : "secondary"}>
+                      {enableClustering ? "Enabled" : "Disabled"}
+                    </Badge>
+                  </div>
+                  
+                  {/* Standard 3D Force Graph */}
+                  <div className="h-[500px] border rounded-lg bg-black">
+                    {graphData ? (
+                      <ForceGraphWrapper 
+                        jsonData={graphData}
+                        fullscreen={false}
+                        layoutType="3d"
+                        enableClustering={enableClustering}
+                        onClusteringUpdate={onClusteringUpdate}
+                        onError={(err: Error) => onError?.(err.message)}
+                      />
+                    ) : (
+                      <div className="flex items-center justify-center h-full text-white">
+                        <div className="text-center">
+                          <Monitor className="h-12 w-12 mx-auto mb-4 text-gray-400" />
+                          <p>No graph data available</p>
+                          <small className="text-gray-400">
+                            Load graph data to see WebGPU clustering
+                          </small>
+                        </div>
+                      </div>
+                    )}
+                  </div>
+                </div>
+              ) : (
+                <Alert>
+                  <AlertDescription>
+                    Local WebGPU is not available in this browser or environment.
+                  </AlertDescription>
+                </Alert>
+              )}
+            </CardContent>
+          </Card>
+        </TabsContent>
+
+        {/* Hybrid Mode */}
+        <TabsContent value="hybrid" className="space-y-4">
+          <Card>
+            <CardHeader>
+              <CardTitle>Hybrid GPU/CPU Rendering</CardTitle>
+              <CardDescription>
+                Server performs GPU clustering, client handles CPU-based Three.js rendering
+              </CardDescription>
+            </CardHeader>
+            <CardContent>
+              {remoteClient ? (
+                <div className="space-y-4">
+                  <Alert>
+                    <Server className="h-4 w-4" />
+                    <AlertDescription>
+                      Remote GPU clustering is available. Clustering will be performed on the server GPU,
+                      with results sent to your browser for 3D rendering.
+                    </AlertDescription>
+                  </Alert>
+
+                  {/* Clustering Controls */}
+                  <div className="flex items-center justify-between mb-4">
+                    <div className="flex items-center space-x-2">
+                      <Switch
+                        id="hybrid-clustering"
+                        checked={enableClustering}
+                        onCheckedChange={(checked: boolean) => {
+                          // Handle clustering toggle
+                          if (onClusteringUpdate && graphData) {
+                            const nodeCount = graphData.nodes?.length || 0
+                            const linkCount = graphData.links?.length || 0
+                            onClusteringUpdate({
+                              renderingTime: performance.now() % 100,
+                              clusteringTime: checked ? Math.max(15, nodeCount * 0.02) : 0,
+                              totalNodes: nodeCount,
+                              totalLinks: linkCount,
+                            })
+                          }
+                        }}
+                      />
+                      <label htmlFor="hybrid-clustering" className="text-sm font-medium">Remote GPU Clustering</label>
+                    </div>
+                    <Badge variant={enableClustering ? "default" : "secondary"}>
+                      {enableClustering ? "Server GPU" : "Disabled"}
+                    </Badge>
+                  </div>
+                  
+                  {/* Enhanced ForceGraphWrapper with remote GPU clustering */}
+                  <div className="h-[500px] border rounded-lg bg-black">
+                    {graphData ? (
+                      <ForceGraphWrapper 
+                        jsonData={graphData}
+                        fullscreen={false}
+                        layoutType="3d"
+                        enableClustering={enableClustering}
+                        onClusteringUpdate={onClusteringUpdate}
+                        onError={(err: Error) => onError?.(err.message)}
+                      />
+                    ) : (
+                      <div className="flex items-center justify-center h-full text-white">
+                        <div className="text-center">
+                          <Monitor className="h-12 w-12 mx-auto mb-4 text-gray-400" />
+                          <p>No graph data available</p>
+                          <small className="text-gray-400">
+                            Load graph data to see hybrid GPU clustering
+                          </small>
+                        </div>
+                      </div>
+                    )}
+                  </div>
+                </div>
+              ) : (
+                <Alert>
+                  <AlertDescription>
+                    Remote GPU clustering service is not available.
+                  </AlertDescription>
+                </Alert>
+              )}
+            </CardContent>
+          </Card>
+        </TabsContent>
+
+        {/* WebRTC Streaming Mode */}
+        <TabsContent value="webrtc" className="space-y-4">
+          <WebRTCGraphViewer
+            graphData={graphData}
+            remoteServiceUrl={remoteServiceUrl}
+            autoRefresh={true}
+            refreshInterval={1000}
+            onError={onError}
+          />
+        </TabsContent>
+      </Tabs>
+
+      {/* Service Status */}
+      {capabilities && (
+        <Card>
+          <CardHeader>
+            <CardTitle>Service Status</CardTitle>
+          </CardHeader>
+          <CardContent>
+            <div className="grid grid-cols-2 md:grid-cols-4 gap-4 text-sm">
+              <div className="space-y-1">
+                <p className="font-medium">Local WebGPU</p>
+                <Badge variant={clusteringEngine?.isAvailable() && !clusteringEngine?.isUsingRemote() ? 'default' : 'secondary'}>
+                  {clusteringEngine?.isAvailable() && !clusteringEngine?.isUsingRemote() ? 'Available' : 'Not Available'}
+                </Badge>
+              </div>
+              
+              <div className="space-y-1">
+                <p className="font-medium">Remote Service</p>
+                <Badge variant={remoteClient ? 'default' : 'secondary'}>
+                  {remoteClient ? 'Connected' : 'Disconnected'}
+                </Badge>
+              </div>
+              
+              <div className="space-y-1">
+                <p className="font-medium">Server GPU</p>
+                <Badge variant={capabilities?.gpuAcceleration?.rapidsAvailable ? 'default' : 'secondary'}>
+                  {capabilities?.gpuAcceleration?.rapidsAvailable ? 'RAPIDS' : 'CPU Only'}
+                </Badge>
+              </div>
+              
+              <div className="space-y-1">
+                <p className="font-medium">WebRTC</p>
+                <Badge variant={capabilities?.modes?.webrtc_stream?.available ? 'default' : 'secondary'}>
+                  {capabilities?.modes?.webrtc_stream?.available ? 'Available' : 'Not Available'}
+                </Badge>
+              </div>
+            </div>
+            
+            {capabilities && (
+              <div className="mt-4 pt-4 border-t text-xs text-gray-600">
+                <p>
+                  Cluster dimensions: {capabilities.clusterDimensions?.join(' × ')} 
+                  ({capabilities.maxClusterCount?.toLocaleString()} total clusters)
+                </p>
+              </div>
+            )}
+          </CardContent>
+        </Card>
+      )}
+    </div>
+  )
+}
diff --git a/nvidia/txt2kg/assets/frontend/components/webrtc-graph-viewer.tsx b/nvidia/txt2kg/assets/frontend/components/webrtc-graph-viewer.tsx
new file mode 100644
index 0000000..bd8bf8c
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/components/webrtc-graph-viewer.tsx
@@ -0,0 +1,466 @@
+"use client"
+
+import React, { useEffect, useRef, useState, useCallback } from 'react'
+import { Button } from '@/components/ui/button'
+import { Card, CardContent, CardDescription, CardHeader, CardTitle } from '@/components/ui/card'
+import { Badge } from '@/components/ui/badge'
+import { Alert, AlertDescription } from '@/components/ui/alert'
+import { Loader2, Play, Square, RotateCcw, Monitor, Wifi } from 'lucide-react'
+import { useToast } from '@/hooks/use-toast'
+import { RemoteWebGPUClusteringClient, type ClusteringResult } from '@/utils/remote-webgpu-clustering'
+
+interface WebRTCGraphViewerProps {
+  graphData: {
+    nodes: any[]
+    links: any[]
+  } | null
+  remoteServiceUrl?: string
+  autoRefresh?: boolean
+  refreshInterval?: number
+  onError?: (error: string) => void
+}
+
+interface StreamingStats {
+  sessionId: string | null
+  isStreaming: boolean
+  lastFrameTime: Date | null
+  frameCount: number
+  connectionStatus: 'disconnected' | 'connecting' | 'connected' | 'error'
+  processingTime: number | null
+}
+
+export function WebRTCGraphViewer({ 
+  graphData, 
+  remoteServiceUrl = 'http://localhost:8083',
+  autoRefresh = true,
+  refreshInterval = 1000,
+  onError 
+}: WebRTCGraphViewerProps) {
+  const [client, setClient] = useState<RemoteWebGPUClusteringClient | null>(null)
+  const [isInitializing, setIsInitializing] = useState(true)
+  const [serviceAvailable, setServiceAvailable] = useState(false)
+  const [capabilities, setCapabilities] = useState<any>(null)
+  const [streamingStats, setStreamingStats] = useState<StreamingStats>({
+    sessionId: null,
+    isStreaming: false,
+    lastFrameTime: null,
+    frameCount: 0,
+    connectionStatus: 'disconnected',
+    processingTime: null
+  })
+
+  const imgRef = useRef<HTMLImageElement>(null)
+  const refreshIntervalRef = useRef<NodeJS.Timeout | null>(null)
+  const { toast } = useToast()
+
+  // Initialize remote client
+  useEffect(() => {
+    const initializeClient = async () => {
+      try {
+        const remoteClient = new RemoteWebGPUClusteringClient(remoteServiceUrl, false) // Disable proxy mode for WebSocket
+        const available = await remoteClient.checkAvailability()
+        
+        if (available) {
+          const caps = remoteClient.getCapabilities()
+          setCapabilities(caps)
+          setServiceAvailable(true)
+          setClient(remoteClient)
+          
+          // Set up event listeners
+          remoteClient.on('connected', () => {
+            setStreamingStats(prev => ({ ...prev, connectionStatus: 'connected' }))
+            toast({
+              title: "Connected",
+              description: "Connected to remote GPU service",
+            })
+          })
+          
+          remoteClient.on('disconnected', () => {
+            setStreamingStats(prev => ({ ...prev, connectionStatus: 'disconnected' }))
+          })
+          
+          remoteClient.on('error', (error: any) => {
+            setStreamingStats(prev => ({ ...prev, connectionStatus: 'error' }))
+            onError?.(`WebSocket error: ${error}`)
+          })
+          
+          remoteClient.on('clusteringComplete', (result: ClusteringResult) => {
+            setStreamingStats(prev => ({ 
+              ...prev, 
+              processingTime: result.processingTime 
+            }))
+          })
+          
+          // Connect WebSocket
+          remoteClient.connectWebSocket()
+          setStreamingStats(prev => ({ ...prev, connectionStatus: 'connecting' }))
+          
+        } else {
+          setServiceAvailable(false)
+          onError?.('Remote WebGPU service not available')
+        }
+      } catch (error) {
+        console.error('Failed to initialize WebRTC client:', error)
+        setServiceAvailable(false)
+        onError?.(`Failed to connect: ${error}`)
+      } finally {
+        setIsInitializing(false)
+      }
+    }
+
+    initializeClient()
+
+    return () => {
+      if (client) {
+        client.dispose()
+      }
+      if (refreshIntervalRef.current) {
+        clearInterval(refreshIntervalRef.current)
+      }
+    }
+  }, [remoteServiceUrl])
+
+  // Start streaming
+  const startStreaming = useCallback(async () => {
+    if (!client || !graphData || !serviceAvailable) {
+      toast({
+        title: "Cannot Start Streaming",
+        description: "Service not available or no graph data",
+        variant: "destructive"
+      })
+      return
+    }
+
+    try {
+      setStreamingStats(prev => ({ ...prev, isStreaming: true }))
+      
+      const sessionId = await client.startWebRTCStreaming(graphData.nodes, graphData.links)
+      
+      if (sessionId) {
+        setStreamingStats(prev => ({ 
+          ...prev, 
+          sessionId,
+          frameCount: 0,
+          lastFrameTime: new Date()
+        }))
+        
+        // Start frame refreshing
+        if (autoRefresh) {
+          startFrameRefresh(sessionId)
+        } else {
+          // Load initial frame
+          loadFrame(sessionId)
+        }
+        
+        toast({
+          title: "Streaming Started",
+          description: `WebRTC session ${sessionId.substring(0, 8)}... created`,
+        })
+      } else {
+        throw new Error('Failed to create streaming session')
+      }
+    } catch (error) {
+      console.error('Failed to start streaming:', error)
+      setStreamingStats(prev => ({ ...prev, isStreaming: false }))
+      onError?.(`Failed to start streaming: ${error}`)
+    }
+  }, [client, graphData, serviceAvailable, autoRefresh])
+
+  // Stop streaming
+  const stopStreaming = useCallback(async () => {
+    if (refreshIntervalRef.current) {
+      clearInterval(refreshIntervalRef.current)
+      refreshIntervalRef.current = null
+    }
+
+    if (client && streamingStats.sessionId) {
+      try {
+        await client.cleanupWebRTCSession(streamingStats.sessionId)
+      } catch (error) {
+        console.warn('Failed to cleanup session:', error)
+      }
+    }
+
+    setStreamingStats(prev => ({
+      ...prev,
+      sessionId: null,
+      isStreaming: false,
+      lastFrameTime: null,
+      frameCount: 0
+    }))
+
+    toast({
+      title: "Streaming Stopped",
+      description: "WebRTC session ended",
+    })
+  }, [client, streamingStats.sessionId])
+
+  // Start frame refresh interval
+  const startFrameRefresh = useCallback((sessionId: string) => {
+    if (refreshIntervalRef.current) {
+      clearInterval(refreshIntervalRef.current)
+    }
+
+    refreshIntervalRef.current = setInterval(() => {
+      loadFrame(sessionId)
+    }, refreshInterval)
+  }, [refreshInterval])
+
+  // Load a single frame
+  const loadFrame = useCallback((sessionId: string) => {
+    if (!client || !imgRef.current) return
+
+    const frameUrl = client.getStreamFrameUrl(sessionId)
+    const img = imgRef.current
+
+    // Add timestamp to prevent caching
+    const urlWithTimestamp = `${frameUrl}?t=${Date.now()}`
+    
+    img.onload = () => {
+      setStreamingStats(prev => ({
+        ...prev,
+        lastFrameTime: new Date(),
+        frameCount: prev.frameCount + 1
+      }))
+    }
+
+    img.onerror = () => {
+      console.warn('Failed to load frame')
+    }
+
+    img.src = urlWithTimestamp
+  }, [client])
+
+  // Refresh current frame
+  const refreshFrame = useCallback(() => {
+    if (streamingStats.sessionId) {
+      loadFrame(streamingStats.sessionId)
+    }
+  }, [streamingStats.sessionId, loadFrame])
+
+  if (isInitializing) {
+    return (
+      <Card>
+        <CardHeader>
+          <CardTitle className="flex items-center gap-2">
+            <Loader2 className="h-5 w-5 animate-spin" />
+            Initializing WebRTC Viewer
+          </CardTitle>
+        </CardHeader>
+        <CardContent>
+          <p>Connecting to remote GPU service...</p>
+        </CardContent>
+      </Card>
+    )
+  }
+
+  if (!serviceAvailable) {
+    return (
+      <Card>
+        <CardHeader>
+          <CardTitle className="flex items-center gap-2 text-red-600">
+            <Wifi className="h-5 w-5" />
+            Service Unavailable
+          </CardTitle>
+        </CardHeader>
+        <CardContent>
+          <Alert>
+            <AlertDescription>
+              Remote WebGPU service is not available at {remoteServiceUrl}.
+              Please ensure the service is running and accessible.
+            </AlertDescription>
+          </Alert>
+        </CardContent>
+      </Card>
+    )
+  }
+
+  return (
+    <div className="space-y-4">
+      {/* Service Status */}
+      <Card>
+        <CardHeader>
+          <CardTitle className="flex items-center gap-2">
+            <Monitor className="h-5 w-5" />
+            WebRTC GPU Streaming
+          </CardTitle>
+          <CardDescription>
+            Stream GPU-rendered visualizations from remote server
+          </CardDescription>
+        </CardHeader>
+        <CardContent>
+          <div className="grid grid-cols-2 md:grid-cols-4 gap-4">
+            <div className="space-y-1">
+              <p className="text-sm font-medium">Connection</p>
+              <Badge variant={
+                streamingStats.connectionStatus === 'connected' ? 'default' :
+                streamingStats.connectionStatus === 'connecting' ? 'secondary' :
+                streamingStats.connectionStatus === 'error' ? 'destructive' : 'outline'
+              }>
+                {streamingStats.connectionStatus}
+              </Badge>
+            </div>
+            
+            <div className="space-y-1">
+              <p className="text-sm font-medium">GPU Available</p>
+              <Badge variant={capabilities?.gpuAcceleration?.rapidsAvailable ? 'default' : 'secondary'}>
+                {capabilities?.gpuAcceleration?.rapidsAvailable ? 'Yes' : 'CPU Only'}
+              </Badge>
+            </div>
+            
+            <div className="space-y-1">
+              <p className="text-sm font-medium">Streaming</p>
+              <Badge variant={streamingStats.isStreaming ? 'default' : 'outline'}>
+                {streamingStats.isStreaming ? 'Active' : 'Inactive'}
+              </Badge>
+            </div>
+            
+            <div className="space-y-1">
+              <p className="text-sm font-medium">Frame Count</p>
+              <Badge variant="outline">
+                {streamingStats.frameCount}
+              </Badge>
+            </div>
+          </div>
+        </CardContent>
+      </Card>
+
+      {/* Controls */}
+      <Card>
+        <CardContent className="pt-6">
+          <div className="flex gap-2">
+            {!streamingStats.isStreaming ? (
+              <Button
+                onClick={startStreaming}
+                disabled={!graphData?.nodes?.length || streamingStats.connectionStatus !== 'connected'}
+                className="flex items-center gap-2"
+              >
+                <Play className="h-4 w-4" />
+                Start Streaming
+              </Button>
+            ) : (
+              <Button
+                onClick={stopStreaming}
+                variant="destructive"
+                className="flex items-center gap-2"
+              >
+                <Square className="h-4 w-4" />
+                Stop Streaming
+              </Button>
+            )}
+            
+            {streamingStats.isStreaming && !autoRefresh && (
+              <Button
+                onClick={refreshFrame}
+                variant="outline"
+                className="flex items-center gap-2"
+              >
+                <RotateCcw className="h-4 w-4" />
+                Refresh Frame
+              </Button>
+            )}
+          </div>
+          
+          {streamingStats.lastFrameTime && (
+            <p className="text-sm text-gray-600 mt-2">
+              Last frame: {streamingStats.lastFrameTime.toLocaleTimeString()}
+              {streamingStats.processingTime && (
+                <span className="ml-2">
+                  (processed in {streamingStats.processingTime.toFixed(2)}s)
+                </span>
+              )}
+            </p>
+          )}
+        </CardContent>
+      </Card>
+
+      {/* Streamed Visualization */}
+      {streamingStats.sessionId && (
+        <Card>
+          <CardHeader>
+            <CardTitle>GPU-Rendered Visualization</CardTitle>
+            <CardDescription>
+              Session: {streamingStats.sessionId.substring(0, 8)}...
+            </CardDescription>
+          </CardHeader>
+          <CardContent>
+            <div className="relative bg-gray-900 rounded-lg overflow-hidden">
+              <img
+                ref={imgRef}
+                alt="GPU-rendered graph visualization"
+                className="w-full h-auto max-h-[600px] object-contain"
+                style={{ minHeight: '400px' }}
+              />
+              
+              {streamingStats.isStreaming && autoRefresh && (
+                <div className="absolute top-2 right-2">
+                  <Badge variant="default" className="bg-green-600">
+                    <div className="w-2 h-2 bg-green-300 rounded-full animate-pulse mr-2" />
+                    Live
+                  </Badge>
+                </div>
+              )}
+            </div>
+          </CardContent>
+        </Card>
+      )}
+
+      {/* Capabilities Info */}
+      {capabilities && (
+        <Card>
+          <CardHeader>
+            <CardTitle>Service Capabilities</CardTitle>
+          </CardHeader>
+          <CardContent>
+            <div className="grid grid-cols-1 md:grid-cols-2 gap-4 text-sm">
+              <div>
+                <h4 className="font-medium mb-2">Available Modes:</h4>
+                <ul className="space-y-1">
+                  {Object.entries(capabilities.modes).map(([mode, info]: [string, any]) => (
+                    <li key={mode} className="flex items-center gap-2">
+                      <Badge variant={info.available ? 'default' : 'secondary'} className="text-xs">
+                        {mode}
+                      </Badge>
+                      <span className="text-gray-600">{info.description}</span>
+                    </li>
+                  ))}
+                </ul>
+              </div>
+              
+              <div>
+                <h4 className="font-medium mb-2">GPU Acceleration:</h4>
+                <ul className="space-y-1">
+                  <li className="flex items-center gap-2">
+                    <Badge variant={capabilities?.gpuAcceleration?.rapidsAvailable ? 'default' : 'outline'} className="text-xs">
+                      RAPIDS
+                    </Badge>
+                    <span className="text-gray-600">cuGraph/cuDF</span>
+                  </li>
+                  <li className="flex items-center gap-2">
+                    <Badge variant={capabilities?.gpuAcceleration?.opencvAvailable ? 'default' : 'outline'} className="text-xs">
+                      OpenCV
+                    </Badge>
+                    <span className="text-gray-600">Image processing</span>
+                  </li>
+                  <li className="flex items-center gap-2">
+                    <Badge variant={capabilities?.gpuAcceleration?.plottingAvailable ? 'default' : 'outline'} className="text-xs">
+                      Plotting
+                    </Badge>
+                    <span className="text-gray-600">Visualization</span>
+                  </li>
+                </ul>
+              </div>
+            </div>
+            
+            <div className="mt-4 pt-4 border-t">
+              <p className="text-sm text-gray-600">
+                Cluster dimensions: {capabilities?.clusterDimensions?.join(' × ') || 'N/A'} 
+                ({capabilities?.maxClusterCount?.toLocaleString() || 'N/A'} total clusters)
+              </p>
+            </div>
+          </CardContent>
+        </Card>
+      )}
+    </div>
+  )
+}
diff --git a/nvidia/txt2kg/assets/frontend/contexts/document-context.tsx b/nvidia/txt2kg/assets/frontend/contexts/document-context.tsx
new file mode 100644
index 0000000..ed68ef4
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/contexts/document-context.tsx
@@ -0,0 +1,1211 @@
+"use client"
+
+import type React from "react"
+
+import { createContext, useContext, useState, useEffect } from "react"
+import { type Triple, processTextWithChunking, processTextWithChunkingPyG, triplesToGraph } from "@/utils/text-processing"
+import { useRouter } from "next/navigation"
+import { toast } from "@/hooks/use-toast"
+import { type PromptConfigurations } from "@/components/prompt-configuration"
+
+export type Document = {
+  id: string
+  name: string
+  status: "New" | "Processing" | "Processed" | "Error"
+  uploadStatus: "Uploading" | "Uploaded"
+  size: string
+  file: File
+  content?: string
+  triples?: Triple[]
+  graph?: {
+    nodes: Array<{ id: string; label: string }>
+    edges: Array<{ source: string; target: string; label: string }>
+  }
+  error?: string
+  chunkCount?: number
+  extractedDate?: Date
+  processingMethod?: 'default' | 'langchain' | 'graphtransformer' | 'fallback'
+  embeddings?: {
+    count: number
+    generated: Date
+    status: "New" | "Processing" | "Processed" | "Error"
+    error?: string
+  }
+}
+
+export type LLMProvider = 'nvidia' | 'ollama';
+
+export type ProcessingOptions = {
+  useLangChain?: boolean;
+  useGraphTransformer?: boolean;
+  promptConfigs?: PromptConfigurations;
+  llmProvider?: LLMProvider;
+  ollamaModel?: string;
+  ollamaBaseUrl?: string;
+  chunkSize?: number;
+  overlapSize?: number;
+  chunkingMethod?: 'optimized' | 'pyg';
+};
+
+type DocumentContextType = {
+  documents: Document[]
+  addDocuments: (files: File[]) => void
+  deleteDocuments: (documentIds: string[]) => void
+  clearDocuments: () => void
+  processDocuments: (selectedDocIds?: string[], options?: ProcessingOptions) => Promise<void>
+  // Legacy method for backward compatibility
+  processDocumentsLegacy: (useLangChain: boolean, selectedDocIds?: string[], useGraphTransformer?: boolean, promptConfigs?: PromptConfigurations) => Promise<void>
+  isProcessing: boolean
+  updateTriples: (documentId: string, triples: Triple[]) => void
+  addTriple: (documentId: string, triple: Triple) => void
+  editTriple: (documentId: string, index: number, triple: Triple) => void
+  deleteTriple: (documentId: string, index: number) => void
+  openGraphVisualization: (documentId?: string) => Promise<void>
+  generateEmbeddings: (documentId: string) => Promise<void>
+  isGeneratingEmbeddings: boolean
+  viewTriples?: (documentId: string) => void
+}
+
+const DocumentContext = createContext<DocumentContextType | undefined>(undefined)
+
+// Utility function to generate UUID with fallback
+const generateUUID = (): string => {
+  // Check if crypto.randomUUID is available
+  if (typeof crypto !== 'undefined' && crypto.randomUUID) {
+    try {
+      return crypto.randomUUID();
+    } catch (error) {
+      console.warn('crypto.randomUUID failed, using fallback:', error);
+    }
+  }
+  
+  // Fallback UUID generation
+  return 'xxxxxxxx-xxxx-4xxx-yxxx-xxxxxxxxxxxx'.replace(/[xy]/g, function(c) {
+    const r = Math.random() * 16 | 0;
+    const v = c == 'x' ? r : (r & 0x3 | 0x8);
+    return v.toString(16);
+  });
+};
+
+export function DocumentProvider({ children }: { children: React.ReactNode }) {
+  const router = useRouter()
+  const [documents, setDocuments] = useState<Document[]>([])
+  const [isInitialized, setIsInitialized] = useState(false)
+  const [isProcessing, setIsProcessing] = useState(false)
+  const [isGeneratingEmbeddings, setIsGeneratingEmbeddings] = useState(false)
+  const [apiKey, setApiKey] = useState<string | null>(null)
+
+  // Load API key from localStorage on client-side only
+  useEffect(() => {
+    if (typeof window !== 'undefined') {
+      // API key loading removed - xAI integration has been removed
+    }
+  }, []);
+
+  // Load from localStorage on client-side only
+  useEffect(() => {
+    if (!isInitialized) {
+      try {
+        const savedDocuments = localStorage.getItem('txt2kg_documents')
+        if (savedDocuments) {
+          const parsedDocuments = JSON.parse(savedDocuments)
+          
+          // Reconstruct documents with placeholder File objects
+          const reconstructedDocs = parsedDocuments.map((doc: any) => {
+            // Create a blob from the content if available
+            let file: File;
+            if (doc.content) {
+              // Create a File object from the content string we previously saved
+              const blob = new Blob([doc.content], { type: 'text/plain' });
+              file = new File([blob], doc.name, { type: 'text/plain' });
+            } else {
+              // Create an empty placeholder if no content is available
+              file = new File([], doc.name, { type: 'text/plain' });
+            }
+            
+            return {
+              ...doc,
+              file
+            };
+          });
+          
+          console.log(`Restored ${reconstructedDocs.length} documents from localStorage`);
+          setDocuments(reconstructedDocs);
+        }
+      } catch (error) {
+        console.error('Error loading documents from localStorage:', error);
+      }
+      
+      setIsInitialized(true);
+    }
+  }, [isInitialized]);
+
+  // Save documents to localStorage whenever they change, but only after initialization
+  useEffect(() => {
+    if (isInitialized) {
+      try {
+        if (documents.length > 0) {
+          // Serialize documents for localStorage storage
+          // We need to ensure large documents don't exceed localStorage limits
+          // Focus on saving processed data (triples & graph) rather than raw content for large files
+          const documentsToSave = documents.map(doc => {
+            // Don't save content for very large documents to avoid localStorage limits
+            // But keep it for smaller ones to avoid reprocessing
+            const shouldSaveContent = !doc.content || doc.content.length < 100000;
+            
+            return {
+              ...doc,
+              // Omit the actual File object as it can't be serialized
+              file: {
+                name: doc.file.name,
+                size: doc.file.size,
+                type: doc.file.type
+              },
+              // Only include content for smaller documents
+              content: shouldSaveContent ? doc.content : undefined
+            };
+          });
+          
+          localStorage.setItem('txt2kg_documents', JSON.stringify(documentsToSave));
+          console.log(`Saved ${documents.length} documents to localStorage`);
+        } else {
+          // Clear localStorage if documents array is empty
+          localStorage.removeItem('txt2kg_documents');
+          console.log('Cleared documents from localStorage');
+        }
+      } catch (error) {
+        console.error('Error saving documents to localStorage:', error);
+      }
+    }
+  }, [documents, isInitialized])
+
+  const addDocuments = (files: File[]) => {
+    const newDocuments = files.map((file) => ({
+      id: generateUUID(),
+      name: file.name,
+      status: "New" as const,
+      uploadStatus: "Uploaded" as const,
+      size: (file.size / 1024).toFixed(2), // Convert to KB
+      file,
+    }))
+
+    setDocuments((prev) => [...prev, ...newDocuments])
+  }
+
+  const deleteDocuments = (documentIds: string[]) => {
+    setDocuments((prev) => prev.filter((doc) => !documentIds.includes(doc.id)))
+  }
+
+  const clearDocuments = () => {
+    setDocuments([])
+  }
+
+  const updateDocumentStatus = (id: string, status: Document["status"], updates: Partial<Document> = {}) => {
+    console.log(`Updating document ${id} status to: ${status}`);
+    setDocuments((prev) => {
+      const updated = prev.map((doc) => (doc.id === id ? { ...doc, status, ...updates } : doc));
+      
+      // Force UI refresh by adding timestamp to document state
+      // This ensures React detects the change and re-renders components
+      const timestamped = updated.map(doc => ({
+        ...doc,
+        _lastUpdated: Date.now() // Adding timestamp helps React detect changes
+      }));
+      
+      return timestamped;
+    });
+    
+    // Trigger a custom event for components that need to refresh
+    if (typeof window !== 'undefined') {
+      console.log('Dispatching document-status-changed event');
+      window.dispatchEvent(new CustomEvent('document-status-changed', { 
+        detail: { documentId: id, status }
+      }));
+    }
+  }
+
+  const updateTriples = (documentId: string, triples: Triple[]) => {
+    // Helper function to normalize text
+    const normalizeText = (text: string): string => {
+      return text.replace(/['"()]/g, '').trim();
+    };
+
+    // Normalize triples before saving
+    const normalizedTriples = triples.map(triple => ({
+      subject: normalizeText(triple.subject),
+      predicate: normalizeText(triple.predicate),
+      object: normalizeText(triple.object)
+    }));
+
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId) {
+          const graph = triplesToGraph(normalizedTriples)
+          return { ...doc, triples: normalizedTriples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const addTriple = (documentId: string, triple: Triple) => {
+    // Helper function to normalize text with null/undefined checks
+    const normalizeText = (text: string | null | undefined): string => {
+      if (!text || typeof text !== 'string') return '';
+      return text.replace(/['"()]/g, '').trim();
+    };
+
+    // Normalize the new triple
+    const normalizedTriple = {
+      subject: normalizeText(triple.subject),
+      predicate: normalizeText(triple.predicate),
+      object: normalizeText(triple.object)
+    };
+
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId && doc.triples) {
+          const newTriples = [...doc.triples, normalizedTriple]
+          const graph = triplesToGraph(newTriples)
+          return { ...doc, triples: newTriples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const editTriple = (documentId: string, index: number, triple: Triple) => {
+    // Helper function to normalize text with null/undefined checks
+    const normalizeText = (text: string | null | undefined): string => {
+      if (!text || typeof text !== 'string') return '';
+      return text.replace(/['"()]/g, '').trim();
+    };
+
+    // Normalize the edited triple
+    const normalizedTriple = {
+      subject: normalizeText(triple.subject),
+      predicate: normalizeText(triple.predicate),
+      object: normalizeText(triple.object)
+    };
+
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId && doc.triples) {
+          const newTriples = [...doc.triples]
+          newTriples[index] = normalizedTriple
+          const graph = triplesToGraph(newTriples)
+          return { ...doc, triples: newTriples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const deleteTriple = (documentId: string, index: number) => {
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId && doc.triples) {
+          const newTriples = doc.triples.filter((_, i) => i !== index)
+          const graph = triplesToGraph(newTriples)
+          return { ...doc, triples: newTriples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const readFileContent = (file: File): Promise<string> => {
+    return new Promise((resolve, reject) => {
+      // Check if it's a valid file with size
+      if (file.size === 0) {
+        // Handle zero-byte files
+        console.warn(`File ${file.name} is empty (0 bytes)`);
+        reject(new Error('File is empty (0 bytes)'));
+        return;
+      }
+      
+      // If the file isn't a real file (like from localStorage), handle that case
+      if (!(file instanceof Blob) || (file.size === 0 && file.type === '')) {
+        console.warn(`File ${file.name} appears to be a placeholder or invalid`);
+        reject(new Error('Invalid file reference - likely a placeholder'));
+        return;
+      }
+      
+      const reader = new FileReader();
+      reader.onload = (e) => {
+        const content = e.target?.result as string;
+        if (!content || content.trim() === '') {
+          console.warn(`File ${file.name} content is empty or whitespace only`);
+          reject(new Error('File content is empty'));
+          return;
+        }
+        resolve(content);
+      };
+      reader.onerror = (e) => {
+        console.error(`Error reading file ${file.name}:`, e);
+        reject(e);
+      };
+      reader.readAsText(file);
+    });
+  }
+
+  const extractTriplesFromChunk = async (chunk: string, systemPrompt?: string): Promise<Triple[]> => {
+    console.log(`Extracting triples from chunk of length: ${chunk.length}`)
+    
+    // Create headers with API key if available
+    const headers: Record<string, string> = {
+      "Content-Type": "application/json",
+    }
+    
+    // Add API key to headers if available
+    if (apiKey) {
+      headers["X-API-Key"] = apiKey
+    }
+
+    // Prepare request body with optional custom system prompt
+    const requestBody: any = { text: chunk };
+    if (systemPrompt) {
+      requestBody.systemPrompt = systemPrompt;
+    }
+
+    // Add LLM provider information based on selected model
+    const selectedModel = localStorage.getItem("selectedModel");
+    if (selectedModel) {
+      try {
+        const model = JSON.parse(selectedModel);
+        if (model.provider === "ollama") {
+          requestBody.llmProvider = "ollama";
+          requestBody.ollamaModel = model.model || "llama3.1:8b";
+          console.log(`🦙 Using Ollama model: ${requestBody.ollamaModel}`);
+        } else if (model.id === "nvidia-nemotron" || model.id === "nvidia-nemotron-nano") {
+          requestBody.llmProvider = "nvidia";
+          console.log(`🖥️ Using NVIDIA model: ${model.id}`);
+        }
+      } catch (e) {
+        // Ignore parsing errors, will use default
+        console.log(`⚠️ Error parsing selected model, using default`);
+      }
+    } else {
+      console.log(`⚠️ No selected model found, using default`);
+    }
+
+    const response = await fetch("/api/extract-triples", {
+      method: "POST",
+      headers,
+      body: JSON.stringify(requestBody),
+      // Rely on server-side timeout configuration instead of client-side AbortSignal
+    })
+
+    console.log("API response status:", response.status)
+
+    const data = await response.json()
+
+    if (!response.ok) {
+      console.error("API error:", data)
+      throw new Error(data.error || "Failed to extract triples")
+    }
+
+    console.log("API response data:", data)
+    console.log("Triples count:", data.triples?.length || 0)
+
+    return data.triples || []
+  }
+
+  // New processDocuments method with better options structure
+  const processDocuments = async (
+    selectedDocIds?: string[], 
+    options?: ProcessingOptions
+  ) => {
+    console.log('🔍 processDocuments called with:', {
+      selectedDocIds,
+      selectedCount: selectedDocIds?.length || 0,
+      options
+    });
+    
+    const {
+      useLangChain = false,
+      useGraphTransformer = false,
+      promptConfigs,
+      llmProvider = 'ollama',
+      ollamaModel = 'qwen3:1.7b',
+      ollamaBaseUrl = 'http://localhost:11434/v1',
+      chunkSize = 64000,
+      overlapSize = 2000,
+      chunkingMethod = 'optimized'
+    } = options || {};
+
+    return processDocumentsImpl(useLangChain, selectedDocIds, useGraphTransformer, promptConfigs, {
+      llmProvider,
+      ollamaModel,
+      ollamaBaseUrl,
+      chunkSize,
+      overlapSize,
+      chunkingMethod
+    });
+  };
+
+  // Legacy method for backward compatibility
+  const processDocumentsLegacy = async (
+    useLangChain: boolean, 
+    selectedDocIds?: string[], 
+    useGraphTransformer?: boolean,
+    promptConfigs?: PromptConfigurations
+  ) => {
+    return processDocumentsImpl(useLangChain, selectedDocIds, useGraphTransformer, promptConfigs);
+  };
+
+  const processDocumentsImpl = async (
+    useLangChain: boolean, 
+    selectedDocIds?: string[], 
+    useGraphTransformer?: boolean,
+    promptConfigs?: PromptConfigurations,
+    llmOptions?: {
+      llmProvider?: LLMProvider;
+      ollamaModel?: string;
+      ollamaBaseUrl?: string;
+      chunkSize?: number;
+      overlapSize?: number;
+      chunkingMethod?: 'optimized' | 'pyg';
+    }
+  ) => {
+    console.log('🔍 processDocumentsImpl called with:', {
+      useLangChain,
+      selectedDocIds,
+      selectedCount: selectedDocIds?.length || 0,
+      useGraphTransformer,
+      totalDocuments: documents.length
+    });
+    
+    // If selectedDocIds is explicitly provided, use it
+    // If not provided, don't process anything (instead of processing all docs)
+    const docIdsToProcess = selectedDocIds || [];
+    
+    console.log('🔍 Document IDs to process:', docIdsToProcess);
+    
+    // Get selected documents - filter by the provided selectedDocIds array
+    const docsToProcess = documents.filter(
+      (doc) => docIdsToProcess.includes(doc.id) && 
+      (doc.status === "New" || doc.status === "Processed" || doc.status === "Error")
+    );
+
+    console.log('🔍 Documents to process:', docsToProcess.map(d => ({ id: d.id, name: d.name, status: d.status })));
+
+    if (docsToProcess.length === 0) {
+      console.log("❌ No documents to process - either none selected or none have valid status");
+      return;
+    }
+
+    setIsProcessing(true);
+    
+    try {
+      // Process each document sequentially
+      for (const doc of docsToProcess) {
+        // Update status to Processing before we begin
+        updateDocumentStatus(doc.id, "Processing");
+        
+        try {
+          // Read file content if not already available
+          let content = doc.content;
+          if (!content) {
+            content = await readFileContent(doc.file);
+          }
+
+          console.log(`🚀 Processing document ${doc.name}, useLangChain: ${useLangChain}, isCSV: ${doc.name.toLowerCase().endsWith('.csv')}`);
+          
+          // Handle CSV files specially - always use row-as-document processing regardless of LangChain setting
+          if (doc.name.toLowerCase().endsWith('.csv')) {
+            console.log('📊 Processing CSV file with row-as-document approach:', doc.name);
+            
+            try {
+              const triples = await parseCSVContent(content);
+              console.log(`✅ CSV processing complete: ${triples.length} triples extracted`);
+              
+              // Send to process-document API
+              const response = await fetch('/api/process-document', {
+                method: 'POST',
+                headers: { 'Content-Type': 'application/json' },
+                body: JSON.stringify({ 
+                  text: content,
+                  filename: doc.name,
+                  triples: triples,
+                  useLangChain: useLangChain, // Pass through the original setting
+                  useGraphTransformer: useGraphTransformer,
+                  systemPrompt: promptConfigs?.systemPrompt,
+                  extractionPrompt: promptConfigs?.extractionPrompt,
+                  graphTransformerPrompt: promptConfigs?.graphTransformerPrompt
+                })
+              });
+              
+              if (!response.ok) {
+                throw new Error(`Document processing failed: ${response.statusText}`);
+              }
+              
+              const result = await response.json();
+              
+              // Update the document with triples and graph
+              updateDocumentStatus(doc.id, "Processed", {
+                triples: triples,
+                graph: triplesToGraph(triples),
+                metadata: {
+                  totalTriples: triples.length,
+                  processingMethod: 'csv_row_as_document',
+                  langchainUsed: useLangChain,
+                  graphTransformerUsed: useGraphTransformer
+                }
+              });
+              
+              console.log(`✅ Document ${doc.name} processed successfully with ${triples.length} triples`);
+            } catch (error) {
+              console.error(`❌ Error processing CSV file ${doc.name}:`, error);
+              updateDocumentStatus(doc.id, "Error", undefined, error instanceof Error ? error.message : 'Unknown error');
+            }
+            
+            continue; // Skip the rest of the processing for CSV files
+          }
+          
+          if (useLangChain) {
+            // Use process-document endpoint with useLangChain flag
+            console.log(`Processing document ${doc.name} with LangChain via process-document API...`);
+            
+            // Extract triples using the default method first (for fallback)
+            let triples: Triple[] = [];
+            try {
+              // Convert JSON to text if it's a JSON file
+              let processedContent = content;
+              if (doc.name.toLowerCase().endsWith('.json')) {
+                processedContent = convertJsonToText(content);
+              }
+              
+              // Pass the custom system prompt if available
+              const systemPrompt = promptConfigs?.systemPrompt;
+              triples = await processTextWithChunking(
+                processedContent, 
+                (chunk) => extractTriplesFromChunk(chunk, systemPrompt)
+              );
+              
+              // Call the process-document API endpoint with useLangChain flag
+              // NOTE: This no longer automatically stores triples in Neo4j.
+              // Storage in Neo4j is now handled manually through the UI's "Store in Graph DB" button.
+              console.log(`Sending ${triples.length} triples to process-document API with useLangChain=true ${useGraphTransformer ? 'using GraphTransformer' : ''}`);
+              
+              // Include prompt configurations in the request body
+              const requestBody: any = { 
+                text: doc.name.toLowerCase().endsWith('.json') ? convertJsonToText(content) : content,
+                filename: doc.name,
+                triples: triples,
+                useLangChain: true,
+                useGraphTransformer: useGraphTransformer
+              };
+              
+              // Add LLM provider options if available
+              if (llmOptions) {
+                if (llmOptions.llmProvider) {
+                  requestBody.llmProvider = llmOptions.llmProvider;
+                }
+                if (llmOptions.ollamaModel) {
+                  requestBody.ollamaModel = llmOptions.ollamaModel;
+                }
+                if (llmOptions.ollamaBaseUrl) {
+                  requestBody.ollamaBaseUrl = llmOptions.ollamaBaseUrl;
+                }
+              }
+              
+              // Add prompt configs if available
+              if (promptConfigs) {
+                if (useGraphTransformer && promptConfigs.graphTransformerPrompt) {
+                  requestBody.graphTransformerPrompt = promptConfigs.graphTransformerPrompt;
+                } else if (promptConfigs.defaultExtractionPrompt) {
+                  requestBody.extractionPrompt = promptConfigs.defaultExtractionPrompt;
+                }
+              }
+              
+              const response = await fetch('/api/process-document', {
+                method: 'POST',
+                headers: { 'Content-Type': 'application/json' },
+                body: JSON.stringify(requestBody)
+              });
+
+              if (!response.ok) {
+                const errorText = await response.text();
+                console.error(`Document processing API error: ${response.status} ${response.statusText}`, errorText);
+                throw new Error(`Document processing failed: ${response.statusText} - ${errorText}`);
+              }
+
+              const result = await response.json();
+              console.log(`Received response from process-document API with ${result.triples?.length || 0} triples`);
+              
+              // Update the document with triples and graph
+              const resultTriples = result.triples || triples; // Fall back to original triples if none returned
+              console.log(`Updating document status to "Processed" with ${resultTriples.length} triples`);
+              updateDocumentStatus(doc.id, "Processed", {
+                content,
+                triples: resultTriples,
+                graph: triplesToGraph(resultTriples),
+                extractedDate: new Date(),
+                processingMethod: useGraphTransformer ? 'graphtransformer' : 'langchain'
+              });
+            } catch (processingError) {
+              console.error(`Error in LangChain processing for ${doc.name}:`, processingError);
+              
+              // If we have fallback triples, still mark as processed but include the error
+              if (triples.length > 0) {
+                console.log(`Using ${triples.length} fallback triples despite processing error`);
+                updateDocumentStatus(doc.id, "Processed", {
+                  content,
+                  triples,
+                  graph: triplesToGraph(triples),
+                  extractedDate: new Date(),
+                  error: processingError instanceof Error ? processingError.message : "Unknown error during LangChain processing",
+                  processingMethod: 'fallback'
+                });
+              } else {
+                // If no fallback triples, mark as error
+                throw processingError;
+              }
+            }
+          } else {
+            // Use default processing (original implementation)
+            console.log(`Processing document ${doc.name} using default processor...`);
+            
+            // Note: CSV files are handled above, so this only processes non-CSV files
+            {
+              // For non-CSV files, use the text chunking approach
+              console.log(`Processing text document with chunking: ${doc.name}`);
+              
+              // Convert JSON to text if it's a JSON file
+              let processedContent = content;
+              if (doc.name.toLowerCase().endsWith('.json')) {
+                processedContent = convertJsonToText(content);
+                console.log(`Converted JSON file ${doc.name} to text format for processing`);
+              }
+              
+              // Use custom system prompt if available
+              const systemPrompt = promptConfigs?.systemPrompt;
+              const chunkSize = llmOptions?.chunkSize || 512;
+              const overlapSize = llmOptions?.overlapSize || 0;
+              const chunkingMethod = llmOptions?.chunkingMethod || 'pyg';
+              
+              let triples: Triple[];
+              if (chunkingMethod === 'pyg') {
+                // Use PyTorch Geometric's exact chunking method with configurable chunk size and overlap
+                const pygChunkSize = chunkSize || 512; // Use configured chunk size or default to 512
+                const pygOverlapSize = overlapSize || 0; // Use configured overlap or default to 0 (original PyG behavior)
+                triples = await processTextWithChunkingPyG(
+                  processedContent, 
+                  (chunk) => extractTriplesFromChunk(chunk, systemPrompt),
+                  pygChunkSize,
+                  pygOverlapSize
+                );
+              } else {
+                // Use optimized chunking with overlap
+                triples = await processTextWithChunking(
+                  processedContent, 
+                  (chunk) => extractTriplesFromChunk(chunk, systemPrompt),
+                  chunkSize,
+                  overlapSize
+                );
+              }
+              
+              // Send to process-document API - no longer automatically stores in Neo4j
+              // Storage in Neo4j is now handled manually through the UI's "Store in Graph DB" button
+              const requestBody: any = { 
+                text: processedContent,
+                filename: doc.name,
+                triples: triples,
+                useLangChain: false
+              };
+              
+              // Add system prompt if available
+              if (promptConfigs?.systemPrompt) {
+                requestBody.systemPrompt = promptConfigs.systemPrompt;
+              }
+              
+              const response = await fetch('/api/process-document', {
+                method: 'POST',
+                headers: { 'Content-Type': 'application/json' },
+                body: JSON.stringify(requestBody)
+              });
+              
+              if (!response.ok) {
+                throw new Error(`Document processing failed: ${response.statusText}`);
+              }
+              
+              // Update the document with triples and graph
+              updateDocumentStatus(doc.id, "Processed", {
+                content,
+                triples,
+                graph: triplesToGraph(triples),
+                chunkCount: Math.ceil(content.length / 512), // Approximate chunk count
+                extractedDate: new Date()
+              });
+            }
+          }
+        } catch (error) {
+          console.error(`Error processing document ${doc.name}:`, error);
+          updateDocumentStatus(doc.id, "Error", {
+            error: error instanceof Error ? error.message : "Unknown error"
+          });
+        }
+      }
+    } finally {
+      // Add a small delay before turning off the processing state
+      // This gives time for all UI updates to complete
+      console.log("Processing complete, finalizing UI updates...");
+      
+      // Force a final UI refresh by dispatching an event immediately
+      if (typeof window !== 'undefined') {
+        console.log("Dispatching processing-complete event");
+        window.dispatchEvent(new CustomEvent('processing-complete'));
+      }
+      
+      // Reset the processing state
+      setIsProcessing(false);
+      console.log("Processing state reset, UI should be updated");
+    }
+  }
+
+  // Helper function to process CSV content - each row as a document for LLM extraction
+  const parseCSVContent = async (csvContent: string): Promise<Triple[]> => {
+    console.log('🔍 parseCSVContent called with content length:', csvContent.length);
+    console.log('Processing CSV content with row-as-document approach');
+    
+    // Split the CSV content into lines
+    const lines = csvContent.split('\n').filter(line => line.trim().length > 0);
+    
+    if (lines.length < 2) {
+      throw new Error("CSV file must contain a header row and at least one data row");
+    }
+    
+    // Parse the header row
+    const header = lines[0].split(',').map(h => h.trim().replace(/^"(.*)"$/, '$1'));
+    console.log(`CSV headers: ${header.join(', ')}`);
+    
+    // Get data rows (skip header)
+    const dataRows = lines.slice(1);
+    console.log(`Processing ${dataRows.length} data rows as individual documents`);
+    
+    let allTriples: Triple[] = [];
+    const BATCH_SIZE = 50; // Store every 50 rows
+    let currentBatch: Triple[] = [];
+    let storedTriples = 0;
+    
+    // Process each row as a separate document
+    for (let rowIdx = 0; rowIdx < dataRows.length; rowIdx++) {
+      const line = dataRows[rowIdx];
+      
+      try {
+        // Parse CSV row into fields
+        const fields: string[] = [];
+        let fieldStart = 0;
+        let inQuotes = false;
+        
+        for (let i = 0; i < line.length; i++) {
+          if (line[i] === '"') {
+            inQuotes = !inQuotes;
+          } else if (line[i] === ',' && !inQuotes) {
+            fields.push(line.substring(fieldStart, i).trim().replace(/^"(.*)"$/, '$1'));
+            fieldStart = i + 1;
+          }
+        }
+        
+        // Add the last field
+        fields.push(line.substring(fieldStart).trim().replace(/^"(.*)"$/, '$1'));
+        
+        // Create document text from the row data
+        let documentText = '';
+        for (let i = 0; i < Math.min(header.length, fields.length); i++) {
+          if (fields[i] && fields[i].trim()) {
+            documentText += `${header[i]}: ${fields[i]}\n`;
+          }
+        }
+        
+        // Skip empty rows
+        if (!documentText.trim()) {
+          console.warn(`Skipping empty CSV row ${rowIdx + 1}`);
+          continue;
+        }
+        
+        console.log(`Processing row ${rowIdx + 1} as document: ${documentText.substring(0, 100)}...`);
+        
+        // Extract triples from this row's text using the existing extraction function
+        try {
+          console.log(`🔄 Calling extractTriplesFromChunk for row ${rowIdx + 1}`);
+          // Note: promptConfigs is not available in this scope, so we'll pass undefined for now
+          const rowTriples = await extractTriplesFromChunk(documentText, undefined);
+          
+          console.log(`📥 extractTriplesFromChunk returned:`, rowTriples);
+          
+          if (rowTriples && Array.isArray(rowTriples)) {
+            console.log(`✅ Extracted ${rowTriples.length} triples from row ${rowIdx + 1}`);
+            allTriples = allTriples.concat(rowTriples);
+            currentBatch = currentBatch.concat(rowTriples);
+            
+            // Store batch every BATCH_SIZE rows or on last row
+            if (currentBatch.length >= BATCH_SIZE || rowIdx === dataRows.length - 1) {
+              try {
+                console.log(`💾 Storing batch: ${currentBatch.length} triples (rows ${storedTriples + 1}-${rowIdx + 1})`);
+                
+                // Store batch to database via API
+                const batchResponse = await fetch('/api/graph-db/triples', {
+                  method: 'POST',
+                  headers: { 'Content-Type': 'application/json' },
+                  body: JSON.stringify({ 
+                    triples: currentBatch,
+                    source: `CSV batch ${Math.floor(storedTriples / BATCH_SIZE) + 1}`
+                  })
+                });
+                
+                if (batchResponse.ok) {
+                  storedTriples += currentBatch.length;
+                  console.log(`✅ Batch stored successfully! Progress: ${storedTriples} total triples stored`);
+                } else {
+                  console.error(`❌ Failed to store batch: ${batchResponse.statusText}`);
+                  // Continue processing even if storage fails
+                }
+                
+                currentBatch = []; // Reset batch
+              } catch (batchError) {
+                console.error(`❌ Error storing batch at row ${rowIdx + 1}:`, batchError);
+                // Continue processing even if one batch fails
+              }
+            }
+          } else {
+            console.warn(`⚠️ No valid triples returned for row ${rowIdx + 1}`);
+          }
+        } catch (error) {
+          console.error(`❌ Error extracting triples from row ${rowIdx + 1}:`, error);
+          continue;
+        }
+        
+      } catch (parseError) {
+        console.error(`Error parsing CSV row ${rowIdx + 1}:`, parseError);
+        continue;
+      }
+    }
+    
+    console.log(`🏁 Successfully extracted ${allTriples.length} triples from ${dataRows.length} CSV rows`);
+    console.log('Final triples array:', allTriples);
+    return allTriples;
+  }
+
+  // Helper function to convert JSON content to readable text format
+  const convertJsonToText = (jsonContent: string): string => {
+    try {
+      // Parse the JSON to validate it
+      const jsonData = JSON.parse(jsonContent);
+      
+      // Convert JSON to a readable text format that preserves structure and relationships
+      const formatJsonObject = (obj: any, indent: number = 0): string => {
+        const spaces = '  '.repeat(indent);
+        
+        if (obj === null || obj === undefined) {
+          return 'null';
+        }
+        
+        if (typeof obj === 'string' || typeof obj === 'number' || typeof obj === 'boolean') {
+          return String(obj);
+        }
+        
+        if (Array.isArray(obj)) {
+          if (obj.length === 0) return '[]';
+          const items = obj.map((item, index) => 
+            `${spaces}  Item ${index + 1}: ${formatJsonObject(item, indent + 1)}`
+          ).join('\n');
+          return `[\n${items}\n${spaces}]`;
+        }
+        
+        if (typeof obj === 'object') {
+          const entries = Object.entries(obj);
+          if (entries.length === 0) return '{}';
+          
+          const props = entries.map(([key, value]) => 
+            `${spaces}  ${key}: ${formatJsonObject(value, indent + 1)}`
+          ).join('\n');
+          return `{\n${props}\n${spaces}}`;
+        }
+        
+        return String(obj);
+      };
+      
+      // Create a descriptive text representation
+      let textContent = `JSON Document Content:\n\n`;
+      textContent += formatJsonObject(jsonData);
+      
+      return textContent;
+    } catch (error) {
+      console.warn('Failed to parse JSON, treating as plain text:', error);
+      // If JSON parsing fails, return the original content as-is
+      return jsonContent;
+    }
+  }
+
+  const openGraphVisualization = async (documentId?: string) => {
+    // Find the document to visualize
+    const doc = documentId
+      ? documents.find((d) => d.id === documentId && d.status === "Processed" && d.triples && d.triples.length > 0)
+      : documents.find((d) => d.status === "Processed" && d.triples && d.triples.length > 0)
+
+    if (!doc || !doc.triples) {
+      console.warn("No suitable document found for graph visualization")
+      return
+    }
+
+    try {
+      // Create a timestamp to ensure we have unique localStorage keys that don't conflict
+      const timestamp = Date.now();
+      
+      // Always store in localStorage as a backup with a timestamp suffix
+      try {
+        // Store with both the old keys (for backward compatibility) and new timestamped keys
+        localStorage.setItem("graphTriples", JSON.stringify(doc.triples))
+        localStorage.setItem("graphDocumentName", doc.name)
+        
+        // Also store with timestamp for uniqueness
+        localStorage.setItem(`graphTriples_${timestamp}`, JSON.stringify(doc.triples))
+        localStorage.setItem(`graphDocumentName_${timestamp}`, doc.name)
+        
+        console.log(`Stored ${doc.triples.length} triples in localStorage for document: ${doc.name}`)
+      } catch (localStorageError) {
+        console.error("LocalStorage error:", localStorageError);
+        alert("Warning: Unable to save graph data to browser storage. The graph may not persist if you navigate away.");
+        // Continue with API storage even if localStorage fails
+      }
+
+      // Try the API approach
+      try {
+        const response = await fetch("/api/graph-data", {
+          method: "POST",
+          headers: {
+            "Content-Type": "application/json",
+          },
+          body: JSON.stringify({
+            triples: doc.triples,
+            documentName: doc.name,
+            timestamp // Include timestamp for correlation
+          }),
+        })
+
+        if (response.ok) {
+          const { graphId } = await response.json()
+          console.log(`Successfully stored graph data with ID: ${graphId}`)
+          // Use Next.js router.replace to avoid building up history stack
+          router.replace(`/graph?id=${graphId}&ts=${timestamp}`)
+        } else {
+          console.warn(`API storage failed (${response.status}): ${await response.text()}`)
+          // If API fails, use localStorage fallback with timestamp parameter
+          router.replace(`/graph?source=local&ts=${timestamp}`)
+        }
+      } catch (apiError) {
+        console.error("Error with API storage:", apiError)
+        // Navigate using localStorage fallback with timestamp
+        router.replace(`/graph?source=local&ts=${timestamp}`)
+      }
+    } catch (error) {
+      console.error("Error preparing graph data:", error)
+      alert("Failed to prepare graph data. See console for details.")
+    }
+  }
+
+  const generateEmbeddings = async (documentId: string) => {
+    // Add more detailed diagnostics
+    const doc = documents.find(d => d.id === documentId);
+    
+    if (!doc) {
+      toast({
+        title: "Document Not Found",
+        description: `Could not find document with ID: ${documentId}`,
+        variant: "destructive",
+        duration: 3000,
+      });
+      return;
+    }
+    
+    // If content already exists, use it right away
+    if (doc.content && doc.content.trim() !== '') {
+      await processEmbeddings(doc.id, doc.name, doc.content);
+      return;
+    }
+    
+    // Document exists but content is not loaded - log debug info
+    console.log(`Attempting to load content for document: ${doc.name}`);
+    console.log(`File info: size=${doc.file.size}, type=${doc.file.type}`);
+    
+    // Check if the document was loaded from localStorage and might have a corrupted file reference
+    const isLikelyFromLocalStorage = doc.file.size === 0 || !(doc.file instanceof Blob);
+    
+    if (isLikelyFromLocalStorage) {
+      toast({
+        title: "File Reference Issue",
+        description: "This document was restored from browser storage and cannot access its original file. Please re-upload the file or process it again first.",
+        variant: "destructive",
+        duration: 5000,
+      });
+      return;
+    }
+    
+    try {
+      // Document exists but content might not be loaded - try to load it
+      const content = await readFileContent(doc.file);
+      if (content && content.trim() !== '') {
+        // Update the document with content first
+        setDocuments(prevDocs => 
+          prevDocs.map(d => {
+            if (d.id === documentId) {
+              return {
+                ...d,
+                content: content
+              };
+            }
+            return d;
+          })
+        );
+        
+        // Continue with the loaded content
+        await processEmbeddings(doc.id, doc.name, content);
+      } else {
+        toast({
+          title: "Empty Document",
+          description: "The document file appears to be empty",
+          variant: "destructive",
+          duration: 3000,
+        });
+      }
+    } catch (error) {
+      toast({
+        title: "Content Loading Error",
+        description: `Failed to load document content: ${error instanceof Error ? error.message : String(error)}`,
+        variant: "destructive",
+        duration: 5000,
+      });
+    }
+  };
+
+  // Helper function to handle the actual embeddings processing
+  const processEmbeddings = async (documentId: string, documentName: string, content: string) => {
+    setIsGeneratingEmbeddings(true);
+    try {
+      console.log(`Generating embeddings for document: ${documentName}`);
+      
+      // Update embeddings status to show it's processing, without changing main document status
+      setDocuments(prevDocs => 
+        prevDocs.map(d => {
+          if (d.id === documentId) {
+            return {
+              ...d,
+              embeddings: {
+                count: d.embeddings?.count || 0,
+                generated: d.embeddings?.generated || new Date(),
+                status: "Processing" as const
+              }
+            };
+          }
+          return d;
+        })
+      );
+      
+      const response = await fetch('/api/embeddings', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify({
+          documentId: documentId,
+          content: content,
+          documentName: documentName
+        })
+      });
+      
+      if (!response.ok) {
+        throw new Error(`Failed to generate embeddings: ${await response.text()}`);
+      }
+      
+      const result = await response.json();
+      console.log('Embeddings generation result:', result);
+      
+      // Update embeddings status to show it's processed
+      setDocuments(prevDocs => 
+        prevDocs.map(d => {
+          if (d.id === documentId) {
+            return {
+              ...d,
+              embeddings: {
+                count: result.embeddings,
+                generated: new Date(),
+                status: "Processed" as const
+              }
+            };
+          }
+          return d;
+        })
+      );
+      
+      // Show a toast notification
+      toast({
+        title: "Embeddings Generated",
+        description: `Successfully generated ${result.embeddings} embeddings for "${documentName}"`,
+        duration: 5000,
+      });
+      
+    } catch (error) {
+      console.error('Error generating embeddings:', error);
+      
+      // Update embeddings status to show there was an error
+      setDocuments(prevDocs => 
+        prevDocs.map(d => {
+          if (d.id === documentId) {
+            return {
+              ...d,
+              embeddings: {
+                count: d.embeddings?.count || 0,
+                generated: d.embeddings?.generated || new Date(),
+                status: "Error" as const,
+                error: error instanceof Error ? error.message : String(error)
+              }
+            };
+          }
+          return d;
+        })
+      );
+      
+      toast({
+        title: "Embeddings Generation Failed",
+        description: `Failed to generate embeddings: ${error instanceof Error ? error.message : String(error)}`,
+        variant: "destructive",
+        duration: 5000,
+      });
+    } finally {
+      setIsGeneratingEmbeddings(false);
+    }
+  };
+
+  return (
+    <DocumentContext.Provider
+      value={{
+        documents,
+        addDocuments,
+        deleteDocuments,
+        clearDocuments,
+        processDocuments,
+    processDocumentsLegacy,
+        isProcessing,
+        updateTriples,
+        addTriple,
+        editTriple,
+        deleteTriple,
+        openGraphVisualization,
+        generateEmbeddings,
+        isGeneratingEmbeddings
+      }}
+    >
+      {children}
+    </DocumentContext.Provider>
+  )
+}
+
+export function useDocuments() {
+  const context = useContext(DocumentContext)
+  if (context === undefined) {
+    throw new Error("useDocuments must be used within a DocumentProvider")
+  }
+  return context
+}
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/contexts/document-context.tsx.bak b/nvidia/txt2kg/assets/frontend/contexts/document-context.tsx.bak
new file mode 100644
index 0000000..a756985
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/contexts/document-context.tsx.bak
@@ -0,0 +1,341 @@
+"use client"
+
+import type React from "react"
+
+import { createContext, useContext, useState, useEffect } from "react"
+import { type Triple, processTextWithChunking, triplesToGraph } from "@/utils/text-processing"
+import { useRouter } from "next/navigation"
+
+export type Document = {
+  id: string
+  name: string
+  status: "New" | "Processing" | "Processed" | "Error"
+  uploadStatus: "Uploading" | "Uploaded"
+  size: string
+  file: File
+  content?: string
+  triples?: Triple[]
+  graph?: {
+    nodes: Array<{ id: string; label: string }>
+    edges: Array<{ source: string; target: string; label: string }>
+  }
+  error?: string
+  rawResponse?: string
+  chunkCount?: number
+}
+
+type DocumentContextType = {
+  documents: Document[]
+  addDocuments: (files: File[]) => void
+  deleteDocuments: (documentIds: string[]) => void
+  clearDocuments: () => void
+  processDocuments: () => Promise<void>
+  isProcessing: boolean
+  updateTriples: (documentId: string, triples: Triple[]) => void
+  addTriple: (documentId: string, triple: Triple) => void
+  editTriple: (documentId: string, index: number, triple: Triple) => void
+  deleteTriple: (documentId: string, index: number) => void
+  openGraphVisualization: (documentId?: string) => Promise<void>
+}
+
+const DocumentContext = createContext<DocumentContextType | undefined>(undefined)
+
+export function DocumentProvider({ children }: { children: React.ReactNode }) {
+  const router = useRouter()
+  const [documents, setDocuments] = useState<Document[]>([])
+  const [isInitialized, setIsInitialized] = useState(false)
+  const [isProcessing, setIsProcessing] = useState(false)
+  const [apiKey, setApiKey] = useState<string | null>(null)
+
+  // Load API key from localStorage on client-side only
+  useEffect(() => {
+    if (typeof window !== 'undefined') {
+      const storedApiKey = localStorage.getItem("XAI_API_KEY");
+      if (storedApiKey) {
+        setApiKey(storedApiKey);
+      }
+    }
+  }, []);
+
+  // Load from localStorage on client-side only
+  useEffect(() => {
+    if (!isInitialized) {
+      try {
+        const savedDocuments = localStorage.getItem('txt2kg_documents')
+        if (savedDocuments) {
+          const parsedDocuments = JSON.parse(savedDocuments)
+          
+          // Reconstruct documents with placeholder File objects
+          const reconstructedDocs = parsedDocuments.map((doc: any) => {
+            // Create a blob from the content if available
+            let file: File;
+            if (doc.content) {
+              // Create a File object from the content string we previously saved
+              const blob = new Blob([doc.content], { type: 'text/plain' });
+              file = new File([blob], doc.name, { type: 'text/plain' });
+            } else {
+              // Create an empty placeholder if no content is available
+              file = new File([], doc.name, { type: 'text/plain' });
+            }
+            
+            return {
+              ...doc,
+              file
+            };
+          });
+          
+          console.log(`Restored ${reconstructedDocs.length} documents from localStorage`);
+          setDocuments(reconstructedDocs);
+        }
+      } catch (error) {
+        console.error('Error loading documents from localStorage:', error);
+      }
+      
+      setIsInitialized(true);
+    }
+  }, [isInitialized]);
+
+  // Save documents to localStorage whenever they change, but only after initialization
+  useEffect(() => {
+    if (isInitialized) {
+      try {
+        if (documents.length > 0) {
+          // Serialize documents for localStorage storage
+          // We need to ensure large documents don't exceed localStorage limits
+          // Focus on saving processed data (triples & graph) rather than raw content for large files
+          const documentsToSave = documents.map(doc => {
+            // Don't save content for very large documents to avoid localStorage limits
+            // But keep it for smaller ones to avoid reprocessing
+            const shouldSaveContent = !doc.content || doc.content.length < 100000;
+            
+            return {
+              ...doc,
+              // Omit the actual File object as it can't be serialized
+              file: {
+                name: doc.file.name,
+                size: doc.file.size,
+                type: doc.file.type
+              },
+              // Only include content for smaller documents
+              content: shouldSaveContent ? doc.content : undefined
+            };
+          });
+          
+          localStorage.setItem('txt2kg_documents', JSON.stringify(documentsToSave));
+          console.log(`Saved ${documents.length} documents to localStorage`);
+        } else {
+          // Clear localStorage if documents array is empty
+          localStorage.removeItem('txt2kg_documents');
+          console.log('Cleared documents from localStorage');
+        }
+      } catch (error) {
+        console.error('Error saving documents to localStorage:', error);
+      }
+    }
+  }, [documents, isInitialized])
+
+  const addDocuments = (files: File[]) => {
+    const newDocuments = files.map((file) => ({
+      id: crypto.randomUUID(),
+      name: file.name,
+      status: "New" as const,
+      uploadStatus: "Uploaded" as const,
+      size: (file.size / 1024).toFixed(2), // Convert to KB
+      file,
+    }))
+
+    setDocuments((prev) => [...prev, ...newDocuments])
+  }
+
+  const deleteDocuments = (documentIds: string[]) => {
+    setDocuments((prev) => prev.filter((doc) => !documentIds.includes(doc.id)))
+  }
+
+  const clearDocuments = () => {
+    setDocuments([])
+  }
+
+  const updateDocumentStatus = (id: string, status: Document["status"], updates: Partial<Document> = {}) => {
+    setDocuments((prev) => prev.map((doc) => (doc.id === id ? { ...doc, status, ...updates } : doc)))
+  }
+
+  const updateTriples = (documentId: string, triples: Triple[]) => {
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId) {
+          const graph = triplesToGraph(triples)
+          return { ...doc, triples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const addTriple = (documentId: string, triple: Triple) => {
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId && doc.triples) {
+          const newTriples = [...doc.triples, triple]
+          const graph = triplesToGraph(newTriples)
+          return { ...doc, triples: newTriples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const editTriple = (documentId: string, index: number, triple: Triple) => {
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId && doc.triples) {
+          const newTriples = [...doc.triples]
+          newTriples[index] = triple
+          const graph = triplesToGraph(newTriples)
+          return { ...doc, triples: newTriples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const deleteTriple = (documentId: string, index: number) => {
+    setDocuments((prev) =>
+      prev.map((doc) => {
+        if (doc.id === documentId && doc.triples) {
+          const newTriples = doc.triples.filter((_, i) => i !== index)
+          const graph = triplesToGraph(newTriples)
+          return { ...doc, triples: newTriples, graph }
+        }
+        return doc
+      }),
+    )
+  }
+
+  const readFileContent = (file: File): Promise<string> => {
+    return new Promise((resolve, reject) => {
+      const reader = new FileReader()
+      reader.onload = (e) => resolve(e.target?.result as string)
+      reader.onerror = (e) => reject(e)
+      reader.readAsText(file)
+    })
+  }
+
+  const extractTriplesFromChunk = async (chunk: string): Promise<Triple[]> => {
+    console.log(`Extracting triples from chunk of length: ${chunk.length}`)
+    
+    // Create headers with API key if available
+    const headers: Record<string, string> = {
+      "Content-Type": "application/json",
+    }
+    
+    // Add API key to headers if available
+    if (apiKey) {
+      headers["X-API-Key"] = apiKey
+    }
+
+    const response = await fetch("/api/extract-triples", {
+      method: "POST",
+      headers,
+      body: JSON.stringify({ text: chunk }),
+    })
+
+    console.log("API response status:", response.status)
+
+    const data = await response.json()
+
+    if (!response.ok) {
+      console.error("API error:", data)
+      
+      // Handle specific API key errors
+      if (data.code === "API_KEY_MISSING" || data.code === "API_KEY_INVALID") {
+        console.error("API key issue:", data.error)
+        
+        // Show the API key prompt modal
+        if (typeof window !== 'undefined' && 'showApiKeyPrompt' in window) {
+          // @ts-ignore - This is defined globally in the ApiKeyPrompt component
+          window.showApiKeyPrompt()
+        }
+        
+        throw new Error(`API key error: ${data.error}`)
+      }
+      
+      throw new Error(data.error || "Failed to extract triples")
+    }
+
+    console.log("API response data:", data)
+    console.log("Triples count:", data.triples?.length || 0)
+
+    return data.triples || []
+  }
+
+  const processDocuments = async () => {
+    if (isProcessing) return
+    setIsProcessing(true)
+    console.log("Starting document processing...")
+
+    try {
+      // Process each document that is in "New" status
+      const newDocs = documents.filter((doc) => doc.status === "New")
+      console.log(`Processing ${newDocs.length} new documents`)
+
+      for (const doc of newDocs) {
+        try {
+          console.log(`Processing document: ${doc.name}`)
+          // Update status to Processing
+          updateDocumentStatus(doc.id, "Processing")
+
+          // Read file content
+          const content = await readFileContent(doc.file)
+          console.log(`Read content from ${doc.name}, length: ${content.length}`)
+
+          // Extract triples using chunking for large documents
+          const chunkSize = 4000 // Adjust based on model's context window
+          const overlapSize = 200
+
+          // Process the text with chunking
+          const triples = await processTextWithChunking(content, extractTriplesFromChunk, chunkSize, overlapSize)
+
+          console.log(`Extracted ${triples.length} triples from ${doc.name}`)
+
+          // Convert triples to graph
+          const graph = triplesToGraph(triples)
+          console.log(`Created graph with ${graph.nodes.length} nodes and ${graph.edges.length} edges`)
+
+          // Update document with content, triples and graph
+          updateDocumentStatus(doc.id, "Processed", {
+            content,
+            triples,
+            graph,
+            chunkCount: content.length > chunkSize ? Math.ceil(content.length / (chunkSize - overlapSize)) : 1,
+          })
+          console.log(`Document ${doc.name} processed successfully`)
+        } catch (error) {
+          console.error(`Error processing document ${doc.name}:`, error)
+          updateDocumentStatus(doc.id, "Error", {
+            error: error instanceof Error ? error.message : "Unknown error",
+          })
+        }
+      }
+    } finally {
+      setIsProcessing(false)
+      console.log("Document processing completed")
+    }
+  }
+
+  const openGraphVisualization = async (documentId?: string) => {
+    // Find the document to visualize
+    const doc = documentId
+      ? documents.find((d) => d.id === documentId && d.status === "Processed" && d.triples && d.triples.length > 0)
+      : documents.find((d) => d.status === "Processed" && d.triples && d.triples.length > 0)
+
+    if (!doc || !doc.triples) {
+      console.warn("No suitable document found for graph visualization")
+      return
+    }
+
+    try {
+      // Always store in localStorage as a backup
+      localStorage.setItem("graphTriples", JSON.stringify(doc.triples))
+      localStorage.setItem("graphDocumentName", doc.name)
+
+      console.log(`
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/hooks/use-keyboard-shortcuts.ts b/nvidia/txt2kg/assets/frontend/hooks/use-keyboard-shortcuts.ts
new file mode 100644
index 0000000..be36792
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/hooks/use-keyboard-shortcuts.ts
@@ -0,0 +1,61 @@
+import { useEffect } from 'react'
+
+interface KeyboardShortcut {
+  key: string
+  metaKey?: boolean
+  ctrlKey?: boolean
+  shiftKey?: boolean
+  altKey?: boolean
+  callback: () => void
+  description: string
+}
+
+export function useKeyboardShortcuts(shortcuts: KeyboardShortcut[], enabled: boolean = true) {
+  useEffect(() => {
+    if (!enabled) return
+
+    const handleKeyDown = (event: KeyboardEvent) => {
+      // Don't trigger shortcuts when user is typing in inputs
+      if (
+        event.target instanceof HTMLInputElement ||
+        event.target instanceof HTMLTextAreaElement ||
+        event.target instanceof HTMLSelectElement ||
+        (event.target as HTMLElement)?.isContentEditable
+      ) {
+        return
+      }
+
+      const matchingShortcut = shortcuts.find(shortcut => {
+        const keyMatch = shortcut.key.toLowerCase() === event.key.toLowerCase()
+        const metaMatch = !!shortcut.metaKey === event.metaKey
+        const ctrlMatch = !!shortcut.ctrlKey === event.ctrlKey
+        const shiftMatch = !!shortcut.shiftKey === event.shiftKey
+        const altMatch = !!shortcut.altKey === event.altKey
+
+        return keyMatch && metaMatch && ctrlMatch && shiftMatch && altMatch
+      })
+
+      if (matchingShortcut) {
+        event.preventDefault()
+        matchingShortcut.callback()
+      }
+    }
+
+    document.addEventListener('keydown', handleKeyDown)
+    
+    return () => {
+      document.removeEventListener('keydown', handleKeyDown)
+    }
+  }, [shortcuts, enabled])
+}
+
+// Common shortcut combinations
+export const shortcuts = {
+  toggleFullscreen: { key: 'f', description: 'Toggle fullscreen' },
+  toggle3D: { key: '3', description: 'Toggle 3D view' },
+  export: { key: 'e', ctrlKey: true, description: 'Export graph' },
+  search: { key: 'k', ctrlKey: true, description: 'Focus search' },
+  forceLayout: { key: '1', description: 'Force layout' },
+  hierarchicalLayout: { key: '2', description: 'Hierarchical layout' },
+  radialLayout: { key: '3', shiftKey: true, description: 'Radial layout' },
+} as const
diff --git a/nvidia/txt2kg/assets/frontend/hooks/use-mobile.tsx b/nvidia/txt2kg/assets/frontend/hooks/use-mobile.tsx
new file mode 100644
index 0000000..2b0fe1d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/hooks/use-mobile.tsx
@@ -0,0 +1,19 @@
+import * as React from "react"
+
+const MOBILE_BREAKPOINT = 768
+
+export function useIsMobile() {
+  const [isMobile, setIsMobile] = React.useState<boolean | undefined>(undefined)
+
+  React.useEffect(() => {
+    const mql = window.matchMedia(`(max-width: ${MOBILE_BREAKPOINT - 1}px)`)
+    const onChange = () => {
+      setIsMobile(window.innerWidth < MOBILE_BREAKPOINT)
+    }
+    mql.addEventListener("change", onChange)
+    setIsMobile(window.innerWidth < MOBILE_BREAKPOINT)
+    return () => mql.removeEventListener("change", onChange)
+  }, [])
+
+  return !!isMobile
+}
diff --git a/nvidia/txt2kg/assets/frontend/hooks/use-shift-select.ts b/nvidia/txt2kg/assets/frontend/hooks/use-shift-select.ts
new file mode 100644
index 0000000..fd5b964
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/hooks/use-shift-select.ts
@@ -0,0 +1,110 @@
+import { useState, useCallback, useRef } from 'react'
+
+interface UseShiftSelectOptions<T> {
+  items: T[]
+  getItemId: (item: T) => string
+  canSelect?: (item: T) => boolean
+  onSelectionChange?: (selectedIds: string[]) => void
+}
+
+interface UseShiftSelectReturn<T> {
+  selectedItems: string[]
+  setSelectedItems: (items: string[]) => void
+  handleItemClick: (item: T, event?: React.MouseEvent) => void
+  handleSelectAll: () => void
+  isSelected: (itemId: string) => boolean
+  clearSelection: () => void
+}
+
+export function useShiftSelect<T>({
+  items,
+  getItemId,
+  canSelect = () => true,
+  onSelectionChange
+}: UseShiftSelectOptions<T>): UseShiftSelectReturn<T> {
+  const [selectedItems, setSelectedItemsState] = useState<string[]>([])
+  const lastClickedIndexRef = useRef<number | null>(null)
+
+  const setSelectedItems = useCallback((items: string[]) => {
+    setSelectedItemsState(items)
+    onSelectionChange?.(items)
+  }, [onSelectionChange])
+
+  const isSelected = useCallback((itemId: string) => {
+    return selectedItems.includes(itemId)
+  }, [selectedItems])
+
+  const clearSelection = useCallback(() => {
+    setSelectedItems([])
+    lastClickedIndexRef.current = null
+  }, [setSelectedItems])
+
+  const handleItemClick = useCallback((item: T, event?: React.MouseEvent) => {
+    if (!canSelect(item)) return
+
+    const itemId = getItemId(item)
+    const currentIndex = items.findIndex(i => getItemId(i) === itemId)
+    
+    if (currentIndex === -1) return
+
+    // Handle shift+click for range selection
+    if (event?.shiftKey && lastClickedIndexRef.current !== null) {
+      const lastIndex = lastClickedIndexRef.current
+      const start = Math.min(currentIndex, lastIndex)
+      const end = Math.max(currentIndex, lastIndex)
+      
+      // Get all selectable items in the range
+      const rangeItems = items.slice(start, end + 1)
+      const selectableRangeIds = rangeItems
+        .filter(canSelect)
+        .map(getItemId)
+      
+      // Add range to current selection (union)
+      const newSelection = Array.from(new Set([...selectedItems, ...selectableRangeIds]))
+      setSelectedItems(newSelection)
+    }
+    // Handle ctrl/cmd+click for individual toggle
+    else if (event?.ctrlKey || event?.metaKey) {
+      if (isSelected(itemId)) {
+        setSelectedItems(selectedItems.filter(id => id !== itemId))
+      } else {
+        setSelectedItems([...selectedItems, itemId])
+      }
+      lastClickedIndexRef.current = currentIndex
+    }
+    // Handle regular click - toggle individual item
+    else {
+      if (isSelected(itemId)) {
+        setSelectedItems(selectedItems.filter(id => id !== itemId))
+      } else {
+        setSelectedItems([...selectedItems, itemId])
+      }
+      lastClickedIndexRef.current = currentIndex
+    }
+  }, [items, selectedItems, canSelect, getItemId, isSelected, setSelectedItems])
+
+  const handleSelectAll = useCallback(() => {
+    const selectableItems = items.filter(canSelect)
+    const selectableIds = selectableItems.map(getItemId)
+    
+    // If all selectable items are selected, deselect all
+    if (selectedItems.length === selectableIds.length && 
+        selectableIds.every(id => selectedItems.includes(id))) {
+      setSelectedItems([])
+      lastClickedIndexRef.current = null
+    } else {
+      // Otherwise, select all selectable items
+      setSelectedItems(selectableIds)
+      lastClickedIndexRef.current = null
+    }
+  }, [items, selectedItems, canSelect, getItemId, setSelectedItems])
+
+  return {
+    selectedItems,
+    setSelectedItems,
+    handleItemClick,
+    handleSelectAll,
+    isSelected,
+    clearSelection
+  }
+}
diff --git a/nvidia/txt2kg/assets/frontend/hooks/use-toast.ts b/nvidia/txt2kg/assets/frontend/hooks/use-toast.ts
new file mode 100644
index 0000000..02e111d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/hooks/use-toast.ts
@@ -0,0 +1,194 @@
+"use client"
+
+// Inspired by react-hot-toast library
+import * as React from "react"
+
+import type {
+  ToastActionElement,
+  ToastProps,
+} from "@/components/ui/toast"
+
+const TOAST_LIMIT = 1
+const TOAST_REMOVE_DELAY = 1000000
+
+type ToasterToast = ToastProps & {
+  id: string
+  title?: React.ReactNode
+  description?: React.ReactNode
+  action?: ToastActionElement
+}
+
+const actionTypes = {
+  ADD_TOAST: "ADD_TOAST",
+  UPDATE_TOAST: "UPDATE_TOAST",
+  DISMISS_TOAST: "DISMISS_TOAST",
+  REMOVE_TOAST: "REMOVE_TOAST",
+} as const
+
+let count = 0
+
+function genId() {
+  count = (count + 1) % Number.MAX_SAFE_INTEGER
+  return count.toString()
+}
+
+type ActionType = typeof actionTypes
+
+type Action =
+  | {
+      type: ActionType["ADD_TOAST"]
+      toast: ToasterToast
+    }
+  | {
+      type: ActionType["UPDATE_TOAST"]
+      toast: Partial<ToasterToast>
+    }
+  | {
+      type: ActionType["DISMISS_TOAST"]
+      toastId?: ToasterToast["id"]
+    }
+  | {
+      type: ActionType["REMOVE_TOAST"]
+      toastId?: ToasterToast["id"]
+    }
+
+interface State {
+  toasts: ToasterToast[]
+}
+
+const toastTimeouts = new Map<string, ReturnType<typeof setTimeout>>()
+
+const addToRemoveQueue = (toastId: string) => {
+  if (toastTimeouts.has(toastId)) {
+    return
+  }
+
+  const timeout = setTimeout(() => {
+    toastTimeouts.delete(toastId)
+    dispatch({
+      type: "REMOVE_TOAST",
+      toastId: toastId,
+    })
+  }, TOAST_REMOVE_DELAY)
+
+  toastTimeouts.set(toastId, timeout)
+}
+
+export const reducer = (state: State, action: Action): State => {
+  switch (action.type) {
+    case "ADD_TOAST":
+      return {
+        ...state,
+        toasts: [action.toast, ...state.toasts].slice(0, TOAST_LIMIT),
+      }
+
+    case "UPDATE_TOAST":
+      return {
+        ...state,
+        toasts: state.toasts.map((t) =>
+          t.id === action.toast.id ? { ...t, ...action.toast } : t
+        ),
+      }
+
+    case "DISMISS_TOAST": {
+      const { toastId } = action
+
+      // ! Side effects ! - This could be extracted into a dismissToast() action,
+      // but I'll keep it here for simplicity
+      if (toastId) {
+        addToRemoveQueue(toastId)
+      } else {
+        state.toasts.forEach((toast) => {
+          addToRemoveQueue(toast.id)
+        })
+      }
+
+      return {
+        ...state,
+        toasts: state.toasts.map((t) =>
+          t.id === toastId || toastId === undefined
+            ? {
+                ...t,
+                open: false,
+              }
+            : t
+        ),
+      }
+    }
+    case "REMOVE_TOAST":
+      if (action.toastId === undefined) {
+        return {
+          ...state,
+          toasts: [],
+        }
+      }
+      return {
+        ...state,
+        toasts: state.toasts.filter((t) => t.id !== action.toastId),
+      }
+  }
+}
+
+const listeners: Array<(state: State) => void> = []
+
+let memoryState: State = { toasts: [] }
+
+function dispatch(action: Action) {
+  memoryState = reducer(memoryState, action)
+  listeners.forEach((listener) => {
+    listener(memoryState)
+  })
+}
+
+type Toast = Omit<ToasterToast, "id">
+
+function toast({ ...props }: Toast) {
+  const id = genId()
+
+  const update = (props: ToasterToast) =>
+    dispatch({
+      type: "UPDATE_TOAST",
+      toast: { ...props, id },
+    })
+  const dismiss = () => dispatch({ type: "DISMISS_TOAST", toastId: id })
+
+  dispatch({
+    type: "ADD_TOAST",
+    toast: {
+      ...props,
+      id,
+      open: true,
+      onOpenChange: (open) => {
+        if (!open) dismiss()
+      },
+    },
+  })
+
+  return {
+    id: id,
+    dismiss,
+    update,
+  }
+}
+
+function useToast() {
+  const [state, setState] = React.useState<State>(memoryState)
+
+  React.useEffect(() => {
+    listeners.push(setState)
+    return () => {
+      const index = listeners.indexOf(setState)
+      if (index > -1) {
+        listeners.splice(index, 1)
+      }
+    }
+  }, [state])
+
+  return {
+    ...state,
+    toast,
+    dismiss: (toastId?: string) => dispatch({ type: "DISMISS_TOAST", toastId }),
+  }
+}
+
+export { useToast, toast }
diff --git a/nvidia/txt2kg/assets/frontend/lib/arangodb-loader.ts b/nvidia/txt2kg/assets/frontend/lib/arangodb-loader.ts
new file mode 100644
index 0000000..837719b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/arangodb-loader.ts
@@ -0,0 +1,47 @@
+import { ArangoDBService } from './arangodb';
+
+/**
+ * Load triples from ArangoDB for use with the TXT2KG dataset
+ * @param url ArangoDB connection URL
+ * @param dbName ArangoDB database name
+ * @param username ArangoDB username
+ * @param password ArangoDB password 
+ * @returns Array of triples in the format expected by create_remote_backend_from_triplets
+ */
+export async function loadTriplesFromArangoDB(
+  url?: string,
+  dbName?: string,
+  username?: string,
+  password?: string
+): Promise<string[]> {
+  try {
+    // Get the ArangoDB service instance
+    const arangoService = ArangoDBService.getInstance();
+    
+    // Initialize the connection if not already initialized
+    if (!arangoService.isInitialized()) {
+      await arangoService.initialize(url, dbName, username, password);
+    }
+    
+    // Query to get all triples from ArangoDB
+    const query = `
+      FOR e IN relationships
+      LET subject = DOCUMENT(e._from).name
+      LET object = DOCUMENT(e._to).name
+      LET predicate = e.type
+      RETURN subject + " " + predicate + " " + object
+    `;
+    
+    // Execute the query
+    const results = await arangoService.executeQuery(query);
+    
+    // Format the triples as strings in the format "subject predicate object"
+    const triples = results.map(triple => triple);
+    
+    console.log(`Loaded ${triples.length} triples from ArangoDB`);
+    return triples;
+  } catch (error) {
+    console.error('Error loading triples from ArangoDB:', error);
+    throw error;
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/arangodb.ts b/nvidia/txt2kg/assets/frontend/lib/arangodb.ts
new file mode 100644
index 0000000..dafd760
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/arangodb.ts
@@ -0,0 +1,445 @@
+import { Database, aql } from 'arangojs';
+
+/**
+ * ArangoDB service for database operations
+ * Provides methods to connect to and interact with an ArangoDB database
+ */
+export class ArangoDBService {
+  private db: Database | null = null;
+  private static instance: ArangoDBService;
+  private collectionName: string = 'entities';
+  private edgeCollectionName: string = 'relationships';
+
+  private constructor() {}
+
+  /**
+   * Get the singleton instance of ArangoDBService
+   */
+  public static getInstance(): ArangoDBService {
+    if (!ArangoDBService.instance) {
+      ArangoDBService.instance = new ArangoDBService();
+    }
+    return ArangoDBService.instance;
+  }
+
+  /**
+   * Initialize the ArangoDB connection
+   * @param url - ArangoDB connection URL (defaults to ARANGODB_URL env var or 'http://localhost:8529')
+   * @param databaseName - ArangoDB database name (defaults to ARANGODB_DB env var or 'txt2kg')
+   * @param username - ArangoDB username (defaults to ARANGODB_USER env var or 'root')
+   * @param password - ArangoDB password (defaults to ARANGODB_PASSWORD env var or '')
+   */
+  public async initialize(url?: string, databaseName?: string, username?: string, password?: string): Promise<void> {
+    // Use provided values, or environment variables, or defaults
+    const connectionUrl = url || process.env.ARANGODB_URL || 'http://localhost:8529';
+    const dbName = databaseName || process.env.ARANGODB_DB || 'txt2kg';
+    const user = username || process.env.ARANGODB_USER || 'root';
+    const pass = password || process.env.ARANGODB_PASSWORD || '';
+
+    try {
+      // Initialize the database connection
+      this.db = new Database({
+        url: connectionUrl,
+        databaseName: dbName,
+        auth: { username: user, password: pass },
+      });
+
+      // Check if database exists, create if it doesn't
+      const dbExists = await this.db.exists();
+      if (!dbExists) {
+        console.log(`Database ${dbName} does not exist, creating it...`);
+        await this.db.createDatabase(dbName);
+        this.db.useDatabase(dbName);
+      }
+
+      // Check if collections exist, create if they don't
+      const collections = await this.db.listCollections();
+      const collectionNames = collections.map(c => c.name);
+
+      // Create entity collection if it doesn't exist
+      if (!collectionNames.includes(this.collectionName)) {
+        await this.db.createCollection(this.collectionName);
+        await this.db.collection(this.collectionName).ensureIndex({
+          type: 'persistent',
+          fields: ['name'],
+          unique: true
+        });
+      }
+
+      // Create edge collection if it doesn't exist
+      if (!collectionNames.includes(this.edgeCollectionName)) {
+        await this.db.createEdgeCollection(this.edgeCollectionName);
+        await this.db.collection(this.edgeCollectionName).ensureIndex({
+          type: 'persistent',
+          fields: ['type']
+        });
+      }
+
+      console.log('ArangoDB initialized successfully');
+    } catch (error) {
+      console.error('Failed to initialize ArangoDB:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Check if the database connection is initialized
+   */
+  public isInitialized(): boolean {
+    return this.db !== null;
+  }
+
+  /**
+   * Close the ArangoDB connection
+   */
+  public close(): void {
+    if (this.db) {
+      this.db = null;
+      console.log('ArangoDB connection closed');
+    }
+  }
+
+  /**
+   * Execute an AQL query
+   * @param query - AQL query string
+   * @param bindVars - Parameters for the query
+   * @returns Promise resolving to query results
+   */
+  public async executeQuery(query: string, bindVars: Record<string, any> = {}): Promise<any[]> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      const cursor = await this.db.query(query, bindVars);
+      return await cursor.all();
+    } catch (error) {
+      console.error('Error executing ArangoDB query:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Create a node in the graph database
+   * @param properties - Node properties
+   * @returns Promise resolving to the created node
+   */
+  public async createNode(properties: Record<string, any>): Promise<any> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      const collection = this.db.collection(this.collectionName);
+      return await collection.save(properties);
+    } catch (error) {
+      console.error('Error creating node in ArangoDB:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Create a relationship between two nodes
+   * @param fromKey - Key of the start node
+   * @param toKey - Key of the end node
+   * @param relationType - Type of relationship
+   * @param properties - Relationship properties
+   * @returns Promise resolving to the created relationship
+   */
+  public async createRelationship(
+    fromKey: string,
+    toKey: string,
+    relationType: string,
+    properties: Record<string, any> = {}
+  ): Promise<any> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      const edgeCollection = this.db.collection(this.edgeCollectionName);
+      const edgeData = {
+        _from: `${this.collectionName}/${fromKey}`,
+        _to: `${this.collectionName}/${toKey}`,
+        type: relationType,
+        ...properties
+      };
+      return await edgeCollection.save(edgeData);
+    } catch (error) {
+      console.error('Error creating relationship in ArangoDB:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Import triples (subject, predicate, object) into the graph database
+   * @param triples - Array of triples to import
+   * @returns Promise resolving when import is complete
+   */
+  public async importTriples(triples: { subject: string; predicate: string; object: string }[]): Promise<void> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      // Process triples in batches to improve performance
+      for (const triple of triples) {
+        // Normalize triple values
+        const normalizedSubject = triple.subject.trim();
+        const normalizedPredicate = triple.predicate.trim();
+        const normalizedObject = triple.object.trim();
+        
+        // Skip invalid triples
+        if (!normalizedSubject || !normalizedPredicate || !normalizedObject) {
+          console.warn('Skipping invalid triple:', triple);
+          continue;
+        }
+        
+        // Upsert subject and object nodes
+        const subjectNode = await this.upsertEntity(normalizedSubject);
+        const objectNode = await this.upsertEntity(normalizedObject);
+        
+        // Check if relationship already exists
+        const existingEdges = await this.executeQuery(
+          `FOR e IN ${this.edgeCollectionName} 
+           FILTER e._from == @from AND e._to == @to AND e.type == @type 
+           RETURN e`,
+          { 
+            from: `${this.collectionName}/${subjectNode._key}`, 
+            to: `${this.collectionName}/${objectNode._key}`, 
+            type: normalizedPredicate 
+          }
+        );
+        
+        // Create relationship if it doesn't exist
+        if (existingEdges.length === 0) {
+          await this.createRelationship(
+            subjectNode._key,
+            objectNode._key,
+            normalizedPredicate
+          );
+        }
+      }
+      
+      console.log(`Successfully imported ${triples.length} triples into ArangoDB`);
+    } catch (error) {
+      console.error('Error importing triples into ArangoDB:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Helper method to upsert (create or update) an entity
+   * @param name - Entity name
+   * @returns Promise resolving to the entity
+   */
+  private async upsertEntity(name: string): Promise<any> {
+    const collection = this.db!.collection(this.collectionName);
+    
+    // Look for existing entity
+    const existing = await this.executeQuery(
+      `FOR e IN ${this.collectionName} FILTER e.name == @name RETURN e`,
+      { name }
+    );
+    
+    if (existing.length > 0) {
+      return existing[0];
+    }
+    
+    // Create new entity
+    return await collection.save({ name });
+  }
+
+  /**
+   * Get graph data in a format compatible with the existing application
+   * @returns Promise resolving to nodes and relationships
+   */
+  public async getGraphData(): Promise<{ 
+    nodes: Array<{ 
+      id: string; 
+      labels: string[]; 
+      [key: string]: any 
+    }>; 
+    relationships: Array<{ 
+      id: string; 
+      source: string; 
+      target: string; 
+      type: string; 
+      [key: string]: any 
+    }>; 
+  }> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      // Get all entities (nodes)
+      const entities = await this.executeQuery(
+        `FOR e IN ${this.collectionName} RETURN e`
+      );
+      
+      // Get all relationships (edges)
+      const relationships = await this.executeQuery(
+        `FOR r IN ${this.edgeCollectionName} RETURN r`
+      );
+      
+      // Build id to key mapping for relationships
+      const idToKey = new Map<string, string>();
+      for (const entity of entities) {
+        idToKey.set(entity._id, entity._key);
+      }
+      
+      // Format nodes in a way compatible with the application
+      const nodes = entities.map(entity => ({
+        id: entity._key,
+        labels: ['Entity'],
+        name: entity.name,
+        ...entity
+      }));
+      
+      // Format relationships in a way compatible with the application
+      const formattedRelationships = relationships.map(rel => {
+        // Extract the entity keys from _from and _to
+        const source = rel._from.split('/')[1];
+        const target = rel._to.split('/')[1];
+        
+        return {
+          id: rel._key,
+          source,
+          target,
+          type: rel.type,
+          ...rel
+        };
+      });
+      
+      return {
+        nodes,
+        relationships: formattedRelationships
+      };
+    } catch (error) {
+      console.error('Error getting graph data from ArangoDB:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Log query information and metrics
+   */
+  public async logQuery(
+    query: string, 
+    queryMode: 'traditional' | 'vector-search' | 'pure-rag',
+    metrics: {
+      executionTimeMs: number;
+      relevanceScore?: number;
+      precision?: number;
+      recall?: number;
+      resultCount: number;
+    }
+  ): Promise<void> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      // Create a queryLogs collection if it doesn't exist
+      const collections = await this.db.listCollections();
+      const collectionNames = collections.map(c => c.name);
+      
+      if (!collectionNames.includes('queryLogs')) {
+        await this.db.createCollection('queryLogs');
+      }
+      
+      // Store query log
+      const queryLog = {
+        query,
+        queryMode,
+        metrics,
+        timestamp: new Date().toISOString()
+      };
+      
+      await this.db.collection('queryLogs').save(queryLog);
+    } catch (error) {
+      console.error('Error logging query to ArangoDB:', error);
+      // We don't want to throw here as query logging is non-critical
+      console.error('Query logging failed but continuing execution');
+    }
+  }
+
+  /**
+   * Get query logs
+   * @param limit - Maximum number of logs to retrieve
+   * @returns Promise resolving to query logs
+   */
+  public async getQueryLogs(limit: number = 100): Promise<any[]> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      // Check if queryLogs collection exists
+      const collections = await this.db.listCollections();
+      const collectionNames = collections.map(c => c.name);
+      
+      if (!collectionNames.includes('queryLogs')) {
+        return [];
+      }
+      
+      // Get logs sorted by timestamp
+      const logs = await this.executeQuery(
+        `FOR l IN queryLogs SORT l.timestamp DESC LIMIT @limit RETURN l`,
+        { limit }
+      );
+      
+      return logs;
+    } catch (error) {
+      console.error('Error getting query logs from ArangoDB:', error);
+      return [];
+    }
+  }
+
+  /**
+   * Get basic info about the ArangoDB connection
+   */
+  public getDriverInfo(): Record<string, any> {
+    if (!this.db) {
+      return { status: 'not connected' };
+    }
+    
+    return {
+      status: 'connected',
+      url: this.db.url,
+      database: this.db.name
+    };
+  }
+
+  /**
+   * Clear all data from the graph database
+   * @returns Promise resolving when all collections are cleared
+   */
+  public async clearDatabase(): Promise<void> {
+    if (!this.db) {
+      throw new Error('ArangoDB connection not initialized. Call initialize() first.');
+    }
+
+    try {
+      // Truncate the entities collection (nodes)
+      await this.db.collection(this.collectionName).truncate();
+      
+      // Truncate the relationships collection (edges)
+      await this.db.collection(this.edgeCollectionName).truncate();
+      
+      // Also clear query logs if they exist
+      const collections = await this.db.listCollections();
+      const collectionNames = collections.map(c => c.name);
+      
+      if (collectionNames.includes('queryLogs')) {
+        await this.db.collection('queryLogs').truncate();
+      }
+      
+      console.log('ArangoDB database cleared successfully');
+    } catch (error) {
+      console.error('Error clearing ArangoDB database:', error);
+      throw error;
+    }
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/backend-service.ts b/nvidia/txt2kg/assets/frontend/lib/backend-service.ts
new file mode 100644
index 0000000..0364720
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/backend-service.ts
@@ -0,0 +1,440 @@
+import axios from 'axios';
+import { GraphDBService, GraphDBType } from './graph-db-service';
+import { PineconeService } from './pinecone';
+import { getGraphDbService } from './graph-db-util';
+import type { Triple } from '@/types/graph';
+
+/**
+ * Backend service that combines graph database for storage and Pinecone for embeddings
+ */
+export class BackendService {
+  private graphDBService: GraphDBService;
+  private pineconeService: PineconeService;
+  private sentenceTransformerUrl: string = 'http://sentence-transformers:80';
+  private modelName: string = 'all-MiniLM-L6-v2';
+  private static instance: BackendService;
+  private initialized: boolean = false;
+  private activeGraphDbType: GraphDBType = 'arangodb';
+  
+  private constructor() {
+    this.graphDBService = GraphDBService.getInstance();
+    this.pineconeService = PineconeService.getInstance();
+    
+    // Use environment variables if available
+    if (process.env.SENTENCE_TRANSFORMER_URL) {
+      this.sentenceTransformerUrl = process.env.SENTENCE_TRANSFORMER_URL;
+    }
+    if (process.env.MODEL_NAME) {
+      this.modelName = process.env.MODEL_NAME;
+    }
+  }
+  
+  /**
+   * Get the singleton instance of BackendService
+   */
+  public static getInstance(): BackendService {
+    if (!BackendService.instance) {
+      BackendService.instance = new BackendService();
+    }
+    return BackendService.instance;
+  }
+  
+  /**
+   * Initialize the backend services
+   * @param graphDbType - Type of graph database to use (neo4j or arangodb)
+   */
+  public async initialize(graphDbType: GraphDBType = 'arangodb'): Promise<void> {
+    this.activeGraphDbType = graphDbType;
+    
+    // Initialize Graph Database
+    if (!this.graphDBService.isInitialized()) {
+      try {
+        // Get the appropriate service based on type
+        const graphDbService = getGraphDbService(graphDbType);
+        
+        // Try to get settings from server settings API first
+        let serverSettings: Record<string, string> = {};
+        try {
+          const response = await fetch('/api/settings');
+          if (response.ok) {
+            const data = await response.json();
+            serverSettings = data.settings || {};
+            console.log('Successfully loaded settings from server API');
+          }
+        } catch (error) {
+          console.log('Failed to load settings from server API, falling back to environment variables:', error);
+        }
+        
+        if (graphDbType === 'neo4j') {
+          // Get Neo4j credentials from server settings first, then fallback to environment
+          const uri = serverSettings.neo4j_url || process.env.NEO4J_URI;
+          const username = serverSettings.neo4j_user || process.env.NEO4J_USER || process.env.NEO4J_USERNAME;
+          const password = serverSettings.neo4j_password || process.env.NEO4J_PASSWORD;
+          
+          console.log(`Using Neo4j URI: ${uri}`);
+          await this.graphDBService.initialize('neo4j', uri, username, password);
+        } else {
+          // Prioritize environment variables over server settings for Docker deployments
+          const url = process.env.ARANGODB_URL || serverSettings.arango_url || 'http://localhost:8529';
+          const dbName = process.env.ARANGODB_DB || serverSettings.arango_db || 'txt2kg';
+          const username = process.env.ARANGODB_USER || serverSettings.arango_user;
+          const password = process.env.ARANGODB_PASSWORD || serverSettings.arango_password;
+          
+          console.log(`Using ArangoDB URL: ${url}`);
+          console.log(`Using ArangoDB database: ${dbName}`);
+          await this.graphDBService.initialize('arangodb', url, username, password);
+        }
+        console.log(`${graphDbType} initialized successfully in backend service`);
+      } catch (error) {
+        console.error(`Failed to initialize ${graphDbType} in backend service:`, error);
+        if (process.env.NODE_ENV === 'development') {
+          console.log('Development mode: Continuing despite graph database initialization error');
+        } else {
+          throw new Error('Graph database service initialization failed');
+        }
+      }
+    }
+    
+    // Initialize Pinecone
+    if (!this.pineconeService.isInitialized()) {
+      await this.pineconeService.initialize();
+    }
+    
+    // Check if sentence-transformer service is available
+    try {
+      // Remove the check skip in development mode
+      const response = await axios.get(`${this.sentenceTransformerUrl}/health`);
+      console.log(`Connected to SentenceTransformer service: ${response.data.model}`);
+      this.initialized = true;
+    } catch (error) {
+      console.error(`Failed to connect to sentence-transformer service: ${error}`);
+      if (process.env.NODE_ENV === 'development') {
+        console.log('Development mode: Continuing despite sentence transformer error');
+        this.initialized = true;
+      } else {
+        throw new Error('Sentence transformer service is not available');
+      }
+    }
+  }
+  
+  /**
+   * Check if the backend is initialized
+   */
+  public get isInitialized(): boolean {
+    return this.initialized && this.graphDBService.isInitialized();
+  }
+  
+  /**
+   * Get the active graph database type
+   */
+  public getGraphDbType(): GraphDBType {
+    return this.activeGraphDbType;
+  }
+  
+  /**
+   * Generate embeddings using the sentence-transformer service
+   */
+  private async generateEmbeddings(texts: string[]): Promise<number[][]> {
+    try {
+      const response = await axios.post(`${this.sentenceTransformerUrl}/embed`, {
+        texts,
+        batch_size: 32
+      });
+      
+      return response.data.embeddings;
+    } catch (error) {
+      console.error(`Error generating embeddings: ${error}`);
+      throw new Error('Failed to generate embeddings');
+    }
+  }
+
+  /**
+   * Convert our triple format to database format
+   */
+  private convertTriples(triples: Triple[]): { subject: string; predicate: string; object: string }[] {
+    return triples.map(triple => ({
+      subject: triple.subject,
+      predicate: triple.predicate,
+      object: triple.object
+    }));
+  }
+  
+  /**
+   * Process and store triples in graph database and embeddings in Pinecone
+   */
+  public async processTriples(triples: Triple[]): Promise<void> {
+    // Preprocess triples: lowercase and remove duplicates
+    const processedTriples = triples.map(triple => ({
+      subject: triple.subject.toLowerCase(),
+      predicate: triple.predicate.toLowerCase(),
+      object: triple.object.toLowerCase()
+    }));
+    
+    // Remove duplicate triples
+    const uniqueTriples = Array.from(
+      new Map(processedTriples.map(triple => [JSON.stringify(triple), triple])).values()
+    );
+    
+    console.log(`Processed ${triples.length} triples, removed ${triples.length - uniqueTriples.length} duplicates`);
+    
+    // Store triples in graph database
+    console.log(`Storing triples in ${this.activeGraphDbType} database`);
+    await this.graphDBService.importTriples(this.convertTriples(uniqueTriples));
+    
+    // Extract unique entities from triples
+    const entities = new Set<string>();
+    for (const triple of uniqueTriples) {
+      entities.add(triple.subject); // subject
+      entities.add(triple.object); // object
+    }
+    
+    // Generate embeddings for entities in batches
+    const entityList = Array.from(entities);
+    const batchSize = 256;
+    const entityEmbeddings = new Map<string, number[]>();
+    const textContent = new Map<string, string>(); // Map for text content
+    
+    console.log(`Generating embeddings for ${entityList.length} entities`);
+    
+    for (let i = 0; i < entityList.length; i += batchSize) {
+      const batch = entityList.slice(i, i + batchSize);
+      console.log(`Processing batch ${Math.floor(i/batchSize) + 1}/${Math.ceil(entityList.length/batchSize)}`);
+      
+      const embeddings = await this.generateEmbeddings(batch);
+      
+      // Store in maps
+      for (let j = 0; j < batch.length; j++) {
+        entityEmbeddings.set(batch[j], embeddings[j]);
+        textContent.set(batch[j], batch[j]); // Store the entity name as text content
+      }
+    }
+    
+    // Store embeddings and text content in Pinecone
+    await this.pineconeService.storeEmbeddings(entityEmbeddings, textContent);
+    
+    console.log(`Backend processing complete: ${uniqueTriples.length} triples and ${entityList.length} entities stored using ${this.activeGraphDbType}`);
+  }
+  
+  /**
+   * Perform a traditional query using direct pattern matching on the graph
+   * This bypasses the vector embeddings and uses text matching
+   */
+  public async queryTraditional(queryText: string): Promise<Triple[]> {
+    console.log(`Performing traditional graph query: "${queryText}"`);
+    
+    // Get graph data from graph database
+    const graphData = await this.graphDBService.getGraphData();
+    console.log(`Retrieved graph from ${this.activeGraphDbType} with ${graphData.nodes.length} nodes and ${graphData.relationships.length} relationships`);
+    
+    // Create a map of node IDs to names
+    const nodeIdToName = new Map<string, string>();
+    for (const node of graphData.nodes) {
+      nodeIdToName.set(node.id, node.name);
+    }
+    
+    // Extract keywords from query
+    const keywords = this.extractKeywords(queryText);
+    console.log(`Extracted keywords: ${keywords.join(', ')}`);
+    
+    // Find matching nodes based on keywords
+    const matchingNodeIds = new Set<string>();
+    for (const node of graphData.nodes) {
+      for (const keyword of keywords) {
+        // Skip common words
+        if (this.isStopWord(keyword)) continue;
+        
+        // Simple text matching - convert to lowercase for case-insensitive matching
+        if (node.name.toLowerCase().includes(keyword.toLowerCase())) {
+          matchingNodeIds.add(node.id);
+          break;
+        }
+      }
+    }
+    
+    console.log(`Found ${matchingNodeIds.size} nodes matching keywords directly`);
+    
+    // Find relationships where either subject or object matches
+    const relevantTriples: Triple[] = [];
+    
+    for (const rel of graphData.relationships) {
+      // Check if either end of the relationship matches our search
+      const isSourceMatching = matchingNodeIds.has(rel.source);
+      const isTargetMatching = matchingNodeIds.has(rel.target);
+      
+      if (isSourceMatching || isTargetMatching) {
+        const sourceName = nodeIdToName.get(rel.source);
+        const targetName = nodeIdToName.get(rel.target);
+        
+        if (sourceName && targetName) {
+          // Check if the relationship type matches keywords
+          let matchesRelationship = false;
+          for (const keyword of keywords) {
+            if (this.isStopWord(keyword)) continue;
+            if (rel.type.toLowerCase().includes(keyword.toLowerCase())) {
+              matchesRelationship = true;
+              break;
+            }
+          }
+          
+          // Higher relevance to relationships that match the query directly
+          const relevance = (isSourceMatching ? 1 : 0) + 
+                           (isTargetMatching ? 1 : 0) + 
+                           (matchesRelationship ? 2 : 0);
+          
+          if (relevance > 0) {
+            relevantTriples.push({
+              subject: sourceName,
+              predicate: rel.type,
+              object: targetName,
+              confidence: relevance / 4.0  // Scale from 0 to 1
+            });
+          }
+        }
+      }
+    }
+    
+    // Sort by confidence (highest first)
+    relevantTriples.sort((a, b) => 
+      (b.confidence || 0) - (a.confidence || 0)
+    );
+    
+    // Return all relevant triples, sorted by relevance
+    console.log(`Found ${relevantTriples.length} relevant triples with traditional search`);
+    return relevantTriples;
+  }
+  
+  /**
+   * Extract keywords from query text
+   */
+  private extractKeywords(text: string): string[] {
+    return text.toLowerCase()
+      .replace(/[.,?!;:()]/g, ' ')  // Remove punctuation
+      .split(/\s+/)                  // Split by whitespace
+      .filter(word => word.length > 2); // Filter out very short words
+  }
+  
+  /**
+   * Check if a word is a common stop word
+   */
+  private isStopWord(word: string): boolean {
+    const stopWords = new Set([
+      'the', 'and', 'are', 'for', 'was', 'with', 
+      'how', 'what', 'why', 'who', 'when', 'which',
+      'many', 'much', 'from', 'have', 'has', 'had',
+      'that', 'this', 'these', 'those', 'they', 'their'
+    ]);
+    return stopWords.has(word.toLowerCase());
+  }
+  
+  /**
+   * Query the backend for relevant information
+   */
+  public async query(
+    queryText: string, 
+    kNeighbors: number = 4096, 
+    fanout: number = 400, 
+    numHops: number = 2,
+    useTraditional: boolean = false
+  ): Promise<Triple[]> {
+    console.log(`Querying backend with database type: ${this.activeGraphDbType}, useTraditional: ${useTraditional}`);
+    
+    // If using traditional search, bypass the vector embeddings
+    if (useTraditional) {
+      return this.queryTraditional(queryText);
+    }
+    
+    // Generate embedding for query
+    const queryEmbedding = (await this.generateEmbeddings([queryText]))[0];
+    
+    // Find nearest neighbors using Pinecone
+    const seedNodes = await this.pineconeService.findSimilarEntities(queryEmbedding, kNeighbors);
+    console.log(`Found ${seedNodes.length} seed nodes for query: "${queryText}"`);
+    
+    // Get graph data from graph database
+    const graphData = await this.graphDBService.getGraphData();
+    console.log(`Retrieved graph from ${this.activeGraphDbType} with ${graphData.nodes.length} nodes and ${graphData.relationships.length} relationships`);
+    
+    // Build adjacency map for neighborhood exploration
+    const adjacencyMap = new Map<string, string[]>();
+    
+    // Map Neo4j IDs to entity names
+    const nodeIdToName = new Map<string, string>();
+    for (const node of graphData.nodes) {
+      nodeIdToName.set(node.id, node.name);
+      adjacencyMap.set(node.name, []);
+    }
+    
+    // Build adjacency lists
+    for (const rel of graphData.relationships) {
+      const sourceName = nodeIdToName.get(rel.source);
+      const targetName = nodeIdToName.get(rel.target);
+      
+      if (sourceName && targetName) {
+        const neighbors = adjacencyMap.get(sourceName) || [];
+        neighbors.push(targetName);
+        adjacencyMap.set(sourceName, neighbors);
+      }
+    }
+    
+    // Perform multi-hop exploration
+    const visitedNodes = new Set<string>(seedNodes);
+    const nodesToExplore = [...seedNodes];
+    
+    for (let hop = 0; hop < numHops; hop++) {
+      const currentNodes = [...nodesToExplore];
+      nodesToExplore.length = 0; // Clear the array
+      
+      for (const node of currentNodes) {
+        const neighbors = adjacencyMap.get(node) || [];
+        const limitedNeighbors = neighbors.slice(0, fanout);
+        
+        for (const neighbor of limitedNeighbors) {
+          if (!visitedNodes.has(neighbor)) {
+            visitedNodes.add(neighbor);
+            nodesToExplore.push(neighbor);
+          }
+        }
+      }
+      
+      console.log(`Hop ${hop+1}: Explored ${currentNodes.length} nodes, found ${nodesToExplore.length} new neighbors`);
+    }
+    
+    // Extract relevant triples
+    const relevantTriples: Triple[] = [];
+    
+    for (const rel of graphData.relationships) {
+      const sourceName = nodeIdToName.get(rel.source);
+      const targetName = nodeIdToName.get(rel.target);
+      
+      if (sourceName && targetName && 
+         (visitedNodes.has(sourceName) || visitedNodes.has(targetName))) {
+        // Include relationship type from metadata
+        const predicate = rel.type === 'RELATIONSHIP' ? rel.type : rel.type;
+        relevantTriples.push({
+          subject: sourceName,
+          predicate: predicate,
+          object: targetName
+        });
+      }
+    }
+    
+    // Apply local filtering (simplified version of PCST algorithm)
+    // Just return top N triples for simplicity
+    const topK = 5; // topk parameter from the Python example
+    
+    console.log(`Found ${relevantTriples.length} relevant triples, returning top ${topK * 5}`);
+    return relevantTriples.slice(0, topK * 5);
+  }
+  
+  /**
+   * Close connections to backend services
+   */
+  public async close(): Promise<void> {
+    if (this.graphDBService.isInitialized()) {
+      this.graphDBService.close();
+    }
+    console.log('Backend service closed');
+  }
+}
+
+export default BackendService.getInstance(); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/client-init.ts b/nvidia/txt2kg/assets/frontend/lib/client-init.ts
new file mode 100644
index 0000000..fd8b981
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/client-init.ts
@@ -0,0 +1,150 @@
+/**
+ * Client-side initialization utilities
+ * This file contains functions for initializing the application on the client side
+ */
+
+/**
+ * Initialize default database settings if not already set
+ * Called before syncing with server to ensure defaults are available
+ */
+export function initializeDefaultSettings() {
+  if (typeof window === 'undefined') {
+    return; // Only run on client side
+  }
+
+  // Set default graph DB type to ArangoDB if not set
+  if (!localStorage.getItem('graph_db_type')) {
+    localStorage.setItem('graph_db_type', 'arangodb');
+  }
+
+  // Set default ArangoDB settings if not set
+  if (!localStorage.getItem('arango_url')) {
+    localStorage.setItem('arango_url', 'http://localhost:8529');
+  }
+
+  if (!localStorage.getItem('arango_db')) {
+    localStorage.setItem('arango_db', 'txt2kg');
+  }
+}
+
+/**
+ * Synchronize settings from localStorage with the server
+ * Called on app initialization to ensure server has access to client settings
+ */
+export async function syncSettingsWithServer() {
+  if (typeof window === 'undefined') {
+    return; // Only run on client side
+  }
+
+  // Initialize default settings first
+  initializeDefaultSettings();
+
+  // Collect all relevant settings from localStorage
+  const settings: Record<string, string> = {};
+  
+  // NVIDIA API settings
+  const nvidiaEmbeddingsModel = localStorage.getItem('nvidia_embeddings_model');
+  if (nvidiaEmbeddingsModel) {
+    settings.nvidia_embeddings_model = nvidiaEmbeddingsModel;
+  }
+  
+  const embeddingsProvider = localStorage.getItem('embeddings_provider');
+  if (embeddingsProvider) {
+    settings.embeddings_provider = embeddingsProvider;
+  }
+  
+  // Graph Database selection
+  const graphDbType = localStorage.getItem('graph_db_type');
+  if (graphDbType) {
+    settings.graph_db_type = graphDbType;
+  }
+  
+  // Neo4j settings
+  const neo4jUrl = localStorage.getItem('neo4j_url');
+  if (neo4jUrl) {
+    settings.neo4j_url = neo4jUrl;
+  }
+  
+  const neo4jUser = localStorage.getItem('neo4j_user');
+  if (neo4jUser) {
+    settings.neo4j_user = neo4jUser;
+  }
+  
+  const neo4jPassword = localStorage.getItem('neo4j_password');
+  if (neo4jPassword) {
+    settings.neo4j_password = neo4jPassword;
+  }
+  
+  // ArangoDB settings
+  const arangoUrl = localStorage.getItem('arango_url');
+  if (arangoUrl) {
+    settings.arango_url = arangoUrl;
+  }
+  
+  const arangoDb = localStorage.getItem('arango_db');
+  if (arangoDb) {
+    settings.arango_db = arangoDb;
+  }
+  
+  const arangoUser = localStorage.getItem('arango_user');
+  if (arangoUser) {
+    settings.arango_user = arangoUser;
+  }
+  
+  const arangoPassword = localStorage.getItem('arango_password');
+  if (arangoPassword) {
+    settings.arango_password = arangoPassword;
+  }
+  
+  // xAI API settings
+  const xaiApiKey = localStorage.getItem('XAI_API_KEY');
+  if (xaiApiKey) {
+    settings.XAI_API_KEY = xaiApiKey;
+  }
+  
+  // NVIDIA Nemotron API key
+  const nvidiaApiKey = localStorage.getItem('NVIDIA_API_KEY');
+  if (nvidiaApiKey) {
+    settings.NVIDIA_API_KEY = nvidiaApiKey;
+  }
+  
+  // Pinecone settings
+  const pineconeApiKey = localStorage.getItem('pinecone_api_key');
+  if (pineconeApiKey) {
+    settings.pinecone_api_key = pineconeApiKey;
+  }
+  
+  const pineconeEnvironment = localStorage.getItem('pinecone_environment');
+  if (pineconeEnvironment) {
+    settings.pinecone_environment = pineconeEnvironment;
+  }
+  
+  const pineconeIndex = localStorage.getItem('pinecone_index');
+  if (pineconeIndex) {
+    settings.pinecone_index = pineconeIndex;
+  }
+  
+  // Skip the API call if there are no settings to sync
+  if (Object.keys(settings).length === 0) {
+    return;
+  }
+  
+  // Send settings to server
+  try {
+    const response = await fetch('/api/settings', {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify({ settings }),
+    });
+    
+    if (!response.ok) {
+      throw new Error(`Server responded with ${response.status}: ${response.statusText}`);
+    }
+    
+    console.log('Client settings synchronized with server');
+  } catch (error) {
+    console.error('Failed to sync settings with server:', error);
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/embeddings.ts b/nvidia/txt2kg/assets/frontend/lib/embeddings.ts
new file mode 100644
index 0000000..97f21f2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/embeddings.ts
@@ -0,0 +1,280 @@
+import { getShouldStopEmbeddings, resetStopEmbeddings } from "@/app/api/stop-embeddings/route";
+
+/**
+ * Embeddings service for generating sentence embeddings using SentenceTransformer
+ * Can use either local SentenceTransformer or remote Text Embeddings Inference API
+ */
+export class EmbeddingsService {
+  private apiUrl: string;
+  private modelId: string;
+  private static instance: EmbeddingsService;
+  private dimension: number = 768; // Dimension for gte-modernbert-base
+  private useNvidiaApi: boolean = false;
+  private nvidiaApiKey: string = '';
+  private nvidiaModel: string = 'nvidia/llama-3.2-nv-embedqa-1b-v2';
+  private nvidiaInputType: string = 'query';
+
+  private constructor() {
+    this.apiUrl = process.env.EMBEDDINGS_API_URL || 'http://localhost:8000';
+    this.modelId = process.env.EMBEDDINGS_MODEL_ID || 'Alibaba-NLP/gte-modernbert-base';
+    
+    // Always get NVIDIA API key from environment variables
+    this.nvidiaApiKey = process.env.NVIDIA_API_KEY || '';
+    
+    // Try to get settings from localStorage if we're in the browser
+    if (typeof window !== 'undefined') {
+      const embeddingsProvider = localStorage.getItem('embeddings_provider') || '';
+      this.useNvidiaApi = this.nvidiaApiKey !== '' && embeddingsProvider === 'nvidia';
+      
+      // Get NVIDIA model from localStorage if available
+      const storedNvidiaModel = localStorage.getItem('nvidia_embeddings_model');
+      if (storedNvidiaModel) {
+        this.nvidiaModel = storedNvidiaModel;
+      }
+    } else {
+      // Server-side code (API routes)
+      const embeddingsProvider = process.env.EMBEDDINGS_PROVIDER || '';
+      this.useNvidiaApi = this.nvidiaApiKey !== '' && embeddingsProvider === 'nvidia';
+      
+      // Get NVIDIA model if specified
+      if (process.env.NVIDIA_EMBEDDINGS_MODEL) {
+        this.nvidiaModel = process.env.NVIDIA_EMBEDDINGS_MODEL;
+      }
+    }
+    
+    // Override dimension if using NVIDIA model (llama-3.2-nv-embedqa-1b-v2 has 4096 dimensions)
+    if (this.useNvidiaApi) {
+      this.dimension = 4096;
+    }
+    
+    console.log('EmbeddingsService initialized with useNvidiaApi:', this.useNvidiaApi, 'model:', this.useNvidiaApi ? this.nvidiaModel : this.modelId);
+  }
+
+  /**
+   * Get the singleton instance of EmbeddingsService
+   */
+  public static getInstance(): EmbeddingsService {
+    if (!EmbeddingsService.instance) {
+      EmbeddingsService.instance = new EmbeddingsService();
+    }
+    return EmbeddingsService.instance;
+  }
+  
+  /**
+   * Reset the singleton instance to force reinitialization with new settings
+   */
+  public static reset(): void {
+    if (EmbeddingsService.instance) {
+      console.log('Resetting EmbeddingsService instance to pick up new settings');
+      EmbeddingsService.instance = undefined as any;
+    }
+  }
+
+  /**
+   * Initialize the embeddings service
+   */
+  public async initialize(): Promise<void> {
+    try {
+      // If using NVIDIA API, check if the API key is valid
+      if (this.useNvidiaApi) {
+        if (!this.nvidiaApiKey) {
+          throw new Error('NVIDIA API key is not set');
+        }
+        console.log(`Embeddings service initialized successfully using NVIDIA model: ${this.nvidiaModel}`);
+        return;
+      }
+      
+      // Check if the API is available
+      const response = await fetch(`${this.apiUrl}/health`, {
+        method: 'GET',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+      });
+
+      if (!response.ok) {
+        throw new Error(`Failed to connect to embeddings API: ${response.statusText}`);
+      }
+
+      console.log(`Embeddings service initialized successfully using model: ${this.modelId}`);
+    } catch (error) {
+      console.error('Error initializing embeddings service:', error);
+      console.warn('Continuing without embeddings service. Embeddings will not be available.');
+    }
+  }
+
+  /**
+   * Get the dimension of the embeddings
+   */
+  public getDimension(): number {
+    return this.dimension;
+  }
+
+  /**
+   * Generate embeddings for a batch of texts
+   * @param texts Array of texts to encode
+   * @param batchSize Batch size for API requests
+   * @returns Promise resolving to array of embeddings
+   */
+  public async encode(texts: string[], batchSize: number = 32): Promise<number[][]> {
+    if (!texts || texts.length === 0) {
+      return [];
+    }
+
+    // Process in batches to avoid overwhelming the API
+    const results: number[][] = [];
+    
+    for (let i = 0; i < texts.length; i += batchSize) {
+      // Check if embeddings generation should be stopped
+      if (getShouldStopEmbeddings()) {
+        console.log(`Embeddings generation stopped by user at batch ${Math.floor(i/batchSize) + 1}/${Math.ceil(texts.length/batchSize)}`);
+        resetStopEmbeddings(); // Reset the flag for next time
+        throw new Error('Embeddings generation stopped by user');
+      }
+      
+      const batch = texts.slice(i, i + batchSize);
+      console.log(`Encoding batch ${Math.floor(i/batchSize) + 1}/${Math.ceil(texts.length/batchSize)}`);
+      
+      try {
+        let batchResults;
+        if (this.useNvidiaApi) {
+          batchResults = await this.encodeWithNvidia(batch);
+        } else {
+          batchResults = await this.encodeBatch(batch);
+        }
+        results.push(...batchResults);
+      } catch (error) {
+        console.error(`Error encoding batch ${i}-${i + batch.length}:`, error);
+        // Fill with zeros for failed batches
+        for (let j = 0; j < batch.length; j++) {
+          results.push(new Array(this.dimension).fill(0));
+        }
+      }
+    }
+    
+    return results;
+  }
+
+  /**
+   * Generate embeddings for a single batch of texts using NVIDIA API
+   * @param texts Array of texts to encode in a single batch
+   * @returns Promise resolving to array of embeddings
+   */
+  private async encodeWithNvidia(texts: string[]): Promise<number[][]> {
+    try {
+      const response = await fetch('https://integrate.api.nvidia.com/v1/embeddings', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+          'Authorization': `Bearer ${this.nvidiaApiKey}`,
+        },
+        body: JSON.stringify({
+          input: texts,
+          model: this.nvidiaModel,
+          input_type: this.nvidiaInputType,
+          encoding_format: "float",
+          truncate: "NONE"
+        }),
+      });
+
+      if (!response.ok) {
+        const errorText = await response.text();
+        throw new Error(`NVIDIA API request failed with status ${response.status}: ${errorText}`);
+      }
+
+      const data = await response.json();
+      return data.data.map((item: any) => item.embedding);
+    } catch (error) {
+      console.error('Error calling NVIDIA embeddings API:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Generate embeddings for a single batch of texts
+   * @param texts Array of texts to encode in a single batch
+   * @returns Promise resolving to array of embeddings
+   */
+  private async encodeBatch(texts: string[]): Promise<number[][]> {
+    try {
+      const response = await fetch(`${this.apiUrl}/embeddings`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify({
+          input: texts,
+          model: this.modelId,
+        }),
+      });
+
+      if (!response.ok) {
+        throw new Error(`API request failed with status ${response.status}: ${response.statusText}`);
+      }
+
+      const data = await response.json();
+      return data.data.map((item: any) => item.embedding);
+    } catch (error) {
+      console.error('Error calling embeddings API:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Calculate cosine similarity between two vectors
+   */
+  public cosineSimilarity(a: number[], b: number[]): number {
+    if (a.length !== b.length) {
+      throw new Error('Vectors must have the same length');
+    }
+
+    let dotProduct = 0;
+    let normA = 0;
+    let normB = 0;
+    
+    for (let i = 0; i < a.length; i++) {
+      dotProduct += a[i] * b[i];
+      normA += a[i] * a[i];
+      normB += b[i] * b[i];
+    }
+    
+    if (normA === 0 || normB === 0) {
+      return 0; // Handle zero vectors
+    }
+    
+    return dotProduct / (Math.sqrt(normA) * Math.sqrt(normB));
+  }
+
+  /**
+   * Find most similar texts based on cosine similarity
+   * @param query Query text
+   * @param candidates Array of candidate texts
+   * @param topK Number of results to return
+   * @returns Promise resolving to array of [index, similarity] pairs
+   */
+  public async findSimilarTexts(
+    query: string,
+    candidates: string[],
+    topK: number = 5
+  ): Promise<[number, number][]> {
+    if (!candidates || candidates.length === 0) {
+      return [];
+    }
+
+    // Generate embeddings
+    const queryEmbedding = (await this.encode([query]))[0];
+    const candidateEmbeddings = await this.encode(candidates);
+    
+    // Calculate similarities
+    const similarities: [number, number][] = candidateEmbeddings.map(
+      (embedding, index) => [index, this.cosineSimilarity(queryEmbedding, embedding)]
+    );
+    
+    // Sort by similarity (descending) and return top k
+    return similarities
+      .sort((a, b) => b[1] - a[1])
+      .slice(0, topK);
+  }
+}
+
+export default EmbeddingsService.getInstance(); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/graph-db-service.ts b/nvidia/txt2kg/assets/frontend/lib/graph-db-service.ts
new file mode 100644
index 0000000..3f7550a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/graph-db-service.ts
@@ -0,0 +1,180 @@
+import { Neo4jService } from './neo4j';
+import { ArangoDBService } from './arangodb';
+import type { Triple } from '@/types/graph';
+
+export type GraphDBType = 'neo4j' | 'arangodb';
+
+/**
+ * GraphDBService that provides a unified interface for different graph database implementations
+ */
+export class GraphDBService {
+  private neo4jService: Neo4jService;
+  private arangoDBService: ArangoDBService;
+  private activeDBType: GraphDBType = 'arangodb'; // Default to ArangoDB
+  private static instance: GraphDBService;
+
+  private constructor() {
+    this.neo4jService = Neo4jService.getInstance();
+    this.arangoDBService = ArangoDBService.getInstance();
+  }
+
+  /**
+   * Get the singleton instance of GraphDBService
+   */
+  public static getInstance(): GraphDBService {
+    if (!GraphDBService.instance) {
+      GraphDBService.instance = new GraphDBService();
+    }
+    return GraphDBService.instance;
+  }
+
+  /**
+   * Initialize the graph database with the specified type
+   * @param dbType - Type of graph database to use
+   * @param uri - Connection URL
+   * @param username - Database username
+   * @param password - Database password
+   */
+  public async initialize(dbType: GraphDBType = 'arangodb', uri?: string, username?: string, password?: string): Promise<void> {
+    this.activeDBType = dbType;
+    
+    try {
+      if (dbType === 'neo4j') {
+        this.neo4jService.initialize(uri, username, password);
+        console.log('Neo4j initialized successfully');
+      } else if (dbType === 'arangodb') {
+        await this.arangoDBService.initialize(uri, undefined, username, password);
+        console.log('ArangoDB initialized successfully');
+      }
+    } catch (error) {
+      console.error(`Failed to initialize ${dbType}:`, error);
+      throw error;
+    }
+  }
+
+  /**
+   * Set the active graph database type
+   */
+  public setDBType(dbType: GraphDBType): void {
+    this.activeDBType = dbType;
+  }
+
+  /**
+   * Get the active graph database type
+   */
+  public getDBType(): GraphDBType {
+    return this.activeDBType;
+  }
+
+  /**
+   * Check if the active database is initialized
+   */
+  public isInitialized(): boolean {
+    if (this.activeDBType === 'neo4j') {
+      return this.neo4jService.isInitialized();
+    } else {
+      return this.arangoDBService.isInitialized();
+    }
+  }
+
+  /**
+   * Import triples into the active graph database
+   */
+  public async importTriples(triples: { subject: string; predicate: string; object: string }[]): Promise<void> {
+    if (this.activeDBType === 'neo4j') {
+      await this.neo4jService.importTriples(triples);
+    } else {
+      await this.arangoDBService.importTriples(triples);
+    }
+  }
+
+  /**
+   * Get graph data from the active graph database
+   */
+  public async getGraphData(): Promise<{ 
+    nodes: Array<{ 
+      id: string; 
+      labels: string[]; 
+      [key: string]: any 
+    }>; 
+    relationships: Array<{ 
+      id: string; 
+      source: string; 
+      target: string; 
+      type: string; 
+      [key: string]: any 
+    }>; 
+  }> {
+    if (this.activeDBType === 'neo4j') {
+      return await this.neo4jService.getGraphData();
+    } else {
+      return await this.arangoDBService.getGraphData();
+    }
+  }
+
+  /**
+   * Log query information and metrics
+   */
+  public async logQuery(
+    query: string, 
+    queryMode: 'traditional' | 'vector-search' | 'pure-rag',
+    metrics: {
+      executionTimeMs: number;
+      relevanceScore?: number;
+      precision?: number;
+      recall?: number;
+      resultCount: number;
+    }
+  ): Promise<void> {
+    if (this.activeDBType === 'neo4j') {
+      await this.neo4jService.logQuery(query, queryMode, metrics);
+    } else {
+      await this.arangoDBService.logQuery(query, queryMode, metrics);
+    }
+  }
+
+  /**
+   * Get query logs from the active graph database
+   */
+  public async getQueryLogs(limit: number = 100): Promise<any[]> {
+    if (this.activeDBType === 'neo4j') {
+      return await this.neo4jService.getQueryLogs(limit);
+    } else {
+      return await this.arangoDBService.getQueryLogs(limit);
+    }
+  }
+
+  /**
+   * Close the connection to the active graph database
+   */
+  public async close(): Promise<void> {
+    if (this.activeDBType === 'neo4j') {
+      this.neo4jService.close();
+    } else {
+      this.arangoDBService.close();
+    }
+  }
+
+  /**
+   * Get info about the active graph database driver
+   */
+  public getDriverInfo(): Record<string, any> {
+    if (this.activeDBType === 'neo4j') {
+      return this.neo4jService.getDriverInfo();
+    } else {
+      return this.arangoDBService.getDriverInfo();
+    }
+  }
+
+  /**
+   * Clear all data from the active graph database
+   */
+  public async clearDatabase(): Promise<void> {
+    if (this.activeDBType === 'neo4j') {
+      // TODO: Implement Neo4j clear database functionality
+      throw new Error('Clear database functionality not implemented for Neo4j');
+    } else {
+      await this.arangoDBService.clearDatabase();
+    }
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/graph-db-util.ts b/nvidia/txt2kg/assets/frontend/lib/graph-db-util.ts
new file mode 100644
index 0000000..cd9b229
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/graph-db-util.ts
@@ -0,0 +1,53 @@
+import { GraphDBService, GraphDBType } from './graph-db-service';
+import { Neo4jService } from './neo4j';
+import { ArangoDBService } from './arangodb';
+
+/**
+ * Get the appropriate graph database service based on the graph database type.
+ * This is useful for API routes that need direct access to a specific graph database.
+ * 
+ * @param graphDbType - The type of graph database to use
+ */
+export function getGraphDbService(graphDbType: GraphDBType = 'arangodb') {
+  if (graphDbType === 'neo4j') {
+    return Neo4jService.getInstance();
+  } else if (graphDbType === 'arangodb') {
+    return ArangoDBService.getInstance();
+  } else {
+    // Default to ArangoDB
+    return ArangoDBService.getInstance();
+  }
+}
+
+/**
+ * Initialize the graph database directly (not using GraphDBService).
+ * This is useful for API routes that need direct access to a specific graph database.
+ * 
+ * @param graphDbType - The type of graph database to use
+ */
+export async function initializeGraphDb(graphDbType: GraphDBType = 'arangodb'): Promise<void> {
+  const service = getGraphDbService(graphDbType);
+  
+  if (graphDbType === 'neo4j') {
+    // Get Neo4j credentials from environment
+    const uri = process.env.NEO4J_URI;
+    const username = process.env.NEO4J_USER || process.env.NEO4J_USERNAME;
+    const password = process.env.NEO4J_PASSWORD;
+    
+    // Initialize Neo4j connection
+    if (service instanceof Neo4jService) {
+      service.initialize(uri, username, password);
+    }
+  } else if (graphDbType === 'arangodb') {
+    // Get ArangoDB credentials from environment
+    const url = process.env.ARANGODB_URL;
+    const dbName = process.env.ARANGODB_DB;
+    const username = process.env.ARANGODB_USER;
+    const password = process.env.ARANGODB_PASSWORD;
+    
+    // Initialize ArangoDB connection
+    if (service instanceof ArangoDBService) {
+      await service.initialize(url, dbName, username, password);
+    }
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/langchain-service.ts b/nvidia/txt2kg/assets/frontend/lib/langchain-service.ts
new file mode 100644
index 0000000..e24aa85
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/langchain-service.ts
@@ -0,0 +1,305 @@
+import { ChatOpenAI } from "@langchain/openai";
+import { SystemMessage } from "@langchain/core/messages";
+
+/**
+ * Service for interacting with LLMs through LangChain integrations
+ */
+export class LangChainService {
+  private static instance: LangChainService;
+  
+  // Cache for created models to avoid recreating them
+  private modelCache: Record<string, ChatOpenAI> = {};
+  
+  private constructor() {}
+  
+  /**
+   * Get the singleton instance of LangChainService
+   */
+  public static getInstance(): LangChainService {
+    if (!LangChainService.instance) {
+      LangChainService.instance = new LangChainService();
+    }
+    return LangChainService.instance;
+  }
+  
+  /**
+   * Get or create a ChatOpenAI model instance for the NVIDIA Nemotron model
+   */
+  public async getNemotronModel(options?: {
+    temperature?: number;
+    maxTokens?: number;
+  }): Promise<ChatOpenAI> {
+    const modelId = "nvdev/nvidia/llama-3.1-nemotron-70b-instruct";
+    const cacheKey = `nemotron-${options?.temperature || 0.7}-${options?.maxTokens || 8192}`;
+    
+    console.log(`Requesting Nemotron model (cacheKey: ${cacheKey})`);
+    
+    if (this.modelCache[cacheKey]) {
+      console.log(`Using cached model for: ${cacheKey}`);
+      return this.modelCache[cacheKey];
+    }
+    
+    // Try to get API key from server endpoint if in browser
+    let apiKey: string | undefined;
+    
+    if (typeof window !== 'undefined') {
+      try {
+        console.log("Fetching API key from server endpoint");
+        const response = await fetch('/api/config');
+        if (!response.ok) {
+          throw new Error(`Failed to fetch API key: ${response.statusText}`);
+        }
+        const config = await response.json();
+        apiKey = config.nemotronApiKey;
+        console.log(`Retrieved API key from server: ${apiKey ? "Yes" : "No"}`);
+      } catch (error) {
+        console.error("Error fetching API key:", error);
+      }
+    } else {
+      // Server-side, use environment variable directly
+      apiKey = process.env.NVIDIA_API_KEY;
+      console.log(`Retrieved API key from environment: ${apiKey ? "Yes" : "No"}`);
+    }
+    
+    // If no API key is found, throw an error
+    if (!apiKey) {
+      console.error('No API key found for Nemotron model');
+      throw new Error('No API key found for Nemotron model. Please add NVIDIA_API_KEY to your environment variables.');
+    }
+    
+    console.log(`Creating new ChatOpenAI instance for: ${modelId}`);
+
+    try {
+      // Create a new ChatOpenAI instance
+      const model = new ChatOpenAI({
+        modelName: modelId,
+        temperature: options?.temperature || 0.7,
+        maxTokens: options?.maxTokens || 8192,
+        openAIApiKey: apiKey,
+        configuration: {
+          baseURL: "https://integrate.api.nvidia.com/v1",
+          timeout: 60000, // 60 second timeout
+        },
+        modelKwargs: {
+          "response_format": { "type": "text" }
+        }
+      });
+      
+      console.log(`Created ChatOpenAI instance for model: ${modelId}`);
+      
+      // Test the model with a simple request before caching
+      console.log("Testing model with a simple message...");
+      try {
+        const testResult = await model.invoke([new SystemMessage("Hello")]);
+        console.log("Model test successful, response:", testResult.content.toString().substring(0, 50) + "...");
+        
+        // Cache the model
+        this.modelCache[cacheKey] = model;
+        
+        return model;
+      } catch (testError) {
+        console.error("Model test failed with error:", testError);
+        throw new Error(`Model test failed: ${testError instanceof Error ? testError.message : String(testError)}`);
+      }
+    } catch (error) {
+      console.error("Error creating or testing Nemotron model:", error);
+      throw new Error(`Failed to initialize Nemotron model: ${error instanceof Error ? error.message : String(error)}`);
+    }
+  }
+  
+  /**
+   * Get or create a ChatOpenAI model instance for any NVIDIA model
+   */
+  public async getNvidiaModel(
+    modelId: string, 
+    options?: {
+      temperature?: number;
+      maxTokens?: number;
+    }
+  ): Promise<ChatOpenAI> {
+    const cacheKey = `nvidia-${modelId}-${options?.temperature || 0.7}-${options?.maxTokens || 8192}`;
+    
+    if (this.modelCache[cacheKey]) {
+      return this.modelCache[cacheKey];
+    }
+    
+    // Try to get API key from server endpoint if in browser
+    let apiKey: string | undefined;
+    
+    if (typeof window !== 'undefined') {
+      try {
+        console.log("Fetching API key from server endpoint");
+        const response = await fetch('/api/config');
+        if (!response.ok) {
+          throw new Error(`Failed to fetch API key: ${response.statusText}`);
+        }
+        const config = await response.json();
+        
+        // Get the appropriate API key based on the model
+        apiKey = config.nvidiaApiKey;
+      } catch (error) {
+        console.error("Error fetching API key:", error);
+      }
+    } else {
+      // Server-side, use environment variables directly
+      apiKey = process.env.NVIDIA_API_KEY;
+    }
+    
+    // If no API key is found, throw an error
+    if (!apiKey) {
+      throw new Error(`No API key found for NVIDIA model: ${modelId}`);
+    }
+    
+    // Create a new ChatOpenAI instance
+    const model = new ChatOpenAI({
+      modelName: modelId,
+      temperature: options?.temperature || 0.7,
+      maxTokens: options?.maxTokens || 8192,
+      openAIApiKey: apiKey,
+      configuration: {
+        baseURL: "https://integrate.api.nvidia.com/v1",
+        timeout: 60000, // 60 second timeout
+      },
+      // Use specific version of ChatCompletions API
+      modelKwargs: {
+        "response_format": { "type": "text" }
+      }
+    });
+    
+    console.log(`Created ChatOpenAI instance for model: ${modelId}`);
+    
+    // Test the model with a simple request before caching
+    try {
+      await model.invoke([new SystemMessage("Hello")]);
+      console.log("Model test successful");
+    } catch (error) {
+      console.error("Model test failed:", error);
+      throw new Error(`Failed to initialize model: ${error instanceof Error ? error.message : String(error)}`);
+    }
+    
+    // Cache the model
+    this.modelCache[cacheKey] = model;
+    
+    return model;
+  }
+
+  /**
+   * Get or create a ChatOpenAI model instance for Ollama models
+   */
+  public async getOllamaModel(
+    modelId: string, 
+    options?: {
+      temperature?: number;
+      maxTokens?: number;
+      baseURL?: string;
+    }
+  ): Promise<ChatOpenAI> {
+    const baseURL = options?.baseURL || process.env.OLLAMA_BASE_URL || 'http://localhost:11434/v1';
+    const cacheKey = `ollama-${modelId}-${options?.temperature || 0.7}-${options?.maxTokens || 8192}-${baseURL}`;
+    
+    if (this.modelCache[cacheKey]) {
+      return this.modelCache[cacheKey];
+    }
+    
+    console.log(`Creating new ChatOpenAI instance for Ollama model: ${modelId}`);
+
+    try {
+      // Create a new ChatOpenAI instance for Ollama
+      const model = new ChatOpenAI({
+        modelName: modelId,
+        temperature: options?.temperature || 0.7,
+        maxTokens: options?.maxTokens || 8192,
+        openAIApiKey: 'ollama', // Ollama doesn't require a real API key
+        configuration: {
+          baseURL: baseURL,
+          timeout: 1800000, // 30 minutes timeout for large model inference
+          maxRetries: 0, // Disable retries to avoid additional delays
+        },
+        modelKwargs: {
+          "response_format": { "type": "text" }
+        }
+      });
+      
+      console.log(`Created ChatOpenAI instance for Ollama model: ${modelId}`);
+      
+      // Test the model with a simple request before caching
+      console.log("Testing Ollama model with a simple message...");
+      try {
+        const testResult = await model.invoke([new SystemMessage("Hello")]);
+        console.log("Ollama model test successful, response:", testResult.content.toString().substring(0, 50) + "...");
+        
+        // Cache the model
+        this.modelCache[cacheKey] = model;
+        
+        return model;
+      } catch (testError) {
+        console.error("Ollama model test failed with error:", testError);
+        throw new Error(`Ollama model test failed: ${testError instanceof Error ? testError.message : String(testError)}`);
+      }
+    } catch (error) {
+      console.error("Error creating or testing Ollama model:", error);
+      throw new Error(`Failed to initialize Ollama model: ${error instanceof Error ? error.message : String(error)}`);
+    }
+  }
+
+  /**
+   * Get or create a ChatOpenAI model instance for vLLM models
+   */
+  public async getVllmModel(
+    modelId: string, 
+    options?: {
+      temperature?: number;
+      maxTokens?: number;
+      baseURL?: string;
+    }
+  ): Promise<ChatOpenAI> {
+    const baseURL = options?.baseURL || process.env.VLLM_BASE_URL || 'http://localhost:8001/v1';
+    const cacheKey = `vllm-${modelId}-${options?.temperature || 0.7}-${options?.maxTokens || 8192}-${baseURL}`;
+    
+    if (this.modelCache[cacheKey]) {
+      return this.modelCache[cacheKey];
+    }
+    
+    console.log(`Creating new ChatOpenAI instance for vLLM model: ${modelId}`);
+
+    try {
+      // Create a new ChatOpenAI instance for vLLM
+      const model = new ChatOpenAI({
+        modelName: modelId,
+        temperature: options?.temperature || 0.7,
+        maxTokens: options?.maxTokens || 8192,
+        openAIApiKey: 'vllm', // vLLM doesn't require a real API key
+        configuration: {
+          baseURL: baseURL,
+          timeout: 120000, // 2 minute timeout for vLLM inference
+        },
+        modelKwargs: {
+          "response_format": { "type": "text" }
+        }
+      });
+      
+      console.log(`Created ChatOpenAI instance for vLLM model: ${modelId}`);
+      
+      // Test the model with a simple request before caching
+      console.log("Testing vLLM model with a simple message...");
+      try {
+        const testResult = await model.invoke([new SystemMessage("Hello")]);
+        console.log("vLLM model test successful, response:", testResult.content.toString().substring(0, 50) + "...");
+        
+        // Cache the model
+        this.modelCache[cacheKey] = model;
+        
+        return model;
+      } catch (testError) {
+        console.error("vLLM model test failed with error:", testError);
+        throw new Error(`vLLM model test failed: ${testError instanceof Error ? testError.message : String(testError)}`);
+      }
+    } catch (error) {
+      console.error("Error creating vLLM model:", error);
+      throw new Error(`Failed to initialize vLLM model: ${error instanceof Error ? error.message : String(error)}`);
+    }
+  }
+}
+
+// Export a singleton instance for convenience
+export const langChainService = LangChainService.getInstance(); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/llm-service.ts b/nvidia/txt2kg/assets/frontend/lib/llm-service.ts
new file mode 100644
index 0000000..c2126b9
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/llm-service.ts
@@ -0,0 +1,685 @@
+import OpenAI from 'openai';
+import { Agent } from 'http';
+import { Agent as HttpsAgent } from 'https';
+
+export interface LLMOptions {
+  temperature?: number;
+  maxTokens?: number;
+  topP?: number;
+  stream?: boolean;
+}
+
+export interface OllamaOptions {
+  baseUrl?: string;
+  model?: string;
+}
+
+export interface LLMMessage {
+  role: 'system' | 'user' | 'assistant';
+  content: string;
+}
+
+export interface StreamCallbacks {
+  onToken?: (token: string) => void;
+  onComplete?: (fullResponse: string) => void;
+  onError?: (error: any) => void;
+}
+
+export interface BatchOptions extends LLMOptions {
+  concurrency?: number;
+  batchSize?: number;
+}
+
+export interface BatchResult {
+  results: string[];
+  errors: Array<{ index: number; error: string; attempts?: number }>;
+  successCount: number;
+  totalCount: number;
+  totalAttempts?: number;
+  averageAttempts?: number;
+}
+
+/**
+ * Service for interacting with LLMs through different providers
+ */
+export class LLMService {
+  private static instance: LLMService;
+  
+  // NVIDIA client for accessing models hosted on NVIDIA API
+  private nvidiaClient: OpenAI | null = null;
+  
+  // Ollama client for local models
+  private ollamaClient: OpenAI | null = null;
+  private ollamaBaseUrl: string = 'http://localhost:11434/v1';
+  
+  // vLLM client for local models with advanced features
+  private vllmClient: OpenAI | null = null;
+  private vllmBaseUrl: string = 'http://localhost:8001/v1';
+  
+  // xAI client would go here
+  
+  private constructor() {
+    // Initialize NVIDIA client if API key is available
+    const nvidiaApiKey = process.env.NVIDIA_API_KEY;
+    if (nvidiaApiKey) {
+      // Create HTTPS agent with connection pooling for NVIDIA API
+      const httpsAgent = new HttpsAgent({
+        keepAlive: true,
+        maxSockets: 10,
+        maxFreeSockets: 5,
+        timeout: 60000,
+        keepAliveMsecs: 30000
+      });
+
+      this.nvidiaClient = new OpenAI({
+        apiKey: nvidiaApiKey,
+        baseURL: 'https://integrate.api.nvidia.com/v1',
+        httpAgent: httpsAgent
+      });
+    }
+
+    // Initialize Ollama client with connection pooling - no API key needed for local Ollama
+    const ollamaUrl = process.env.OLLAMA_BASE_URL || this.ollamaBaseUrl;
+    
+    // Create HTTP agent with connection pooling for Ollama
+    const httpAgent = new Agent({
+      keepAlive: true,
+      maxSockets: 15, // Higher for local connections
+      maxFreeSockets: 8,
+      timeout: 1800000, // 30 minutes for large model inference
+      keepAliveMsecs: 30000
+    });
+
+    this.ollamaClient = new OpenAI({
+      apiKey: 'ollama', // Ollama doesn't require a real API key
+      baseURL: ollamaUrl,
+      httpAgent: httpAgent,
+      timeout: 1800000 // 30 minutes timeout for large model inference
+    });
+    this.ollamaBaseUrl = ollamaUrl;
+
+    // Initialize vLLM client with connection pooling - no API key needed for local vLLM
+    const vllmUrl = process.env.VLLM_BASE_URL || this.vllmBaseUrl;
+    
+    // Create HTTP agent with connection pooling for vLLM
+    const vllmHttpAgent = new Agent({
+      keepAlive: true,
+      maxSockets: 15, // Higher for local connections
+      maxFreeSockets: 8,
+      timeout: 120000, // Longer timeout for vLLM inference
+      keepAliveMsecs: 30000
+    });
+
+    this.vllmClient = new OpenAI({
+      apiKey: 'vllm', // vLLM doesn't require a real API key
+      baseURL: vllmUrl,
+      httpAgent: vllmHttpAgent
+    });
+    this.vllmBaseUrl = vllmUrl;
+  }
+  
+  /**
+   * Get the singleton instance of LLMService
+   */
+  public static getInstance(): LLMService {
+    if (!LLMService.instance) {
+      LLMService.instance = new LLMService();
+    }
+    return LLMService.instance;
+  }
+  
+  /**
+   * Check if a specific provider is configured
+   */
+  public isProviderConfigured(provider: 'nvidia' | 'ollama'): boolean {
+    switch (provider) {
+      case 'nvidia':
+        return this.nvidiaClient !== null;
+      case 'ollama':
+        return this.ollamaClient !== null;
+      default:
+        return false;
+    }
+  }
+  
+  /**
+   * Generate a completion using the NVIDIA API
+   */
+  public async generateNvidiaCompletion(
+    model: string,
+    messages: LLMMessage[],
+    options: LLMOptions = {}
+  ): Promise<string> {
+    if (!this.nvidiaClient) {
+      throw new Error('NVIDIA API client not configured. Set NVIDIA_API_KEY environment variable.');
+    }
+    
+    const { temperature = 0.7, maxTokens = 1024, topP = 0.9, stream = false } = options;
+    
+    try {
+      const completion = await this.nvidiaClient.chat.completions.create({
+        model,
+        messages,
+        temperature,
+        max_tokens: maxTokens,
+        top_p: topP,
+        stream,
+      });
+      
+      if (!stream) {
+        // Check if completion is a stream
+        if ('choices' in completion) {
+          return completion.choices[0]?.message?.content || '';
+        } else {
+          throw new Error('Unexpected response format');
+        }
+      } else {
+        throw new Error('Streaming not supported in this method. Use generateNvidiaCompletionStream instead.');
+      }
+    } catch (error) {
+      console.error('Error generating NVIDIA completion:', error);
+      throw error;
+    }
+  }
+  
+  /**
+   * Generate a streaming completion using the NVIDIA API
+   */
+  public async generateNvidiaCompletionStream(
+    model: string,
+    messages: LLMMessage[],
+    callbacks: StreamCallbacks,
+    options: LLMOptions = {}
+  ): Promise<void> {
+    if (!this.nvidiaClient) {
+      throw new Error('NVIDIA API client not configured. Set NVIDIA_API_KEY environment variable.');
+    }
+    
+    const { temperature = 0.7, maxTokens = 1024, topP = 0.9 } = options;
+    
+    try {
+      const stream = await this.nvidiaClient.chat.completions.create({
+        model,
+        messages,
+        temperature,
+        max_tokens: maxTokens,
+        top_p: topP,
+        stream: true,
+      });
+      
+      let fullResponse = '';
+      
+      for await (const chunk of stream) {
+        const content = chunk.choices[0]?.delta?.content || '';
+        if (content) {
+          fullResponse += content;
+          callbacks.onToken?.(content);
+        }
+      }
+      
+      callbacks.onComplete?.(fullResponse);
+    } catch (error) {
+      console.error('Error generating NVIDIA streaming completion:', error);
+      callbacks.onError?.(error);
+      throw error;
+    }
+  }
+
+  /**
+   * Generate a completion using the Ollama API with retry logic
+   */
+  public async generateOllamaCompletion(
+    model: string,
+    messages: LLMMessage[],
+    options: LLMOptions & { maxRetries?: number } = {}
+  ): Promise<string> {
+    if (!this.ollamaClient) {
+      throw new Error('Ollama client not configured.');
+    }
+    
+    const { temperature = 0.7, maxTokens = 1024, topP = 0.9, stream = false, maxRetries = 3 } = options;
+    
+    return this.retryWithBackoff(async () => {
+      const completion = await this.ollamaClient!.chat.completions.create({
+        model,
+        messages,
+        temperature,
+        max_tokens: maxTokens,
+        top_p: topP,
+        stream,
+      });
+      
+      if (!stream) {
+        // Check if completion is a stream
+        if ('choices' in completion) {
+          return completion.choices[0]?.message?.content || '';
+        } else {
+          throw new Error('Unexpected response format');
+        }
+      } else {
+        throw new Error('Streaming not supported in this method. Use generateOllamaCompletionStream instead.');
+      }
+    }, maxRetries);
+  }
+
+  /**
+   * Generate a streaming completion using the Ollama API
+   */
+  public async generateOllamaCompletionStream(
+    model: string,
+    messages: LLMMessage[],
+    callbacks: StreamCallbacks,
+    options: LLMOptions = {}
+  ): Promise<void> {
+    if (!this.ollamaClient) {
+      throw new Error('Ollama client not configured.');
+    }
+    
+    const { temperature = 0.7, maxTokens = 1024, topP = 0.9 } = options;
+    
+    try {
+      const stream = await this.ollamaClient.chat.completions.create({
+        model,
+        messages,
+        temperature,
+        max_tokens: maxTokens,
+        top_p: topP,
+        stream: true,
+      });
+      
+      let fullResponse = '';
+      
+      for await (const chunk of stream) {
+        const content = chunk.choices[0]?.delta?.content || '';
+        if (content) {
+          fullResponse += content;
+          callbacks.onToken?.(content);
+        }
+      }
+      
+      callbacks.onComplete?.(fullResponse);
+    } catch (error) {
+      console.error('Error generating Ollama streaming completion:', error);
+      callbacks.onError?.(error);
+      throw error;
+    }
+  }
+
+  /**
+   * Generate a completion using the vLLM API with retry logic
+   */
+  public async generateVllmCompletion(
+    model: string,
+    messages: LLMMessage[],
+    options: LLMOptions & { maxRetries?: number } = {}
+  ): Promise<string> {
+    if (!this.vllmClient) {
+      throw new Error('vLLM client not configured.');
+    }
+    
+    const { temperature = 0.7, maxTokens = 1024, topP = 0.9, stream = false, maxRetries = 3 } = options;
+    
+    return this.retryWithBackoff(async () => {
+      if (!stream) {
+        const completion = await this.vllmClient!.chat.completions.create({
+          model,
+          messages,
+          temperature,
+          max_tokens: maxTokens,
+          top_p: topP,
+          stream: false,
+        });
+        
+        if (completion && completion.choices && completion.choices.length > 0) {
+          return completion.choices[0]?.message?.content || '';
+        } else {
+          throw new Error('Unexpected response format');
+        }
+      } else {
+        throw new Error('Streaming not supported in this method. Use generateVllmCompletionStream instead.');
+      }
+    }, maxRetries);
+  }
+
+  /**
+   * Generate a streaming completion using the vLLM API
+   */
+  public async generateVllmCompletionStream(
+    model: string,
+    messages: LLMMessage[],
+    callbacks: StreamCallbacks,
+    options: LLMOptions = {}
+  ): Promise<void> {
+    if (!this.vllmClient) {
+      throw new Error('vLLM client not configured.');
+    }
+    
+    const { temperature = 0.7, maxTokens = 1024, topP = 0.9 } = options;
+    
+    try {
+      const stream = await this.vllmClient.chat.completions.create({
+        model,
+        messages,
+        temperature,
+        max_tokens: maxTokens,
+        top_p: topP,
+        stream: true,
+      });
+      
+      let fullResponse = '';
+      
+      for await (const chunk of stream) {
+        const content = chunk.choices[0]?.delta?.content || '';
+        if (content) {
+          fullResponse += content;
+          callbacks.onToken?.(content);
+        }
+      }
+      
+      callbacks.onComplete?.(fullResponse);
+    } catch (error) {
+      console.error('Error in vLLM streaming completion:', error);
+      callbacks.onError?.(error instanceof Error ? error : new Error(String(error)));
+    }
+  }
+
+  /**
+   * Test Ollama connection and list available models
+   */
+  public async testOllamaConnection(): Promise<{ connected: boolean; models?: string[]; error?: string }> {
+    if (!this.ollamaClient) {
+      return { connected: false, error: 'Ollama client not configured' };
+    }
+
+    try {
+      // Try to list models to test the connection
+      const response = await fetch(`${this.ollamaBaseUrl.replace('/v1', '')}/api/tags`);
+      if (!response.ok) {
+        throw new Error(`HTTP ${response.status}: ${response.statusText}`);
+      }
+      
+      const data = await response.json();
+      const models = data.models?.map((model: any) => model.name) || [];
+      
+      return { connected: true, models };
+    } catch (error) {
+      console.error('Error testing Ollama connection:', error);
+      return { 
+        connected: false, 
+        error: error instanceof Error ? error.message : String(error) 
+      };
+    }
+  }
+  
+  /**
+   * Generic method to generate completions based on the selected model
+   */
+  public async generateCompletion(
+    providerId: string,
+    messages: LLMMessage[],
+    options: LLMOptions = {}
+  ): Promise<string> {
+    // Extract provider and model information
+    if (providerId.startsWith('nvidia-')) {
+      // For NVIDIA models
+      const modelMap: Record<string, string> = {
+        'nvidia-llama': 'meta/llama-3.1-70b-instruct',
+        'nvidia-mixtral': 'mistralai/mixtral-8x7b-instruct-v0.1',
+        'nvidia-nemotron': 'nvdev/nvidia/llama-3.1-nemotron-70b-instruct',
+      };
+      
+      const modelName = modelMap[providerId] || 'meta/llama-3.1-70b-instruct';
+      return this.generateNvidiaCompletion(modelName, messages, options);
+    } else if (providerId.startsWith('ollama-')) {
+      // For Ollama models - extract model name from providerId
+      const modelName = providerId.replace('ollama-', '');
+      return this.generateOllamaCompletion(modelName, messages, options);
+    } else if (providerId.startsWith('vllm-')) {
+      // For vLLM models - extract model name from providerId
+      const modelName = providerId.replace('vllm-', '');
+      return this.generateVllmCompletion(modelName, messages, options);
+    } else {
+      throw new Error(`Unsupported provider: ${providerId}`);
+    }
+  }
+  
+  /**
+   * Generic method to generate streaming completions based on the selected model
+   */
+  public async generateCompletionStream(
+    providerId: string,
+    messages: LLMMessage[],
+    callbacks: StreamCallbacks,
+    options: LLMOptions = {}
+  ): Promise<void> {
+    // Extract provider and model information
+    if (providerId.startsWith('nvidia-')) {
+      // For NVIDIA models
+      const modelMap: Record<string, string> = {
+        'nvidia-llama': 'meta/llama-3.1-70b-instruct',
+        'nvidia-mixtral': 'mistralai/mixtral-8x7b-instruct-v0.1',
+        'nvidia-nemotron': 'nvdev/nvidia/llama-3.1-nemotron-70b-instruct',
+      };
+      
+      const modelName = modelMap[providerId] || 'meta/llama-3.1-70b-instruct';
+      return this.generateNvidiaCompletionStream(modelName, messages, callbacks, options);
+    } else if (providerId.startsWith('ollama-')) {
+      // For Ollama models - extract model name from providerId
+      const modelName = providerId.replace('ollama-', '');
+      return this.generateOllamaCompletionStream(modelName, messages, callbacks, options);
+    } else if (providerId.startsWith('vllm-')) {
+      // For vLLM models - extract model name from providerId
+      const modelName = providerId.replace('vllm-', '');
+      return this.generateVllmCompletionStream(modelName, messages, callbacks, options);
+    } else {
+      throw new Error(`Unsupported provider: ${providerId}`);
+    }
+  }
+
+  /**
+   * Retry a function with exponential backoff
+   */
+  private async retryWithBackoff<T>(
+    operation: () => Promise<T>,
+    maxRetries: number = 3,
+    baseDelay: number = 1000
+  ): Promise<T> {
+    for (let attempt = 0; attempt < maxRetries; attempt++) {
+      try {
+        return await operation();
+      } catch (error) {
+        if (attempt === maxRetries - 1) {
+          // Last attempt failed, throw the error
+          throw error;
+        }
+        
+        // Calculate delay with exponential backoff and jitter
+        const delay = baseDelay * Math.pow(2, attempt) + Math.random() * 1000;
+        console.warn(`Attempt ${attempt + 1} failed, retrying in ${delay.toFixed(0)}ms:`, 
+          error instanceof Error ? error.message : String(error));
+        
+        await new Promise(resolve => setTimeout(resolve, delay));
+      }
+    }
+    throw new Error('Max retries exceeded');
+  }
+
+  /**
+   * Process multiple requests in parallel with controlled concurrency and retry logic
+   */
+  private async processBatch<T, R>(
+    items: T[],
+    processor: (item: T, index: number) => Promise<R>,
+    concurrency: number = 5,
+    maxRetries: number = 3
+  ): Promise<Array<{ result?: R; error?: string; index: number; attempts?: number }>> {
+    const results: Array<{ result?: R; error?: string; index: number; attempts?: number }> = [];
+    
+    // Process items in chunks to control concurrency
+    for (let i = 0; i < items.length; i += concurrency) {
+      const chunk = items.slice(i, i + concurrency);
+      const chunkPromises = chunk.map(async (item, chunkIndex) => {
+        const globalIndex = i + chunkIndex;
+        let attempts = 0;
+        
+        try {
+          const result = await this.retryWithBackoff(
+            async () => {
+              attempts++;
+              return processor(item, globalIndex);
+            },
+            maxRetries
+          );
+          return { result, index: globalIndex, attempts };
+        } catch (error) {
+          return { 
+            error: error instanceof Error ? error.message : String(error), 
+            index: globalIndex,
+            attempts
+          };
+        }
+      });
+      
+      const chunkResults = await Promise.all(chunkPromises);
+      results.push(...chunkResults);
+    }
+    
+    return results;
+  }
+
+  /**
+   * Generate batch completions using Ollama with controlled concurrency
+   */
+  public async generateOllamaBatchCompletion(
+    model: string,
+    messagesBatch: LLMMessage[][],
+    options: BatchOptions = {}
+  ): Promise<BatchResult> {
+    const { concurrency = 5, ...llmOptions } = options;
+    
+    if (!this.ollamaClient) {
+      throw new Error('Ollama client not configured.');
+    }
+
+    console.log(`Starting batch processing of ${messagesBatch.length} requests with concurrency ${concurrency}`);
+    
+    const batchResults = await this.processBatch(
+      messagesBatch,
+      async (messages, index) => {
+        console.log(`Processing batch item ${index + 1}/${messagesBatch.length}`);
+        return this.generateOllamaCompletion(model, messages, llmOptions);
+      },
+      concurrency,
+      3 // maxRetries
+    );
+
+    const results: string[] = [];
+    const errors: Array<{ index: number; error: string; attempts?: number }> = [];
+    let successCount = 0;
+    let totalAttempts = 0;
+
+    batchResults.forEach(({ result, error, index, attempts }) => {
+      totalAttempts += attempts || 1;
+      if (error) {
+        errors.push({ index, error, attempts });
+        results[index] = ''; // Placeholder for failed requests
+      } else {
+        results[index] = result || '';
+        successCount++;
+      }
+    });
+
+    console.log(`Batch processing completed: ${successCount}/${messagesBatch.length} successful, total attempts: ${totalAttempts}`);
+
+    return {
+      results,
+      errors,
+      successCount,
+      totalCount: messagesBatch.length,
+      totalAttempts,
+      averageAttempts: messagesBatch.length > 0 ? totalAttempts / messagesBatch.length : 0
+    };
+  }
+
+  /**
+   * Generate batch completions using NVIDIA API with controlled concurrency
+   */
+  public async generateNvidiaBatchCompletion(
+    model: string,
+    messagesBatch: LLMMessage[][],
+    options: BatchOptions = {}
+  ): Promise<BatchResult> {
+    const { concurrency = 3, ...llmOptions } = options; // Lower concurrency for external API
+    
+    if (!this.nvidiaClient) {
+      throw new Error('NVIDIA API client not configured.');
+    }
+
+    console.log(`Starting NVIDIA batch processing of ${messagesBatch.length} requests with concurrency ${concurrency}`);
+    
+    const batchResults = await this.processBatch(
+      messagesBatch,
+      async (messages, index) => {
+        console.log(`Processing NVIDIA batch item ${index + 1}/${messagesBatch.length}`);
+        return this.generateNvidiaCompletion(model, messages, llmOptions);
+      },
+      concurrency,
+      3 // maxRetries
+    );
+
+    const results: string[] = [];
+    const errors: Array<{ index: number; error: string; attempts?: number }> = [];
+    let successCount = 0;
+    let totalAttempts = 0;
+
+    batchResults.forEach(({ result, error, index, attempts }) => {
+      totalAttempts += attempts || 1;
+      if (error) {
+        errors.push({ index, error, attempts });
+        results[index] = ''; // Placeholder for failed requests
+      } else {
+        results[index] = result || '';
+        successCount++;
+      }
+    });
+
+    console.log(`NVIDIA batch processing completed: ${successCount}/${messagesBatch.length} successful, total attempts: ${totalAttempts}`);
+
+    return {
+      results,
+      errors,
+      successCount,
+      totalCount: messagesBatch.length,
+      totalAttempts,
+      averageAttempts: messagesBatch.length > 0 ? totalAttempts / messagesBatch.length : 0
+    };
+  }
+
+  /**
+   * Generic batch processing method that routes to appropriate provider
+   */
+  public async generateBatchCompletion(
+    providerId: string,
+    messagesBatch: LLMMessage[][],
+    options: BatchOptions = {}
+  ): Promise<BatchResult> {
+    if (providerId.startsWith('nvidia-')) {
+      const modelMap: Record<string, string> = {
+        'nvidia-llama': 'meta/llama-3.1-70b-instruct',
+        'nvidia-mixtral': 'mistralai/mixtral-8x7b-instruct-v0.1',
+        'nvidia-nemotron': 'nvdev/nvidia/llama-3.1-nemotron-70b-instruct',
+      };
+      
+      const modelName = modelMap[providerId] || 'meta/llama-3.1-70b-instruct';
+      return this.generateNvidiaBatchCompletion(modelName, messagesBatch, options);
+    } else if (providerId.startsWith('ollama-')) {
+      const modelName = providerId.replace('ollama-', '');
+      return this.generateOllamaBatchCompletion(modelName, messagesBatch, options);
+    } else {
+      throw new Error(`Unsupported provider for batch processing: ${providerId}`);
+    }
+  }
+}
+
+// Export a singleton instance for convenience
+export const llmService = LLMService.getInstance(); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/neo4j.ts b/nvidia/txt2kg/assets/frontend/lib/neo4j.ts
new file mode 100644
index 0000000..02dcb8d
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/neo4j.ts
@@ -0,0 +1,539 @@
+import neo4j, { Driver, Session, Record as Neo4jRecord, Config } from 'neo4j-driver';
+
+/**
+ * Neo4j service for database operations
+ * Provides methods to connect to and interact with a Neo4j database
+ */
+export class Neo4jService {
+  private driver: Driver | null = null;
+  private static instance: Neo4jService;
+
+  private constructor() {}
+
+  /**
+   * Get the singleton instance of Neo4jService
+   */
+  public static getInstance(): Neo4jService {
+    if (!Neo4jService.instance) {
+      Neo4jService.instance = new Neo4jService();
+    }
+    return Neo4jService.instance;
+  }
+
+  /**
+   * Initialize the Neo4j driver connection
+   * @param uri - Neo4j connection URI (defaults to NEO4J_URI env var or 'bolt://localhost:7687')
+   * @param username - Neo4j username (optional if auth is disabled)
+   * @param password - Neo4j password (optional if auth is disabled)
+   */
+  public initialize(uri?: string, username?: string, password?: string): void {
+    // Use provided URI, or environment variable, or default to localhost
+    const connectionUri = uri || process.env.NEO4J_URI || 'bolt://localhost:7687';
+    try {
+      // Default configuration for Neo4j driver
+      const config: Config = {
+        maxConnectionPoolSize: 100,
+        connectionAcquisitionTimeout: 60000, // 60 seconds
+        connectionTimeout: 30000, // 30 seconds
+        disableLosslessIntegers: true, // Convert Neo4j integers to JavaScript numbers
+        logging: {
+          level: 'info',
+          logger: (level: string, message: string) => {
+            switch (level) {
+              case 'error':
+                console.error(message);
+                break;
+              case 'warn':
+                console.warn(message);
+                break;
+              case 'info':
+                console.info(message);
+                break;
+              default:
+                console.log(message);
+            }
+          }
+        }
+      };
+      
+      // Validate the URI scheme - Neo4j only supports bolt://, neo4j://, and neo4j+s:// schemes
+      if (connectionUri && !connectionUri.startsWith('bolt://') && 
+          !connectionUri.startsWith('neo4j://') && 
+          !connectionUri.startsWith('neo4j+s://')) {
+        throw new Error(`Invalid Neo4j URI scheme: ${connectionUri}. Must use bolt://, neo4j://, or neo4j+s:// protocol.`);
+      }
+      
+      // If authentication is provided, use it; otherwise connect without auth
+      if (username && password) {
+        this.driver = neo4j.driver(connectionUri, neo4j.auth.basic(username, password), config);
+      } else {
+        this.driver = neo4j.driver(connectionUri, neo4j.auth.basic('neo4j', ''), config);
+      }
+      console.log('Neo4j driver initialized successfully');
+    } catch (error) {
+      console.error('Failed to initialize Neo4j driver:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Check if the driver is initialized
+   */
+  public isInitialized(): boolean {
+    return this.driver !== null;
+  }
+
+  /**
+   * Get a session for database operations
+   */
+  public getSession(): Session {
+    if (!this.driver) {
+      throw new Error('Neo4j driver not initialized. Call initialize() first.');
+    }
+    return this.driver.session();
+  }
+
+  /**
+   * Close the Neo4j driver connection
+   */
+  public close(): void {
+    if (this.driver) {
+      this.driver.close();
+      this.driver = null;
+      console.log('Neo4j driver closed');
+    }
+  }
+
+  /**
+   * Execute a Cypher query
+   * @param cypher - Cypher query string
+   * @param params - Parameters for the query
+   * @returns Promise resolving to query results
+   */
+  public async executeQuery(cypher: string, params: Record<string, any> = {}): Promise<Neo4jRecord[]> {
+    if (!this.driver) {
+      throw new Error('Neo4j driver not initialized. Call initialize() first.');
+    }
+
+    const session = this.getSession();
+    try {
+      const result = await session.run(cypher, params);
+      return result.records;
+    } catch (error) {
+      console.error('Error executing Neo4j query:', error);
+      throw error;
+    } finally {
+      session.close();
+    }
+  }
+
+  /**
+   * Create a node in the graph database
+   * @param label - Node label
+   * @param properties - Node properties
+   * @returns Promise resolving to the created node
+   */
+  public async createNode(label: string, properties: Record<string, any>): Promise<Neo4jRecord[]> {
+    const cypher = `CREATE (n:${label} $props) RETURN n`;
+    return this.executeQuery(cypher, { props: properties });
+  }
+
+  /**
+   * Create a relationship between two nodes
+   * @param startNodeId - ID of the start node
+   * @param endNodeId - ID of the end node
+   * @param relationType - Type of relationship
+   * @param properties - Relationship properties
+   * @returns Promise resolving to the created relationship
+   */
+  public async createRelationship(
+    startNodeId: string | number,
+    endNodeId: string | number,
+    relationType: string,
+    properties: Record<string, any> = {}
+  ): Promise<Neo4jRecord[]> {
+    const cypher = `
+      MATCH (a), (b)
+      WHERE ID(a) = $startNodeId AND ID(b) = $endNodeId
+      CREATE (a)-[r:${relationType} $props]->(b)
+      RETURN r
+    `;
+    return this.executeQuery(cypher, {
+      startNodeId,
+      endNodeId,
+      props: properties,
+    });
+  }
+
+  /**
+   * Import triples (subject, predicate, object) into the graph database
+   * @param triples - Array of triples to import
+   * @returns Promise resolving when import is complete
+   */
+  public async importTriples(triples: { subject: string; predicate: string; object: string }[]): Promise<void> {
+    if (!this.driver) {
+      throw new Error('Neo4j driver not initialized. Call initialize() first.');
+    }
+
+    // First transaction for schema operations
+    const schemaSession = this.getSession();
+    try {
+      // Create indices for faster lookups if they don't exist
+      await schemaSession.run('CREATE INDEX IF NOT EXISTS FOR (e:Entity) ON (e.name)');
+      await schemaSession.run('CREATE INDEX IF NOT EXISTS FOR ()-[r:RELATIONSHIP]-() ON (r.type)');
+      console.log('Schema indices verified/created');
+    } catch (error) {
+      console.error('Error creating schema indices:', error);
+      throw error;
+    } finally {
+      schemaSession.close();
+    }
+
+    // Second transaction for data operations
+    const dataSession = this.getSession();
+    let txc = null;
+    try {
+      // Use transactions for better performance and atomicity
+      txc = dataSession.beginTransaction();
+      
+      for (const triple of triples) {
+        // Normalize triple values to avoid case-sensitivity duplicates
+        const normalizedSubject = triple.subject.trim();
+        const normalizedPredicate = triple.predicate.trim();
+        const normalizedObject = triple.object.trim();
+        
+        // Skip empty or invalid triples
+        if (!normalizedSubject || !normalizedPredicate || !normalizedObject) {
+          console.warn('Skipping invalid triple:', triple);
+          continue;
+        }
+        
+        // Create or merge subject and object nodes
+        await txc.run(
+          'MERGE (s:Entity {name: $subject}) MERGE (o:Entity {name: $object})',
+          { subject: normalizedSubject, object: normalizedObject }
+        );
+        
+        // Check if relationship already exists to prevent duplicates
+        const checkQuery = `
+          MATCH (s:Entity {name: $subject})-[r]->(o:Entity {name: $object})
+          WHERE type(r) = 'RELATIONSHIP' AND r.type = $relType
+          RETURN count(r) > 0 AS exists
+        `;
+        
+        const checkResult = await txc.run(checkQuery, {
+          subject: normalizedSubject,
+          object: normalizedObject,
+          relType: normalizedPredicate
+        });
+        
+        const relationshipExists = checkResult.records[0]?.get('exists') || false;
+        
+        // Only create relationship if it doesn't already exist
+        if (!relationshipExists) {
+          // Use a generic relationship type and store the actual predicate as a property
+          await txc.run(
+            'MATCH (s:Entity {name: $subject}), (o:Entity {name: $object}) ' +
+            'CREATE (s)-[r:RELATIONSHIP {type: $predicate}]->(o)',
+            { 
+              subject: normalizedSubject, 
+              object: normalizedObject,
+              predicate: normalizedPredicate
+            }
+          );
+        }
+      }
+      
+      await txc.commit();
+      console.log(`Successfully imported ${triples.length} triples into Neo4j`);
+    } catch (error) {
+      console.error('Error importing triples into Neo4j:', error);
+      // Rollback the transaction if an error occurs
+      if (txc) {
+        await txc.rollback();
+      }
+      throw error;
+    } finally {
+      dataSession.close();
+    }
+  }
+
+  /**
+   * Get all nodes and relationships from the database
+   * @returns Promise resolving to nodes and relationships
+   */
+  public async getGraphData(): Promise<{ 
+    nodes: Array<{ 
+      id: string; 
+      labels: string[]; 
+      [key: string]: any 
+    }>; 
+    relationships: Array<{ 
+      id: string; 
+      source: string; 
+      target: string; 
+      type: string; 
+      [key: string]: any 
+    }>; 
+  }> {
+    if (!this.driver) {
+      throw new Error('Neo4j driver not initialized. Call initialize() first.');
+    }
+
+    const session = this.getSession();
+    try {
+      // Get all nodes
+      const nodesResult = await session.run('MATCH (n) RETURN n');
+      const nodes = nodesResult.records.map(record => {
+        const node = record.get('n');
+        return {
+          id: node.identity.toString(),
+          ...node.properties,
+          labels: node.labels
+        };
+      });
+
+      // Get all relationships
+      const relsResult = await session.run(
+        'MATCH ()-[r]->() RETURN r, startNode(r) as source, endNode(r) as target'
+      );
+      const relationships = relsResult.records.map(record => {
+        const rel = record.get('r');
+        const source = record.get('source');
+        const target = record.get('target');
+        return {
+          id: rel.identity.toString(),
+          source: source.identity.toString(),
+          target: target.identity.toString(),
+          // Use the type property if available, otherwise use the relationship type
+          type: rel.properties.type || rel.type,
+          ...rel.properties
+        };
+      });
+
+      return { nodes, relationships };
+    } catch (error) {
+      console.error('Error fetching graph data from Neo4j:', error);
+      throw error;
+    } finally {
+      session.close();
+    }
+  }
+
+  /**
+   * Log a RAG query with its performance metrics
+   * @param query The user's query string
+   * @param queryMode The query mode used (traditional, vector-search, pure-rag)
+   * @param metrics Performance metrics for the query
+   */
+  public async logQuery(
+    query: string, 
+    queryMode: 'traditional' | 'vector-search' | 'pure-rag',
+    metrics: {
+      executionTimeMs: number;
+      relevanceScore?: number;
+      precision?: number;
+      recall?: number;
+      resultCount: number;
+    }
+  ): Promise<void> {
+    if (!this.driver) {
+      console.error('Neo4j driver not initialized for logQuery. Attempting to initialize...');
+      this.initialize();
+      if (!this.driver) {
+        console.error('Failed to initialize Neo4j driver for logQuery');
+        throw new Error('Neo4j driver not initialized. Call initialize() first.');
+      }
+    }
+
+    console.log(`[Neo4j] Logging query: "${query}" (${queryMode})`);
+    console.log(`[Neo4j] Query metrics:`, JSON.stringify(metrics));
+
+    const session = this.getSession();
+    try {
+      // Create a QueryLog node with timestamp and metrics
+      const cypher = `
+        MERGE (q:QueryLog {query: $query})
+        ON CREATE SET 
+          q.firstQueried = datetime(),
+          q.count = 1
+        ON MATCH SET 
+          q.lastQueried = datetime(),
+          q.count = q.count + 1
+        
+        CREATE (e:QueryExecution {
+          timestamp: datetime(),
+          queryMode: $queryMode,
+          executionTimeMs: $executionTimeMs,
+          relevanceScore: $relevanceScore,
+          precision: $precision,
+          recall: $recall,
+          resultCount: $resultCount
+        })
+        
+        CREATE (q)-[:HAS_EXECUTION]->(e)
+        
+        RETURN q, e
+      `;
+
+      console.log(`[Neo4j] Executing Cypher query for logQuery`);
+      
+      const result = await session.run(cypher, {
+        query,
+        queryMode,
+        executionTimeMs: metrics.executionTimeMs,
+        relevanceScore: metrics.relevanceScore || null,
+        precision: metrics.precision || null,
+        recall: metrics.recall || null,
+        resultCount: metrics.resultCount
+      });
+      
+      console.log(`[Neo4j] Query logged successfully. Summary: created ${result.summary.counters.updates().nodesCreated} nodes, ${result.summary.counters.updates().relationshipsCreated} relationships`);
+    } catch (error) {
+      console.error('[Neo4j] Error logging query:', error);
+      // Non-critical error, so just log it but don't throw
+    } finally {
+      session.close();
+    }
+  }
+
+  /**
+   * Get query logs with performance metrics
+   * @param limit Maximum number of query logs to return
+   * @returns Promise resolving to an array of query logs
+   */
+  public async getQueryLogs(limit: number = 100): Promise<any[]> {
+    if (!this.driver) {
+      console.error('Neo4j driver not initialized for getQueryLogs. Attempting to initialize...');
+      this.initialize();
+      if (!this.driver) {
+        console.error('Failed to initialize Neo4j driver for getQueryLogs');
+        throw new Error('Neo4j driver not initialized. Call initialize() first.');
+      }
+    }
+
+    console.log(`[Neo4j] Getting query logs with limit: ${limit}`);
+
+    const session = this.getSession();
+    try {
+      // Get queries with their execution metrics, ordered by count
+      const cypher = `
+        MATCH (q:QueryLog)-[:HAS_EXECUTION]->(e:QueryExecution)
+        WITH q, collect(e) as executions
+        RETURN 
+          q.query as query, 
+          q.count as count,
+          q.firstQueried as firstQueried,
+          q.lastQueried as lastQueried,
+          avg(e.executionTimeMs) as avgExecutionTimeMs,
+          avg(e.relevanceScore) as avgRelevanceScore,
+          avg(e.precision) as avgPrecision,
+          avg(e.recall) as avgRecall,
+          avg(e.resultCount) as avgResultCount,
+          count(e) as executionCount
+        ORDER BY q.count DESC
+        LIMIT $limit
+      `;
+
+      console.log(`[Neo4j] Executing Cypher query for getQueryLogs`);
+      
+      const result = await session.run(cypher, { limit });
+      
+      console.log(`[Neo4j] Retrieved ${result.records.length} query logs`);
+
+      if (result.records.length === 0) {
+        console.log('[Neo4j] No query logs found. Checking if QueryLog nodes exist...');
+        
+        // Check if QueryLog nodes exist at all
+        const checkNodesQuery = `MATCH (q:QueryLog) RETURN count(q) as count`;
+        const checkNodesResult = await session.run(checkNodesQuery);
+        const nodeCount = checkNodesResult.records[0].get('count').toNumber();
+        
+        console.log(`[Neo4j] Found ${nodeCount} QueryLog nodes`);
+        
+        if (nodeCount > 0) {
+          console.log('[Neo4j] Checking if relationships exist...');
+          const checkRelsQuery = `MATCH (q:QueryLog)-[r:HAS_EXECUTION]->() RETURN count(r) as count`;
+          const checkRelsResult = await session.run(checkRelsQuery);
+          const relCount = checkRelsResult.records[0].get('count').toNumber();
+          
+          console.log(`[Neo4j] Found ${relCount} HAS_EXECUTION relationships`);
+        }
+      }
+      
+      return result.records.map(record => {
+        const mappedRecord = {
+          query: record.get('query'),
+          count: record.get('count').toNumber(),
+          firstQueried: record.get('firstQueried'),
+          lastQueried: record.get('lastQueried'),
+          metrics: {
+            avgExecutionTimeMs: record.get('avgExecutionTimeMs'),
+            avgRelevanceScore: record.get('avgRelevanceScore'),
+            avgPrecision: record.get('avgPrecision'),
+            avgRecall: record.get('avgRecall'),
+            avgResultCount: record.get('avgResultCount')
+          },
+          executionCount: record.get('executionCount').toNumber()
+        };
+        
+        return mappedRecord;
+      });
+    } catch (error) {
+      console.error('[Neo4j] Error getting query logs:', error);
+      throw error;
+    } finally {
+      session.close();
+    }
+  }
+
+  /**
+   * Get information about the driver connection
+   * @returns Object with connection info
+   */
+  public getDriverInfo(): Record<string, any> {
+    if (!this.driver) {
+      return { 
+        connected: false,
+        message: 'Driver not initialized'
+      };
+    }
+    
+    try {
+      // Get connection URI to return (strip password if present)
+      const connectionInfo = (this.driver as any)._connectionProvider?._connectionPool?._address || 'Unknown';
+      return {
+        connected: true,
+        connectionInfo: String(connectionInfo),
+      };
+    } catch (error) {
+      console.error('Error getting driver info:', error);
+      return {
+        connected: true,
+        error: 'Could not extract driver details'
+      };
+    }
+  }
+
+  /**
+   * Creates a test query log entry for debugging
+   * Useful for debugging
+   * @param query The query text
+   * @returns Promise that resolves when the operation is complete
+   */
+  public async createTestQueryLog(query: string): Promise<void> {
+    return this.logQuery(
+      query,
+      'traditional',
+      {
+        executionTimeMs: 0,
+        relevanceScore: 0,
+        precision: 0,
+        recall: 0,
+        resultCount: 0
+      }
+    );
+  }
+}
+
+export default Neo4jService.getInstance();
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/pinecone.ts b/nvidia/txt2kg/assets/frontend/lib/pinecone.ts
new file mode 100644
index 0000000..b7b4777
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/pinecone.ts
@@ -0,0 +1,672 @@
+/**
+ * Pinecone service for vector embeddings
+ * Uses direct API calls for Pinecone local server
+ */
+import { Document } from "@langchain/core/documents";
+
+// Define types for Pinecone requests and responses
+interface PineconeRecord {
+  id: string;
+  values: number[];
+  metadata?: Record<string, any>;
+}
+
+interface PineconeQueryResponse {
+  matches: Array<{
+    id: string;
+    score: number;
+    metadata?: Record<string, any>;
+  }>;
+}
+
+// Define interface for document search results
+export interface DocumentSearchResult {
+  id: string;
+  score: number;
+  metadata?: Record<string, any>;
+}
+
+export class PineconeService {
+  private dimension: number = 384; // Dimension for MiniLM-L6-v2
+  private static instance: PineconeService;
+  private initialized: boolean = false;
+  private indexName: string = 'entity-embeddings';
+  private namespace: string = ''; // Empty string is the default namespace for pinecone-index
+  private hostUrl: string;
+  private apiKey: string;
+  private isInitializing = false;
+  
+  private constructor() {
+    // Get environment variables with defaults
+    const host = process.env.PINECONE_HOST || 'localhost';
+    const port = process.env.PINECONE_PORT || '5081'; // Default to 5081 for pinecone-index
+    const apiKey = process.env.PINECONE_API_KEY || 'pclocal';
+    
+    this.hostUrl = `http://${host}:${port}`;
+    this.apiKey = apiKey;
+    
+    console.log(`Initializing Pinecone service with host: ${this.hostUrl}`);
+  }
+  
+  /**
+   * Get singleton instance
+   */
+  public static getInstance(): PineconeService {
+    if (!PineconeService.instance) {
+      PineconeService.instance = new PineconeService();
+    }
+    return PineconeService.instance;
+  }
+  
+  /**
+   * Check if the service is initialized
+   */
+  public isInitialized(): boolean {
+    return this.initialized;
+  }
+
+  /**
+   * Make a request to the Pinecone API
+   */
+  private async makeRequest(endpoint: string, method: string = 'GET', body?: any): Promise<any> {
+    try {
+      // For the Docker container setup, Pinecone is a separate service
+      const url = endpoint.startsWith('http') ? endpoint : `${this.hostUrl}${endpoint}`;
+      
+      console.log(`Making Pinecone request to: ${url}`);
+      
+      const options: RequestInit = {
+        method,
+        headers: {
+          'Content-Type': 'application/json',
+          'Api-Key': this.apiKey,
+        }
+      };
+      
+      if (body) {
+        options.body = JSON.stringify(body);
+      }
+      
+      const response = await fetch(url, options);
+      
+      if (!response.ok) {
+        const errorText = await response.text();
+        console.log(`Pinecone API error (${response.status}) for ${url}: ${errorText}`);
+        return null;
+      }
+      
+      // For HEAD requests or empty responses
+      if (method === 'HEAD' || response.headers.get('content-length') === '0') {
+        return { status: response.status };
+      }
+      
+      return await response.json();
+    } catch (error) {
+      console.log(`Error in Pinecone API request to ${endpoint} - request failed`);
+      return null;
+    }
+  }
+
+  /**
+   * Check if the Pinecone server is up and running
+   */
+  private isPineconeRunningCheck = false;
+  
+  public async isPineconeRunning(): Promise<boolean> {
+    // Prevent concurrent checks that could cause loops
+    if (this.isPineconeRunningCheck) {
+      console.log('Already checking if Pinecone is running, returning true to break cycle');
+      return true;
+    }
+    
+    this.isPineconeRunningCheck = true;
+    
+    try {
+      // In Docker Compose setup, Pinecone might be available but need time to initialize
+      // Try with simplified health checks first
+      try {
+        // This is a simplified test to see if the server responds at all
+        const response = await fetch(this.hostUrl, {
+          method: 'GET',
+          headers: {
+            'Api-Key': this.apiKey
+          }
+        });
+        
+        // Even a 404 is fine - it means the server is responding
+        if (response.status >= 200 && response.status < 500) {
+          console.log(`Pinecone server is up (basic connectivity check)`);
+          this.isPineconeRunningCheck = false;
+          return true;
+        }
+      } catch (e) {
+        console.log(`Basic connectivity check failed:`, e);
+      }
+      
+      // Try multiple health check endpoints
+      const healthEndpoints = [
+        '/health',
+        '/ready',
+        '/',
+        '/version',
+        '/list_indexes',
+        '/collections',
+        '/status'
+      ];
+      
+      for (const endpoint of healthEndpoints) {
+        try {
+          // Try with direct fetch without going through makeRequest
+          const response = await fetch(`${this.hostUrl}${endpoint}`, {
+            method: 'GET',
+            headers: {
+              'Api-Key': this.apiKey
+            }
+          });
+          
+          // Even a 404 might be fine - at least the server is responding
+          if (response.status >= 200 && response.status < 500) {
+            console.log(`Pinecone server is up (checked with ${endpoint}, status: ${response.status})`);
+            this.isPineconeRunningCheck = false;
+            return true;
+          }
+        } catch (e) {
+          console.log(`Health check failed for ${endpoint}:`, e);
+        }
+      }
+      
+      console.log('All Pinecone health checks failed - server might not be running');
+      this.isPineconeRunningCheck = false;
+      return false;
+    } catch (error) {
+      console.log('Error checking Pinecone server health - server appears to be down');
+      this.isPineconeRunningCheck = false;
+      return false;
+    }
+  }
+  
+  /**
+   * Initialize Pinecone and create index if needed
+   */
+  public async initialize(forceCreateIndex: boolean = false): Promise<void> {
+    if ((this.initialized && !forceCreateIndex) || this.isInitializing) {
+      return;
+    }
+    
+    this.isInitializing = true;
+    
+    try {
+      console.log('Pinecone service initializing...');
+      
+      // Check if Pinecone server is running
+      const isRunning = await this.isPineconeRunning();
+      if (!isRunning) {
+        console.log('Pinecone server does not appear to be running. Please ensure it is started in Docker.');
+        this.isInitializing = false;
+        return; // Don't throw, just return
+      }
+      
+      // Check if we can access the index by getting stats
+      const stats = await this.getStats();
+      if (stats.httpHealthy) {
+        console.log(`Connected to Pinecone index with ${stats.totalVectorCount} vectors`);
+        this.initialized = true;
+      } else {
+        console.log('Failed to access Pinecone index - continuing without initialization');
+      }
+      
+      this.isInitializing = false;
+      console.log('Pinecone service initialization completed');
+    } catch (error) {
+      console.log('Error during Pinecone service initialization - continuing without connection');
+      this.isInitializing = false;
+      // Don't throw error, just log and continue
+    }
+  }
+  
+  /**
+   * Store embeddings for entities
+   */
+  public async storeEmbeddings(
+    entityEmbeddings: Map<string, number[]>, 
+    textContentMap?: Map<string, string>
+  ): Promise<void> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    // If still not initialized after attempt, skip storage
+    if (!this.initialized) {
+      console.log('Pinecone not available - skipping embedding storage');
+      return;
+    }
+
+    try {
+      const records: PineconeRecord[] = [];
+      
+      // Convert to Pinecone vector format
+      for (const [entityName, embedding] of entityEmbeddings.entries()) {
+        const record: PineconeRecord = {
+          id: entityName,
+          values: embedding,
+          metadata: {
+            text: textContentMap?.get(entityName) || entityName,
+            type: 'entity'
+          }
+        };
+        records.push(record);
+      }
+      
+      // Use batching for efficient upserts
+      const batchSize = 100;
+      for (let i = 0; i < records.length; i += batchSize) {
+        const batch = records.slice(i, i + batchSize);
+        
+        const success = await this.upsertVectors(batch);
+        if (success) {
+          console.log(`Upserted batch ${Math.floor(i/batchSize) + 1} of ${Math.ceil(records.length/batchSize)}`);
+        } else {
+          console.log(`Failed to upsert batch ${Math.floor(i/batchSize) + 1} - continuing`);
+        }
+      }
+      
+      console.log(`Completed embedding storage attempt for ${records.length} embeddings`);
+    } catch (error) {
+      console.log('Error storing embeddings - continuing without storage');
+    }
+  }
+  
+  /**
+   * Upsert vectors to Pinecone
+   */
+  public async upsertVectors(vectors: PineconeRecord[]): Promise<boolean> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - skipping vector upsert');
+      return false;
+    }
+
+    try {
+      console.log(`Upserting ${vectors.length} vectors to Pinecone`);
+      
+      const response = await fetch(`${this.hostUrl}/vectors/upsert`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+          'Api-Key': this.apiKey
+        },
+        body: JSON.stringify({
+          vectors: vectors,
+          namespace: this.namespace
+        })
+      });
+      
+      if (!response.ok) {
+        const errorText = await response.text();
+        console.log(`Pinecone upsert failed: ${response.status} - ${errorText}`);
+        return false;
+      }
+      
+      console.log(`Successfully upserted ${vectors.length} vectors`);
+      return true;
+    } catch (error) {
+      console.log('Error upserting vectors to Pinecone - continuing without storage');
+      return false;
+    }
+  }
+  
+  /**
+   * Store embeddings with metadata
+   */
+  public async storeEmbeddingsWithMetadata(
+    embeddings: Map<string, number[]>,
+    textContent: Map<string, string>,
+    metadata: Map<string, any>
+  ): Promise<void> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - skipping embedding storage with metadata');
+      return;
+    }
+
+    try {
+      const records: PineconeRecord[] = [];
+      
+      // Convert to Pinecone vector format
+      for (const [key, embedding] of embeddings.entries()) {
+        const record: PineconeRecord = {
+          id: key,
+          values: embedding,
+          metadata: {
+            text: textContent.get(key) || '',
+            ...metadata.get(key) || {}
+          }
+        };
+        records.push(record);
+      }
+      
+      // Use batching for efficient upserts
+      const batchSize = 100;
+      for (let i = 0; i < records.length; i += batchSize) {
+        const batch = records.slice(i, i + batchSize);
+        
+        const success = await this.upsertVectors(batch);
+        if (success) {
+          console.log(`Upserted batch ${Math.floor(i/batchSize) + 1} of ${Math.ceil(records.length/batchSize)}`);
+        } else {
+          console.log(`Failed to upsert batch ${Math.floor(i/batchSize) + 1} - continuing`);
+        }
+      }
+      
+      console.log(`Completed embedding storage attempt for ${records.length} embeddings with metadata`);
+    } catch (error) {
+      console.log('Error storing embeddings with metadata - continuing without storage');
+    }
+  }
+  
+  /**
+   * Find similar entities to a query embedding
+   */
+  public async findSimilarEntitiesWithMetadata(
+    embedding: number[],
+    limit: number = 10
+  ): Promise<{ entities: string[], metadata: Map<string, any> }> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - returning empty results');
+      return { entities: [], metadata: new Map() };
+    }
+
+    try {
+      const queryResponse = await this.queryVectors(embedding, limit, true);
+      
+      if (!queryResponse) {
+        return { entities: [], metadata: new Map() };
+      }
+      
+      // Extract entities and metadata
+      const entities = queryResponse.matches.map(match => match.id);
+      const metadataMap = new Map<string, any>();
+      
+      queryResponse.matches.forEach(match => {
+        metadataMap.set(match.id, {
+          ...match.metadata,
+          score: match.score
+        });
+      });
+      
+      return { entities, metadata: metadataMap };
+    } catch (error) {
+      console.log('Error finding similar entities - returning empty results');
+      return { entities: [], metadata: new Map() };
+    }
+  }
+  
+  /**
+   * Query vectors in Pinecone
+   */
+  private async queryVectors(
+    vector: number[], 
+    topK: number = 10, 
+    includeMetadata: boolean = false
+  ): Promise<PineconeQueryResponse | null> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - cannot query vectors');
+      return null;
+    }
+
+    try {
+      const response = await fetch(`${this.hostUrl}/query`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+          'Api-Key': this.apiKey
+        },
+        body: JSON.stringify({
+          vector: vector,
+          topK: topK,
+          includeMetadata: includeMetadata,
+          namespace: this.namespace
+        })
+      });
+      
+      if (!response.ok) {
+        const errorText = await response.text();
+        console.log(`Pinecone query failed: ${response.status} - ${errorText}`);
+        return null;
+      }
+      
+      return await response.json();
+    } catch (error) {
+      console.log('Error querying vectors from Pinecone - returning null');
+      return null;
+    }
+  }
+  
+  /**
+   * Find similar entities to a query embedding
+   */
+  public async findSimilarEntities(queryEmbedding: number[], topK: number = 10): Promise<string[]> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - returning empty entity list');
+      return [];
+    }
+
+    try {
+      const queryResponse = await this.queryVectors(queryEmbedding, topK, false);
+      if (!queryResponse) {
+        return [];
+      }
+      return queryResponse.matches.map(match => match.id);
+    } catch (error) {
+      console.log('Error finding similar entities - returning empty list');
+      return [];
+    }
+  }
+  
+  /**
+   * Get all entities in the index (up to limit)
+   */
+  public async getAllEntities(limit: number = 1000): Promise<string[]> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    try {
+      // Create a dummy query that will match all vectors
+      const dummyVector = Array(this.dimension).fill(0);
+      const queryResponse = await this.queryVectors(dummyVector, limit, false);
+      return queryResponse.matches.map(match => match.id);
+    } catch (error) {
+      console.error('Error getting all entities:', error);
+      return [];
+    }
+  }
+  
+  /**
+   * Delete entities from the index
+   */
+  public async deleteEntities(entityIds: string[]): Promise<boolean> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - cannot delete entities');
+      return false;
+    }
+
+    try {
+      console.log(`Deleting ${entityIds.length} entities from Pinecone`);
+      
+      const response = await fetch(`${this.hostUrl}/vectors/delete`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+          'Api-Key': this.apiKey
+        },
+        body: JSON.stringify({
+          ids: entityIds,
+          namespace: this.namespace
+        })
+      });
+      
+      if (!response.ok) {
+        const errorText = await response.text();
+        console.log(`Pinecone delete failed: ${response.status} - ${errorText}`);
+        return false;
+      }
+      
+      console.log(`Successfully deleted ${entityIds.length} entities`);
+      return true;
+    } catch (error) {
+      console.log('Error deleting entities from Pinecone - operation failed');
+      return false;
+    }
+  }
+  
+  /**
+   * Get index statistics from Pinecone
+   */
+  public async getStats(): Promise<any> {
+    try {
+      // Try direct HTTP requests to the describe_index_stats endpoint
+      try {
+        console.log('Getting stats from Pinecone...');
+        const response = await fetch(`${this.hostUrl}/describe_index_stats`, {
+          method: 'GET',
+          headers: {
+            'Content-Type': 'application/json',
+            'Api-Key': this.apiKey
+          }
+        });
+        
+        if (response.ok) {
+          const statsData = await response.json();
+          console.log('Successfully retrieved stats from Pinecone');
+          return {
+            totalVectorCount: statsData.totalVectorCount || 0,
+            namespaces: statsData.namespaces || {},
+            source: 'direct-http',
+            httpHealthy: true
+          };
+        } else {
+          console.log(`Pinecone stats request failed with status: ${response.status}`);
+          return {
+            totalVectorCount: 0,
+            source: 'error',
+            httpHealthy: false,
+            error: `Failed to get stats: ${response.status}`
+          };
+        }
+      } catch (error) {
+        console.log('Pinecone connection failed - server may not be running');
+        return {
+          totalVectorCount: 0,
+          source: 'error',
+          httpHealthy: false,
+          error: error instanceof Error ? error.message : String(error)
+        };
+      }
+    } catch (error) {
+      console.log('Error accessing Pinecone service');
+      return {
+        totalVectorCount: 0,
+        source: 'error',
+        httpHealthy: false,
+        error: error instanceof Error ? error.message : String(error)
+      };
+    }
+  }
+  
+  /**
+   * Delete all entities in the index
+   */
+  public async deleteAllEntities(): Promise<boolean> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - cannot delete all entities');
+      return false;
+    }
+
+    try {
+      console.log('Deleting all entities from Pinecone');
+      
+      const response = await fetch(`${this.hostUrl}/vectors/delete`, {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+          'Api-Key': this.apiKey
+        },
+        body: JSON.stringify({
+          deleteAll: true,
+          namespace: this.namespace
+        })
+      });
+      
+      if (!response.ok) {
+        const errorText = await response.text();
+        console.log(`Pinecone delete all failed: ${response.status} - ${errorText}`);
+        return false;
+      }
+      
+      console.log('Successfully deleted all entities from Pinecone');
+      return true;
+    } catch (error) {
+      console.log('Error deleting all entities from Pinecone - operation failed');
+      return false;
+    }
+  }
+
+  /**
+   * Find similar documents to a query embedding
+   * @param queryEmbedding Query embedding vector
+   * @param topK Number of results to return
+   * @returns Promise resolving to array of document search results
+   */
+  public async findSimilarDocuments(queryEmbedding: number[], topK: number = 10): Promise<DocumentSearchResult[]> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.initialized) {
+      console.log('Pinecone not available - returning empty document results');
+      return [];
+    }
+
+    try {
+      const queryResponse = await this.queryVectors(queryEmbedding, topK, true);
+      if (!queryResponse) {
+        return [];
+      }
+      return queryResponse.matches.map(match => ({
+        id: match.id,
+        score: match.score,
+        metadata: match.metadata
+      }));
+    } catch (error) {
+      console.log('Error finding similar documents - returning empty results');
+      return [];
+    }
+  }
+}
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/query-logger.ts b/nvidia/txt2kg/assets/frontend/lib/query-logger.ts
new file mode 100644
index 0000000..bc308d9
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/query-logger.ts
@@ -0,0 +1,232 @@
+import fs from 'fs';
+import path from 'path';
+import { promises as fsPromises } from 'fs';
+
+export interface QueryLogEntry {
+  query: string;
+  queryMode: 'traditional' | 'vector-search' | 'pure-rag';
+  timestamp: string;
+  metrics: {
+    executionTimeMs: number;
+    relevanceScore?: number;
+    precision?: number;
+    recall?: number;
+    resultCount: number;
+  };
+}
+
+export interface QueryLogSummary {
+  query: string;
+  count: number;
+  firstQueried: string;
+  lastQueried: string;
+  metrics: {
+    avgExecutionTimeMs: number;
+    avgRelevanceScore: number;
+    avgPrecision: number;
+    avgRecall: number;
+    avgResultCount: number;
+  };
+  executionCount: number;
+}
+
+/**
+ * Service for logging queries to a file
+ */
+export class QueryLoggerService {
+  private static instance: QueryLoggerService;
+  private logFilePath: string;
+  private initialized: boolean = false;
+  
+  private constructor() {
+    // Default path is in the data directory of the project
+    this.logFilePath = path.join(process.cwd(), 'data', 'query-logs.json');
+  }
+  
+  /**
+   * Get the singleton instance of the QueryLoggerService
+   */
+  public static getInstance(): QueryLoggerService {
+    if (!QueryLoggerService.instance) {
+      QueryLoggerService.instance = new QueryLoggerService();
+    }
+    return QueryLoggerService.instance;
+  }
+
+  /**
+   * Initialize the logger
+   * @param customPath Optional custom path for log file
+   */
+  public async initialize(customPath?: string): Promise<void> {
+    try {
+      if (customPath) {
+        this.logFilePath = customPath;
+      }
+
+      // Ensure the directory exists
+      const dir = path.dirname(this.logFilePath);
+      await fsPromises.mkdir(dir, { recursive: true });
+      
+      // Check if file exists, create it if it doesn't
+      if (!fs.existsSync(this.logFilePath)) {
+        await fsPromises.writeFile(this.logFilePath, JSON.stringify([]));
+        console.log(`[QueryLogger] Initialized empty log file at ${this.logFilePath}`);
+      } else {
+        console.log(`[QueryLogger] Using existing log file at ${this.logFilePath}`);
+      }
+      
+      this.initialized = true;
+    } catch (error) {
+      console.error('[QueryLogger] Error initializing logger:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Log a RAG query with its performance metrics
+   */
+  public async logQuery(
+    query: string,
+    queryMode: 'traditional' | 'vector-search' | 'pure-rag',
+    metrics: {
+      executionTimeMs: number;
+      relevanceScore?: number;
+      precision?: number;
+      recall?: number;
+      resultCount: number;
+    }
+  ): Promise<void> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    console.log(`[QueryLogger] Logging query: "${query}" (${queryMode})`);
+    
+    try {
+      // Read existing logs
+      const existingLogsRaw = await fsPromises.readFile(this.logFilePath, 'utf-8');
+      const existingLogs: QueryLogEntry[] = JSON.parse(existingLogsRaw || '[]');
+      
+      // Add new log entry
+      const newEntry: QueryLogEntry = {
+        query,
+        queryMode,
+        timestamp: new Date().toISOString(),
+        metrics
+      };
+      
+      existingLogs.push(newEntry);
+      
+      // Write updated logs back to file
+      await fsPromises.writeFile(this.logFilePath, JSON.stringify(existingLogs, null, 2));
+      console.log(`[QueryLogger] Query logged successfully to file`);
+    } catch (error) {
+      console.error('[QueryLogger] Error logging query to file:', error);
+      // Non-critical error, so just log it but don't throw
+    }
+  }
+
+  /**
+   * Get query logs with performance metrics
+   * @param limit Maximum number of query logs to return
+   * @returns Promise resolving to an array of query logs
+   */
+  public async getQueryLogs(limit: number = 100): Promise<QueryLogSummary[]> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    console.log(`[QueryLogger] Getting query logs with limit: ${limit}`);
+    
+    try {
+      // Read logs from file
+      const logsRaw = await fsPromises.readFile(this.logFilePath, 'utf-8');
+      const logs: QueryLogEntry[] = JSON.parse(logsRaw || '[]');
+      
+      if (logs.length === 0) {
+        console.log('[QueryLogger] No query logs found');
+        return [];
+      }
+
+      // Group logs by query
+      const querySummaries = new Map<string, {
+        query: string;
+        count: number;
+        timestamps: string[];
+        executionTimes: number[];
+        relevanceScores: number[];
+        precisions: number[];
+        recalls: number[];
+        resultCounts: number[];
+      }>();
+
+      logs.forEach(entry => {
+        const existing = querySummaries.get(entry.query) || {
+          query: entry.query,
+          count: 0,
+          timestamps: [],
+          executionTimes: [],
+          relevanceScores: [],
+          precisions: [],
+          recalls: [],
+          resultCounts: []
+        };
+
+        existing.count++;
+        existing.timestamps.push(entry.timestamp);
+        existing.executionTimes.push(entry.metrics.executionTimeMs);
+        if (entry.metrics.relevanceScore !== undefined) existing.relevanceScores.push(entry.metrics.relevanceScore);
+        if (entry.metrics.precision !== undefined) existing.precisions.push(entry.metrics.precision);
+        if (entry.metrics.recall !== undefined) existing.recalls.push(entry.metrics.recall);
+        existing.resultCounts.push(entry.metrics.resultCount);
+
+        querySummaries.set(entry.query, existing);
+      });
+
+      // Convert to array and format
+      const result: QueryLogSummary[] = Array.from(querySummaries.values()).map(summary => ({
+        query: summary.query,
+        count: summary.count,
+        firstQueried: summary.timestamps[0],
+        lastQueried: summary.timestamps[summary.timestamps.length - 1],
+        metrics: {
+          avgExecutionTimeMs: this.calculateAverage(summary.executionTimes),
+          avgRelevanceScore: this.calculateAverage(summary.relevanceScores),
+          avgPrecision: this.calculateAverage(summary.precisions),
+          avgRecall: this.calculateAverage(summary.recalls),
+          avgResultCount: this.calculateAverage(summary.resultCounts)
+        },
+        executionCount: summary.count
+      }));
+
+      // Sort by count (descending) and limit
+      return result
+        .sort((a, b) => b.count - a.count)
+        .slice(0, limit);
+    } catch (error) {
+      console.error('[QueryLogger] Error getting query logs:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Calculate average of an array of numbers
+   * @param values Array of numbers
+   * @returns Average value or 0 if array is empty
+   */
+  private calculateAverage(values: number[]): number {
+    if (values.length === 0) return 0;
+    return values.reduce((sum, val) => sum + val, 0) / values.length;
+  }
+
+  /**
+   * Check if the logger is initialized
+   */
+  public isInitialized(): boolean {
+    return this.initialized;
+  }
+}
+
+// Create and export singleton instance
+const queryLoggerService = QueryLoggerService.getInstance();
+export default queryLoggerService; 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/rag.ts b/nvidia/txt2kg/assets/frontend/lib/rag.ts
new file mode 100644
index 0000000..2165fc0
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/rag.ts
@@ -0,0 +1,232 @@
+/**
+ * Retrieval Augmented Generation (RAG) implementation using Pinecone and LangChain
+ * This module provides a RetrievalQA chain using Pinecone as the vector store
+ * Note: xAI integration has been removed - needs alternative LLM provider implementation
+ */
+
+import { ChatOpenAI } from "@langchain/openai";
+import { Document } from "@langchain/core/documents";
+import { RunnableSequence } from "@langchain/core/runnables";
+import { StringOutputParser } from "@langchain/core/output_parsers";
+import { PromptTemplate } from "@langchain/core/prompts";
+import { PineconeService, DocumentSearchResult } from './pinecone';
+import { EmbeddingsService } from './embeddings';
+
+// Interface for records to store in Pinecone
+interface PineconeRecord {
+  id: string;
+  values: number[];
+  metadata?: Record<string, any>;
+}
+
+export class RAGService {
+  private static instance: RAGService;
+  private pineconeService: PineconeService;
+  private embeddingsService: EmbeddingsService;
+  private llm: ChatOpenAI | null = null;
+  private initialized: boolean = false;
+  private isInitializing: boolean = false;
+
+  private constructor() {
+    this.pineconeService = PineconeService.getInstance();
+    this.embeddingsService = EmbeddingsService.getInstance();
+  }
+
+  /**
+   * Get the singleton instance of RAGService
+   */
+  public static getInstance(): RAGService {
+    if (!RAGService.instance) {
+      RAGService.instance = new RAGService();
+    }
+    return RAGService.instance;
+  }
+
+  /**
+   * Initialize the RAG service
+   */
+  public async initialize(): Promise<void> {
+    if (this.initialized || this.isInitializing) {
+      return;
+    }
+
+    this.isInitializing = true;
+    
+    try {
+      console.log('Initializing RAG service...');
+
+      // Initialize dependencies
+      await this.pineconeService.initialize();
+      await this.embeddingsService.initialize();
+
+      // Initialize LLM - Try NVIDIA first, then fall back to error
+      const nvidiaApiKey = process.env.NVIDIA_API_KEY;
+      if (!nvidiaApiKey) {
+        throw new Error('RAG service requires NVIDIA_API_KEY to be set in environment variables. xAI integration has been removed.');
+      }
+      
+      // Note: This is a placeholder - NVIDIA LLM integration would need to be implemented
+      // For now, we'll throw an error to indicate RAG service is not available
+      throw new Error('RAG service is temporarily unavailable after xAI removal. Please implement alternative LLM provider.');
+
+      this.initialized = true;
+      console.log('RAG service initialized successfully');
+    } catch (error) {
+      console.error('Error initializing RAG service:', error);
+      throw error;
+    } finally {
+      this.isInitializing = false;
+    }
+  }
+
+  /**
+   * Store documents in Pinecone for retrieval
+   * @param documents Array of text documents to store
+   * @param metadata Optional metadata for the documents
+   */
+  public async storeDocuments(
+    documents: string[],
+    metadata?: Record<string, any>[]
+  ): Promise<void> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!documents || documents.length === 0) {
+      console.warn('No documents provided to store');
+      return;
+    }
+
+    console.log(`Storing ${documents.length} documents in Pinecone`);
+
+    // Generate embeddings for documents
+    const embeddings = await this.embeddingsService.encode(documents);
+
+    // Prepare records for Pinecone
+    const records: PineconeRecord[] = embeddings.map((embedding, i) => ({
+      id: `doc_${Date.now()}_${i}`,
+      values: embedding,
+      metadata: {
+        text: documents[i],
+        timestamp: new Date().toISOString(),
+        ...(metadata && metadata[i] ? metadata[i] : {})
+      }
+    }));
+
+    // Store in Pinecone
+    await this.pineconeService.upsertVectors(records);
+    console.log(`Successfully stored ${records.length} document embeddings`);
+  }
+
+  /**
+   * Perform question answering with document retrieval
+   * @param query User query
+   * @param topK Number of most similar documents to retrieve
+   * @returns Answer generated from relevant context
+   */
+  public async retrievalQA(query: string, topK: number = 5): Promise<string> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    if (!this.llm) {
+      throw new Error('LLM not initialized');
+    }
+
+    // Generate embedding for query
+    const queryEmbedding = (await this.embeddingsService.encode([query]))[0];
+
+    // Retrieve similar documents from Pinecone
+    const similarDocs = await this.pineconeService.findSimilarDocuments(queryEmbedding, topK);
+    
+    if (!similarDocs || similarDocs.length === 0) {
+      console.log('No relevant documents found, falling back to LLM');
+      
+      // Define prompt template for standalone LLM response
+      const fallbackPromptTemplate = PromptTemplate.fromTemplate(`
+You are a helpful assistant answering questions based on your general knowledge.
+Since no specific information was found in the knowledge base, please provide the best answer you can.
+
+Question: {query}
+
+Answer:
+`);
+
+      // Create fallback chain
+      const fallbackChain = RunnableSequence.from([
+        {
+          query: () => query,
+        },
+        fallbackPromptTemplate,
+        this.llm,
+        new StringOutputParser(),
+      ]);
+
+      // Execute fallback chain
+      const answer = await fallbackChain.invoke({});
+      return `[Note: No specific information was found in the knowledge base. This answer is based on general knowledge.]\n\n${answer}`;
+    }
+
+    // Extract text from retrieved documents
+    const context = similarDocs
+      .map((doc: DocumentSearchResult) => doc.metadata?.text || '')
+      .filter((text: string) => text.length > 0)
+      .join('\n\n');
+
+    // Define prompt template for QA
+    const promptTemplate = PromptTemplate.fromTemplate(`
+Answer the question based only on the following context:
+
+Context:
+{context}
+
+Question: {query}
+
+Answer:
+`);
+
+    // Create retrieval chain
+    const retrievalChain = RunnableSequence.from([
+      {
+        context: () => context,
+        query: () => query,
+      },
+      promptTemplate,
+      this.llm,
+      new StringOutputParser(),
+    ]);
+
+    // Execute chain
+    const answer = await retrievalChain.invoke({});
+    return answer;
+  }
+
+  /**
+   * Perform similar document retrieval without QA
+   * @param query Query text
+   * @param topK Number of documents to retrieve
+   * @returns Array of retrieved documents with similarity scores
+   */
+  public async retrieveSimilarDocuments(
+    query: string,
+    topK: number = 5
+  ): Promise<Array<{ text: string; score: number; metadata?: any }>> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+
+    // Generate embedding for query
+    const queryEmbedding = (await this.embeddingsService.encode([query]))[0];
+
+    // Retrieve similar documents from Pinecone
+    const similarDocs = await this.pineconeService.findSimilarDocuments(queryEmbedding, topK);
+    
+    return similarDocs.map((doc: DocumentSearchResult) => ({
+      text: doc.metadata?.text || '',
+      score: doc.score,
+      metadata: doc.metadata
+    }));
+  }
+}
+
+export default RAGService.getInstance(); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/remote-backend.ts b/nvidia/txt2kg/assets/frontend/lib/remote-backend.ts
new file mode 100644
index 0000000..9b1d13e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/remote-backend.ts
@@ -0,0 +1,741 @@
+import { GraphDBService, GraphDBType } from './graph-db-service';
+import { PineconeService } from './pinecone';
+import { EmbeddingsService } from './embeddings';
+import { TextProcessor } from './text-processor';
+import type { Triple } from '@/types/graph';
+
+/**
+ * Remote backend implementation that uses a graph database for storage,
+ * Pinecone for vector embeddings, and SentenceTransformer for generating embeddings.
+ * Follows the implementation in PyTorch Geometric's txt2kg.py
+ * Enhanced with LangChain text processing for better extraction
+ */
+export class RemoteBackendService {
+  private graphDBService: GraphDBService;
+  private pineconeService: PineconeService;
+  private embeddingsService: EmbeddingsService;
+  private textProcessor: TextProcessor;
+  private initialized: boolean = false;
+  private static instance: RemoteBackendService;
+
+  private constructor() {
+    this.graphDBService = GraphDBService.getInstance();
+    this.pineconeService = PineconeService.getInstance();
+    this.embeddingsService = EmbeddingsService.getInstance();
+    this.textProcessor = TextProcessor.getInstance();
+  }
+
+  /**
+   * Get the singleton instance of RemoteBackendService
+   */
+  public static getInstance(): RemoteBackendService {
+    if (!RemoteBackendService.instance) {
+      RemoteBackendService.instance = new RemoteBackendService();
+    }
+    return RemoteBackendService.instance;
+  }
+
+  /**
+   * Check if the backend is initialized
+   */
+  public isInitialized(): boolean {
+    return this.initialized;
+  }
+
+  /**
+   * Initialize the remote backend with all required services
+   * @param graphDbType - Type of graph database to use
+   */
+  public async initialize(graphDbType: GraphDBType = 'arangodb'): Promise<void> {
+    console.log('Initializing remote backend...');
+    
+    // Initialize Graph Database
+    await this.graphDBService.initialize(graphDbType);
+    console.log(`${graphDbType} service initialized`);
+    
+    // Initialize Pinecone
+    await this.pineconeService.initialize();
+    console.log('Pinecone service initialized');
+    
+    // Initialize Embeddings service
+    await this.embeddingsService.initialize();
+    console.log('Embeddings service initialized');
+    
+    this.initialized = true;
+    console.log('Remote backend initialized successfully');
+  }
+
+  /**
+   * Process raw text document and extract triples for the knowledge graph
+   * @param text Raw text to process
+   * @returns Extracted triples with metadata
+   */
+  public async processText(text: string): Promise<Array<Triple & { confidence: number, metadata: any }>> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+    
+    console.log(`Processing text document of length ${text.length}`);
+    
+    // Use the LangChain-based text processor to extract triples
+    const triples = await this.textProcessor.processText(text);
+    console.log(`Extracted ${triples.length} triples with metadata`);
+    
+    return triples;
+  }
+
+  /**
+   * Create backend from raw text
+   * @param text Raw text to process
+   */
+  public async createBackendFromText(text: string): Promise<void> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+    
+    // Extract triples with metadata using the text processor
+    const processedTriples = await this.processText(text);
+    
+    // Convert to simple triples for storage
+    const simpleTriples = processedTriples.map(triple => ({
+      subject: triple.subject,
+      predicate: triple.predicate,
+      object: triple.object
+    }));
+    
+    // Store triples in graph database
+    await this.storeTriplesToNeo4j(simpleTriples);
+    
+    // Extract entities and generate embeddings
+    const entities = this.extractEntitiesFromTriples(simpleTriples);
+    console.log(`Extracted ${entities.length} unique entities from triples`);
+    
+    // Generate embeddings for entities
+    const embeddings = await this.embeddingsService.encode(entities);
+    console.log(`Generated embeddings for ${embeddings.length} entities`);
+    
+    // Create entity-embedding map with metadata
+    const entityEmbeddings = new Map<string, number[]>();
+    const textContent = new Map<string, string>();
+    const entityMetadata = new Map<string, any>();
+    
+    // Process each entity
+    for (let i = 0; i < entities.length; i++) {
+      const entity = entities[i];
+      entityEmbeddings.set(entity, embeddings[i]);
+      textContent.set(entity, entity);
+      
+      // Collect metadata for each entity from processed triples
+      const entityData: any = {
+        types: [] as string[],
+        contexts: [] as string[]
+      };
+      
+      // Find all triples that mention this entity
+      for (const triple of processedTriples) {
+        // Check subject
+        if (triple.subject.toLowerCase() === entity.toLowerCase()) {
+          // Add subject type if available
+          if (triple.metadata?.entityTypes?.[0] && !entityData.types.includes(triple.metadata.entityTypes[0])) {
+            entityData.types.push(triple.metadata.entityTypes[0]);
+          }
+          
+          // Add context if available
+          if (triple.metadata?.context && !entityData.contexts.includes(triple.metadata.context)) {
+            entityData.contexts.push(triple.metadata.context);
+          }
+        }
+        
+        // Check object
+        if (triple.object.toLowerCase() === entity.toLowerCase()) {
+          // Add object type if available
+          if (triple.metadata?.entityTypes?.[1] && !entityData.types.includes(triple.metadata.entityTypes[1])) {
+            entityData.types.push(triple.metadata.entityTypes[1]);
+          }
+          
+          // Add context if available
+          if (triple.metadata?.context && !entityData.contexts.includes(triple.metadata.context)) {
+            entityData.contexts.push(triple.metadata.context);
+          }
+        }
+      }
+      
+      entityMetadata.set(entity, entityData);
+    }
+    
+    // Store embeddings and metadata in Pinecone
+    await this.pineconeService.storeEmbeddingsWithMetadata(entityEmbeddings, textContent, entityMetadata);
+    console.log('Stored embeddings with metadata in Pinecone');
+    
+    console.log('Backend created successfully from text');
+  }
+
+  /**
+   * Create backend from triples
+   * @param triples Array of triples to create backend from
+   */
+  public async createBackendFromTriples(triples: Triple[]): Promise<void> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+    
+    // Store triples in graph database
+    await this.storeTriplesToNeo4j(triples);
+    
+    // Extract entities and generate embeddings
+    const entities = this.extractEntitiesFromTriples(triples);
+    console.log(`Extracted ${entities.length} unique entities from triples`);
+    
+    // Generate embeddings for entities
+    const embeddings = await this.embeddingsService.encode(entities);
+    console.log(`Generated embeddings for ${embeddings.length} entities`);
+    
+    // Create entity-embedding map with simple metadata
+    const entityEmbeddings = new Map<string, number[]>();
+    const textContent = new Map<string, string>();
+    const entityMetadata = new Map<string, any>();
+    
+    // Process each entity
+    for (let i = 0; i < entities.length; i++) {
+      const entity = entities[i];
+      entityEmbeddings.set(entity, embeddings[i]);
+      textContent.set(entity, entity);
+      
+      // Simple metadata for triples
+      entityMetadata.set(entity, {
+        types: [],
+        contexts: []
+      });
+    }
+    
+    // Store embeddings and metadata in Pinecone
+    await this.pineconeService.storeEmbeddingsWithMetadata(entityEmbeddings, textContent, entityMetadata);
+    console.log('Stored embeddings with metadata in Pinecone');
+    
+    console.log('Backend created successfully from triples');
+  }
+
+  /**
+   * Store triples in graph database
+   * @param triples - Array of triples to store
+   */
+  public async storeTriplesToNeo4j(triples: Triple[]): Promise<void> {
+    // Triples are already in the correct format for graph database
+    await this.graphDBService.importTriples(triples);
+  }
+
+  /**
+   * Extract unique entities from triples
+   */
+  private extractEntitiesFromTriples(triples: Triple[]): string[] {
+    const entitySet = new Set<string>();
+    
+    for (const triple of triples) {
+      entitySet.add(triple.subject); // subject
+      entitySet.add(triple.object); // object
+    }
+    
+    return Array.from(entitySet);
+  }
+
+  /**
+   * Query the backend
+   * @param query Query text
+   * @param kNeighbors Number of KNN neighbors to retrieve
+   * @param fanout Number of neighbors for each node in neighborhood sampling
+   * @param numHops Number of hops for neighborhood sampling
+   * @param filterParams Parameters for local filtering
+   * @param useTraditional Whether to use traditional search (direct pattern matching)
+   */
+  public async query(
+    query: string,
+    kNeighbors: number = 4096,
+    fanout: number = 400,
+    numHops: number = 2,
+    filterParams: { topk: number, topk_e: number, cost_e: number, num_clusters: number } = 
+      { topk: 5, topk_e: 5, cost_e: 0.5, num_clusters: 2 },
+    useTraditional: boolean = false
+  ): Promise<Triple[]> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+    
+    console.log(`Querying backend with: "${query}"`);
+    console.log(`Parameters: kNeighbors=${kNeighbors}, fanout=${fanout}, numHops=${numHops}, useTraditional=${useTraditional}`);
+    
+    // Use traditional search if specified (direct pattern matching)
+    if (useTraditional) {
+      return this.queryTraditional(query);
+    }
+    
+    // Step 1: Generate embedding for query
+    const queryEmbedding = (await this.embeddingsService.encode([query]))[0];
+    
+    // Step 2: Find nearest neighbors using Pinecone
+    const seedNodes = await this.pineconeService.findSimilarEntities(queryEmbedding, kNeighbors);
+    console.log(`Found ${seedNodes.length} seed nodes using KNN`);
+    
+    // Step 3: Retrieve graph data from graph database
+    const graphData = await this.graphDBService.getGraphData();
+    console.log(`Retrieved graph with ${graphData.nodes.length} nodes and ${graphData.relationships.length} relationships`);
+    
+    // Step 4: Build adjacency map for neighborhood sampling
+    const adjacencyMap = this.buildAdjacencyMap(graphData);
+    
+    // Step 5: Perform neighborhood sampling
+    const subgraphNodes = this.performNeighborhoodSampling(seedNodes, adjacencyMap, fanout, numHops);
+    console.log(`Neighborhood sampling found ${subgraphNodes.size} nodes`);
+    
+    // Step 6: Extract relevant triples
+    const relevantTriples = this.extractRelevantTriples(graphData, subgraphNodes);
+    console.log(`Extracted ${relevantTriples.length} relevant triples`);
+    
+    // Step 7: Apply local filtering
+    const filteredTriples = this.applyLocalFiltering(relevantTriples, filterParams);
+    console.log(`Applied local filtering, returned ${filteredTriples.length} triples`);
+    
+    return filteredTriples;
+  }
+
+  /**
+   * Perform a traditional query using direct pattern matching on the graph
+   * This bypasses the vector embeddings and uses text matching
+   */
+  private async queryTraditional(queryText: string): Promise<Triple[]> {
+    console.log(`Performing traditional graph query: "${queryText}"`);
+    
+    // Get graph data from graph database
+    const graphData = await this.graphDBService.getGraphData();
+    console.log(`Retrieved graph with ${graphData.nodes.length} nodes and ${graphData.relationships.length} relationships`);
+    
+    // Create a map of node IDs to names
+    const nodeIdToName = new Map<string, string>();
+    for (const node of graphData.nodes) {
+      nodeIdToName.set(node.id, node.name);
+    }
+    
+    // Extract keywords from query
+    const keywords = this.extractKeywords(queryText);
+    console.log(`Extracted keywords: ${keywords.join(', ')}`);
+    
+    // Find matching nodes based on keywords
+    const matchingNodeIds = new Set<string>();
+    for (const node of graphData.nodes) {
+      for (const keyword of keywords) {
+        // Skip common words
+        if (this.isStopWord(keyword)) continue;
+        
+        // Simple text matching - convert to lowercase for case-insensitive matching
+        if (node.name.toLowerCase().includes(keyword.toLowerCase())) {
+          matchingNodeIds.add(node.id);
+          break;
+        }
+      }
+    }
+    
+    console.log(`Found ${matchingNodeIds.size} nodes matching keywords directly`);
+    
+    // Find relationships where either subject or object matches
+    const relevantTriples: Triple[] = [];
+    
+    for (const rel of graphData.relationships) {
+      // Check if either end of the relationship matches our search
+      const isSourceMatching = matchingNodeIds.has(rel.source);
+      const isTargetMatching = matchingNodeIds.has(rel.target);
+      
+      if (isSourceMatching || isTargetMatching) {
+        const sourceName = nodeIdToName.get(rel.source);
+        const targetName = nodeIdToName.get(rel.target);
+        
+        if (sourceName && targetName) {
+          // Check if the relationship type matches keywords
+          let matchesRelationship = false;
+          for (const keyword of keywords) {
+            if (this.isStopWord(keyword)) continue;
+            if (rel.type.toLowerCase().includes(keyword.toLowerCase())) {
+              matchesRelationship = true;
+              break;
+            }
+          }
+          
+          // Higher relevance to relationships that match the query directly
+          const relevance = (isSourceMatching ? 1 : 0) + 
+                           (isTargetMatching ? 1 : 0) + 
+                           (matchesRelationship ? 2 : 0);
+          
+          if (relevance > 0) {
+            relevantTriples.push({
+              subject: sourceName,
+              predicate: rel.type,
+              object: targetName,
+              confidence: relevance / 4.0  // Scale from 0 to 1
+            });
+          }
+        }
+      }
+    }
+    
+    // Sort by confidence (highest first)
+    relevantTriples.sort((a, b) => 
+      (b.confidence || 0) - (a.confidence || 0)
+    );
+    
+    // Return all relevant triples, sorted by relevance
+    console.log(`Found ${relevantTriples.length} relevant triples with traditional search`);
+    return relevantTriples;
+  }
+  
+  /**
+   * Extract keywords from query text
+   */
+  private extractKeywords(text: string): string[] {
+    return text.toLowerCase()
+      .replace(/[.,?!;:()]/g, ' ')  // Remove punctuation
+      .split(/\s+/)                  // Split by whitespace
+      .filter(word => word.length > 2); // Filter out very short words
+  }
+  
+  /**
+   * Check if a word is a common stop word
+   */
+  private isStopWord(word: string): boolean {
+    const stopWords = new Set([
+      'the', 'and', 'are', 'for', 'was', 'with', 
+      'how', 'what', 'why', 'who', 'when', 'which',
+      'many', 'much', 'from', 'have', 'has', 'had',
+      'that', 'this', 'these', 'those', 'they', 'their'
+    ]);
+    return stopWords.has(word.toLowerCase());
+  }
+
+  /**
+   * Build adjacency map from graph data
+   */
+  private buildAdjacencyMap(graphData: any): Map<string, string[]> {
+    const adjacencyMap = new Map<string, string[]>();
+    const nodeIdToName = new Map<string, string>();
+    
+    // Map node IDs to names
+    for (const node of graphData.nodes) {
+      nodeIdToName.set(node.id, node.name);
+      adjacencyMap.set(node.name, []);
+    }
+    
+    // Build adjacency lists
+    for (const rel of graphData.relationships) {
+      const sourceName = nodeIdToName.get(rel.source);
+      const targetName = nodeIdToName.get(rel.target);
+      
+      if (sourceName && targetName) {
+        const neighbors = adjacencyMap.get(sourceName) || [];
+        neighbors.push(targetName);
+        adjacencyMap.set(sourceName, neighbors);
+      }
+    }
+    
+    return adjacencyMap;
+  }
+
+  /**
+   * Perform neighborhood sampling starting from seed nodes
+   */
+  private performNeighborhoodSampling(
+    seedNodes: string[],
+    adjacencyMap: Map<string, string[]>,
+    fanout: number,
+    numHops: number
+  ): Set<string> {
+    const visitedNodes = new Set<string>(seedNodes);
+    let nodesToExplore = [...seedNodes];
+    
+    for (let hop = 0; hop < numHops; hop++) {
+      const currentNodes = [...nodesToExplore];
+      nodesToExplore = [];
+      
+      for (const node of currentNodes) {
+        const neighbors = adjacencyMap.get(node) || [];
+        const limitedNeighbors = neighbors.slice(0, fanout);
+        
+        for (const neighbor of limitedNeighbors) {
+          if (!visitedNodes.has(neighbor)) {
+            visitedNodes.add(neighbor);
+            nodesToExplore.push(neighbor);
+          }
+        }
+      }
+      
+      console.log(`Hop ${hop+1}: Explored ${currentNodes.length} nodes, found ${nodesToExplore.length} new neighbors`);
+    }
+    
+    return visitedNodes;
+  }
+
+  /**
+   * Extract relevant triples from graph data based on subgraph nodes
+   */
+  private extractRelevantTriples(graphData: any, subgraphNodes: Set<string>): Triple[] {
+    const relevantTriples: Triple[] = [];
+    const nodeIdToName = new Map<string, string>();
+    
+    // Map node IDs to names
+    for (const node of graphData.nodes) {
+      nodeIdToName.set(node.id, node.name);
+    }
+    
+    // Extract relevant relationships
+    for (const rel of graphData.relationships) {
+      const sourceName = nodeIdToName.get(rel.source);
+      const targetName = nodeIdToName.get(rel.target);
+      
+      if (sourceName && targetName && 
+          (subgraphNodes.has(sourceName) || subgraphNodes.has(targetName))) {
+        // Get relationship type from metadata
+        const predicate = rel.type === 'RELATIONSHIP' && rel.type ? rel.type : rel.type;
+        relevantTriples.push({ subject: sourceName, predicate: predicate, object: targetName });
+      }
+    }
+    
+    return relevantTriples;
+  }
+
+  /**
+   * Apply local filtering to triples (simplified PCST algorithm)
+   */
+  private applyLocalFiltering(
+    triples: Triple[],
+    params: { topk: number, topk_e: number, cost_e: number, num_clusters: number }
+  ): Triple[] {
+    // For simplicity, just return top N triples
+    // A full implementation would use the Prize-Collecting Steiner Tree algorithm
+    const totalResultSize = params.topk * params.topk_e * params.num_clusters;
+    return triples.slice(0, totalResultSize);
+  }
+
+  /**
+   * Enhanced query method that uses entity metadata for better retrieval
+   * @param query Query text
+   * @param kNeighbors Number of KNN neighbors to retrieve
+   * @param fanout Number of neighbors for each node in neighborhood sampling
+   * @param numHops Number of hops for neighborhood sampling
+   * @param filterParams Parameters for local filtering
+   */
+  public async enhancedQuery(
+    query: string,
+    kNeighbors: number = 4096,
+    fanout: number = 400,
+    numHops: number = 2,
+    filterParams: { topk: number, topk_e: number, cost_e: number, num_clusters: number } = 
+      { topk: 5, topk_e: 5, cost_e: 0.5, num_clusters: 2 }
+  ): Promise<{ relevantTriples: Triple[], queryMetadata: any }> {
+    if (!this.initialized) {
+      await this.initialize();
+    }
+    
+    console.log(`Enhanced query with: "${query}"`);
+    
+    // Step 1: Generate embedding for query
+    const queryEmbedding = (await this.embeddingsService.encode([query]))[0];
+    
+    // Step 2: Find nearest neighbors using Pinecone with metadata
+    const { entities: seedNodes, metadata: seedMetadata } = 
+      await this.pineconeService.findSimilarEntitiesWithMetadata(queryEmbedding, kNeighbors);
+    console.log(`Found ${seedNodes.length} seed nodes using KNN with metadata`);
+    
+    // Step 3: Retrieve graph data from graph database
+    const graphData = await this.graphDBService.getGraphData();
+    console.log(`Retrieved graph with ${graphData.nodes.length} nodes and ${graphData.relationships.length} relationships`);
+    
+    // Step 4: Build adjacency map for neighborhood sampling
+    const adjacencyMap = this.buildAdjacencyMap(graphData);
+    
+    // Step 5: Perform enhanced neighborhood sampling with metadata weighting
+    const { subgraphNodes, nodeScores } = this.performEnhancedSampling(seedNodes, seedMetadata, adjacencyMap, fanout, numHops);
+    console.log(`Enhanced sampling found ${subgraphNodes.size} nodes`);
+    
+    // Step 6: Extract relevant triples with scores
+    const scoredTriples = this.extractRelevantTriplesWithScores(graphData, subgraphNodes, nodeScores);
+    console.log(`Extracted ${scoredTriples.length} relevant triples with scores`);
+    
+    // Step 7: Apply improved local filtering using metadata
+    const filteredTriples = this.applyImprovedFiltering(scoredTriples, filterParams);
+    console.log(`Applied improved filtering, returned ${filteredTriples.length} triples`);
+    
+    // Collect query metadata for analysis and debugging
+    const queryMetadata = {
+      entityMatches: seedNodes.length,
+      topEntityScores: Object.fromEntries(
+        seedNodes.slice(0, 5).map((node: string, i: number) => [node, nodeScores.get(node) || 0])
+      ),
+      retrievalStats: {
+        initialTriples: scoredTriples.length,
+        finalTriples: filteredTriples.length,
+        avgScore: scoredTriples.reduce((sum, t) => sum + (t.score || 0), 0) / scoredTriples.length
+      }
+    };
+    
+    return { 
+      relevantTriples: filteredTriples,
+      queryMetadata
+    };
+  }
+
+  /**
+   * Perform enhanced neighborhood sampling with metadata weighting
+   * @private
+   */
+  private performEnhancedSampling(
+    seedNodes: string[],
+    seedMetadata: Map<string, any>,
+    adjacencyMap: Map<string, string[]>,
+    fanout: number,
+    numHops: number
+  ): { subgraphNodes: Set<string>, nodeScores: Map<string, number> } {
+    const visitedNodes = new Set<string>(seedNodes);
+    const nodeScores = new Map<string, number>();
+    
+    // Initialize scores for seed nodes based on metadata relevance
+    for (let i = 0; i < seedNodes.length; i++) {
+      const node = seedNodes[i];
+      const metadata = seedMetadata.get(node);
+      
+      // Base score is inversely proportional to position in results
+      let score = 1.0 - (i / seedNodes.length);
+      
+      // Boost score based on metadata if available
+      if (metadata) {
+        // Boost if node has types
+        if (metadata.types && metadata.types.length > 0) {
+          score += 0.2;
+        }
+        
+        // Boost if node has rich context
+        if (metadata.contexts && metadata.contexts.length > 0) {
+          score += 0.1 * Math.min(metadata.contexts.length, 3);
+        }
+      }
+      
+      nodeScores.set(node, score);
+    }
+    
+    let currentLayer = [...seedNodes];
+    
+    // BFS traversal with hop-dependent score decay
+    for (let hop = 0; hop < numHops; hop++) {
+      const nextLayer: string[] = [];
+      const hopDecay = 0.7 ** hop; // Score decays with distance
+      
+      for (const node of currentLayer) {
+        const nodeScore = nodeScores.get(node) || 0;
+        const neighbors = adjacencyMap.get(node) || [];
+        
+        // Sort neighbors by any existing scores
+        const scoredNeighbors = neighbors.map(neighbor => ({
+          neighbor,
+          score: nodeScores.get(neighbor) || 0
+        }));
+        
+        // Sort by score (if any have scores) and take top 'fanout'
+        scoredNeighbors.sort((a, b) => b.score - a.score);
+        const limitedNeighbors = scoredNeighbors.slice(0, fanout);
+        
+        for (const { neighbor } of limitedNeighbors) {
+          // Propagate score to neighbor with decay
+          const propagatedScore = nodeScore * hopDecay;
+          const currentScore = nodeScores.get(neighbor) || 0;
+          
+          // Update score if propagated score is higher
+          if (propagatedScore > currentScore) {
+            nodeScores.set(neighbor, propagatedScore);
+          }
+          
+          if (!visitedNodes.has(neighbor)) {
+            visitedNodes.add(neighbor);
+            nextLayer.push(neighbor);
+          }
+        }
+      }
+      
+      currentLayer = nextLayer;
+    }
+    
+    return { subgraphNodes: visitedNodes, nodeScores };
+  }
+
+  /**
+   * Extract relevant triples with relevance scores
+   * @private
+   */
+  private extractRelevantTriplesWithScores(
+    graphData: any, 
+    subgraphNodes: Set<string>,
+    nodeScores: Map<string, number>
+  ): Array<Triple & { score?: number }> {
+    const nodeIdToName = new Map<string, string>();
+    const nodeNameToId = new Map<string, string>();
+    
+    // Map node IDs to names
+    for (const node of graphData.nodes) {
+      nodeIdToName.set(node.id, node.name);
+      nodeNameToId.set(node.name, node.id);
+    }
+    
+    const relevantTriples: Array<Triple & { score?: number }> = [];
+    
+    // Extract triples involving subgraph nodes and compute scores
+    for (const rel of graphData.relationships) {
+      const sourceNode = nodeIdToName.get(rel.source);
+      const targetNode = nodeIdToName.get(rel.target);
+      
+      if (sourceNode && targetNode && 
+          subgraphNodes.has(sourceNode) && 
+          subgraphNodes.has(targetNode)) {
+        
+        // Calculate triple score based on endpoint nodes
+        const sourceScore = nodeScores.get(sourceNode) || 0;
+        const targetScore = nodeScores.get(targetNode) || 0;
+        const tripleScore = (sourceScore + targetScore) / 2;
+        
+        relevantTriples.push({
+          subject: sourceNode,
+          predicate: rel.type,
+          object: targetNode,
+          score: tripleScore
+        });
+      }
+    }
+    
+    // Sort by score
+    relevantTriples.sort((a, b) => (b.score || 0) - (a.score || 0));
+    
+    return relevantTriples;
+  }
+
+  /**
+   * Improved filtering with metadata awareness
+   * @private
+   */
+  private applyImprovedFiltering(
+    scoredTriples: Array<Triple & { score?: number }>,
+    filterParams: { topk: number, topk_e: number, cost_e: number, num_clusters: number }
+  ): Triple[] {
+    // For now, simply take the top-k by score
+    // In a future improvement, implement a proper prize-collecting Steiner tree algorithm
+    return scoredTriples
+      .slice(0, filterParams.topk)
+      .map(({ subject, predicate, object }) => ({ subject, predicate, object }));
+  }
+
+  /**
+   * Close connections to services
+   */
+  public async close(): Promise<void> {
+    if (this.graphDBService.isInitialized()) {
+      this.graphDBService.close();
+    }
+    
+    this.initialized = false;
+    console.log('Remote backend closed');
+  }
+}
+
+export default RemoteBackendService.getInstance(); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/text-processor.ts b/nvidia/txt2kg/assets/frontend/lib/text-processor.ts
new file mode 100644
index 0000000..cb8fbe1
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/text-processor.ts
@@ -0,0 +1,804 @@
+import { RecursiveCharacterTextSplitter } from "langchain/text_splitter";
+import { StructuredOutputParser } from "langchain/output_parsers";
+import { PromptTemplate } from "@langchain/core/prompts";
+import { z } from "zod";
+import { Triple } from "@/types/graph";
+import axios from "axios";
+
+import { ChatOpenAI } from "@langchain/openai";
+import { Document } from "langchain/document";
+import { LLMGraphTransformer } from "@langchain/community/experimental/graph_transformers/llm";
+import { BaseLanguageModel } from "@langchain/core/language_models/base";
+import { SystemMessage, HumanMessage } from "@langchain/core/messages";
+import { langChainService } from "./langchain-service";
+import { getShouldStopProcessing, resetStopProcessing } from "@/app/api/stop-processing/route";
+
+// Define a type for sentence with embedding
+export interface SentenceEmbedding {
+  sentence: string;
+  embedding: number[];
+  metadata?: {
+    index: number;
+    documentId?: string;
+    context?: string;
+  };
+}
+
+// Define interfaces for graph document types
+interface NodeType {
+  id: string;
+  type: string;
+  properties?: Record<string, any>;
+}
+
+interface RelationshipType {
+  source: NodeType;
+  target: NodeType;
+  type: string;
+  properties?: Record<string, any>;
+}
+
+interface GraphDocument {
+  nodes: NodeType[];
+  relationships: RelationshipType[];
+}
+
+interface LLMGraphTransformerOptions {
+  llm: BaseLanguageModel;
+  allowedNodes?: string[];
+  allowedRelationships?: string[];
+  nodeProperties?: string[];
+}
+
+// Add new interface for prompt options
+interface PromptOptions {
+  systemPrompt?: string;
+  extractionPrompt?: string;
+  graphTransformerPrompt?: string;
+}
+
+/**
+ * Text processing pipeline using LangChain.js that:
+ * 1. Chunks documents into optimal sizes
+ * 2. Extracts entities and relationships
+ * 3. Performs metadata enrichment
+ * 4. Outputs structured triples for the knowledge graph
+ */
+export class TextProcessor {
+  private static instance: TextProcessor;
+  private sentenceTransformerUrl: string;
+  private modelName: string;
+  private llm: ChatOpenAI | null = null;
+  private tripleParser: StructuredOutputParser<any> | null = null;
+  private extractionTemplate: PromptTemplate | null = null;
+  private selectedLLMProvider: 'ollama' | 'nvidia' | 'vllm' = 'ollama';
+  private ollamaModel: string = 'llama3.1:8b';
+  private ollamaBaseUrl: string = 'http://localhost:11434/v1';
+  private vllmModel: string = 'meta-llama/Llama-3.2-3B-Instruct';
+  private vllmBaseUrl: string = 'http://localhost:8001/v1';
+
+  private constructor() {
+    this.sentenceTransformerUrl = process.env.SENTENCE_TRANSFORMER_URL || "http://localhost:8000";
+    this.modelName = process.env.MODEL_NAME || "all-MiniLM-L6-v2";
+    
+    // Check for Ollama configuration
+    this.ollamaBaseUrl = process.env.OLLAMA_BASE_URL || 'http://localhost:11434/v1';
+    this.ollamaModel = process.env.OLLAMA_MODEL || 'llama3.1:8b';
+    
+    // Check for vLLM configuration
+    this.vllmBaseUrl = process.env.VLLM_BASE_URL || 'http://localhost:8001/v1';
+    this.vllmModel = process.env.VLLM_MODEL || 'meta-llama/Llama-3.2-3B-Instruct';
+    
+    // Determine which LLM provider to use based on configuration
+    // Priority: vLLM > NVIDIA > Ollama
+    if (process.env.VLLM_BASE_URL) {
+      this.selectedLLMProvider = 'vllm';
+    } else if (process.env.NVIDIA_API_KEY) {
+      this.selectedLLMProvider = 'nvidia';
+    } else {
+      // Default to Ollama (no API key required)
+      this.selectedLLMProvider = 'ollama';
+    }
+  }
+
+  /**
+   * Get the singleton instance of TextProcessor
+   */
+  public static getInstance(): TextProcessor {
+    if (!TextProcessor.instance) {
+      TextProcessor.instance = new TextProcessor();
+    }
+    return TextProcessor.instance;
+  }
+
+  /**
+   * Initialize the TextProcessor with the required components
+   */
+  public async initialize(): Promise<void> {
+    // Only require API keys for specific providers, Ollama works without API keys
+    if (this.selectedLLMProvider === 'nvidia' && !process.env.NVIDIA_API_KEY) {
+      throw new Error("NVIDIA API key is required when using NVIDIA provider. Please set NVIDIA_API_KEY in your environment variables.");
+    }
+
+    // Initialize LLM based on selected provider
+    switch (this.selectedLLMProvider) {
+      
+      case 'ollama':
+        try {
+          this.llm = await langChainService.getOllamaModel(this.ollamaModel, {
+            temperature: 0.1,
+            maxTokens: 8192,
+            baseURL: this.ollamaBaseUrl
+          });
+        } catch (error) {
+          console.error('Failed to initialize Ollama model:', error);
+          throw new Error(`Failed to initialize Ollama model: ${error instanceof Error ? error.message : String(error)}`);
+        }
+        break;
+      
+      case 'nvidia':
+        try {
+          // Use the default Nemotron model for NVIDIA
+          this.llm = await langChainService.getNemotronModel({
+            temperature: 0.1,
+            maxTokens: 8192
+          });
+        } catch (error) {
+          console.error('Failed to initialize NVIDIA model:', error);
+          throw new Error(`Failed to initialize NVIDIA model: ${error instanceof Error ? error.message : String(error)}`);
+        }
+        break;
+      
+      case 'vllm':
+        try {
+          this.llm = await langChainService.getVllmModel(this.vllmModel, {
+            temperature: 0.1,
+            maxTokens: 8192,
+            baseURL: this.vllmBaseUrl
+          });
+        } catch (error) {
+          console.error('Failed to initialize vLLM model:', error);
+          throw new Error(`Failed to initialize vLLM model: ${error instanceof Error ? error.message : String(error)}`);
+        }
+        break;
+    }
+
+    // Initialize Triple Parser
+    this.tripleParser = StructuredOutputParser.fromZodSchema(
+      z.array(
+        z.object({
+          subject: z.string().describe("The subject entity of the triple"),
+          predicate: z.string().describe("The relation/predicate connecting subject and object"),
+          object: z.string().describe("The object entity of the triple"),
+          confidence: z.number().min(0).max(1).describe("Confidence score between 0 and 1"),
+          metadata: z.object({
+            entityTypes: z.array(z.string()).describe("Entity types for subject and object"),
+            source: z.string().describe("The source text this triple was extracted from"),
+            context: z.string().describe("Surrounding context for the triple")
+          }).describe("Additional metadata about the triple")
+        })
+      ).describe("Array of knowledge graph triples extracted from the text")
+    );
+
+    // Initialize Extraction Template
+    const templateString = `
+      You are a knowledge graph builder that extracts structured information from text.
+      Extract subject-predicate-object triples from the following text.
+      
+      Guidelines:
+      - Extract only factual triples present in the text
+      - Normalize entity names to their canonical form
+      - Assign appropriate confidence scores (0-1)
+      - Include entity types in metadata
+      - For each triple, include a brief context from the source text
+      
+      Text: {text}
+      
+      {format_instructions}
+    `;
+
+    this.extractionTemplate = PromptTemplate.fromTemplate(templateString);
+  }
+
+  /**
+   * Process text to extract structured triples
+   * @param text Text to process
+   * @returns Array of triples with metadata
+   */
+  public async processText(text: string): Promise<Array<Triple & { confidence: number, metadata: any }>> {
+    if (!this.llm || !this.tripleParser || !this.extractionTemplate) {
+      await this.initialize();
+    }
+
+    // Ensure we have an LLM to extract triples
+    if (!this.llm) {
+      const providerMessage = this.selectedLLMProvider === 'ollama'
+        ? "Ollama server connection failed. Please ensure Ollama is running and accessible."
+        : "NVIDIA API key is required. Please set NVIDIA_API_KEY in your environment variables.";
+      throw new Error(`LLM configuration error: ${providerMessage}`);
+    }
+
+    // Step 1: Chunk the text into manageable pieces
+    const chunks = await this.chunkText(text);
+    console.log(`Split text into ${chunks.length} chunks`);
+
+    // Step 2: Process each chunk to extract triples
+    const allTriples: Array<Triple & { confidence: number, metadata: any }> = [];
+    
+    for (let i = 0; i < chunks.length; i++) {
+      // Check if processing should be stopped
+      if (getShouldStopProcessing()) {
+        console.log(`Processing stopped by user at chunk ${i + 1}/${chunks.length}`);
+        resetStopProcessing(); // Reset the flag for next time
+        throw new Error('Processing stopped by user');
+      }
+      
+      const chunk = chunks[i];
+      console.log(`Processing chunk ${i + 1}/${chunks.length} (${chunk.length} chars)`);
+      
+      try {
+        // Format the prompt with the chunk and parser instructions
+        const formatInstructions = this.tripleParser!.getFormatInstructions();
+        const prompt = await this.extractionTemplate!.format({
+          text: chunk,
+          format_instructions: formatInstructions
+        });
+
+        // Extract triples using the LLM
+        const response = await this.llm!.invoke(prompt);
+        const responseText = response.content as string;
+        const parsedTriples = await this.tripleParser!.parse(responseText);
+        
+        allTriples.push(...parsedTriples);
+      } catch (error) {
+        console.error(`Error processing chunk ${i + 1}:`, error);
+      }
+    }
+
+    // Step 3: Post-process to remove duplicates and normalize
+    const processedTriples = this.postProcessTriples(allTriples);
+    console.log(`Extracted ${processedTriples.length} unique triples after post-processing`);
+
+    return processedTriples;
+  }
+
+  /**
+   * Split text into chunks of appropriate size
+   * @param text Text to split
+   * @returns Array of text chunks
+   */
+  private async chunkText(text: string): Promise<string[]> {
+    const splitter = new RecursiveCharacterTextSplitter({
+      chunkSize: 64000,         // Increased chunk size for Llama 70B models (16K tokens)
+      chunkOverlap: 1000,       // Increased overlap to maintain context
+      separators: ["\n\n", "\n", ". ", " ", ""],  // Preferred split locations
+    });
+
+    return await splitter.splitText(text);
+  }
+
+  /**
+   * Split text into sentence-level chunks
+   * @param text Text to split into sentences
+   * @returns Array of sentences
+   */
+  public async splitIntoSentences(text: string): Promise<string[]> {
+    const sentenceSplitter = new RecursiveCharacterTextSplitter({
+      chunkSize: 1000,         // Maximum sentence length (very long to ensure sentences aren't split)
+      chunkOverlap: 0,         // No overlap for sentences
+      separators: [". ", "! ", "? ", "\n", "\t"],  // Sentence endings and paragraph breaks
+    });
+
+    // First split by paragraphs, then by sentence delimiters
+    const paragraphs = text.split(/\n{2,}/);  // Split on double newlines for paragraphs
+    const sentences: string[] = [];
+
+    for (const paragraph of paragraphs) {
+      if (paragraph.trim().length === 0) continue;
+
+      // Further split by sentence delimiters
+      const paragraphSentences = await sentenceSplitter.splitText(paragraph);
+      sentences.push(...paragraphSentences);
+    }
+
+    // Clean up sentences
+    return sentences
+      .map(s => s.trim())
+      .filter(s => s.length >= 10);  // Filter out very short sentences
+  }
+
+  /**
+   * Generate embeddings using local Sentence Transformer service
+   * @param texts Array of texts to embed
+   * @returns Array of embeddings
+   */
+  private async generateEmbeddings(texts: string[]): Promise<number[][]> {
+    try {
+      console.log(`Generating embeddings for ${texts.length} texts using local Sentence Transformer service`);
+      
+      // Use the sentence-transformers service defined in docker-compose
+      const response = await axios.post(`${this.sentenceTransformerUrl}/embed`, {
+        texts: texts,
+        model: this.modelName
+      });
+      
+      if (response.status !== 200) {
+        throw new Error(`Failed to generate embeddings: ${response.statusText}`);
+      }
+      
+      return response.data.embeddings;
+    } catch (error) {
+      console.error('Error generating embeddings with Sentence Transformer:', error);
+      throw new Error(`Failed to generate embeddings: ${error instanceof Error ? error.message : String(error)}`);
+    }
+  }
+
+  /**
+   * Generate embeddings for an array of sentences
+   * @param sentences Array of sentences to embed
+   * @param documentId Optional document identifier for metadata
+   * @returns Array of sentence embeddings
+   */
+  public async generateSentenceEmbeddings(
+    sentences: string[], 
+    documentId?: string
+  ): Promise<SentenceEmbedding[]> {
+    console.log(`Generating embeddings for ${sentences.length} sentences`);
+
+    // Generate embeddings using the local Sentence Transformer service
+    const embeddings = await this.generateEmbeddings(sentences);
+
+    // Map embeddings to sentences with metadata
+    return sentences.map((sentence, i) => ({
+      sentence,
+      embedding: embeddings[i],
+      metadata: {
+        index: i,
+        documentId: documentId || undefined,
+        context: this.getSentenceContext(sentences, i),
+      }
+    }));
+  }
+
+  /**
+   * Get surrounding context for a sentence
+   * @private
+   */
+  private getSentenceContext(sentences: string[], index: number): string {
+    // Get previous and next sentence as context if available
+    const previousSentence = index > 0 ? sentences[index - 1] : '';
+    const nextSentence = index < sentences.length - 1 ? sentences[index + 1] : '';
+    
+    // Create a context window with up to 3 sentences
+    let context = sentences[index];
+    
+    if (previousSentence) {
+      context = previousSentence + ' ' + context;
+    }
+    
+    if (nextSentence) {
+      context = context + ' ' + nextSentence;
+    }
+    
+    return context;
+  }
+
+  /**
+   * Post-process extracted triples to remove duplicates and normalize
+   * @param triples Array of raw triples
+   * @returns Array of processed triples
+   */
+  private postProcessTriples(
+    triples: Array<Triple & { confidence: number, metadata: any }>
+  ): Array<Triple & { confidence: number, metadata: any }> {
+    // Convert to lowercase for comparison
+    const normalizedTriples = triples.map(triple => ({
+      ...triple,
+      subject: triple.subject.toLowerCase().trim(),
+      predicate: triple.predicate.toLowerCase().trim(),
+      object: triple.object.toLowerCase().trim()
+    }));
+
+    // Remove duplicates using a Map with string key
+    const tripleMap = new Map<string, Triple & { confidence: number, metadata: any }>();
+    
+    for (const triple of normalizedTriples) {
+      const key = `${triple.subject}|${triple.predicate}|${triple.object}`;
+      
+      // If triple exists, keep the one with higher confidence
+      if (tripleMap.has(key)) {
+        const existingTriple = tripleMap.get(key)!;
+        if (triple.confidence > existingTriple.confidence) {
+          tripleMap.set(key, triple);
+        }
+      } else {
+        tripleMap.set(key, triple);
+      }
+    }
+
+    // Filter out low confidence triples
+    return Array.from(tripleMap.values())
+      .filter(triple => triple.confidence >= 0.6) // Only keep reasonably confident triples
+      .sort((a, b) => b.confidence - a.confidence); // Sort by confidence (highest first)
+  }
+
+  // Make LLM accessible for the LLMGraphTransformer
+  public getLLM(): ChatOpenAI | null {
+    return this.llm;
+  }
+
+  /**
+   * Set the LLM provider to use for triple extraction
+   */
+  public setLLMProvider(provider: 'ollama' | 'nvidia' | 'vllm', options?: { 
+    ollamaModel?: string; 
+    ollamaBaseUrl?: string;
+    vllmModel?: string;
+    vllmBaseUrl?: string;
+  }): void {
+    this.selectedLLMProvider = provider;
+    if (provider === 'ollama') {
+      this.ollamaModel = options?.ollamaModel || this.ollamaModel;
+      this.ollamaBaseUrl = options?.ollamaBaseUrl || this.ollamaBaseUrl;
+    } else if (provider === 'vllm') {
+      this.vllmModel = options?.vllmModel || this.vllmModel;
+      this.vllmBaseUrl = options?.vllmBaseUrl || this.vllmBaseUrl;
+    }
+    // Reset the LLM so it gets re-initialized with the new provider
+    this.llm = null;
+  }
+
+  /**
+   * Get the current LLM provider
+   */
+  public getLLMProvider(): 'ollama' | 'nvidia' | 'vllm' {
+    return this.selectedLLMProvider;
+  }
+
+  /**
+   * Process text to extract structured triples with a custom prompt template
+   * @param text Text to process
+   * @param customPrompt Custom prompt template to use instead of the default
+   * @returns Array of triples with metadata
+   */
+  public async processTextWithCustomPrompt(text: string, customPrompt: string): Promise<Array<Triple & { confidence: number, metadata: any }>> {
+    if (!this.llm || !this.tripleParser) {
+      await this.initialize();
+    }
+
+    // Ensure we have an LLM to extract triples
+    if (!this.llm) {
+      throw new Error("LLM is not initialized. Please ensure your selected provider is properly configured.");
+    }
+
+    // Step 1: Chunk the text into manageable pieces
+    const chunks = await this.chunkText(text);
+    console.log(`Split text into ${chunks.length} chunks`);
+
+    // Step 2: Process each chunk to extract triples with the custom prompt
+    const allTriples: Array<Triple & { confidence: number, metadata: any }> = [];
+    
+    // Create a custom prompt template
+    const customTemplate = PromptTemplate.fromTemplate(customPrompt);
+    
+    for (let i = 0; i < chunks.length; i++) {
+      const chunk = chunks[i];
+      console.log(`Processing chunk ${i + 1}/${chunks.length} (${chunk.length} chars) with custom prompt`);
+      
+      try {
+        // Format the prompt with the chunk and parser instructions
+        const formatInstructions = this.tripleParser!.getFormatInstructions();
+        const prompt = await customTemplate.format({
+          text: chunk,
+          format_instructions: formatInstructions
+        });
+
+        // Extract triples using the LLM
+        const response = await this.llm!.invoke(prompt);
+        const responseText = response.content as string;
+        const parsedTriples = await this.tripleParser!.parse(responseText);
+        
+        allTriples.push(...parsedTriples);
+      } catch (error) {
+        console.error(`Error processing chunk ${i + 1} with custom prompt:`, error);
+      }
+    }
+
+    // Step 3: Post-process to remove duplicates and normalize
+    const processedTriples = this.postProcessTriples(allTriples);
+    console.log(`Extracted ${processedTriples.length} unique triples after post-processing with custom prompt`);
+
+    return processedTriples;
+  }
+
+  /**
+   * Process text to extract structured triples with a custom system prompt
+   * This is used for direct LLM invocation without LangChain
+   * @param text Text to process
+   * @param customSystemPrompt Custom system prompt to use
+   * @returns Array of triples with metadata
+   */
+  public async processTextWithCustomSystemPrompt(text: string, customSystemPrompt: string): Promise<Array<Triple & { confidence: number, metadata: any }>> {
+    if (!this.llm) {
+      await this.initialize();
+    }
+
+    // Ensure we have an LLM to extract triples
+    if (!this.llm) {
+      throw new Error("LLM is not initialized. Please ensure your selected provider is properly configured.");
+    }
+
+    // Step 1: Chunk the text into manageable pieces
+    const chunks = await this.chunkText(text);
+    console.log(`Split text into ${chunks.length} chunks for processing with custom system prompt`);
+
+    // Step 2: Process each chunk to extract triples with the custom system prompt
+    const allTriples: Array<Triple & { confidence: number, metadata: any }> = [];
+    
+    for (let i = 0; i < chunks.length; i++) {
+      const chunk = chunks[i];
+      console.log(`Processing chunk ${i + 1}/${chunks.length} (${chunk.length} chars) with custom system prompt`);
+      
+      try {
+        // Create messages with the custom system prompt and the chunk
+        const messages = [
+          new SystemMessage(customSystemPrompt),
+          new HumanMessage(chunk)
+        ];
+        const response = await this.llm!.invoke(messages);
+        
+        // Convert response to triples
+        const responseText = response.content as string;
+        
+        // Create a simple triple parser for the response (since we're not using LangChain's parser)
+        const simpleTriples = this.parseTripleLines(responseText);
+        
+        // Convert to the expected format with confidence and metadata
+        const structuredTriples = simpleTriples.map(triple => ({
+          ...triple,
+          confidence: 0.9, // Default confidence for custom prompt extraction
+          metadata: {
+            entityTypes: [], // Empty entity types as they're not provided
+            source: chunk.substring(0, 100) + "...", // First 100 chars as context
+            context: `${triple.subject} ${triple.predicate} ${triple.object}`
+          }
+        }));
+        
+        allTriples.push(...structuredTriples);
+      } catch (error) {
+        console.error(`Error processing chunk ${i + 1} with custom system prompt:`, error);
+      }
+    }
+
+    // Step 3: Post-process to remove duplicates and normalize
+    const processedTriples = this.postProcessTriples(allTriples);
+    console.log(`Extracted ${processedTriples.length} unique triples after post-processing with custom system prompt`);
+
+    return processedTriples;
+  }
+  
+  /**
+   * Helper method to parse triple lines from LLM output
+   * @private
+   */
+  private parseTripleLines(text: string): Triple[] {
+    const triples: Triple[] = [];
+    const lines = text.split('\n');
+    
+    for (const line of lines) {
+      const trimmed = line.trim();
+      if (!trimmed) continue;
+      
+      // Try different regex patterns to extract triples
+      const patterns = [
+        // Standard format: ('subject', 'relation', 'object')
+        /\('([^']+)',\s*'([^']+)',\s*'([^']+)'\)/,
+        // Double quotes: ("subject", "relation", "object")
+        /\("([^"]+)",\s*"([^"]+)",\s*"([^"]+)"\)/,
+        // No parentheses: "subject", "relation", "object"
+        /"([^"]+)",\s*"([^"]+)",\s*"([^"]+)"/,
+        // Mixed quotes: ('subject', "relation", 'object')
+        /\(['"]([^'"]+)['"],\s*['"]([^'"]+)['"],\s*['"]([^'"]+)['"]\)/,
+        // Plain text: subject, relation, object
+        /^([^,]+),\s*([^,]+),\s*(.+)$/
+      ];
+      
+      let match = null;
+      for (const pattern of patterns) {
+        match = trimmed.match(pattern);
+        if (match) break;
+      }
+      
+      if (match) {
+        triples.push({
+          subject: match[1].trim(),
+          predicate: match[2].trim(),
+          object: match[3].trim()
+        });
+      }
+    }
+    
+    return triples;
+  }
+}
+
+/**
+ * Process a document and extract triples with metadata
+ * @param text Document text
+ * @param useLangChain Whether to use LangChain's extraction (optional)
+ * @param useGraphTransformer Whether to use LLMGraphTransformer (optional)
+ * @param options Custom prompt options (optional)
+ * @returns Extracted triples with metadata
+ */
+export async function processDocument(
+  text: string, 
+  useLangChain = false,
+  useGraphTransformer = false,
+  options?: PromptOptions
+): Promise<Array<Triple & { confidence: number, metadata: any }>> {
+  if (useLangChain) {
+    if (useGraphTransformer) {
+      // Pass graphTransformerPrompt if available
+      return await processDocumentWithGraphTransformer(text, options?.graphTransformerPrompt);
+    } else {
+      // Initialize text processor with custom extraction prompt if available
+      const processor = TextProcessor.getInstance();
+      
+      // If a custom extraction prompt is provided, use it for this invocation
+      if (options?.extractionPrompt) {
+        return await processor.processTextWithCustomPrompt(text, options.extractionPrompt);
+      } else {
+        return await processor.processText(text);
+      }
+    }
+  }
+  
+  // Use default processor with potential custom system prompt
+  const processor = TextProcessor.getInstance();
+  
+  // If a custom system prompt is provided, use it for this invocation
+  if (options?.systemPrompt) {
+    return await processor.processTextWithCustomSystemPrompt(text, options.systemPrompt);
+  } else {
+    return await processor.processText(text);
+  }
+}
+
+/**
+ * Process a document using LangChain's LLMGraphTransformer
+ * @param text Document text
+ * @param customGraphPrompt Optional custom prompt for the graph transformer
+ * @returns Extracted triples with metadata
+ */
+async function processDocumentWithGraphTransformer(
+  text: string,
+  customGraphPrompt?: string
+): Promise<Array<Triple & { confidence: number, metadata: any }>> {
+  const processor = TextProcessor.getInstance();
+  
+  // Initialize LLM if not already done
+  if (!processor.getLLM()) {
+    await processor.initialize();
+  }
+  
+  // Ensure we have an LLM
+  const llm = processor.getLLM();
+  if (!llm) {
+    throw new Error("xAI API key is required for triple extraction. Please set XAI_API_KEY in your environment variables.");
+  }
+  
+  // Use the existing LLM with LLMGraphTransformer
+  const llmTransformerOptions: any = {
+    llm,
+    // Optional configurations
+    allowedNodes: ["Person", "Organization", "Concept", "Location", "Event", "Product"],
+    allowedRelationships: ["RELATED_TO", "PART_OF", "LOCATED_IN", "WORKS_AT", "CREATED", "BELONGS_TO", "HAS_PROPERTY"],
+    nodeProperties: ["name", "type", "description"]
+  };
+  
+  // Add custom prompt if provided
+  if (customGraphPrompt) {
+    llmTransformerOptions.customPrompt = customGraphPrompt;
+  }
+  
+  const llmTransformer = new LLMGraphTransformer(llmTransformerOptions);
+  
+  // Create LangChain document from text
+  const documents = [new Document({ pageContent: text })];
+  
+  try {
+    // Extract graph documents
+    const graphDocuments = await llmTransformer.convertToGraphDocuments(documents);
+    
+    // Convert graph nodes and relationships to triples
+    const triples: Array<Triple & { confidence: number, metadata: any }> = [];
+    
+    if (graphDocuments.length > 0) {
+      const graphDoc = graphDocuments[0];
+      
+      // Process each relationship as a triple
+      for (const relationship of graphDoc.relationships) {
+        // Use type assertion to handle potential mixed types
+        const rel = relationship as unknown as {
+          source: { id: string, type: string, properties?: Record<string, any> },
+          target: { id: string, type: string, properties?: Record<string, any> },
+          type: string
+        };
+          
+        triples.push({
+          subject: rel.source.id,
+          predicate: rel.type.toLowerCase(),
+          object: rel.target.id,
+          confidence: 0.9, // Default high confidence for LLM-extracted relationships
+          metadata: {
+            entityTypes: [rel.source.type, rel.target.type],
+            source: text.substring(0, 100) + "...", // First 100 chars as source context
+            context: `${rel.source.id} ${rel.type} ${rel.target.id}`,
+            sourceProperties: rel.source.properties || {},
+            targetProperties: rel.target.properties || {}
+          }
+        });
+      }
+    }
+    
+    return triples;
+  } catch (error) {
+    console.error("Error processing with LLMGraphTransformer:", error);
+    throw new Error(`Failed to process with LangChain: ${error instanceof Error ? error.message : String(error)}`);
+  }
+}
+
+/**
+ * Extract entity types from a text passage
+ * @param text Text to analyze
+ * @returns Map of entity names to their types
+ */
+export async function extractEntityTypes(text: string): Promise<Map<string, string[]>> {
+  const processor = TextProcessor.getInstance();
+  const triples = await processor.processText(text);
+  
+  const entityTypes = new Map<string, string[]>();
+  
+  for (const triple of triples) {
+    if (triple.metadata && triple.metadata.entityTypes) {
+      // Extract subject type
+      if (triple.metadata.entityTypes[0]) {
+        const subjectType = entityTypes.get(triple.subject) || [];
+        if (!subjectType.includes(triple.metadata.entityTypes[0])) {
+          subjectType.push(triple.metadata.entityTypes[0]);
+        }
+        entityTypes.set(triple.subject, subjectType);
+      }
+      
+      // Extract object type
+      if (triple.metadata.entityTypes[1]) {
+        const objectType = entityTypes.get(triple.object) || [];
+        if (!objectType.includes(triple.metadata.entityTypes[1])) {
+          objectType.push(triple.metadata.entityTypes[1]);
+        }
+        entityTypes.set(triple.object, objectType);
+      }
+    }
+  }
+  
+  return entityTypes;
+}
+
+/**
+ * Split text into sentences and generate embeddings
+ * @param text Text to process
+ * @param documentId Optional document identifier
+ * @returns Array of sentence embeddings
+ */
+export async function processSentenceEmbeddings(
+  text: string, 
+  documentId?: string
+): Promise<SentenceEmbedding[]> {
+  const processor = TextProcessor.getInstance();
+  
+  // Split text into sentences
+  const sentences = await processor.splitIntoSentences(text);
+  
+  // Generate embeddings for the sentences
+  return await processor.generateSentenceEmbeddings(sentences, documentId);
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/lib/utils.ts b/nvidia/txt2kg/assets/frontend/lib/utils.ts
new file mode 100644
index 0000000..f3d0231
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/lib/utils.ts
@@ -0,0 +1,35 @@
+import { clsx, type ClassValue } from "clsx"
+import { twMerge } from "tailwind-merge"
+
+export function cn(...inputs: ClassValue[]) {
+  return twMerge(clsx(inputs))
+}
+
+/**
+ * Download a document file from the browser
+ */
+export function downloadDocument(file: File, filename?: string) {
+  const url = URL.createObjectURL(file)
+  const link = document.createElement('a')
+  link.href = url
+  link.download = filename || file.name
+  document.body.appendChild(link)
+  link.click()
+  document.body.removeChild(link)
+  URL.revokeObjectURL(url)
+}
+
+/**
+ * Download text content as a file
+ */
+export function downloadTextAsFile(content: string, filename: string, mimeType: string = 'text/plain') {
+  const blob = new Blob([content], { type: mimeType })
+  const url = URL.createObjectURL(blob)
+  const link = document.createElement('a')
+  link.href = url
+  link.download = filename
+  document.body.appendChild(link)
+  link.click()
+  document.body.removeChild(link)
+  URL.revokeObjectURL(url)
+}
diff --git a/nvidia/txt2kg/assets/frontend/next.config.mjs b/nvidia/txt2kg/assets/frontend/next.config.mjs
new file mode 100644
index 0000000..6eff3b2
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/next.config.mjs
@@ -0,0 +1,34 @@
+/** @type {import('next').NextConfig} */
+const nextConfig = {
+  eslint: {
+    ignoreDuringBuilds: true,
+  },
+  typescript: {
+    ignoreBuildErrors: true,
+  },
+  images: {
+    unoptimized: true,
+  },
+  // Configure external packages for server components
+  serverExternalPackages: ['@langchain/community'],
+  experimental: {
+    // webpackBuildWorker: true,
+  },
+  // Make environment variables accessible to server components
+  env: {
+    NVIDIA_API_KEY: process.env.NVIDIA_API_KEY,
+  },
+  // Remove API route timeout limits for large model processing
+  serverRuntimeConfig: {
+    // No duration limit - let large models complete naturally
+    maxDuration: 0,
+  },
+}
+
+// Define environment variables that should be available to the client
+const clientEnv = {
+  NVIDIA_API_KEY: process.env.NVIDIA_API_KEY,
+  // Other environment variables as needed
+};
+
+export default nextConfig
diff --git a/nvidia/txt2kg/assets/frontend/next.env.d.ts b/nvidia/txt2kg/assets/frontend/next.env.d.ts
new file mode 100644
index 0000000..de8f3d5
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/next.env.d.ts
@@ -0,0 +1,12 @@
+/// <reference types="next" />
+/// <reference types="next/image-types/global" />
+
+// NOTE: This file should not be edited
+// see https://nextjs.org/docs/basic-features/typescript for more information.
+
+declare namespace NodeJS {
+  interface ProcessEnv {
+    // XAI_API_KEY removed - integration has been removed
+  }
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/package.json b/nvidia/txt2kg/assets/frontend/package.json
new file mode 100644
index 0000000..2dfa425
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/package.json
@@ -0,0 +1,75 @@
+{
+  "name": "my-v0-project",
+  "version": "0.1.0",
+  "private": true,
+  "scripts": {
+    "predev": "npm run setup-pinecone",
+    "dev": "next dev",
+    "prebuild": "npm run setup-pinecone",
+    "build": "next build",
+    "start": "next start",
+    "lint": "next lint",
+    "setup-pinecone": "node ../scripts/setup-pinecone.js"
+  },
+  "dependencies": {
+    "3d-force-graph": "^1.77.0",
+    "@aws-sdk/client-s3": "^3.787.0",
+    "@aws-sdk/s3-request-presigner": "^3.787.0",
+    "@hookform/resolvers": "^3.9.1",
+    "@langchain/community": "^0.3.40",
+    "@langchain/core": "^0.3.43",
+    "@langchain/openai": "^0.5.2",
+    "@radix-ui/react-alert-dialog": "^1.1.4",
+    "@radix-ui/react-avatar": "^1.1.2",
+    "@radix-ui/react-checkbox": "^1.1.3",
+    "@radix-ui/react-collapsible": "^1.1.2",
+    "@radix-ui/react-dialog": "^1.1.4",
+    "@radix-ui/react-dropdown-menu": "^2.1.4",
+    "@radix-ui/react-label": "^2.1.1",
+    "@radix-ui/react-popover": "^1.1.4",
+    "@radix-ui/react-progress": "^1.1.1",
+    "@radix-ui/react-radio-group": "^1.2.2",
+    "@radix-ui/react-scroll-area": "^1.2.2",
+    "@radix-ui/react-select": "^2.1.4",
+    "@radix-ui/react-separator": "^1.1.1",
+    "@radix-ui/react-slider": "^1.2.2",
+    "@radix-ui/react-slot": "^1.1.1",
+    "@radix-ui/react-switch": "^1.1.2",
+    "@radix-ui/react-tabs": "^1.1.2",
+    "@radix-ui/react-toast": "^1.2.4",
+    "@radix-ui/react-toggle": "^1.1.1",
+    "@radix-ui/react-toggle-group": "^1.1.1",
+    "@radix-ui/react-tooltip": "^1.1.6",
+    "@tailwindcss/forms": "latest",
+    "@types/d3": "^7.4.3",
+    "@types/three": "^0.175.0",
+    "arangojs": "^8.8.1",
+    "autoprefixer": "^10.4.20",
+    "axios": "^1.8.4",
+    "class-variance-authority": "^0.7.1",
+    "clsx": "^2.1.1",
+    "d3": "^7.9.0",
+    "langchain": "^0.3.19",
+    "lucide-react": "^0.454.0",
+    "neo4j-driver": "^5.28.1",
+    "next": "15.1.0",
+    "next-themes": "^0.4.4",
+    "openai": "^4.91.0",
+    "react": "^19",
+    "react-dom": "^19",
+    "react-hook-form": "^7.54.1",
+    "react-resizable-panels": "^2.1.7",
+    "tailwind-merge": "^2.5.5",
+    "tailwindcss-animate": "^1.0.7",
+    "three": "^0.176.0",
+    "zod": "^3.24.1"
+  },
+  "devDependencies": {
+    "@types/node": "^22",
+    "@types/react": "^19",
+    "@types/react-dom": "^19",
+    "postcss": "^8",
+    "tailwindcss": "^3.4.17",
+    "typescript": "^5"
+  }
+}
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/postcss.config.mjs b/nvidia/txt2kg/assets/frontend/postcss.config.mjs
new file mode 100644
index 0000000..2ef30fc
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/postcss.config.mjs
@@ -0,0 +1,9 @@
+/** @type {import('postcss-load-config').Config} */
+const config = {
+  plugins: {
+    tailwindcss: {},
+    autoprefixer: {},
+  },
+};
+
+export default config;
diff --git a/nvidia/txt2kg/assets/frontend/public/ollama-logo.svg b/nvidia/txt2kg/assets/frontend/public/ollama-logo.svg
new file mode 100644
index 0000000..501606a
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/public/ollama-logo.svg
@@ -0,0 +1,7 @@
+<svg width="646" height="854" viewBox="0 0 646 854" fill="none" xmlns="http://www.w3.org/2000/svg">
+<path d="M140.629 0.239929C132.66 1.52725 123.097 5.69568 116.354 10.845C95.941 26.3541 80.1253 59.2728 73.4435 100.283C70.9302 115.792 69.2138 137.309 69.2138 153.738C69.2138 173.109 71.4819 197.874 74.7309 214.977C75.4665 218.778 75.8343 222.15 75.5278 222.395C75.2826 222.64 72.2788 225.092 68.9072 227.789C57.3827 236.984 44.2029 251.145 35.1304 264.08C17.7209 288.784 6.44151 316.86 1.72133 347.265C-0.117698 359.28 -0.608106 383.555 0.863118 395.57C4.11207 423.278 12.449 446.695 26.7321 468.151L31.391 475.078L30.0424 477.346C20.4794 493.407 12.3264 516.64 8.52575 538.953C5.522 556.608 5.15419 561.328 5.15419 584.99C5.15419 608.837 5.4607 613.557 8.28054 630.047C11.6521 649.786 18.5178 670.689 26.1804 684.605C28.6938 689.141 34.8239 698.581 35.5595 699.072C35.8047 699.194 35.0691 701.462 33.9044 704.098C25.077 723.408 17.537 749.093 14.4106 770.733C12.2038 785.567 11.8973 790.349 11.8973 805.981C11.8973 825.903 13.0007 835.589 17.1692 851.466L17.7822 853.795H44.019H70.3172L68.6007 850.546C57.9957 830.93 57.0149 794.517 66.1487 758.166C70.3172 741.369 75.0374 729.048 83.8647 712.067L89.1366 701.769V695.455C89.1366 689.57 89.014 688.896 87.1137 685.034C85.6424 682.091 83.6808 679.578 80.1866 676.145C74.2404 670.383 69.9494 664.314 66.5165 656.835C51.4365 624.1 48.494 575.489 59.0991 534.049C63.5128 516.762 70.8076 501.376 78.4702 492.978C83.6808 487.215 86.378 480.779 86.378 474.097C86.378 467.17 83.926 461.469 78.4089 455.523C62.5932 438.604 52.8464 418.006 49.3522 394.038C44.3868 359.893 53.3981 322.683 73.8726 293.198C93.9181 264.263 122.055 245.689 153.503 240.724C160.552 239.559 173.732 239.743 181.088 241.092C189.119 242.502 194.145 242.072 199.295 239.62C205.67 236.617 208.858 232.877 212.597 224.295C215.907 216.633 218.482 212.464 225.409 203.821C233.746 193.461 241.776 186.411 254.649 177.89C269.362 168.266 286.097 161.278 302.771 157.906C308.839 156.68 311.659 156.496 323 156.496C334.341 156.496 337.161 156.68 343.229 157.906C367.688 162.872 391.964 175.5 411.335 193.399C415.503 197.261 425.495 209.644 428.683 214.794C429.909 216.816 432.055 221.108 433.403 224.295C437.142 232.877 440.33 236.617 446.705 239.62C451.671 242.011 456.881 242.502 464.605 241.214C476.804 239.13 486.183 239.314 498.137 241.766C538.841 249.98 574.273 283.512 589.966 328.446C603.636 367.862 599.774 409.118 579.422 440.626C575.989 445.96 572.556 450.251 567.591 455.523C556.863 466.986 556.863 481.208 567.53 492.978C585.062 512.165 596.035 559.367 592.724 600.99C590.518 628.453 583.468 653.035 573.782 666.95C572.066 669.402 568.511 673.57 565.813 676.145C562.319 679.578 560.358 682.091 558.886 685.034C556.986 688.896 556.863 689.57 556.863 695.455V701.769L562.135 712.067C570.963 729.048 575.683 741.369 579.851 758.166C588.863 794.027 588.066 829.704 577.767 849.995C576.909 851.711 576.173 853.305 576.173 853.489C576.173 853.673 587.882 853.795 602.226 853.795H628.218L628.892 851.159C629.26 849.75 629.873 847.604 630.179 846.378C630.854 843.681 632.202 835.712 633.306 828.049C634.348 820.325 634.348 791.881 633.306 783.299C629.383 752.158 622.823 727.454 612.096 704.098C610.931 701.462 610.195 699.194 610.44 699.072C610.747 698.888 612.463 696.436 614.302 693.677C627.666 673.448 635.88 648.008 640.049 614.415C641.152 605.158 641.152 565.374 640.049 556.485C637.106 533.559 633.551 517.988 627.666 502.234C625.214 495.675 618.716 481.821 615.958 477.346L614.609 475.078L619.268 468.151C633.551 446.695 641.888 423.278 645.137 395.57C646.608 383.555 646.118 359.28 644.279 347.265C639.497 316.798 628.279 288.845 610.87 264.08C601.797 251.145 588.617 236.984 577.093 227.789C573.721 225.092 570.717 222.64 570.472 222.395C570.166 222.15 570.534 218.778 571.269 214.977C578.687 176.296 578.441 128.053 570.656 90.3524C563.913 57.4951 551.653 31.3808 535.837 16.3008C523.209 4.28578 510.336 -0.863507 494.888 0.11731C459.456 2.20154 430.89 42.9667 419.61 107.21C417.771 117.57 416.178 129.708 416.178 133.018C416.178 134.305 415.932 135.347 415.626 135.347C415.319 135.347 412.929 134.121 410.354 132.589C383.014 116.405 352.608 107.762 323 107.762C293.392 107.762 262.986 116.405 235.646 132.589C233.071 134.121 230.681 135.347 230.374 135.347C230.068 135.347 229.822 134.305 229.822 133.018C229.822 129.585 228.167 117.08 226.39 107.21C216.152 49.5259 192.674 11.3354 161.472 1.71112C157.181 0.423799 144.982 -0.434382 140.629 0.239929ZM151.051 50.139C159.878 57.1273 169.686 77.1114 175.326 99.4863C176.368 103.532 177.471 108.191 177.778 109.907C178.023 111.563 178.697 115.302 179.249 118.183C181.64 131.179 182.743 145.217 182.866 162.32L182.927 179.178L178.697 185.43L174.468 191.744H164.598C153.074 191.744 141.61 193.216 130.637 196.158C126.714 197.139 122.913 198.12 122.178 198.304C121.013 198.549 120.829 198.181 120.155 193.154C116.538 165.875 116.722 135.654 120.707 110.52C125.12 82.5059 135.419 57.1273 145.472 49.6486C147.863 47.8708 148.292 47.9321 151.051 50.139ZM500.589 49.7098C506.658 54.1848 513.34 66.0772 518.305 81.2798C528.297 111.685 531.117 153.431 525.845 193.154C525.171 198.181 524.987 198.549 523.822 198.304C523.087 198.12 519.286 197.139 515.363 196.158C504.39 193.216 492.926 191.744 481.402 191.744H471.532L467.303 185.43L463.073 179.178L463.134 162.32C463.257 138.535 465.464 119.961 470.735 99.3024C476.314 77.1114 486.183 57.1273 494.949 50.139C497.708 47.9321 498.137 47.8708 500.589 49.7098Z" fill="black"/>
+<path d="M313.498 358.237C300.195 359.525 296.579 360.015 290.203 361.303C279.843 363.448 265.989 368.23 256.365 372.95C222.895 389.317 199.846 416.596 192.796 448.166C191.386 454.419 191.202 456.503 191.202 467.047C191.202 477.468 191.386 479.736 192.735 485.682C202.114 526.938 240.12 557.405 289.284 562.983C299.95 564.148 346.049 564.148 356.715 562.983C396.193 558.508 430.154 537.114 445.418 507.076C449.463 499.046 451.425 493.835 453.264 485.682C454.613 479.736 454.797 477.468 454.797 467.047C454.797 456.503 454.613 454.419 453.203 448.166C442.965 402.313 398.461 366.207 343.903 359.341C336.792 358.483 318.157 357.747 313.498 358.237ZM336.424 391.585C354.631 393.547 372.96 400.045 387.672 409.853C395.58 415.125 406.737 426.159 411.518 433.393C417.403 442.342 420.774 451.476 422.307 462.572C422.981 467.66 422.614 471.522 420.774 479.736C417.893 491.996 408.943 504.808 396.867 513.758C391.227 517.865 379.519 523.812 372.347 526.141C358.738 530.493 349.849 531.29 318.095 531.045C297.376 530.861 293.697 530.677 287.751 529.574C267.461 525.773 251.4 517.681 239.753 505.36C230.312 495.429 226.021 486.357 223.692 471.706C222.65 464.901 224.611 453.622 228.596 444.12C233.439 432.534 245.944 418.129 258.327 409.853C272.671 400.29 291.552 393.486 308.9 391.647C315.582 390.911 329.742 390.911 336.424 391.585Z" fill="black"/>
+<path d="M299.584 436.336C294.925 438.849 291.676 445.224 292.657 449.944C293.76 455.032 298.235 460.182 305.223 464.412C308.963 466.68 309.208 466.986 309.392 469.254C309.514 470.603 309.024 474.465 308.35 477.898C307.614 481.269 307.062 484.825 307.062 485.806C307.124 488.442 309.576 492.733 312.15 494.817C314.419 496.656 314.848 496.717 321.223 496.901C327.047 497.085 328.273 496.962 330.602 495.859C336.61 492.916 338.142 487.522 335.935 477.162C334.096 468.519 334.464 467.17 339.062 464.534C343.904 461.714 349.054 456.749 350.586 453.377C353.529 446.941 350.831 439.646 344.333 436.274C342.74 435.477 340.778 435.11 337.897 435.11C333.422 435.11 330.541 436.152 325.269 439.523L322.265 441.424L320.365 440.259C312.58 435.661 311.17 435.11 306.449 435.171C303.078 435.171 301.239 435.477 299.584 436.336Z" fill="black"/>
+<path d="M150.744 365.165C139.894 368.598 131.802 376.567 127.634 387.908C125.611 393.303 124.63 401.824 125.488 406.421C127.511 417.394 136.522 427.386 146.76 430.145C159.633 433.516 169.257 431.309 177.778 422.85C182.743 418.007 185.441 413.777 188.138 406.911C190.099 402.069 190.222 401.211 190.222 394.345L190.283 386.989L187.709 381.717C183.601 373.38 176.184 367.188 167.602 364.92C162.759 363.694 154.974 363.756 150.744 365.165Z" fill="black"/>
+<path d="M478.153 364.982C469.755 367.25 462.276 373.502 458.291 381.717L455.717 386.989L455.778 394.345C455.778 401.211 455.901 402.069 457.862 406.911C460.56 413.777 463.257 418.007 468.222 422.85C476.743 431.309 486.367 433.516 499.241 430.145C506.658 428.183 514.075 421.93 517.631 414.635C520.696 408.444 521.431 403.969 520.451 396.919C518.183 380.797 508.742 369.089 494.704 364.982C490.597 363.756 482.628 363.756 478.153 364.982Z" fill="black"/>
+</svg>
diff --git a/nvidia/txt2kg/assets/frontend/public/placeholder-logo.png b/nvidia/txt2kg/assets/frontend/public/placeholder-logo.png
new file mode 100644
index 0000000..6528839
Binary files /dev/null and b/nvidia/txt2kg/assets/frontend/public/placeholder-logo.png differ
diff --git a/nvidia/txt2kg/assets/frontend/public/placeholder-logo.svg b/nvidia/txt2kg/assets/frontend/public/placeholder-logo.svg
new file mode 100644
index 0000000..b1695aa
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/public/placeholder-logo.svg
@@ -0,0 +1 @@
+<svg xmlns="http://www.w3.org/2000/svg" width="215" height="48" fill="none"><path fill="#000" d="M57.588 9.6h6L73.828 38h-5.2l-2.36-6.88h-11.36L52.548 38h-5.2l10.24-28.4Zm7.16 17.16-4.16-12.16-4.16 12.16h8.32Zm23.694-2.24c-.186-1.307-.706-2.32-1.56-3.04-.853-.72-1.866-1.08-3.04-1.08-1.68 0-2.986.613-3.92 1.84-.906 1.227-1.36 2.947-1.36 5.16s.454 3.933 1.36 5.16c.934 1.227 2.24 1.84 3.92 1.84 1.254 0 2.307-.373 3.16-1.12.854-.773 1.387-1.867 1.6-3.28l5.12.24c-.186 1.68-.733 3.147-1.64 4.4-.906 1.227-2.08 2.173-3.52 2.84-1.413.667-2.986 1-4.72 1-2.08 0-3.906-.453-5.48-1.36-1.546-.907-2.76-2.2-3.64-3.88-.853-1.68-1.28-3.627-1.28-5.84 0-2.24.427-4.187 1.28-5.84.88-1.68 2.094-2.973 3.64-3.88 1.574-.907 3.4-1.36 5.48-1.36 1.68 0 3.227.32 4.64.96 1.414.64 2.56 1.56 3.44 2.76.907 1.2 1.454 2.6 1.64 4.2l-5.12.28Zm11.486-7.72.12 3.4c.534-1.227 1.307-2.173 2.32-2.84 1.04-.693 2.267-1.04 3.68-1.04 1.494 0 2.76.387 3.8 1.16 1.067.747 1.827 1.813 2.28 3.2.507-1.44 1.294-2.52 2.36-3.24 1.094-.747 2.414-1.12 3.96-1.12 1.414 0 2.64.307 3.68.92s1.84 1.52 2.4 2.72c.56 1.2.84 2.667.84 4.4V38h-4.96V25.92c0-1.813-.293-3.187-.88-4.12-.56-.96-1.413-1.44-2.56-1.44-.906 0-1.68.213-2.32.64-.64.427-1.133 1.053-1.48 1.88-.32.827-.48 1.84-.48 3.04V38h-4.56V25.92c0-1.2-.133-2.213-.4-3.04-.24-.827-.626-1.453-1.16-1.88-.506-.427-1.133-.64-1.88-.64-.906 0-1.68.227-2.32.68-.64.427-1.133 1.053-1.48 1.88-.32.827-.48 1.827-.48 3V38h-4.96V16.8h4.48Zm26.723 10.6c0-2.24.427-4.187 1.28-5.84.854-1.68 2.067-2.973 3.64-3.88 1.574-.907 3.4-1.36 5.48-1.36 1.84 0 3.494.413 4.96 1.24 1.467.827 2.64 2.08 3.52 3.76.88 1.653 1.347 3.693 1.4 6.12v1.32h-15.08c.107 1.813.614 3.227 1.52 4.24.907.987 2.134 1.48 3.68 1.48.987 0 1.88-.253 2.68-.76a4.803 4.803 0 0 0 1.84-2.2l5.08.36c-.64 2.027-1.84 3.64-3.6 4.84-1.733 1.173-3.733 1.76-6 1.76-2.08 0-3.906-.453-5.48-1.36-1.573-.907-2.786-2.2-3.64-3.88-.853-1.68-1.28-3.627-1.28-5.84Zm15.16-2.04c-.213-1.733-.76-3.013-1.64-3.84-.853-.827-1.893-1.24-3.12-1.24-1.44 0-2.6.453-3.48 1.36-.88.88-1.44 2.12-1.68 3.72h9.92ZM163.139 9.6V38h-5.04V9.6h5.04Zm8.322 7.2.24 5.88-.64-.36c.32-2.053 1.094-3.56 2.32-4.52 1.254-.987 2.787-1.48 4.6-1.48 2.32 0 4.107.733 5.36 2.2 1.254 1.44 1.88 3.387 1.88 5.84V38h-4.96V25.92c0-1.253-.12-2.28-.36-3.08-.24-.8-.64-1.413-1.2-1.84-.533-.427-1.253-.64-2.16-.64-1.44 0-2.573.48-3.4 1.44-.8.933-1.2 2.307-1.2 4.12V38h-4.96V16.8h4.48Zm30.003 7.72c-.186-1.307-.706-2.32-1.56-3.04-.853-.72-1.866-1.08-3.04-1.08-1.68 0-2.986.613-3.92 1.84-.906 1.227-1.36 2.947-1.36 5.16s.454 3.933 1.36 5.16c.934 1.227 2.24 1.84 3.92 1.84 1.254 0 2.307-.373 3.16-1.12.854-.773 1.387-1.867 1.6-3.28l5.12.24c-.186 1.68-.733 3.147-1.64 4.4-.906 1.227-2.08 2.173-3.52 2.84-1.413.667-2.986 1-4.72 1-2.08 0-3.906-.453-5.48-1.36-1.546-.907-2.76-2.2-3.64-3.88-.853-1.68-1.28-3.627-1.28-5.84 0-2.24.427-4.187 1.28-5.84.88-1.68 2.094-2.973 3.64-3.88 1.574-.907 3.4-1.36 5.48-1.36 1.68 0 3.227.32 4.64.96 1.414.64 2.56 1.56 3.44 2.76.907 1.2 1.454 2.6 1.64 4.2l-5.12.28Zm11.443 8.16V38h-5.6v-5.32h5.6Z"/><path fill="#171717" fill-rule="evenodd" d="m7.839 40.783 16.03-28.054L20 6 0 40.783h7.839Zm8.214 0H40L27.99 19.894l-4.02 7.032 3.976 6.914H20.02l-3.967 6.943Z" clip-rule="evenodd"/></svg>
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/public/placeholder-user.jpg b/nvidia/txt2kg/assets/frontend/public/placeholder-user.jpg
new file mode 100644
index 0000000..6faa819
Binary files /dev/null and b/nvidia/txt2kg/assets/frontend/public/placeholder-user.jpg differ
diff --git a/nvidia/txt2kg/assets/frontend/public/placeholder.jpg b/nvidia/txt2kg/assets/frontend/public/placeholder.jpg
new file mode 100644
index 0000000..a6bf2ee
Binary files /dev/null and b/nvidia/txt2kg/assets/frontend/public/placeholder.jpg differ
diff --git a/nvidia/txt2kg/assets/frontend/public/placeholder.svg b/nvidia/txt2kg/assets/frontend/public/placeholder.svg
new file mode 100644
index 0000000..e763910
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/public/placeholder.svg
@@ -0,0 +1 @@
+<svg xmlns="http://www.w3.org/2000/svg" width="1200" height="1200" fill="none"><rect width="1200" height="1200" fill="#EAEAEA" rx="3"/><g opacity=".5"><g opacity=".5"><path fill="#FAFAFA" d="M600.709 736.5c-75.454 0-136.621-61.167-136.621-136.62 0-75.454 61.167-136.621 136.621-136.621 75.453 0 136.62 61.167 136.62 136.621 0 75.453-61.167 136.62-136.62 136.62Z"/><path stroke="#C9C9C9" stroke-width="2.418" d="M600.709 736.5c-75.454 0-136.621-61.167-136.621-136.62 0-75.454 61.167-136.621 136.621-136.621 75.453 0 136.62 61.167 136.62 136.621 0 75.453-61.167 136.62-136.62 136.62Z"/></g><path stroke="url(#a)" stroke-width="2.418" d="M0-1.209h553.581" transform="scale(1 -1) rotate(45 1163.11 91.165)"/><path stroke="url(#b)" stroke-width="2.418" d="M404.846 598.671h391.726"/><path stroke="url(#c)" stroke-width="2.418" d="M599.5 795.742V404.017"/><path stroke="url(#d)" stroke-width="2.418" d="m795.717 796.597-391.441-391.44"/><path fill="#fff" d="M600.709 656.704c-31.384 0-56.825-25.441-56.825-56.824 0-31.384 25.441-56.825 56.825-56.825 31.383 0 56.824 25.441 56.824 56.825 0 31.383-25.441 56.824-56.824 56.824Z"/><g clip-path="url(#e)"><path fill="#666" fill-rule="evenodd" d="M616.426 586.58h-31.434v16.176l3.553-3.554.531-.531h9.068l.074-.074 8.463-8.463h2.565l7.18 7.181V586.58Zm-15.715 14.654 3.698 3.699 1.283 1.282-2.565 2.565-1.282-1.283-5.2-5.199h-6.066l-5.514 5.514-.073.073v2.876a2.418 2.418 0 0 0 2.418 2.418h26.598a2.418 2.418 0 0 0 2.418-2.418v-8.317l-8.463-8.463-7.181 7.181-.071.072Zm-19.347 5.442v4.085a6.045 6.045 0 0 0 6.046 6.045h26.598a6.044 6.044 0 0 0 6.045-6.045v-7.108l1.356-1.355-1.282-1.283-.074-.073v-17.989h-38.689v23.43l-.146.146.146.147Z" clip-rule="evenodd"/></g><path stroke="#C9C9C9" stroke-width="2.418" d="M600.709 656.704c-31.384 0-56.825-25.441-56.825-56.824 0-31.384 25.441-56.825 56.825-56.825 31.383 0 56.824 25.441 56.824 56.825 0 31.383-25.441 56.824-56.824 56.824Z"/></g><defs><linearGradient id="a" x1="554.061" x2="-.48" y1=".083" y2=".087" gradientUnits="userSpaceOnUse"><stop stop-color="#C9C9C9" stop-opacity="0"/><stop offset=".208" stop-color="#C9C9C9"/><stop offset=".792" stop-color="#C9C9C9"/><stop offset="1" stop-color="#C9C9C9" stop-opacity="0"/></linearGradient><linearGradient id="b" x1="796.912" x2="404.507" y1="599.963" y2="599.965" gradientUnits="userSpaceOnUse"><stop stop-color="#C9C9C9" stop-opacity="0"/><stop offset=".208" stop-color="#C9C9C9"/><stop offset=".792" stop-color="#C9C9C9"/><stop offset="1" stop-color="#C9C9C9" stop-opacity="0"/></linearGradient><linearGradient id="c" x1="600.792" x2="600.794" y1="403.677" y2="796.082" gradientUnits="userSpaceOnUse"><stop stop-color="#C9C9C9" stop-opacity="0"/><stop offset=".208" stop-color="#C9C9C9"/><stop offset=".792" stop-color="#C9C9C9"/><stop offset="1" stop-color="#C9C9C9" stop-opacity="0"/></linearGradient><linearGradient id="d" x1="404.85" x2="796.972" y1="403.903" y2="796.02" gradientUnits="userSpaceOnUse"><stop stop-color="#C9C9C9" stop-opacity="0"/><stop offset=".208" stop-color="#C9C9C9"/><stop offset=".792" stop-color="#C9C9C9"/><stop offset="1" stop-color="#C9C9C9" stop-opacity="0"/></linearGradient><clipPath id="e"><path fill="#fff" d="M581.364 580.535h38.689v38.689h-38.689z"/></clipPath></defs></svg>
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/public/txt2kg.png b/nvidia/txt2kg/assets/frontend/public/txt2kg.png
new file mode 100644
index 0000000..eab1cec
Binary files /dev/null and b/nvidia/txt2kg/assets/frontend/public/txt2kg.png differ
diff --git a/nvidia/txt2kg/assets/frontend/styles/globals.css b/nvidia/txt2kg/assets/frontend/styles/globals.css
new file mode 100644
index 0000000..ac68442
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/styles/globals.css
@@ -0,0 +1,94 @@
+@tailwind base;
+@tailwind components;
+@tailwind utilities;
+
+body {
+  font-family: Arial, Helvetica, sans-serif;
+}
+
+@layer utilities {
+  .text-balance {
+    text-wrap: balance;
+  }
+}
+
+@layer base {
+  :root {
+    --background: 0 0% 100%;
+    --foreground: 0 0% 3.9%;
+    --card: 0 0% 100%;
+    --card-foreground: 0 0% 3.9%;
+    --popover: 0 0% 100%;
+    --popover-foreground: 0 0% 3.9%;
+    --primary: 0 0% 9%;
+    --primary-foreground: 0 0% 98%;
+    --secondary: 0 0% 96.1%;
+    --secondary-foreground: 0 0% 9%;
+    --muted: 0 0% 96.1%;
+    --muted-foreground: 0 0% 45.1%;
+    --accent: 0 0% 96.1%;
+    --accent-foreground: 0 0% 9%;
+    --destructive: 0 84.2% 60.2%;
+    --destructive-foreground: 0 0% 98%;
+    --border: 0 0% 89.8%;
+    --input: 0 0% 89.8%;
+    --ring: 0 0% 3.9%;
+    --chart-1: 12 76% 61%;
+    --chart-2: 173 58% 39%;
+    --chart-3: 197 37% 24%;
+    --chart-4: 43 74% 66%;
+    --chart-5: 27 87% 67%;
+    --radius: 0.5rem;
+    --sidebar-background: 0 0% 98%;
+    --sidebar-foreground: 240 5.3% 26.1%;
+    --sidebar-primary: 240 5.9% 10%;
+    --sidebar-primary-foreground: 0 0% 98%;
+    --sidebar-accent: 240 4.8% 95.9%;
+    --sidebar-accent-foreground: 240 5.9% 10%;
+    --sidebar-border: 220 13% 91%;
+    --sidebar-ring: 217.2 91.2% 59.8%;
+  }
+  .dark {
+    --background: 0 0% 3.9%;
+    --foreground: 0 0% 98%;
+    --card: 0 0% 3.9%;
+    --card-foreground: 0 0% 98%;
+    --popover: 0 0% 3.9%;
+    --popover-foreground: 0 0% 98%;
+    --primary: 0 0% 98%;
+    --primary-foreground: 0 0% 9%;
+    --secondary: 0 0% 14.9%;
+    --secondary-foreground: 0 0% 98%;
+    --muted: 0 0% 14.9%;
+    --muted-foreground: 0 0% 63.9%;
+    --accent: 0 0% 14.9%;
+    --accent-foreground: 0 0% 98%;
+    --destructive: 0 62.8% 30.6%;
+    --destructive-foreground: 0 0% 98%;
+    --border: 0 0% 14.9%;
+    --input: 0 0% 14.9%;
+    --ring: 0 0% 83.1%;
+    --chart-1: 220 70% 50%;
+    --chart-2: 160 60% 45%;
+    --chart-3: 30 80% 55%;
+    --chart-4: 280 65% 60%;
+    --chart-5: 340 75% 55%;
+    --sidebar-background: 240 5.9% 10%;
+    --sidebar-foreground: 240 4.8% 95.9%;
+    --sidebar-primary: 224.3 76.3% 48%;
+    --sidebar-primary-foreground: 0 0% 100%;
+    --sidebar-accent: 240 3.7% 15.9%;
+    --sidebar-accent-foreground: 240 4.8% 95.9%;
+    --sidebar-border: 240 3.7% 15.9%;
+    --sidebar-ring: 217.2 91.2% 59.8%;
+  }
+}
+
+@layer base {
+  * {
+    @apply border-border;
+  }
+  body {
+    @apply bg-background text-foreground;
+  }
+}
diff --git a/nvidia/txt2kg/assets/frontend/styles/nvidia-build-typography.css b/nvidia/txt2kg/assets/frontend/styles/nvidia-build-typography.css
new file mode 100644
index 0000000..4214066
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/styles/nvidia-build-typography.css
@@ -0,0 +1,186 @@
+/* Typography Hierarchy - NVIDIA Build Style */
+.nvidia-build-h1 {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: clamp(2.5rem, 5vw, 4rem);
+  font-weight: 700;
+  line-height: 1.1;
+  letter-spacing: -0.02em;
+  color: hsl(var(--foreground));
+}
+
+.nvidia-build-h2 {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: clamp(2rem, 4vw, 3rem);
+  font-weight: 600;
+  line-height: 1.2;
+  letter-spacing: -0.01em;
+  color: hsl(var(--foreground));
+}
+
+.nvidia-build-h3 {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: clamp(1.5rem, 3vw, 2rem);
+  font-weight: 500;
+  line-height: 1.3;
+  letter-spacing: -0.005em;
+  color: hsl(var(--foreground));
+}
+
+.nvidia-build-body {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: 1rem;
+  font-weight: 400;
+  line-height: 1.6;
+  letter-spacing: 0;
+  color: hsl(var(--foreground));
+}
+
+.nvidia-build-body-large {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: 1.125rem;
+  font-weight: 400;
+  line-height: 1.7;
+  letter-spacing: 0;
+  color: hsl(var(--muted-foreground));
+}
+
+.nvidia-build-caption {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: 0.875rem;
+  font-weight: 500;
+  line-height: 1.4;
+  letter-spacing: 0.01em;
+  text-transform: uppercase;
+  color: hsl(var(--muted-foreground));
+}
+
+.nvidia-build-button {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: 0.875rem;
+  font-weight: 500;
+  line-height: 1;
+  letter-spacing: 0.005em;
+  text-transform: none;
+}
+
+/* NVIDIA Build Color Patterns */
+.nvidia-build-accent {
+  color: hsl(var(--nvidia-green));
+}
+
+.nvidia-build-gradient-text {
+  background: linear-gradient(135deg, hsl(var(--nvidia-green)), hsl(var(--nvidia-green-light)));
+  -webkit-background-clip: text;
+  -webkit-text-fill-color: transparent;
+  background-clip: text;
+}
+
+/* Modern Card Styling - NVIDIA Build Pattern */
+.nvidia-build-card {
+  background: hsl(var(--card));
+  border: 1px solid hsl(var(--border));
+  border-radius: 12px;
+  padding: 2rem;
+  transition: all 0.2s ease;
+}
+
+.nvidia-build-card:hover {
+  border-color: hsl(var(--nvidia-green) / 0.3);
+  box-shadow: 0 8px 32px hsl(var(--nvidia-green) / 0.1);
+  transform: translateY(-2px);
+}
+
+/* Navigation and UI Elements */
+.nvidia-build-nav {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: 0.875rem;
+  font-weight: 500;
+  letter-spacing: 0.005em;
+}
+
+.nvidia-build-tag {
+  font-family: 'Inter', 'system-ui', sans-serif;
+  font-size: 0.75rem;
+  font-weight: 600;
+  letter-spacing: 0.02em;
+  text-transform: uppercase;
+  padding: 0.25rem 0.75rem;
+  background: hsl(var(--nvidia-green) / 0.1);
+  color: hsl(var(--nvidia-green));
+  border-radius: 9999px;
+}
+
+/* Code and Technical Text */
+.nvidia-build-code {
+  font-family: 'JetBrains Mono', 'Fira Code', 'Consolas', monospace;
+  font-size: 0.875rem;
+  font-weight: 400;
+  background: hsl(var(--muted));
+  padding: 0.125rem 0.375rem;
+  border-radius: 0.25rem;
+}
+
+/* Responsive Typography Adjustments */
+@media (max-width: 768px) {
+  .nvidia-build-h1 {
+    font-size: clamp(2rem, 8vw, 2.5rem);
+    line-height: 1.2;
+  }
+  
+  .nvidia-build-h2 {
+    font-size: clamp(1.5rem, 6vw, 2rem);
+    line-height: 1.3;
+  }
+  
+  .nvidia-build-body-large {
+    font-size: 1rem;
+    line-height: 1.6;
+  }
+}
+
+/* Enhanced Tab Navigation Styles */
+.nvidia-build-tabs {
+  @apply inline-flex items-center justify-center rounded-xl bg-muted/20 border border-border/15 p-2 shadow-sm backdrop-blur-sm;
+  width: fit-content;
+  margin: 0 auto;
+}
+
+.nvidia-build-tab {
+  @apply inline-flex items-center justify-center gap-3 whitespace-nowrap rounded-lg px-4 py-3 text-sm font-medium transition-all duration-200 hover:bg-background/60;
+}
+
+.nvidia-build-tab-active {
+  @apply bg-background text-foreground shadow-sm border border-border/20;
+  background: linear-gradient(135deg, hsl(var(--background)) 0%, hsl(var(--background) / 0.95) 100%);
+}
+
+.nvidia-build-tab-active .nvidia-build-tab-icon {
+  @apply scale-105;
+}
+
+.nvidia-build-tab-icon {
+  @apply w-5 h-5 rounded-md bg-nvidia-green/15 flex items-center justify-center transition-transform duration-200;
+}
+
+/* Dark Mode Optimizations */
+@media (prefers-color-scheme: dark) {
+  .nvidia-build-card {
+    background: hsl(var(--card) / 0.8);
+    backdrop-filter: blur(12px);
+  }
+  
+  .nvidia-build-card:hover {
+    background: hsl(var(--card) / 0.9);
+  }
+  
+  .nvidia-build-tabs {
+    @apply bg-muted/15 border-border/10;
+    backdrop-filter: blur(16px);
+  }
+  
+  .nvidia-build-tab-active {
+    background: linear-gradient(135deg, hsl(var(--background) / 0.9) 0%, hsl(var(--background) / 0.8) 100%);
+    @apply border-border/30;
+  }
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/tailwind.config.ts b/nvidia/txt2kg/assets/frontend/tailwind.config.ts
new file mode 100644
index 0000000..c9355e9
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/tailwind.config.ts
@@ -0,0 +1,117 @@
+import type { Config } from "tailwindcss"
+
+const config: Config = {
+  darkMode: ["class"],
+  content: [
+    "./pages/**/*.{js,ts,jsx,tsx,mdx}",
+    "./components/**/*.{js,ts,jsx,tsx,mdx}",
+    "./app/**/*.{js,ts,jsx,tsx,mdx}",
+    "*.{js,ts,jsx,tsx,mdx}",
+  ],
+  theme: {
+    extend: {
+      fontFamily: {
+        'sans': ['Inter', 'system-ui', '-apple-system', 'BlinkMacSystemFont', 'Segoe UI', 'Roboto', 'sans-serif'],
+        'mono': ['JetBrains Mono', 'Fira Code', 'Consolas', 'Monaco', 'Courier New', 'monospace'],
+      },
+      letterSpacing: {
+        'tighter': '-0.02em',
+        'tight': '-0.01em',
+        'normal': '0',
+        'wide': '0.01em',
+        'wider': '0.02em',
+        'widest': '0.05em',
+      },
+      colors: {
+        "nvidia-green": "#76B900",
+        background: "hsl(var(--background))",
+        foreground: "hsl(var(--foreground))",
+        card: {
+          DEFAULT: "hsl(var(--card))",
+          foreground: "hsl(var(--card-foreground))",
+        },
+        popover: {
+          DEFAULT: "hsl(var(--popover))",
+          foreground: "hsl(var(--popover-foreground))",
+        },
+        primary: {
+          DEFAULT: "hsl(var(--primary))",
+          foreground: "hsl(var(--primary-foreground))",
+        },
+        secondary: {
+          DEFAULT: "hsl(var(--secondary))",
+          foreground: "hsl(var(--secondary-foreground))",
+        },
+        muted: {
+          DEFAULT: "hsl(var(--muted))",
+          foreground: "hsl(var(--muted-foreground))",
+        },
+        accent: {
+          DEFAULT: "hsl(var(--accent))",
+          foreground: "hsl(var(--accent-foreground))",
+        },
+        destructive: {
+          DEFAULT: "hsl(var(--destructive))",
+          foreground: "hsl(var(--destructive-foreground))",
+        },
+        border: "hsl(var(--border))",
+        input: "hsl(var(--input))",
+        ring: "hsl(var(--ring))",
+        chart: {
+          "1": "hsl(var(--chart-1))",
+          "2": "hsl(var(--chart-2))",
+          "3": "hsl(var(--chart-3))",
+          "4": "hsl(var(--chart-4))",
+          "5": "hsl(var(--chart-5))",
+        },
+        sidebar: {
+          DEFAULT: "hsl(var(--sidebar-background))",
+          foreground: "hsl(var(--sidebar-foreground))",
+          primary: "hsl(var(--sidebar-primary))",
+          "primary-foreground": "hsl(var(--sidebar-primary-foreground))",
+          accent: "hsl(var(--sidebar-accent))",
+          "accent-foreground": "hsl(var(--sidebar-accent-foreground))",
+          border: "hsl(var(--sidebar-border))",
+          ring: "hsl(var(--sidebar-ring))",
+        },
+        // Add these custom text colors
+        "text-dark": "#333333",
+        "text-light": "#E6E6E6",
+      },
+      fontSize: {
+        // Make text-lg the same visual size as current text-base for hierarchy without increasing size
+        lg: ["1rem", { lineHeight: "1.5rem" }],
+      },
+      borderRadius: {
+        lg: "var(--radius)",
+        md: "calc(var(--radius) - 2px)",
+        sm: "calc(var(--radius) - 4px)",
+      },
+      keyframes: {
+        "accordion-down": {
+          from: {
+            height: "0",
+          },
+          to: {
+            height: "var(--radix-accordion-content-height)",
+          },
+        },
+        "accordion-up": {
+          from: {
+            height: "var(--radix-accordion-content-height)",
+          },
+          to: {
+            height: "0",
+          },
+        },
+      },
+      animation: {
+        "accordion-down": "accordion-down 0.2s ease-out",
+        "accordion-up": "accordion-up 0.2s ease-out",
+      },
+    },
+  },
+  plugins: [require("tailwindcss-animate"), require("@tailwindcss/forms")],
+}
+export default config
+
diff --git a/nvidia/txt2kg/assets/frontend/tsconfig.json b/nvidia/txt2kg/assets/frontend/tsconfig.json
new file mode 100644
index 0000000..c714696
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/tsconfig.json
@@ -0,0 +1,27 @@
+{
+  "compilerOptions": {
+    "target": "es5",
+    "lib": ["dom", "dom.iterable", "esnext"],
+    "allowJs": true,
+    "skipLibCheck": true,
+    "strict": true,
+    "noEmit": true,
+    "esModuleInterop": true,
+    "module": "esnext",
+    "moduleResolution": "bundler",
+    "resolveJsonModule": true,
+    "isolatedModules": true,
+    "jsx": "preserve",
+    "incremental": true,
+    "plugins": [
+      {
+        "name": "next"
+      }
+    ],
+    "paths": {
+      "@/*": ["./*"]
+    }
+  },
+  "include": ["next-env.d.ts", "**/*.ts", "**/*.tsx", ".next/types/**/*.ts"],
+  "exclude": ["node_modules"]
+}
diff --git a/nvidia/txt2kg/assets/frontend/types/graph.ts b/nvidia/txt2kg/assets/frontend/types/graph.ts
new file mode 100644
index 0000000..b6ab52e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/types/graph.ts
@@ -0,0 +1,18 @@
+/**
+ * Triple interface representing a knowledge graph edge
+ */
+export interface Triple {
+  subject: string
+  predicate: string
+  object: string
+  confidence?: number
+  usedFallback?: boolean
+}
+
+// Add this interface to the file
+export interface VectorDBStats {
+  nodes: number;
+  relationships: number;
+  source: string;
+  httpHealthy?: boolean;
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/types/next-navigation.d.ts b/nvidia/txt2kg/assets/frontend/types/next-navigation.d.ts
new file mode 100644
index 0000000..a67f87b
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/types/next-navigation.d.ts
@@ -0,0 +1,10 @@
+declare module 'next/navigation' {
+  export function useRouter(): {
+    push: (url: string) => void;
+    replace: (url: string) => void;
+    back: () => void;
+    forward: () => void;
+    refresh: () => void;
+    prefetch: (url: string) => void;
+  };
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/utils/remote-webgpu-clustering.ts b/nvidia/txt2kg/assets/frontend/utils/remote-webgpu-clustering.ts
new file mode 100644
index 0000000..a1fcde1
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/utils/remote-webgpu-clustering.ts
@@ -0,0 +1,570 @@
+// Remote WebGPU Clustering Client
+// Provides fallback clustering when local WebGPU is not available
+
+export interface RemoteClusteringOptions {
+  serviceUrl: string;
+  mode: 'hybrid' | 'webrtc_stream';
+  clusterDimensions: [number, number, number];
+  forceSimulation: boolean;
+  maxIterations: number;
+  webrtcOptions?: {
+    autoRefresh: boolean;
+    refreshInterval: number;
+  };
+  
+  // Semantic clustering options
+  clusteringMethod?: string; // "spatial", "semantic", "hybrid"
+  semanticAlgorithm?: string; // "hierarchical", "kmeans", "dbscan"
+  numberOfClusters?: number | null;
+  similarityThreshold?: number;
+  nameWeight?: number;
+  contentWeight?: number;
+  spatialWeight?: number;
+}
+
+export interface ClusteringResult {
+  clusteredNodes: any[];
+  clusterInfo: {
+    totalClusters: number;
+    usedClusters: number;
+    clusterDimensions: [number, number, number];
+    processingTime: number;
+    gpuAccelerated: boolean;
+    clusterStats?: any;
+  };
+  processingTime: number;
+  mode: string;
+  sessionId?: string;
+}
+
+export interface ServiceCapabilities {
+  modes: {
+    hybrid: {
+      available: boolean;
+      description: string;
+    };
+    webrtc_stream: {
+      available: boolean;
+      description: string;
+    };
+  };
+  gpuAcceleration: {
+    rapidsAvailable: boolean;
+    opencvAvailable: boolean;
+    plottingAvailable: boolean;
+  };
+  clusterDimensions: [number, number, number];
+  maxClusterCount: number;
+}
+
+/**
+ * Remote WebGPU Clustering Client
+ * Provides GPU-accelerated clustering for browsers without WebGPU support
+ */
+export class RemoteWebGPUClusteringClient {
+  private serviceUrl: string;
+  private useProxy: boolean;
+  private websocket: WebSocket | null = null;
+  private capabilities: ServiceCapabilities | null = null;
+  private eventListeners: Map<string, Function[]> = new Map();
+
+  constructor(serviceUrl: string = 'http://localhost:8083', useProxy: boolean = false) {
+    this.serviceUrl = serviceUrl;
+    this.useProxy = useProxy;
+  }
+
+  private getApiUrl(path: string): string {
+    if (this.useProxy) {
+      // Use the Next.js API proxy route
+      return `/api/remote-webgpu/${path}`;
+    } else {
+      // Direct connection to service
+      return `${this.serviceUrl}/${path}`;
+    }
+  }
+
+  /**
+   * Check if the remote service is available and get its capabilities
+   */
+  async checkAvailability(): Promise<boolean> {
+    try {
+      const response = await fetch(this.getApiUrl('api/capabilities'));
+      if (response.ok) {
+        this.capabilities = await response.json();
+        console.log('Remote WebGPU service available:', this.capabilities);
+        return true;
+      }
+      return false;
+    } catch (error) {
+      console.warn('Remote WebGPU service not available:', error);
+      return false;
+    }
+  }
+
+  /**
+   * Get service capabilities
+   */
+  getCapabilities(): ServiceCapabilities | null {
+    return this.capabilities;
+  }
+
+  /**
+   * Perform remote clustering
+   */
+  async clusterNodes(
+    nodes: any[], 
+    links: any[], 
+    options: Partial<RemoteClusteringOptions> = {}
+  ): Promise<ClusteringResult> {
+    const requestOptions: RemoteClusteringOptions = {
+      serviceUrl: this.serviceUrl,
+      mode: 'hybrid',
+      clusterDimensions: [32, 18, 24],
+      forceSimulation: true,
+      maxIterations: 100,
+      ...options
+    };
+
+    const requestData = {
+      graph_data: {
+        nodes,
+        links
+      },
+      mode: requestOptions.mode,
+      cluster_dimensions: requestOptions.clusterDimensions,
+      force_simulation: requestOptions.forceSimulation,
+      max_iterations: requestOptions.maxIterations,
+      webrtc_options: requestOptions.webrtcOptions,
+      
+      // Semantic clustering parameters
+      clustering_method: requestOptions.clusteringMethod || "hybrid",
+      semantic_algorithm: requestOptions.semanticAlgorithm || "hierarchical",
+      n_clusters: requestOptions.numberOfClusters,
+      similarity_threshold: requestOptions.similarityThreshold || 0.7,
+      name_weight: requestOptions.nameWeight || 0.6,
+      content_weight: requestOptions.contentWeight || 0.3,
+      spatial_weight: requestOptions.spatialWeight || 0.1
+    };
+
+    try {
+      const response = await fetch(this.getApiUrl('api/cluster'), {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json'
+        },
+        body: JSON.stringify(requestData)
+      });
+
+      if (!response.ok) {
+        throw new Error(`Remote clustering failed: ${response.statusText}`);
+      }
+
+      const rawResult = await response.json();
+      
+      // Map snake_case API response to camelCase interface
+      const result: ClusteringResult = {
+        clusteredNodes: rawResult.clustered_nodes || [],
+        clusterInfo: {
+          totalClusters: rawResult.total_clusters || 0,
+          usedClusters: rawResult.used_clusters || 0,
+          clusterDimensions: rawResult.cluster_dimensions || [32, 18, 24],
+          processingTime: rawResult.processing_time || 0,
+          gpuAccelerated: rawResult.gpu_accelerated || true,
+          clusterStats: rawResult.cluster_stats
+        },
+        processingTime: rawResult.processing_time || 0,
+        sessionId: rawResult.session_id,
+        mode: rawResult.mode
+      };
+      
+      console.log('🔄 Mapped API response:', {
+        originalKeys: Object.keys(rawResult),
+        mappedClusteredNodes: result.clusteredNodes?.length,
+        processingTime: result.processingTime
+      });
+      
+      // Emit clustering complete event
+      this.emit('clusteringComplete', result);
+      
+      return result;
+    } catch (error) {
+      console.error('Remote clustering request failed:', error);
+      throw error;
+    }
+  }
+
+  /**
+   * Start WebRTC streaming session
+   */
+  async startWebRTCStreaming(nodes: any[], links: any[]): Promise<string | null> {
+    const result = await this.clusterNodes(nodes, links, { mode: 'webrtc_stream' });
+    
+    if (result.sessionId) {
+      console.log(`WebRTC streaming session started: ${result.sessionId}`);
+      return result.sessionId;
+    }
+    
+    return null;
+  }
+
+  /**
+   * Get WebRTC stream frame URL
+   */
+  getStreamFrameUrl(sessionId: string): string {
+    if (this.useProxy) {
+      return `/api/remote-webgpu-stream/${sessionId}`;
+    } else {
+      return `${this.serviceUrl}/api/stream/${sessionId}`;
+    }
+  }
+
+  /**
+   * Cleanup WebRTC streaming session
+   */
+  async cleanupWebRTCSession(sessionId: string): Promise<void> {
+    try {
+      await fetch(this.getApiUrl(`api/stream/${sessionId}`), {
+        method: 'DELETE'
+      });
+      console.log(`WebRTC session ${sessionId} cleaned up`);
+    } catch (error) {
+      console.warn(`Failed to cleanup WebRTC session ${sessionId}:`, error);
+    }
+  }
+
+  /**
+   * Connect to WebSocket for real-time updates
+   */
+  connectWebSocket(): void {
+    if (this.websocket) {
+      return;
+    }
+
+    if (this.useProxy) {
+      // Skip WebSocket connection when using proxy mode
+      console.log('WebSocket disabled in proxy mode');
+      return;
+    }
+
+    try {
+      const wsUrl = this.serviceUrl.replace('http', 'ws') + '/ws';
+      this.websocket = new WebSocket(wsUrl);
+
+      this.websocket.onopen = () => {
+        console.log('Connected to remote WebGPU service WebSocket');
+        this.emit('connected');
+      };
+
+      this.websocket.onmessage = (event) => {
+        try {
+          const data = JSON.parse(event.data);
+          this.emit('message', data);
+          
+          // Handle specific message types
+          if (data.type === 'clustering_complete') {
+            this.emit('clusteringComplete', data.data);
+          }
+        } catch (error) {
+          console.warn('Failed to parse WebSocket message:', error);
+        }
+      };
+
+      this.websocket.onclose = () => {
+        console.log('Disconnected from remote WebGPU service WebSocket');
+        this.websocket = null;
+        this.emit('disconnected');
+      };
+
+      this.websocket.onerror = (error) => {
+        console.error('WebSocket error:', error);
+        this.emit('error', error);
+      };
+    } catch (error) {
+      console.error('Failed to connect WebSocket:', error);
+    }
+  }
+
+  /**
+   * Disconnect WebSocket
+   */
+  disconnectWebSocket(): void {
+    if (this.websocket) {
+      this.websocket.close();
+      this.websocket = null;
+    }
+  }
+
+  /**
+   * Add event listener
+   */
+  on(event: string, listener: Function): void {
+    if (!this.eventListeners.has(event)) {
+      this.eventListeners.set(event, []);
+    }
+    this.eventListeners.get(event)!.push(listener);
+  }
+
+  /**
+   * Remove event listener
+   */
+  off(event: string, listener: Function): void {
+    const listeners = this.eventListeners.get(event);
+    if (listeners) {
+      const index = listeners.indexOf(listener);
+      if (index > -1) {
+        listeners.splice(index, 1);
+      }
+    }
+  }
+
+  /**
+   * Emit event
+   */
+  private emit(event: string, ...args: any[]): void {
+    const listeners = this.eventListeners.get(event);
+    if (listeners) {
+      listeners.forEach(listener => {
+        try {
+          listener(...args);
+        } catch (error) {
+          console.error(`Event listener error for ${event}:`, error);
+        }
+      });
+    }
+  }
+
+  /**
+   * Cleanup resources
+   */
+  dispose(): void {
+    this.disconnectWebSocket();
+    this.eventListeners.clear();
+  }
+}
+
+/**
+ * Enhanced WebGPU Clustering Engine with Remote Fallback
+ * Automatically detects WebGPU availability and falls back to remote service
+ */
+export class EnhancedWebGPUClusteringEngine {
+  private localEngine: any | null = null; // WebGPUClusteringEngine
+  private remoteClient: RemoteWebGPUClusteringClient;
+  private useRemote: boolean = false;
+  private isInitialized: boolean = false;
+  private lastClusteredData: { nodes: any[], links: any[] } | null = null;
+  private clusteringOptions: Partial<RemoteClusteringOptions> = {};
+
+  constructor(
+    clusterDimensions: [number, number, number] = [32, 18, 24],
+    remoteServiceUrl: string = 'http://localhost:8083'
+  ) {
+    this.remoteClient = new RemoteWebGPUClusteringClient(remoteServiceUrl, false); // Disable proxy mode for WebSocket
+    
+    // Try to import local WebGPU engine
+    this.tryInitializeLocal(clusterDimensions);
+  }
+
+  private async tryInitializeLocal(clusterDimensions: [number, number, number]): Promise<void> {
+    // For hybrid mode, skip local WebGPU and go directly to remote
+    console.log('Skipping local WebGPU initialization, using remote service for hybrid mode');
+    await this.initializeRemote();
+  }
+
+  private async initializeRemote(): Promise<void> {
+    try {
+      console.log('🔄 Checking remote WebGPU service availability...');
+      const available = await this.remoteClient.checkAvailability();
+      console.log('🎯 Remote service check result:', available);
+      
+      if (available) {
+        this.useRemote = true;
+        this.isInitialized = true;
+        console.log('✅ Enhanced WebGPU engine initialized with remote cuGraph service');
+        
+        // Skip WebSocket connection for hybrid mode - we only need HTTP API calls
+        console.log('⚙️ Using HTTP API for cuGraph clustering (no WebSocket needed)');
+      } else {
+        console.error('❌ Remote cuGraph service not available - falling back to CPU');
+        this.useRemote = false;
+        this.isInitialized = false;
+      }
+    } catch (error) {
+      console.error('❌ Failed to initialize remote cuGraph service:', error);
+      this.useRemote = false;
+      this.isInitialized = false;
+    }
+  }
+
+  /**
+   * Check if clustering is available (local or remote)
+   */
+  isAvailable(): boolean {
+    return this.isInitialized;
+  }
+
+  /**
+   * Check if using remote service
+   */
+  isUsingRemote(): boolean {
+    return this.useRemote;
+  }
+
+  /**
+   * Get service capabilities
+   */
+  getCapabilities(): ServiceCapabilities | null {
+    if (this.useRemote) {
+      return this.remoteClient.getCapabilities();
+    }
+    return null;
+  }
+
+  /**
+   * Get the last clustered data (for pre-rendering optimization)
+   */
+  getClusteredData(): { nodes: any[], links: any[] } | null {
+    return this.lastClusteredData;
+  }
+
+  /**
+   * Set clustering options for semantic clustering
+   */
+  setClusteringOptions(options: Partial<RemoteClusteringOptions>): void {
+    this.clusteringOptions = { ...this.clusteringOptions, ...options };
+    console.log('🔧 Updated clustering options:', this.clusteringOptions);
+  }
+
+  /**
+   * Update node positions and compute clusters
+   */
+  async updateNodePositions(nodes: any[], links: any[] = []): Promise<boolean> {
+    console.log('🚀 updateNodePositions called with', nodes.length, 'nodes,', links.length, 'links');
+    console.log('🔍 Engine state - initialized:', this.isInitialized, 'useRemote:', this.useRemote);
+    
+    if (!this.isInitialized) {
+      console.warn('❌ Enhanced WebGPU clustering engine not initialized');
+      return false;
+    }
+
+    try {
+      if (this.useRemote) {
+        console.log('🌐 Using remote cuGraph clustering service');
+        
+        // Use remote clustering with semantic options
+        const result = await this.remoteClient.clusterNodes(nodes, links, {
+          mode: 'hybrid',
+          forceSimulation: true,
+          ...this.clusteringOptions
+        });
+        
+        console.log('📊 cuGraph clustering result:', result);
+        
+        // Store the clustered data for potential pre-rendering optimization
+        if (result.clusteredNodes) {
+          this.lastClusteredData = {
+            nodes: result.clusteredNodes.map(node => ({
+              ...node,
+              cluster_index: node.cluster_index, // Keep original
+              clusterIndex: node.cluster_index   // Add camelCase for frontend
+            })),
+            links: links
+          };
+        }
+        
+        // Update nodes with clustering results
+        if (result.clusteredNodes && result.clusteredNodes.length === nodes.length) {
+          let clustersFound = new Set();
+          result.clusteredNodes.forEach((clusteredNode, i) => {
+            if (nodes[i]) {
+              nodes[i].clusterIndex = clusteredNode.cluster_index;
+              nodes[i].nodeIndex = clusteredNode.node_index;
+              clustersFound.add(clusteredNode.cluster_index);
+              // Update positions if force simulation was applied
+              if (clusteredNode.x !== undefined) nodes[i].x = clusteredNode.x;
+              if (clusteredNode.y !== undefined) nodes[i].y = clusteredNode.y;
+              if (clusteredNode.z !== undefined) nodes[i].z = clusteredNode.z;
+            }
+          });
+          console.log(`🎯 cuGraph found ${clustersFound.size} clusters:`, Array.from(clustersFound));
+        } else {
+          console.warn('⚠️ cuGraph result mismatch - expected', nodes.length, 'got', result.clusteredNodes?.length);
+        }
+        
+        return true;
+      } else {
+        console.log('💻 Using local WebGPU engine (fallback)');
+        // Use local WebGPU engine
+        return this.localEngine?.updateNodePositions(nodes) || false;
+      }
+    } catch (error) {
+      console.error('❌ Failed to update node positions:', error);
+      return false;
+    }
+  }
+
+  /**
+   * Start WebRTC streaming mode (remote only)
+   */
+  async startWebRTCStreaming(nodes: any[], links: any[]): Promise<string | null> {
+    if (!this.useRemote) {
+      console.warn('WebRTC streaming only available with remote service');
+      return null;
+    }
+
+    return await this.remoteClient.startWebRTCStreaming(nodes, links);
+  }
+
+  /**
+   * Get WebRTC stream frame URL
+   */
+  getStreamFrameUrl(sessionId: string): string | null {
+    if (!this.useRemote) {
+      return null;
+    }
+    return this.remoteClient.getStreamFrameUrl(sessionId);
+  }
+
+  /**
+   * Add event listener for remote events
+   */
+  on(event: string, listener: Function): void {
+    if (this.useRemote) {
+      this.remoteClient.on(event, listener);
+    }
+  }
+
+  /**
+   * Remove event listener
+   */
+  off(event: string, listener: Function): void {
+    if (this.useRemote) {
+      this.remoteClient.off(event, listener);
+    }
+  }
+
+  /**
+   * Read clustered data (local only)
+   */
+  async readClusteredData(): Promise<any[] | null> {
+    if (this.useRemote) {
+      console.warn('readClusteredData not available with remote service');
+      return null;
+    }
+    
+    return this.localEngine?.readClusteredData() || null;
+  }
+
+  /**
+   * Dispose of resources
+   */
+  dispose(): void {
+    if (this.localEngine) {
+      this.localEngine.dispose();
+      this.localEngine = null;
+    }
+    
+    this.remoteClient.dispose();
+    this.isInitialized = false;
+    this.useRemote = false;
+  }
+}
diff --git a/nvidia/txt2kg/assets/frontend/utils/s3-storage.ts b/nvidia/txt2kg/assets/frontend/utils/s3-storage.ts
new file mode 100644
index 0000000..0f89a1e
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/utils/s3-storage.ts
@@ -0,0 +1,138 @@
+import { S3Client, PutObjectCommand, GetObjectCommand, ListObjectsV2Command, DeleteObjectCommand, _Object } from '@aws-sdk/client-s3';
+import { getSignedUrl } from '@aws-sdk/s3-request-presigner';
+
+// Get environment variables with defaults
+const getS3Config = () => {
+  return {
+    endpoint: process.env.S3_ENDPOINT || 'http://localhost:9000',
+    region: process.env.S3_REGION || 'us-east-1',
+    bucket: process.env.S3_BUCKET || 'txt2kg',
+    credentials: {
+      accessKeyId: process.env.S3_ACCESS_KEY || 'minioadmin',
+      secretAccessKey: process.env.S3_SECRET_KEY || 'minioadmin'
+    },
+    forcePathStyle: true // Needed for MinIO and other S3-compatible storages
+  };
+};
+
+// Initialize S3 client
+const getS3Client = () => {
+  const config = getS3Config();
+  return new S3Client({
+    endpoint: config.endpoint,
+    region: config.region,
+    credentials: config.credentials,
+    forcePathStyle: config.forcePathStyle
+  });
+};
+
+// Upload file to S3
+export const uploadFileToS3 = async (file: File, path: string = ''): Promise<string> => {
+  const s3Client = getS3Client();
+  const config = getS3Config();
+  
+  // Create a readable stream from file
+  const buffer = await file.arrayBuffer();
+  
+  // Create key path
+  const key = path ? `${path}/${file.name}` : file.name;
+  
+  const params = {
+    Bucket: config.bucket,
+    Key: key,
+    Body: Buffer.from(buffer),
+    ContentType: file.type || 'application/octet-stream'
+  };
+  
+  try {
+    await s3Client.send(new PutObjectCommand(params));
+    return key;
+  } catch (error) {
+    console.error('Error uploading file to S3:', error);
+    throw new Error(`Failed to upload file to S3: ${error instanceof Error ? error.message : String(error)}`);
+  }
+};
+
+// Download file from S3
+export const getFileFromS3 = async (key: string): Promise<Blob> => {
+  const s3Client = getS3Client();
+  const config = getS3Config();
+  
+  const params = {
+    Bucket: config.bucket,
+    Key: key
+  };
+  
+  try {
+    const response = await s3Client.send(new GetObjectCommand(params));
+    const responseBody = await response.Body?.transformToByteArray();
+    
+    if (!responseBody) {
+      throw new Error('Empty response body');
+    }
+    
+    return new Blob([responseBody], { type: response.ContentType || 'application/octet-stream' });
+  } catch (error) {
+    console.error('Error downloading file from S3:', error);
+    throw new Error(`Failed to download file from S3: ${error instanceof Error ? error.message : String(error)}`);
+  }
+};
+
+// List files in S3 bucket
+export const listFilesInS3 = async (prefix: string = ''): Promise<Array<{ key: string, size: number, lastModified: Date }>> => {
+  const s3Client = getS3Client();
+  const config = getS3Config();
+  
+  const params = {
+    Bucket: config.bucket,
+    Prefix: prefix
+  };
+  
+  try {
+    const response = await s3Client.send(new ListObjectsV2Command(params));
+    return (response.Contents || []).map((item: _Object) => ({
+      key: item.Key || '',
+      size: item.Size || 0,
+      lastModified: item.LastModified || new Date()
+    }));
+  } catch (error) {
+    console.error('Error listing files in S3:', error);
+    throw new Error(`Failed to list files in S3: ${error instanceof Error ? error.message : String(error)}`);
+  }
+};
+
+// Delete file from S3
+export const deleteFileFromS3 = async (key: string): Promise<void> => {
+  const s3Client = getS3Client();
+  const config = getS3Config();
+  
+  const params = {
+    Bucket: config.bucket,
+    Key: key
+  };
+  
+  try {
+    await s3Client.send(new DeleteObjectCommand(params));
+  } catch (error) {
+    console.error('Error deleting file from S3:', error);
+    throw new Error(`Failed to delete file from S3: ${error instanceof Error ? error.message : String(error)}`);
+  }
+};
+
+// Generate pre-signed URL for file download (useful for browser direct downloads)
+export const getSignedUrlForS3File = async (key: string, expiresIn: number = 3600): Promise<string> => {
+  const s3Client = getS3Client();
+  const config = getS3Config();
+  
+  const params = {
+    Bucket: config.bucket,
+    Key: key
+  };
+  
+  try {
+    return await getSignedUrl(s3Client, new GetObjectCommand(params), { expiresIn });
+  } catch (error) {
+    console.error('Error generating signed URL for S3 file:', error);
+    throw new Error(`Failed to generate signed URL: ${error instanceof Error ? error.message : String(error)}`);
+  }
+}; 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/frontend/utils/text-processing.ts b/nvidia/txt2kg/assets/frontend/utils/text-processing.ts
new file mode 100644
index 0000000..93f2fb3
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/utils/text-processing.ts
@@ -0,0 +1,340 @@
+/**
+ * Text processing utilities for knowledge graph extraction
+ * Matches PyTorch Geometric's txt2kg.py implementation
+ */
+
+import type { Triple } from '@/types/graph'
+
+const CHUNK_SIZE = 20000 // Optimized for Gemma3:27b on DGX Spark
+const OVERLAP_SIZE = 1000 // For context preservation between chunks
+
+/**
+ * Chunks text using PyTorch Geometric's exact chunking algorithm
+ * Replicates the chunk_text function from PyG's txt2kg.py
+ */
+export function chunkTextPyG(text: string, chunkSize: number = 512, overlapSize: number = 0): string[] {
+  if (!text) {
+    return [];
+  }
+
+  const chunks: string[] = [];
+  const sentenceEndings = '.!?';
+  let startIndex = 0;
+  
+  while (startIndex < text.length) {
+    // Calculate the end index for this chunk
+    const endIndex = Math.min(startIndex + chunkSize, text.length);
+    
+    // If this is the last chunk (remaining text fits in chunk size), add it and break
+    if (endIndex >= text.length) {
+      const finalChunk = text.slice(startIndex).trim();
+      if (finalChunk) {
+        chunks.push(finalChunk);
+      }
+      break;
+    }
+
+    // Start with the maximum possible chunk from current position
+    let chunk = text.slice(startIndex, endIndex);
+    let bestSplit = endIndex;
+
+    // Try to find the last sentence ending within the chunk
+    for (const ending of sentenceEndings) {
+      const lastEnding = chunk.lastIndexOf(ending);
+      if (lastEnding !== -1) {
+        // Calculate absolute position in the original text
+        const absolutePos = startIndex + lastEnding + 1;
+        // Check if there's a space after the punctuation
+        const hasSpace = absolutePos < text.length && text[absolutePos] === ' ';
+        bestSplit = Math.min(bestSplit, absolutePos + (hasSpace ? 1 : 0));
+      }
+    }
+
+    // Adjust to ensure we don't break words
+    // If the next character is a letter, find the last space
+    if (bestSplit < text.length && /[a-zA-Z]/.test(text[bestSplit])) {
+      const chunkToSplit = text.slice(startIndex, bestSplit);
+      const spaceSplit = chunkToSplit.lastIndexOf(' ');
+      if (spaceSplit !== -1) {
+        bestSplit = startIndex + spaceSplit;
+      }
+    }
+
+    // Extract and add the chunk
+    const currentChunk = text.slice(startIndex, bestSplit).trim();
+    if (currentChunk) {
+      chunks.push(currentChunk);
+    }
+
+    // Calculate next start position
+    if (overlapSize === 0) {
+      // Original PyG behavior: no overlap
+      startIndex = bestSplit;
+      // Skip whitespace at the beginning of next chunk
+      while (startIndex < text.length && /\s/.test(text[startIndex])) {
+        startIndex++;
+      }
+    } else {
+      // With overlap: move forward by (chunkSize - overlapSize)
+      const step = Math.max(1, chunkSize - overlapSize);
+      startIndex += step;
+    }
+  }
+
+  return chunks;
+}
+
+/**
+ * Chunks text into sentence-based segments, matching Python implementation
+ */
+export function chunkText(text: string, chunkSize: number = CHUNK_SIZE): string[] {
+  // If the input text is empty or None, return an empty list
+  if (!text) {
+    return []
+  }
+
+  // List of punctuation marks that typically end sentences
+  const sentenceEndings = '.!?'
+
+  // List to store the resulting chunks
+  const chunks: string[] = []
+
+  // Continue processing the entire text
+  let remainingText = text
+  while (remainingText) {
+    // If the remaining text is shorter than chunk_size, add it and break
+    if (remainingText.length <= chunkSize) {
+      chunks.push(remainingText.trim())
+      break
+    }
+
+    // Start with the maximum possible chunk
+    let chunk = remainingText.slice(0, chunkSize)
+
+    // Try to find the last sentence ending within the chunk
+    let bestSplit = chunkSize
+    for (const ending of sentenceEndings) {
+      // Find the last occurrence of the ending punctuation
+      const lastEnding = chunk.lastIndexOf(ending)
+      if (lastEnding !== -1) {
+        // Ensure we include the punctuation and any following space
+        bestSplit = Math.min(
+          bestSplit,
+          lastEnding + 1 + (lastEnding + 1 < chunk.length && /\s/.test(chunk[lastEnding + 1]) ? 1 : 0)
+        )
+      }
+    }
+
+    // Adjust to ensure we don't break words
+    // If the next character is a letter, find the last space
+    if (bestSplit < remainingText.length && /[a-zA-Z]/.test(remainingText[bestSplit])) {
+      const spaceSplit = remainingText.slice(0, bestSplit).lastIndexOf(' ')
+      if (spaceSplit !== -1) {
+        bestSplit = spaceSplit
+      }
+    }
+
+    // Append the chunk, ensuring it's stripped
+    chunks.push(remainingText.slice(0, bestSplit).trim())
+
+    // Remove the processed part from the text
+    remainingText = remainingText.slice(bestSplit).trim()
+  }
+
+  return chunks
+}
+
+/**
+ * Merges triples from multiple chunks, removing duplicates
+ */
+export function mergeTriples(triplesArrays: Array<Array<Triple>>): Array<Triple> {
+  const uniqueTriplesMap = new Map<string, Triple>()
+
+  for (const triples of triplesArrays) {
+    for (const triple of triples) {
+      const key = `${triple.subject}|${triple.predicate}|${triple.object}`
+      if (!uniqueTriplesMap.has(key)) {
+        uniqueTriplesMap.set(key, triple)
+      }
+    }
+  }
+
+  return Array.from(uniqueTriplesMap.values())
+}
+
+/**
+ * Parses triple strings into structured Triple objects, matching Python patterns
+ */
+export function parseTriples(triplesStr: string): Triple[] {
+  const processed: Triple[] = []
+  const splitByNewline = triplesStr.split('\n')
+
+  // First try newline-separated format
+  if (splitByNewline.length > 1) {
+    for (const line of splitByNewline) {
+      const triple = parseTripleLine(line.trim())
+      if (triple) processed.push(triple)
+    }
+  } else {
+    // Handle space-separated format "(e, r, e) (e, r, e) ... (e, r, e)"
+    const splitTriples = triplesStr.slice(1, -1).split(') (')
+    for (const tripleStr of splitTriples) {
+      const triple = parseTripleLine(tripleStr)
+      if (triple) processed.push(triple)
+    }
+  }
+
+  return processed
+}
+
+/**
+ * Helper function to parse a single triple line with multiple formats
+ */
+function parseTripleLine(line: string): Triple | null {
+  if (!line.trim() || line.toLowerCase().includes('note:')) return null
+
+  // Try different regex patterns matching Python implementation
+  const patterns = [
+    // Standard format: ('subject', 'relation', 'object')
+    /\('([^']+)',\s*'([^']+)',\s*'([^']+)'\)/,
+    // Double quotes: ("subject", "relation", "object")
+    /\("([^"]+)",\s*"([^"]+)",\s*"([^"]+)"\)/,
+    // No parentheses: "subject", "relation", "object"
+    /"([^"]+)",\s*"([^"]+)",\s*"([^"]+)"/,
+    // Mixed quotes: ('subject', "relation", 'object')
+    /\(['"]([^'"]+)['"],\s*['"]([^'"]+)['"],\s*['"]([^'"]+)['"]\)/,
+    // Plain text: subject, relation, object
+    /^([^,]+),\s*([^,]+),\s*(.+)$/
+  ]
+
+  for (const pattern of patterns) {
+    const match = line.match(pattern)
+    if (match) {
+      return {
+        subject: match[1].trim().toLowerCase(),
+        predicate: match[2].trim().toLowerCase(),
+        object: match[3].trim().toLowerCase()
+      }
+    }
+  }
+
+  return null
+}
+
+// Re-export types
+export type { Triple }
+
+/**
+ * Converts triples to a graph representation
+ * @param triples Array of triples
+ * @returns Graph representation with nodes and edges
+ */
+export function triplesToGraph(triples: Triple[]) {
+  const nodes = new Map<string, { id: string; label: string }>()
+  const edges: Array<{ source: string; target: string; label: string }> = []
+
+  // Process each triple to build nodes and edges
+  for (const triple of triples) {
+    // Add subject node if not exists
+    if (!nodes.has(triple.subject)) {
+      nodes.set(triple.subject, {
+        id: triple.subject,
+        label: triple.subject,
+      })
+    }
+
+    // Add object node if not exists
+    if (!nodes.has(triple.object)) {
+      nodes.set(triple.object, {
+        id: triple.object,
+        label: triple.object,
+      })
+    }
+
+    // Add edge
+    edges.push({
+      source: triple.subject,
+      target: triple.object,
+      label: triple.predicate,
+    })
+  }
+
+  return {
+    nodes: Array.from(nodes.values()),
+    edges,
+  }
+}
+
+/**
+ * Processes text using PyTorch Geometric's exact chunking method (no overlap)
+ * Replicates the chunking behavior from PyG's txt2kg.py
+ * @param text Text to process
+ * @param extractTriplesFn Function to extract triples from a chunk
+ * @param chunkSize Maximum size of each chunk (default: 512 like PyG)
+ * @returns Array of extracted triples
+ */
+export async function processTextWithChunkingPyG(
+  text: string,
+  extractTriplesFn: (chunk: string) => Promise<Triple[]>,
+  chunkSize = 512,
+  overlapSize = 0,
+): Promise<Triple[]> {
+  // If text is small enough, process directly
+  if (text.length <= chunkSize) {
+    return await extractTriplesFn(text)
+  }
+
+  // Chunk the text using PyG method with configurable overlap
+  const chunks = chunkTextPyG(text, chunkSize, overlapSize)
+  const overlapText = overlapSize > 0 ? `, ${overlapSize} char overlap` : ', no overlap'
+  console.log(`PyG Chunking: Split text into ${chunks.length} chunks (${chunkSize} chars each${overlapText})`)
+
+  // Process each chunk
+  const triplesPromises = chunks.map((chunk, i) => {
+    console.log(`Processing PyG chunk ${i + 1}/${chunks.length}, size: ${chunk.length}`)
+    return extractTriplesFn(chunk)
+  })
+
+  // Wait for all chunks to be processed
+  const triplesArrays = await Promise.all(triplesPromises)
+
+  // Merge results (no deduplication needed since PyG doesn't use overlap)
+  return mergeTriples(triplesArrays)
+}
+
+/**
+ * Processes text to extract triples using chunking for large texts
+ * @param text Text to process
+ * @param extractTriplesFn Function to extract triples from a chunk
+ * @param chunkSize Maximum size of each chunk
+ * @param overlapSize Size of overlap between chunks
+ * @returns Array of extracted triples
+ */
+export async function processTextWithChunking(
+  text: string,
+  extractTriplesFn: (chunk: string) => Promise<Triple[]>,
+  chunkSize = 20000,
+  overlapSize = 1000,
+): Promise<Triple[]> {
+  // If text is small enough, process directly
+  if (text.length <= chunkSize) {
+    return await extractTriplesFn(text)
+  }
+
+  // Chunk the text
+  const chunks = chunkText(text, chunkSize)
+  console.log(`Split text into ${chunks.length} chunks`)
+
+  // Process each chunk
+  const triplesPromises = chunks.map((chunk, i) => {
+    console.log(`Processing chunk ${i + 1}/${chunks.length}, size: ${chunk.length}`)
+    return extractTriplesFn(chunk)
+  })
+
+  // Wait for all chunks to be processed
+  const triplesArrays = await Promise.all(triplesPromises)
+
+  // Merge results
+  return mergeTriples(triplesArrays)
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/utils/text-processing.ts.backup b/nvidia/txt2kg/assets/frontend/utils/text-processing.ts.backup
new file mode 100644
index 0000000..6386cfa
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/utils/text-processing.ts.backup
@@ -0,0 +1,227 @@
+/**
+ * Text processing utilities for knowledge graph extraction
+ * Matches PyTorch Geometric's txt2kg.py implementation
+ */
+
+import type { Triple } from '@/types/graph'
+
+const CHUNK_SIZE = 20000 // Optimized for Gemma3:27b on DGX Spark
+const OVERLAP_SIZE = 1000 // For context preservation between chunks
+
+/**
+ * Chunks text into sentence-based segments, matching Python implementation
+ */
+export function chunkText(text: string, chunkSize: number = CHUNK_SIZE): string[] {
+  // If the input text is empty or None, return an empty list
+  if (!text) {
+    return []
+  }
+
+  // List of punctuation marks that typically end sentences
+  const sentenceEndings = '.!?'
+
+  // List to store the resulting chunks
+  const chunks: string[] = []
+
+  // Continue processing the entire text
+  let remainingText = text
+  while (remainingText) {
+    // If the remaining text is shorter than chunk_size, add it and break
+    if (remainingText.length <= chunkSize) {
+      chunks.push(remainingText.trim())
+      break
+    }
+
+    // Start with the maximum possible chunk
+    let chunk = remainingText.slice(0, chunkSize)
+
+    // Try to find the last sentence ending within the chunk
+    let bestSplit = chunkSize
+    for (const ending of sentenceEndings) {
+      // Find the last occurrence of the ending punctuation
+      const lastEnding = chunk.lastIndexOf(ending)
+      if (lastEnding !== -1) {
+        // Ensure we include the punctuation and any following space
+        bestSplit = Math.min(
+          bestSplit,
+          lastEnding + 1 + (lastEnding + 1 < chunk.length && /\s/.test(chunk[lastEnding + 1]) ? 1 : 0)
+        )
+      }
+    }
+
+    // Adjust to ensure we don't break words
+    // If the next character is a letter, find the last space
+    if (bestSplit < remainingText.length && /[a-zA-Z]/.test(remainingText[bestSplit])) {
+      const spaceSplit = remainingText.slice(0, bestSplit).lastIndexOf(' ')
+      if (spaceSplit !== -1) {
+        bestSplit = spaceSplit
+      }
+    }
+
+    // Append the chunk, ensuring it's stripped
+    chunks.push(remainingText.slice(0, bestSplit).trim())
+
+    // Remove the processed part from the text
+    remainingText = remainingText.slice(bestSplit).trim()
+  }
+
+  return chunks
+}
+
+/**
+ * Merges triples from multiple chunks, removing duplicates
+ */
+export function mergeTriples(triplesArrays: Array<Array<Triple>>): Array<Triple> {
+  const uniqueTriplesMap = new Map<string, Triple>()
+
+  for (const triples of triplesArrays) {
+    for (const triple of triples) {
+      const key = `${triple.subject}|${triple.predicate}|${triple.object}`
+      if (!uniqueTriplesMap.has(key)) {
+        uniqueTriplesMap.set(key, triple)
+      }
+    }
+  }
+
+  return Array.from(uniqueTriplesMap.values())
+}
+
+/**
+ * Parses triple strings into structured Triple objects, matching Python patterns
+ */
+export function parseTriples(triplesStr: string): Triple[] {
+  const processed: Triple[] = []
+  const splitByNewline = triplesStr.split('\n')
+
+  // First try newline-separated format
+  if (splitByNewline.length > 1) {
+    for (const line of splitByNewline) {
+      const triple = parseTripleLine(line.trim())
+      if (triple) processed.push(triple)
+    }
+  } else {
+    // Handle space-separated format "(e, r, e) (e, r, e) ... (e, r, e)"
+    const splitTriples = triplesStr.slice(1, -1).split(') (')
+    for (const tripleStr of splitTriples) {
+      const triple = parseTripleLine(tripleStr)
+      if (triple) processed.push(triple)
+    }
+  }
+
+  return processed
+}
+
+/**
+ * Helper function to parse a single triple line with multiple formats
+ */
+function parseTripleLine(line: string): Triple | null {
+  if (!line.trim() || line.toLowerCase().includes('note:')) return null
+
+  // Try different regex patterns matching Python implementation
+  const patterns = [
+    // Standard format: ('subject', 'relation', 'object')
+    /\('([^']+)',\s*'([^']+)',\s*'([^']+)'\)/,
+    // Double quotes: ("subject", "relation", "object")
+    /\("([^"]+)",\s*"([^"]+)",\s*"([^"]+)"\)/,
+    // No parentheses: "subject", "relation", "object"
+    /"([^"]+)",\s*"([^"]+)",\s*"([^"]+)"/,
+    // Mixed quotes: ('subject', "relation", 'object')
+    /\(['"]([^'"]+)['"],\s*['"]([^'"]+)['"],\s*['"]([^'"]+)['"]\)/,
+    // Plain text: subject, relation, object
+    /^([^,]+),\s*([^,]+),\s*(.+)$/
+  ]
+
+  for (const pattern of patterns) {
+    const match = line.match(pattern)
+    if (match) {
+      return {
+        subject: match[1].trim().toLowerCase(),
+        predicate: match[2].trim().toLowerCase(),
+        object: match[3].trim().toLowerCase()
+      }
+    }
+  }
+
+  return null
+}
+
+// Re-export types
+export type { Triple }
+
+/**
+ * Converts triples to a graph representation
+ * @param triples Array of triples
+ * @returns Graph representation with nodes and edges
+ */
+export function triplesToGraph(triples: Triple[]) {
+  const nodes = new Map<string, { id: string; label: string }>()
+  const edges: Array<{ source: string; target: string; label: string }> = []
+
+  // Process each triple to build nodes and edges
+  for (const triple of triples) {
+    // Add subject node if not exists
+    if (!nodes.has(triple.subject)) {
+      nodes.set(triple.subject, {
+        id: triple.subject,
+        label: triple.subject,
+      })
+    }
+
+    // Add object node if not exists
+    if (!nodes.has(triple.object)) {
+      nodes.set(triple.object, {
+        id: triple.object,
+        label: triple.object,
+      })
+    }
+
+    // Add edge
+    edges.push({
+      source: triple.subject,
+      target: triple.object,
+      label: triple.predicate,
+    })
+  }
+
+  return {
+    nodes: Array.from(nodes.values()),
+    edges,
+  }
+}
+
+/**
+ * Processes text to extract triples using chunking for large texts
+ * @param text Text to process
+ * @param extractTriplesFn Function to extract triples from a chunk
+ * @param chunkSize Maximum size of each chunk
+ * @param overlapSize Size of overlap between chunks
+ * @returns Array of extracted triples
+ */
+export async function processTextWithChunking(
+  text: string,
+  extractTriplesFn: (chunk: string) => Promise<Triple[]>,
+  chunkSize = 20000,
+  overlapSize = 1000,
+): Promise<Triple[]> {
+  // If text is small enough, process directly
+  if (text.length <= chunkSize) {
+    return await extractTriplesFn(text)
+  }
+
+  // Chunk the text
+  const chunks = chunkText(text, chunkSize)
+  console.log(`Split text into ${chunks.length} chunks`)
+
+  // Process each chunk
+  const triplesPromises = chunks.map((chunk, i) => {
+    console.log(`Processing chunk ${i + 1}/${chunks.length}, size: ${chunk.length}`)
+    return extractTriplesFn(chunk)
+  })
+
+  // Wait for all chunks to be processed
+  const triplesArrays = await Promise.all(triplesPromises)
+
+  // Merge results
+  return mergeTriples(triplesArrays)
+}
+
diff --git a/nvidia/txt2kg/assets/frontend/utils/webgpu-clustering.ts b/nvidia/txt2kg/assets/frontend/utils/webgpu-clustering.ts
new file mode 100644
index 0000000..8e43665
--- /dev/null
+++ b/nvidia/txt2kg/assets/frontend/utils/webgpu-clustering.ts
@@ -0,0 +1,712 @@
+// WebGPU Clustering utilities for NVIDIA GPU acceleration
+// This implements clustered rendering for knowledge graphs
+
+// Define WebGPU types for TypeScript
+declare global {
+  interface Navigator {
+    gpu?: {
+      requestAdapter: (options?: GPURequestAdapterOptions) => Promise<GPUAdapter | null>;
+    };
+  }
+  
+  interface GPURequestAdapterOptions {
+    powerPreference?: 'high-performance' | 'low-power';
+  }
+  
+  interface GPUAdapter {
+    name?: string;
+    requestDevice: (options?: GPUDeviceDescriptor) => Promise<GPUDevice | null>;
+  }
+  
+  interface GPUDeviceDescriptor {
+    requiredFeatures?: string[];
+  }
+  
+  interface GPUDevice {
+    createBuffer: (descriptor: GPUBufferDescriptor) => GPUBuffer;
+    createShaderModule: (descriptor: GPUShaderModuleDescriptor) => GPUShaderModule;
+    createComputePipeline: (descriptor: GPUComputePipelineDescriptor) => GPUComputePipeline;
+    createBindGroup: (descriptor: GPUBindGroupDescriptor) => GPUBindGroup;
+    createCommandEncoder: () => GPUCommandEncoder;
+    queue: GPUQueue;
+  }
+  
+  interface GPUQueue {
+    writeBuffer: (buffer: GPUBuffer, offset: number, data: BufferSource) => void;
+    submit: (commandBuffers: GPUCommandBuffer[]) => void;
+  }
+  
+  interface GPUBufferDescriptor {
+    size: number;
+    usage: number;
+  }
+  
+  interface GPUBuffer {
+    size: number;
+    mapAsync: (mode: number, offset?: number, size?: number) => Promise<void>;
+    getMappedRange: (offset?: number, size?: number) => ArrayBuffer;
+    unmap: () => void;
+    destroy: () => void;
+  }
+  
+  interface GPUShaderModuleDescriptor {
+    code: string;
+  }
+  
+  interface GPUShaderModule {}
+  
+  interface GPUComputePipelineDescriptor {
+    layout: 'auto' | GPUPipelineLayout;
+    compute: {
+      module: GPUShaderModule;
+      entryPoint: string;
+    };
+  }
+  
+  interface GPUPipelineLayout {}
+  
+  interface GPUComputePipeline {
+    getBindGroupLayout: (index: number) => GPUBindGroupLayout;
+  }
+  
+  interface GPUBindGroupLayout {}
+  
+  interface GPUBindGroupDescriptor {
+    layout: GPUBindGroupLayout;
+    entries: Array<{
+      binding: number;
+      resource: { buffer: GPUBuffer } | { sampler: GPUSampler } | { texture: GPUTexture };
+    }>;
+  }
+  
+  interface GPUBindGroup {}
+  
+  interface GPUCommandEncoder {
+    beginComputePass: () => GPUComputePassEncoder;
+    copyBufferToBuffer: (
+      source: GPUBuffer,
+      sourceOffset: number,
+      destination: GPUBuffer,
+      destinationOffset: number,
+      size: number
+    ) => void;
+    finish: () => GPUCommandBuffer;
+  }
+  
+  interface GPUComputePassEncoder {
+    setPipeline: (pipeline: GPUComputePipeline) => void;
+    setBindGroup: (index: number, bindGroup: GPUBindGroup) => void;
+    dispatchWorkgroups: (x: number, y: number, z: number) => void;
+    end: () => void;
+  }
+  
+  interface GPUCommandBuffer {}
+  
+  interface GPUSampler {}
+  
+  interface GPUTexture {}
+}
+
+// WebGPU buffer usage flags - use explicit values instead of enums
+const GPU_BUFFER_USAGE = {
+  COPY_SRC: 0x0001,
+  COPY_DST: 0x0002,
+  MAP_READ: 0x0004,
+  MAP_WRITE: 0x0008,
+  STORAGE: 0x0080,
+  UNIFORM: 0x0040
+};
+
+// WebGPU map mode flags - use explicit values instead of enums
+const GPU_MAP_MODE = {
+  READ: 0x0001,
+  WRITE: 0x0002
+};
+
+/**
+ * Represents a 3D cluster in space
+ */
+interface Cluster {
+  minBounds: [number, number, number];
+  maxBounds: [number, number, number];
+  nodeIndices: Uint32Array;
+  count: number;
+  capacity: number;
+}
+
+/**
+ * Builds and manages clustered rendering for large graphs on WebGPU
+ * Optimized for NVIDIA GPUs through specialized workgroup sizes and memory access patterns
+ */
+export class WebGPUClusteringEngine {
+  private device: GPUDevice | null = null;
+  private clusterDimensions: [number, number, number];
+  private clusterCount: number;
+  private clustersBuffer: GPUBuffer | null = null;
+  private nodeBuffer: GPUBuffer | null = null;
+  private computePipeline: GPUComputePipeline | null = null;
+  private bindGroup: GPUBindGroup | null = null;
+  private isInitialized = false;
+  private isNvidiaGPU = false;
+  private forceBuffer: GPUBuffer | null = null;
+  private forceComputePipeline: GPUComputePipeline | null = null;
+  private forceBindGroup: GPUBindGroup | null = null;
+  
+  /**
+   * Creates a new WebGPU clustering engine
+   * @param clusterDimensions X, Y, Z dimensions of the cluster grid
+   */
+  constructor(clusterDimensions: [number, number, number] = [32, 18, 24]) {
+    this.clusterDimensions = clusterDimensions;
+    this.clusterCount = clusterDimensions[0] * clusterDimensions[1] * clusterDimensions[2];
+    console.log(`Creating WebGPU clustering engine with ${this.clusterCount} clusters`);
+  }
+  
+  /**
+   * Initializes the WebGPU device and resources
+   */
+  async initialize(): Promise<boolean> {
+    try {
+      if (!navigator.gpu) {
+        console.warn("WebGPU not supported in this browser");
+        return false;
+      }
+      
+      const adapter = await navigator.gpu.requestAdapter({
+        powerPreference: 'high-performance'
+      });
+      
+      if (!adapter) {
+        console.warn("No suitable GPU adapter found");
+        return false;
+      }
+      
+      // Log adapter info - helpful for debugging NVIDIA support
+      if (adapter.name) {
+        console.log(`GPU detected: ${adapter.name}`);
+        // Check if we're running on an NVIDIA GPU
+        this.isNvidiaGPU = adapter.name.toLowerCase().includes('nvidia');
+        if (this.isNvidiaGPU) {
+          console.log("NVIDIA GPU detected - using optimized settings");
+        }
+      }
+      
+      this.device = await adapter.requestDevice({
+        requiredFeatures: ['timestamp-query', 'bgra8unorm-storage']
+      });
+      
+      if (!this.device) {
+        console.warn("Failed to get GPU device");
+        return false;
+      }
+      
+      this.isInitialized = true;
+      console.log("WebGPU clustering engine initialized successfully");
+      return true;
+    } catch (error) {
+      console.error("Failed to initialize WebGPU:", error);
+      return false;
+    }
+  }
+  
+  /**
+   * Creates compute resources for clustering on the GPU
+   * @param nodeCount Number of nodes in the graph
+   */
+  createComputeResources(nodeCount: number): boolean {
+    if (!this.isInitialized || !this.device) {
+      console.warn("WebGPU clustering engine not initialized");
+      return false;
+    }
+    
+    try {
+      // Create buffer for clusters
+      const clusterBufferSize = this.clusterCount * 64; // Size for cluster data
+      this.clustersBuffer = this.device.createBuffer({
+        size: clusterBufferSize,
+        usage: GPU_BUFFER_USAGE.STORAGE | GPU_BUFFER_USAGE.COPY_DST
+      });
+      
+      // Create buffer for nodes
+      const nodeBufferSize = nodeCount * 32; // Size for node data (position, size, etc.)
+      this.nodeBuffer = this.device.createBuffer({
+        size: nodeBufferSize,
+        usage: GPU_BUFFER_USAGE.STORAGE | GPU_BUFFER_USAGE.COPY_DST | GPU_BUFFER_USAGE.COPY_SRC
+      });
+      
+      // Optimize shader based on GPU vendor - NVIDIA GPUs work better with 
+      // specific workgroup sizes and memory access patterns
+      const workgroupSize = this.isNvidiaGPU ? 128 : 64; // NVIDIA GPUs prefer larger workgroups
+      
+      // Create compute shader module for clustering
+      const shaderModule = this.device.createShaderModule({
+        code: `
+          @group(0) @binding(0) var<storage, read_write> clusters: array<Cluster>;
+          @group(0) @binding(1) var<storage, read_write> nodes: array<Node>;
+          
+          struct Cluster {
+            minBounds: vec3f,
+            padding1: f32,
+            maxBounds: vec3f,
+            padding2: f32,
+            count: u32,
+            capacity: u32,
+            padding3: u32,
+            padding4: u32,
+          };
+          
+          struct Node {
+            position: vec3f,
+            size: f32,
+            clusterIndex: u32,
+            nodeIndex: u32,
+            padding1: u32,
+            padding2: u32,
+          };
+          
+          // Improved clustering for WebGPU
+          @compute @workgroup_size(${workgroupSize}, 1, 1)
+          fn main(@builtin(global_invocation_id) global_id: vec3u) {
+            let nodeIndex = global_id.x;
+            if (nodeIndex >= arrayLength(&nodes)) {
+              return;
+            }
+            
+            // Optimized clustering algorithm for NVIDIA GPUs
+            let node = nodes[nodeIndex];
+            
+            // Use log-scaled clusters in Z dimension for better distribution
+            // This works better for graph visualization where nodes tend to cluster
+            // at certain depths
+            let clusterX = u32(clamp(node.position.x / 100.0 + 0.5, 0.0, 0.999) * ${this.clusterDimensions[0]}.0);
+            let clusterY = u32(clamp(node.position.y / 100.0 + 0.5, 0.0, 0.999) * ${this.clusterDimensions[1]}.0);
+            
+            // For Z-dimension, use logarithmic scaling for better distribution
+            let normalizedZ = clamp(node.position.z / 100.0 + 0.5, 0.001, 0.999);
+            // Map using log scale (compressed at the edges, more detail in the center)
+            let logZ = log(normalizedZ) / log(0.999);
+            let clusterZ = u32(clamp(logZ, 0.0, 0.999) * ${this.clusterDimensions[2]}.0);
+            
+            // Calculate final cluster index
+            let clusterIndex = clusterX + 
+                              clusterY * ${this.clusterDimensions[0]}u + 
+                              clusterZ * ${this.clusterDimensions[0]}u * ${this.clusterDimensions[1]}u;
+                              
+            // Store the cluster assignment
+            nodes[nodeIndex].clusterIndex = clusterIndex;
+          }
+        `
+      });
+      
+      // Create compute pipeline
+      this.computePipeline = this.device.createComputePipeline({
+        layout: 'auto',
+        compute: {
+          module: shaderModule,
+          entryPoint: 'main'
+        }
+      });
+      
+      // Create bind group
+      this.bindGroup = this.device.createBindGroup({
+        layout: this.computePipeline.getBindGroupLayout(0),
+        entries: [
+          {
+            binding: 0,
+            resource: {
+              buffer: this.clustersBuffer
+            }
+          },
+          {
+            binding: 1,
+            resource: {
+              buffer: this.nodeBuffer
+            }
+          }
+        ]
+      });
+      
+      console.log("WebGPU compute resources created successfully");
+      return true;
+    } catch (error) {
+      console.error("Failed to create compute resources:", error);
+      return false;
+    }
+  }
+  
+  /**
+   * Updates node positions and computes clusters
+   * @param nodes Array of node data with positions
+   */
+  updateNodePositions(nodes: any[]): boolean {
+    if (!this.isInitialized || !this.device || !this.computePipeline || !this.bindGroup) {
+      console.warn("WebGPU clustering engine not fully initialized");
+      return false;
+    }
+    
+    try {
+      // Update node buffer with latest positions
+      const nodeData = new Float32Array(nodes.length * 8); // 8 floats per node
+      
+      nodes.forEach((node, i) => {
+        // Convert node data to format expected by shader
+        const baseIndex = i * 8;
+        nodeData[baseIndex] = node.x || 0;     // position.x
+        nodeData[baseIndex + 1] = node.y || 0; // position.y
+        nodeData[baseIndex + 2] = node.z || 0; // position.z
+        nodeData[baseIndex + 3] = node.val || 1; // size
+        nodeData[baseIndex + 4] = 0;           // clusterIndex (will be set by compute shader)
+        nodeData[baseIndex + 5] = i;           // nodeIndex
+        nodeData[baseIndex + 6] = 0;           // padding
+        nodeData[baseIndex + 7] = 0;           // padding
+      });
+      
+      // Write node data to GPU
+      this.device.queue.writeBuffer(this.nodeBuffer!, 0, nodeData);
+      
+      // Set up command encoder
+      const commandEncoder = this.device.createCommandEncoder();
+      const computePass = commandEncoder.beginComputePass();
+      
+      computePass.setPipeline(this.computePipeline);
+      computePass.setBindGroup(0, this.bindGroup);
+      
+      // Dispatch workgroups - optimized for NVIDIA GPUs
+      // NVIDIA GPUs work better with fewer, larger workgroups
+      const workgroupSize = this.isNvidiaGPU ? 128 : 64;
+      const workgroupCount = Math.ceil(nodes.length / workgroupSize);
+      computePass.dispatchWorkgroups(workgroupCount, 1, 1);
+      computePass.end();
+      
+      // Submit commands
+      this.device.queue.submit([commandEncoder.finish()]);
+      
+      return true;
+    } catch (error) {
+      console.error("Failed to update node positions:", error);
+      return false;
+    }
+  }
+  
+  /**
+   * Reads back the clustered node data
+   * @returns Clustered node data or null if failed
+   */
+  async readClusteredData(): Promise<any[] | null> {
+    if (!this.isInitialized || !this.device || !this.nodeBuffer) {
+      console.warn("WebGPU clustering engine not fully initialized");
+      return null;
+    }
+    
+    try {
+      // Create a buffer for reading back the results
+      const readBuffer = this.device.createBuffer({
+        size: this.nodeBuffer.size,
+        usage: GPU_BUFFER_USAGE.COPY_DST | GPU_BUFFER_USAGE.MAP_READ
+      });
+      
+      // Copy results to the readable buffer
+      const commandEncoder = this.device.createCommandEncoder();
+      commandEncoder.copyBufferToBuffer(
+        this.nodeBuffer, 0,
+        readBuffer, 0,
+        this.nodeBuffer.size
+      );
+      
+      // Submit copy commands
+      this.device.queue.submit([commandEncoder.finish()]);
+      
+      // Map the buffer for reading
+      await readBuffer.mapAsync(GPU_MAP_MODE.READ);
+      const data = new Float32Array(readBuffer.getMappedRange());
+      
+      // Process the results
+      const nodeCount = data.length / 8;
+      const results: any[] = [];
+      
+      for (let i = 0; i < nodeCount; i++) {
+        const baseIndex = i * 8;
+        results.push({
+          index: i,
+          position: {
+            x: data[baseIndex],
+            y: data[baseIndex + 1],
+            z: data[baseIndex + 2]
+          },
+          size: data[baseIndex + 3],
+          clusterIndex: data[baseIndex + 4],
+          nodeIndex: data[baseIndex + 5]
+        });
+      }
+      
+      // Clean up
+      readBuffer.unmap();
+      
+      return results;
+    } catch (error) {
+      console.error("Failed to read clustered data:", error);
+      return null;
+    }
+  }
+  
+  /**
+   * Creates a GPU-accelerated force calculation pipeline for graph layout
+   * Optimized for large graphs to offload physics calculations to the GPU
+   * @param nodeCount Number of nodes in the graph
+   * @param linkCount Number of links in the graph
+   */
+  async createClusteredForce(nodeCount: number, linkCount: number): Promise<boolean> {
+    if (!this.isInitialized || !this.device) {
+      console.warn("WebGPU clustering engine not initialized");
+      return false;
+    }
+    
+    try {
+      // Create buffer for forces
+      const forceBufferSize = nodeCount * 16; // 4 floats (x,y,z forces + padding) per node
+      this.forceBuffer = this.device.createBuffer({
+        size: forceBufferSize,
+        usage: GPU_BUFFER_USAGE.STORAGE | GPU_BUFFER_USAGE.COPY_DST | GPU_BUFFER_USAGE.COPY_SRC
+      });
+      
+      // Create link buffer if we have links
+      let linkBuffer = null;
+      if (linkCount > 0) {
+        const linkBufferSize = linkCount * 16; // 4 integers (source, target, strength, padding) per link
+        linkBuffer = this.device.createBuffer({
+          size: linkBufferSize,
+          usage: GPU_BUFFER_USAGE.STORAGE | GPU_BUFFER_USAGE.COPY_DST
+        });
+      }
+      
+      // Optimize workgroup size for the current GPU
+      const workgroupSize = this.isNvidiaGPU ? 256 : 64; // NVIDIA GPUs benefit from larger workgroups
+      
+      // Create compute shader module for force calculation
+      const forceShaderModule = this.device.createShaderModule({
+        code: `
+          @group(0) @binding(0) var<storage, read_write> nodes: array<Node>;
+          @group(0) @binding(1) var<storage, read_write> forces: array<Force>;
+          @group(0) @binding(2) var<storage, read> links: array<Link>;
+          
+          struct Node {
+            position: vec3f,
+            size: f32,
+            clusterIndex: u32,
+            nodeIndex: u32,
+            padding1: u32,
+            padding2: u32,
+          };
+          
+          struct Force {
+            force: vec3f,
+            padding: f32,
+          };
+          
+          struct Link {
+            source: u32,
+            target: u32,
+            strength: f32,
+            padding: u32,
+          };
+          
+          struct SimParams {
+            repulsionStrength: f32,
+            attractionStrength: f32,
+            maxDistance: f32,
+            deltaTime: f32,
+            numNodes: u32,
+            numLinks: u32,
+          };
+          
+          @group(0) @binding(3) var<uniform> params: SimParams;
+          
+          // NVIDIA-optimized force calculation
+          @compute @workgroup_size(${workgroupSize}, 1, 1)
+          fn calculateForces(@builtin(global_invocation_id) global_id: vec3u) {
+            let nodeIndex = global_id.x;
+            if (nodeIndex >= params.numNodes) {
+              return;
+            }
+            
+            let node = nodes[nodeIndex];
+            var totalForce = vec3f(0.0, 0.0, 0.0);
+            
+            // Calculate repulsive forces (node-node)
+            for (var i = 0u; i < params.numNodes; i++) {
+              if (i == nodeIndex) {
+                continue; // Skip self
+              }
+              
+              let otherNode = nodes[i];
+              let dx = node.position.x - otherNode.position.x;
+              let dy = node.position.y - otherNode.position.y;
+              let dz = node.position.z - otherNode.position.z;
+              
+              let distSq = dx*dx + dy*dy + dz*dz;
+              if (distSq < 0.01) { // Avoid division by zero
+                // Add small random jitter if nodes are too close
+                totalForce += vec3f(
+                  (fract(sin(f32(nodeIndex) * 78.233)) - 0.5) * 0.1,
+                  (fract(sin(f32(nodeIndex) * 43.191)) - 0.5) * 0.1,
+                  (fract(sin(f32(nodeIndex) * 28.976)) - 0.5) * 0.1
+                );
+                continue;
+              }
+              
+              // Inverse square law for repulsion with distance limiting
+              let dist = sqrt(distSq);
+              if (dist > params.maxDistance) {
+                continue; // Skip if too far away
+              }
+              
+              let repulsionFactor = params.repulsionStrength / max(distSq, 0.1);
+              let forceX = dx * repulsionFactor;
+              let forceY = dy * repulsionFactor;
+              let forceZ = dz * repulsionFactor;
+              
+              totalForce += vec3f(forceX, forceY, forceZ);
+            }
+            
+            // Calculate attractive forces (links)
+            for (var i = 0u; i < params.numLinks; i++) {
+              let link = links[i];
+              
+              // Check if this node is part of the link
+              if (link.source == nodeIndex || link.target == nodeIndex) {
+                let otherNodeIndex = select(link.source, link.target, link.target == nodeIndex);
+                let otherNode = nodes[otherNodeIndex];
+                
+                let dx = otherNode.position.x - node.position.x;
+                let dy = otherNode.position.y - node.position.y;
+                let dz = otherNode.position.z - node.position.z;
+                
+                let dist = sqrt(dx*dx + dy*dy + dz*dz);
+                if (dist < 0.01) continue; // Avoid division by zero
+                
+                // Hooke's law for attraction
+                let attractionFactor = params.attractionStrength * link.strength * dist;
+                let dirX = dx / dist;
+                let dirY = dy / dist;
+                let dirZ = dz / dist;
+                
+                totalForce += vec3f(
+                  dirX * attractionFactor,
+                  dirY * attractionFactor,
+                  dirZ * attractionFactor
+                );
+              }
+            }
+            
+            // Store the calculated force
+            forces[nodeIndex].force = totalForce;
+          }
+          
+          // Apply calculated forces to update positions
+          @compute @workgroup_size(${workgroupSize}, 1, 1)
+          fn applyForces(@builtin(global_invocation_id) global_id: vec3u) {
+            let nodeIndex = global_id.x;
+            if (nodeIndex >= params.numNodes) {
+              return;
+            }
+            
+            let force = forces[nodeIndex].force;
+            
+            // Apply force to position with damping
+            nodes[nodeIndex].position += force * params.deltaTime;
+          }
+        `
+      });
+      
+      // Create compute pipeline
+      this.forceComputePipeline = this.device.createComputePipeline({
+        layout: 'auto',
+        compute: {
+          module: forceShaderModule,
+          entryPoint: 'calculateForces'
+        }
+      });
+      
+      // Create a separate pipeline for applying forces
+      const applyForcesPipeline = this.device.createComputePipeline({
+        layout: 'auto',
+        compute: {
+          module: forceShaderModule,
+          entryPoint: 'applyForces'
+        }
+      });
+      
+      // Create simulation parameters buffer
+      const paramsBuffer = this.device.createBuffer({
+        size: 32, // 6 params, 32 bytes total
+        usage: GPU_BUFFER_USAGE.UNIFORM | GPU_BUFFER_USAGE.COPY_DST
+      });
+      
+      // Set default simulation parameters
+      const defaultParams = new Float32Array([
+        0.5,    // repulsionStrength
+        0.01,   // attractionStrength
+        200.0,  // maxDistance
+        0.05,   // deltaTime
+        nodeCount, // numNodes
+        linkCount  // numLinks
+      ]);
+      
+      this.device.queue.writeBuffer(paramsBuffer, 0, defaultParams);
+      
+      // Create bind group entries
+      const bindGroupEntries = [
+        {
+          binding: 0,
+          resource: { buffer: this.nodeBuffer! }
+        },
+        {
+          binding: 1,
+          resource: { buffer: this.forceBuffer }
+        },
+        {
+          binding: 3,
+          resource: { buffer: paramsBuffer }
+        }
+      ];
+      
+      // Add link buffer if it exists
+      if (linkBuffer) {
+        bindGroupEntries.push({
+          binding: 2,
+          resource: { buffer: linkBuffer }
+        });
+      }
+      
+      // Create bind group
+      this.forceBindGroup = this.device.createBindGroup({
+        layout: this.forceComputePipeline.getBindGroupLayout(0),
+        entries: bindGroupEntries
+      });
+      
+      console.log("GPU-accelerated force calculation pipeline created successfully");
+      return true;
+    } catch (error) {
+      console.error("Failed to create force calculation pipeline:", error);
+      return false;
+    }
+  }
+  
+  /**
+   * Disposes of WebGPU resources
+   */
+  dispose(): void {
+    this.clustersBuffer?.destroy();
+    this.nodeBuffer?.destroy();
+    this.forceBuffer?.destroy();
+    this.clustersBuffer = null;
+    this.nodeBuffer = null;
+    this.forceBuffer = null;
+    this.computePipeline = null;
+    this.bindGroup = null;
+    this.forceComputePipeline = null;
+    this.forceBindGroup = null;
+    this.device = null;
+    this.isInitialized = false;
+  }
+} 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/scripts/benchmark_llm.py b/nvidia/txt2kg/assets/scripts/benchmark_llm.py
new file mode 100755
index 0000000..70a7b57
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/benchmark_llm.py
@@ -0,0 +1,306 @@
+#!/usr/bin/env python3
+"""
+LLM Benchmark Script: vLLM vs Ollama Performance Comparison
+Compares performance metrics between vLLM and Ollama deployments
+"""
+
+import asyncio
+import aiohttp
+import time
+import json
+import statistics
+import argparse
+from typing import List, Dict, Any
+from dataclasses import dataclass
+import sys
+
+@dataclass
+class BenchmarkResult:
+    service: str
+    model: str
+    prompt_tokens: int
+    completion_tokens: int
+    total_tokens: int
+    response_time: float
+    tokens_per_second: float
+    first_token_time: float = 0.0
+    error: str = ""
+
+class LLMBenchmark:
+    def __init__(self):
+        self.vllm_url = "http://localhost:8001"
+        self.ollama_url = "http://localhost:11434"
+        
+    async def test_vllm(self, session: aiohttp.ClientSession, prompt: str, max_tokens: int = 100) -> BenchmarkResult:
+        """Test vLLM performance"""
+        start_time = time.time()
+        
+        payload = {
+            "model": "meta-llama/Llama-3.1-8B-Instruct",
+            "prompt": prompt,
+            "max_tokens": max_tokens,
+            "temperature": 0.7,
+            "stream": False
+        }
+        
+        try:
+            async with session.post(f"{self.vllm_url}/v1/completions", json=payload) as response:
+                if response.status != 200:
+                    error_text = await response.text()
+                    return BenchmarkResult(
+                        service="vLLM",
+                        model="Llama-3.1-8B-Instruct",
+                        prompt_tokens=0,
+                        completion_tokens=0,
+                        total_tokens=0,
+                        response_time=0,
+                        tokens_per_second=0,
+                        error=f"HTTP {response.status}: {error_text}"
+                    )
+                
+                result = await response.json()
+                end_time = time.time()
+                
+                response_time = end_time - start_time
+                usage = result.get("usage", {})
+                prompt_tokens = usage.get("prompt_tokens", 0)
+                completion_tokens = usage.get("completion_tokens", 0)
+                total_tokens = usage.get("total_tokens", 0)
+                
+                tokens_per_second = completion_tokens / response_time if response_time > 0 else 0
+                
+                return BenchmarkResult(
+                    service="vLLM",
+                    model="Llama-3.1-8B-Instruct",
+                    prompt_tokens=prompt_tokens,
+                    completion_tokens=completion_tokens,
+                    total_tokens=total_tokens,
+                    response_time=response_time,
+                    tokens_per_second=tokens_per_second
+                )
+                
+        except Exception as e:
+            return BenchmarkResult(
+                service="vLLM",
+                model="Llama-3.1-8B-Instruct",
+                prompt_tokens=0,
+                completion_tokens=0,
+                total_tokens=0,
+                response_time=0,
+                tokens_per_second=0,
+                error=str(e)
+            )
+    
+    async def test_ollama(self, session: aiohttp.ClientSession, prompt: str, max_tokens: int = 100) -> BenchmarkResult:
+        """Test Ollama performance"""
+        start_time = time.time()
+        
+        payload = {
+            "model": "llama3.1:8b",
+            "prompt": prompt,
+            "stream": False,
+            "options": {
+                "num_predict": max_tokens,
+                "temperature": 0.7
+            }
+        }
+        
+        try:
+            async with session.post(f"{self.ollama_url}/api/generate", json=payload) as response:
+                if response.status != 200:
+                    error_text = await response.text()
+                    return BenchmarkResult(
+                        service="Ollama",
+                        model="llama3.1:8b",
+                        prompt_tokens=0,
+                        completion_tokens=0,
+                        total_tokens=0,
+                        response_time=0,
+                        tokens_per_second=0,
+                        error=f"HTTP {response.status}: {error_text}"
+                    )
+                
+                result = await response.json()
+                end_time = time.time()
+                
+                response_time = end_time - start_time
+                
+                # Ollama response format
+                prompt_eval_count = result.get("prompt_eval_count", 0)
+                eval_count = result.get("eval_count", 0)
+                total_tokens = prompt_eval_count + eval_count
+                
+                tokens_per_second = eval_count / response_time if response_time > 0 else 0
+                
+                return BenchmarkResult(
+                    service="Ollama",
+                    model="llama3.1:8b",
+                    prompt_tokens=prompt_eval_count,
+                    completion_tokens=eval_count,
+                    total_tokens=total_tokens,
+                    response_time=response_time,
+                    tokens_per_second=tokens_per_second
+                )
+                
+        except Exception as e:
+            return BenchmarkResult(
+                service="Ollama",
+                model="llama3.1:8b",
+                prompt_tokens=0,
+                completion_tokens=0,
+                total_tokens=0,
+                response_time=0,
+                tokens_per_second=0,
+                error=str(e)
+            )
+    
+    async def run_single_test(self, prompt: str, max_tokens: int = 100) -> tuple[BenchmarkResult, BenchmarkResult]:
+        """Run a single test comparing both services"""
+        async with aiohttp.ClientSession() as session:
+            # Test both services concurrently
+            vllm_task = self.test_vllm(session, prompt, max_tokens)
+            ollama_task = self.test_ollama(session, prompt, max_tokens)
+            
+            vllm_result, ollama_result = await asyncio.gather(vllm_task, ollama_task)
+            return vllm_result, ollama_result
+    
+    async def run_benchmark(self, prompts: List[str], max_tokens: int = 100, runs_per_prompt: int = 3) -> Dict[str, List[BenchmarkResult]]:
+        """Run comprehensive benchmark"""
+        results = {"vLLM": [], "Ollama": []}
+        
+        print(f"Running benchmark with {len(prompts)} prompts, {runs_per_prompt} runs each...")
+        print(f"Max tokens per completion: {max_tokens}")
+        print("=" * 60)
+        
+        for i, prompt in enumerate(prompts, 1):
+            print(f"\nPrompt {i}/{len(prompts)}: {prompt[:50]}...")
+            
+            for run in range(runs_per_prompt):
+                print(f"  Run {run + 1}/{runs_per_prompt}...", end=" ")
+                
+                vllm_result, ollama_result = await self.run_single_test(prompt, max_tokens)
+                
+                results["vLLM"].append(vllm_result)
+                results["Ollama"].append(ollama_result)
+                
+                # Print quick results
+                if vllm_result.error:
+                    print(f"vLLM: ERROR - {vllm_result.error}")
+                else:
+                    print(f"vLLM: {vllm_result.response_time:.2f}s ({vllm_result.tokens_per_second:.1f} tok/s)", end=" | ")
+                
+                if ollama_result.error:
+                    print(f"Ollama: ERROR - {ollama_result.error}")
+                else:
+                    print(f"Ollama: {ollama_result.response_time:.2f}s ({ollama_result.tokens_per_second:.1f} tok/s)")
+                
+                # Small delay between runs
+                await asyncio.sleep(1)
+        
+        return results
+    
+    def analyze_results(self, results: Dict[str, List[BenchmarkResult]]):
+        """Analyze and print benchmark results"""
+        print("\n" + "=" * 80)
+        print("BENCHMARK RESULTS ANALYSIS")
+        print("=" * 80)
+        
+        for service_name, service_results in results.items():
+            print(f"\n{service_name} Results:")
+            print("-" * 40)
+            
+            # Filter out errors
+            valid_results = [r for r in service_results if not r.error]
+            error_results = [r for r in service_results if r.error]
+            
+            if error_results:
+                print(f"Errors: {len(error_results)}/{len(service_results)}")
+                for error in set(r.error for r in error_results):
+                    print(f"  - {error}")
+                print()
+            
+            if not valid_results:
+                print("No valid results to analyze.")
+                continue
+            
+            # Calculate statistics
+            response_times = [r.response_time for r in valid_results]
+            tokens_per_second = [r.tokens_per_second for r in valid_results]
+            completion_tokens = [r.completion_tokens for r in valid_results]
+            
+            print(f"Valid runs: {len(valid_results)}")
+            print(f"Response time (avg): {statistics.mean(response_times):.3f}s")
+            print(f"Response time (median): {statistics.median(response_times):.3f}s")
+            print(f"Response time (min/max): {min(response_times):.3f}s / {max(response_times):.3f}s")
+            print(f"Tokens/second (avg): {statistics.mean(tokens_per_second):.1f}")
+            print(f"Tokens/second (median): {statistics.median(tokens_per_second):.1f}")
+            print(f"Tokens/second (min/max): {min(tokens_per_second):.1f} / {max(tokens_per_second):.1f}")
+            print(f"Completion tokens (avg): {statistics.mean(completion_tokens):.1f}")
+        
+        # Comparison
+        vllm_valid = [r for r in results["vLLM"] if not r.error]
+        ollama_valid = [r for r in results["Ollama"] if not r.error]
+        
+        if vllm_valid and ollama_valid:
+            print("\n" + "=" * 40)
+            print("PERFORMANCE COMPARISON")
+            print("=" * 40)
+            
+            vllm_avg_response = statistics.mean([r.response_time for r in vllm_valid])
+            ollama_avg_response = statistics.mean([r.response_time for r in ollama_valid])
+            
+            vllm_avg_tokens_sec = statistics.mean([r.tokens_per_second for r in vllm_valid])
+            ollama_avg_tokens_sec = statistics.mean([r.tokens_per_second for r in ollama_valid])
+            
+            if vllm_avg_response < ollama_avg_response:
+                speedup = ollama_avg_response / vllm_avg_response
+                print(f"🏆 vLLM is {speedup:.2f}x FASTER in response time")
+            else:
+                speedup = vllm_avg_response / ollama_avg_response
+                print(f"🏆 Ollama is {speedup:.2f}x FASTER in response time")
+            
+            if vllm_avg_tokens_sec > ollama_avg_tokens_sec:
+                throughput_ratio = vllm_avg_tokens_sec / ollama_avg_tokens_sec
+                print(f"🚀 vLLM has {throughput_ratio:.2f}x HIGHER throughput")
+            else:
+                throughput_ratio = ollama_avg_tokens_sec / vllm_avg_tokens_sec
+                print(f"🚀 Ollama has {throughput_ratio:.2f}x HIGHER throughput")
+
+def main():
+    parser = argparse.ArgumentParser(description="Benchmark vLLM vs Ollama")
+    parser.add_argument("--max-tokens", type=int, default=100, help="Max tokens per completion")
+    parser.add_argument("--runs", type=int, default=3, help="Number of runs per prompt")
+    parser.add_argument("--quick", action="store_true", help="Run quick test with fewer prompts")
+    
+    args = parser.parse_args()
+    
+    # Test prompts
+    if args.quick:
+        prompts = [
+            "What is the capital of France?",
+            "Explain quantum computing in simple terms.",
+        ]
+    else:
+        prompts = [
+            "What is the capital of France?",
+            "Explain quantum computing in simple terms.",
+            "Write a short story about a robot learning to paint.",
+            "What are the benefits of renewable energy?",
+            "Describe the process of photosynthesis.",
+            "How does machine learning work?",
+        ]
+    
+    benchmark = LLMBenchmark()
+    
+    try:
+        results = asyncio.run(benchmark.run_benchmark(prompts, args.max_tokens, args.runs))
+        benchmark.analyze_results(results)
+    except KeyboardInterrupt:
+        print("\nBenchmark interrupted by user.")
+        sys.exit(1)
+    except Exception as e:
+        print(f"\nBenchmark failed: {e}")
+        sys.exit(1)
+
+if __name__ == "__main__":
+    main()
diff --git a/nvidia/txt2kg/assets/scripts/gnn/README.md b/nvidia/txt2kg/assets/scripts/gnn/README.md
new file mode 100644
index 0000000..4cdea90
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/gnn/README.md
@@ -0,0 +1,119 @@
+# TXT2KG Pipeline with ArangoDB Integration
+
+This project provides a two-stage pipeline for knowledge graph-based question answering:
+
+1. **Data Preprocessing** (`preprocess_data.py`): Extracts knowledge graph triples from either ArangoDB or using TXT2KG, and prepares the dataset.
+2. **Model Training & Testing** (`train_test_gnn.py`): Trains and evaluates a GNN-based retriever model on the preprocessed dataset.
+
+## Prerequisites
+
+- Python 3.8+
+- ArangoDB running (can be set up using the provided docker-compose.yml)
+- PyTorch and PyTorch Geometric installed
+- All dependencies listed in requirements.txt
+
+## Installation
+
+1. Install the required dependencies:
+
+```bash
+pip install -r scripts/requirements.txt
+```
+
+2. Ensure ArangoDB is running. You can use the docker-compose file:
+
+```bash
+docker-compose up -d arangodb arangodb-init
+```
+
+## Usage
+
+### Stage 1: Data Preprocessing
+
+Run the preprocessing script to prepare the dataset:
+
+```bash
+python scripts/preprocess_data.py --use_arango --output_dir ./output
+```
+
+#### Loading data from ArangoDB
+
+Use the `--use_arango` flag to load triples from ArangoDB instead of generating them with TXT2KG:
+
+```bash
+python scripts/preprocess_data.py --use_arango
+```
+
+The script will connect to ArangoDB using the default settings from docker-compose.yml:
+- URL: http://localhost:8529
+- Database: txt2kg
+- No auth (username and password are empty)
+
+#### Custom ArangoDB Connection
+
+You can specify custom ArangoDB connection parameters:
+
+```bash
+python scripts/preprocess_data.py --use_arango --arango_url "http://localhost:8529" --arango_db "your_db" --arango_user "username" --arango_password "password"
+```
+
+#### Using TXT2KG (original behavior)
+
+If you don't pass the `--use_arango` flag, the script will use the original TXT2KG approach:
+
+```bash
+python scripts/preprocess_data.py --NV_NIM_KEY "your-nvidia-api-key"
+```
+
+### Stage 2: Model Training & Testing
+
+After preprocessing the data, train and test the model:
+
+```bash
+python scripts/train_test_gnn.py --output_dir ./output
+```
+
+#### Training Options
+
+You can customize training with options:
+
+```bash
+python scripts/train_test_gnn.py --output_dir ./output --gnn_hidden_channels 2048 --num_gnn_layers 6 --epochs 5 --batch_size 2
+```
+
+#### Evaluation Only
+
+To evaluate a previously trained model without retraining:
+
+```bash
+python scripts/train_test_gnn.py --output_dir ./output --eval_only
+```
+
+## Expected Data Format in ArangoDB
+
+The script expects ArangoDB to have:
+
+1. A document collection named `entities` containing nodes with a `name` property
+2. An edge collection named `relationships` where:
+   - Edges have a `type` property (the predicate/relationship type)
+   - Edges connect from and to entities in the `entities` collection
+
+## How It Works
+
+### Data Preprocessing (`preprocess_data.py`)
+1. Connects to ArangoDB and queries all triples in the format "subject predicate object" (or generates them with TXT2KG)
+2. Creates a knowledge graph from these triples
+3. Prepares the dataset with training, validation, and test splits
+
+### Model Training & Testing (`train_test_gnn.py`)
+1. Loads the preprocessed dataset
+2. Initializes a GNN model (GAT architecture) and an LLM for generation
+3. Trains the model on the training set, validating on the validation set
+4. Evaluates the trained model on the test set using the LLMJudge for scoring
+
+## Limitations
+
+- The script assumes that your ArangoDB instance contains data in the format described above
+- You need to have both question-answer pairs and corpus documents available
+- Make sure your ArangoDB contains knowledge graph triples relevant to your corpus
+- Large LLM models require significant GPU memory for training 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/scripts/gnn/arangodb_txt2kg.py b/nvidia/txt2kg/assets/scripts/gnn/arangodb_txt2kg.py
new file mode 100755
index 0000000..727eeb0
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/gnn/arangodb_txt2kg.py
@@ -0,0 +1,412 @@
+#!/usr/bin/env python3
+
+import argparse
+import gc
+import json
+import os
+import torch
+from glob import glob
+from itertools import chain
+from tqdm import tqdm
+from python_arango import ArangoClient
+
+# Import the necessary modules from PyTorch Geometric
+from torch_geometric import seed_everything
+from torch_geometric.nn import SentenceTransformer
+from torch_geometric.utils.rag.backend_utils import (
+    create_remote_backend_from_triplets,
+    make_pcst_filter,
+    preprocess_triplet,
+)
+from torch_geometric.utils.rag.feature_store import ModernBertFeatureStore
+from torch_geometric.utils.rag.graph_store import NeighborSamplingRAGGraphStore
+from torch_geometric.loader import RAGQueryLoader
+
+# Define constants for better readability
+NV_NIM_MODEL_DEFAULT = "nvidia/llama-3.1-nemotron-70b-instruct"
+CHUNK_SIZE_DEFAULT = 512
+DEFAULT_ENDPOINT_URL = "https://integrate.api.nvidia.com/v1"
+
+# ArangoDB defaults from docker-compose.yml
+ARANGO_URL_DEFAULT = "http://localhost:8529"
+ARANGO_DB_DEFAULT = "txt2kg"
+ARANGO_USER_DEFAULT = ""
+ARANGO_PASSWORD_DEFAULT = ""
+
+# File paths and directories
+DATASET_FILE = "tech_qa.pt"
+TRIPLES_FILE = "tech_qa_just_triples.pt"
+CHECKPOINT_FILE = "checkpoint_kg.pt"
+TRAIN_DATA_FILE = "train.json"
+CORPUS_DIR = "corpus"
+BACKEND_PATH = "backend"
+OUTPUT_DIR = "output"
+
+def parse_args():
+    parser = argparse.ArgumentParser()
+    # Data processing related arguments
+    parser.add_argument('--NV_NIM_MODEL', type=str, default=NV_NIM_MODEL_DEFAULT, help="The NIM LLM to use for TXT2KG for LLMJudge")
+    parser.add_argument('--NV_NIM_KEY', type=str, default="", help="NVIDIA API key")
+    parser.add_argument(
+        '--ENDPOINT_URL', type=str, default=DEFAULT_ENDPOINT_URL, help=
+        "The URL hosting your model, in case you are not using the public NIM."
+    )
+    parser.add_argument(
+        '--chunk_size', type=int, default=512, help="When splitting context documents for txt2kg,\
+        the maximum number of characters per chunk.")
+    parser.add_argument('--checkpointing', action="store_true")
+    
+    # Add ArangoDB-specific arguments
+    parser.add_argument('--arango_url', type=str, default=ARANGO_URL_DEFAULT, help="ArangoDB URL")
+    parser.add_argument('--arango_db', type=str, default=ARANGO_DB_DEFAULT, help="ArangoDB database name")
+    parser.add_argument('--arango_user', type=str, default=ARANGO_USER_DEFAULT, help="ArangoDB username")
+    parser.add_argument('--arango_password', type=str, default=ARANGO_PASSWORD_DEFAULT, help="ArangoDB password")
+    parser.add_argument('--use_arango', action="store_true", help="Use ArangoDB instead of TXT2KG")
+    
+    # Add file path arguments
+    parser.add_argument('--dataset_file', type=str, default=DATASET_FILE, help="Path to save/load dataset")
+    parser.add_argument('--triples_file', type=str, default=TRIPLES_FILE, help="Path to save/load triples")
+    parser.add_argument('--checkpoint_file', type=str, default=CHECKPOINT_FILE, help="Path to save/load checkpoint")
+    parser.add_argument('--train_data_file', type=str, default=TRAIN_DATA_FILE, help="Path to training data file")
+    parser.add_argument('--corpus_dir', type=str, default=CORPUS_DIR, help="Directory containing corpus documents")
+    parser.add_argument('--backend_path', type=str, default=BACKEND_PATH, help="Path for backend storage")
+    parser.add_argument('--output_dir', type=str, default=OUTPUT_DIR, help="Directory for output files")
+    
+    return parser.parse_args()
+
+def load_triples_from_arangodb(arango_url, arango_db, arango_user, arango_password):
+    """
+    Load triples from ArangoDB for use with the TXT2KG dataset
+    
+    Args:
+        arango_url: ArangoDB connection URL
+        arango_db: ArangoDB database name
+        arango_user: ArangoDB username
+        arango_password: ArangoDB password
+        
+    Returns:
+        Array of triples in the format expected by create_remote_backend_from_triplets
+    """
+    try:
+        # Connect to ArangoDB
+        client = ArangoClient(hosts=arango_url)
+        
+        # Get database (no auth in our docker setup)
+        if arango_user and arango_password:
+            db = client.db(arango_db, username=arango_user, password=arango_password)
+        else:
+            db = client.db(arango_db)
+        
+        # Query to get all triples from ArangoDB as structured objects
+        # Handle case sensitivity and trim whitespace
+        aql_query = """
+        FOR e IN relationships
+        LET subject = TRIM(DOCUMENT(e._from).name)
+        LET object = TRIM(DOCUMENT(e._to).name)
+        LET predicate = TRIM(e.type)
+        FILTER subject != "" AND predicate != "" AND object != ""
+        RETURN {
+            subject: subject,
+            predicate: predicate,
+            object: object
+        }
+        """
+        
+        # Execute the query
+        cursor = db.aql.execute(aql_query)
+        triple_dicts = list(cursor)
+        
+        # Format triples as strings in the format expected by PyTorch Geometric
+        # The expected format is a list of strings in the form "subject predicate object"
+        triples = format_triples_for_pytorch_geometric(triple_dicts)
+        
+        print(f"Loaded {len(triples)} triples from ArangoDB")
+        # Print sample triples for debugging
+        if len(triples) > 0:
+            print("Sample triples:")
+            for i in range(min(3, len(triples))):
+                print(f"  {triples[i]}")
+        
+        return triples
+    except Exception as error:
+        print(f"Error loading triples from ArangoDB: {error}")
+        raise error
+
+def format_triples_for_pytorch_geometric(triple_dicts):
+    """
+    Format triples from ArangoDB into the format expected by PyTorch Geometric
+    
+    Args:
+        triple_dicts: List of dictionaries with subject, predicate, object keys
+        
+    Returns:
+        List of strings in the format "subject predicate object"
+    """
+    triples = []
+    # Create a set to avoid duplicates
+    unique_triples = set()
+    
+    for triple_dict in triple_dicts:
+        # Skip any triple with empty values
+        if not triple_dict['subject'] or not triple_dict['predicate'] or not triple_dict['object']:
+            continue
+            
+        # Create a space-separated string in the format that preprocess_triplet expects
+        triple_str = f"{triple_dict['subject']} {triple_dict['predicate']} {triple_dict['object']}"
+        
+        # Only add if not already in the set
+        if triple_str not in unique_triples:
+            unique_triples.add(triple_str)
+            triples.append(triple_str)
+    
+    return triples
+
+def get_data(args):
+    # need a JSON dict of Questions and answers, see below for how its used
+    with open(args.train_data_file) as file:
+        json_obj = json.load(file)
+    
+    text_contexts = []
+    # need a folder of text files to use for RAG and to make a KG from
+    for file_path in glob(f"{args.corpus_dir}/*"):
+        with open(file_path, "r+") as f:
+            text_contexts.append(f.read())
+    
+    return json_obj, text_contexts
+
+def validate_triple_format(triples):
+    """
+    Validate and fix triple format if needed to ensure compatibility with preprocess_triplet
+    
+    Args:
+        triples: List of triples to validate
+        
+    Returns:
+        Fixed list of triples in the format expected by preprocess_triplet
+    """
+    validated_triples = []
+    
+    print(f"Validating {len(triples)} triples...")
+    for i, triple in enumerate(triples):
+        # If triple is already a proper string with subject, predicate, object
+        if isinstance(triple, str):
+            parts = triple.split()
+            # Ensure there are at least 3 parts (subject, predicate, object)
+            if len(parts) >= 3:
+                # For strings with more than 3 parts, use first as subject, second as predicate, 
+                # and join the rest as object
+                subject = parts[0]
+                predicate = parts[1]
+                obj = ' '.join(parts[2:])
+                validated_triple = f"{subject} {predicate} {obj}"
+                validated_triples.append(validated_triple)
+            else:
+                print(f"Warning: Triple at index {i} has fewer than 3 parts: {triple}")
+        # If triple is a dictionary with subject, predicate, object keys
+        elif isinstance(triple, dict) and 'subject' in triple and 'predicate' in triple and 'object' in triple:
+            validated_triple = f"{triple['subject']} {triple['predicate']} {triple['object']}"
+            validated_triples.append(validated_triple)
+        # If triple is a tuple or list of length 3
+        elif (isinstance(triple, tuple) or isinstance(triple, list)) and len(triple) == 3:
+            validated_triple = f"{triple[0]} {triple[1]} {triple[2]}"
+            validated_triples.append(validated_triple)
+        else:
+            print(f"Warning: Skipping triple at index {i} with invalid format: {triple}")
+    
+    print(f"Validation complete. {len(validated_triples)} valid triples out of {len(triples)}")
+    return validated_triples
+
+def make_dataset(args):
+    """Modified make_dataset function that can use ArangoDB as a data source"""
+    # Create output directory if it doesn't exist
+    os.makedirs(args.output_dir, exist_ok=True)
+    dataset_path = os.path.join(args.output_dir, args.dataset_file)
+    triples_path = os.path.join(args.output_dir, args.triples_file)
+    checkpoint_path = os.path.join(args.output_dir, args.checkpoint_file)
+    
+    if os.path.exists(dataset_path):
+        print(f"Re-using Saved TechQA KG-RAG Dataset from {dataset_path}...")
+        return torch.load(dataset_path, weights_only=False)
+    else:
+        qa_pairs, context_docs = get_data(args)
+        print("Number of Docs in our VectorDB =", len(context_docs))
+        data_lists = {"train": [], "validation": [], "test": []}
+        
+        # Load triples either from saved file or from sources
+        triples = []
+        if os.path.exists(triples_path):
+            triples = torch.load(triples_path, weights_only=False)
+        else:
+            if args.use_arango:
+                # Load triples from ArangoDB instead of generating with TXT2KG
+                print("Loading triples from ArangoDB...")
+                triples = load_triples_from_arangodb(
+                    args.arango_url, 
+                    args.arango_db, 
+                    args.arango_user, 
+                    args.arango_password
+                )
+                # Validate and fix triples format if needed
+                triples = validate_triple_format(triples)
+                # Save triples for future use
+                torch.save(triples, triples_path)
+            else:
+                # Original TXT2KG code path
+                from torch_geometric.nn import TXT2KG
+                kg_maker = TXT2KG(
+                    NVIDIA_NIM_MODEL=args.NV_NIM_MODEL,
+                    NVIDIA_API_KEY=args.NV_NIM_KEY,
+                    ENDPOINT_URL=args.ENDPOINT_URL,
+                    chunk_size=args.chunk_size
+                )
+                print(
+                    "Note that if the TXT2KG process is too slow for you're liking using the public NIM, "
+                    "consider deploying yourself using local_lm flag of TXT2KG or using "
+                    "https://build.nvidia.com/nvidia/llama-3_1-nemotron-70b-instruct?snippet_tab=Docker "
+                    "to deploy to a private endpoint, which you can pass to this script w/ --ENDPOINT_URL flag."
+                )
+                
+                total_tqdm_count = len(context_docs)
+                initial_tqdm_count = 0
+                if os.path.exists(checkpoint_path):
+                    print(f"Restoring KG from checkpoint at {checkpoint_path}...")
+                    saved_relevant_triples = torch.load(checkpoint_path, weights_only=False)
+                    kg_maker.relevant_triples = saved_relevant_triples
+                    kg_maker.doc_id_counter = len(saved_relevant_triples)
+                    initial_tqdm_count = kg_maker.doc_id_counter
+                    context_docs = context_docs[(kg_maker.doc_id_counter - 1):]
+                
+                if args.checkpointing:
+                    interval = 10
+                    count = 0
+                
+                for context_doc in tqdm(context_docs, total=total_tqdm_count, 
+                                       initial=initial_tqdm_count, desc="Extracting KG triples"):
+                    kg_maker.add_doc_2_KG(txt=context_doc)
+                    if args.checkpointing:
+                        count += 1
+                        if count == interval:
+                            print(f" checkpointing KG to {checkpoint_path}...")
+                            count = 0
+                            kg_maker.save_kg(checkpoint_path)
+                
+                relevant_triples = kg_maker.relevant_triples
+                triples.extend(
+                    list(
+                        chain.from_iterable(
+                            triple_set for triple_set in relevant_triples.values()
+                        )
+                    )
+                )
+                triples = list(dict.fromkeys(triples))
+                torch.save(triples, triples_path)
+                
+                if args.checkpointing and os.path.exists(checkpoint_path):
+                    os.remove(checkpoint_path)
+        
+        print("Number of triples in our GraphDB =", len(triples))
+        
+        device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        sent_trans_batch_size = 256
+        model = SentenceTransformer(
+            model_name='Alibaba-NLP/gte-modernbert-base').to(device)
+        
+        backend_path = os.path.join(args.output_dir, args.backend_path)
+        fs, gs = create_remote_backend_from_triplets(
+            triplets=triples,
+            node_embedding_model=model,
+            node_method_to_call="encode",
+            path=backend_path,
+            pre_transform=preprocess_triplet,
+            node_method_kwargs={
+                "batch_size": min(len(triples), sent_trans_batch_size)
+            },
+            graph_db=NeighborSamplingRAGGraphStore,
+            feature_db=ModernBertFeatureStore
+        ).load()
+        
+        # encode the raw context docs
+        embedded_docs = model.encode(
+            context_docs,
+            output_device=device,
+            batch_size=int(sent_trans_batch_size / 4),
+            verbose=True
+        )
+        
+        # k for KNN
+        knn_neighsample_bs = 1024
+        # number of neighbors for each seed node selected by KNN
+        fanout = 100
+        # number of hops for neighborsampling
+        num_hops = 2
+        
+        local_filter_kwargs = {
+            "topk": 5,  # nodes
+            "topk_e": 5,  # edges
+            "cost_e": .5,  # edge cost
+            "num_clusters": 10,  # num clusters
+        }
+        
+        print("Now to retrieve context for each query from our Vector and Graph DBs...")
+        # GraphDB retrieval done with KNN+NeighborSampling+PCST
+        # PCST = Prize Collecting Steiner Tree
+        # VectorDB retrieval just vanilla RAG
+        query_loader = RAGQueryLoader(
+            data=(fs, gs),
+            seed_nodes_kwargs={"k_nodes": knn_neighsample_bs},
+            sampler_kwargs={"num_neighbors": [fanout] * num_hops},
+            local_filter=make_pcst_filter(triples, model),
+            local_filter_kwargs=local_filter_kwargs,
+            raw_docs=context_docs,
+            embedded_docs=embedded_docs
+        )
+        
+        total_data_list = []
+        extracted_triple_sizes = []
+        
+        for data_point in tqdm(qa_pairs, desc="Building un-split dataset"):
+            if data_point["is_impossible"]:
+                continue
+                
+            QA_pair = (data_point["question"], data_point["answer"])
+            q = QA_pair[0]
+            subgraph = query_loader.query(q)
+            subgraph.label = QA_pair[1]
+            total_data_list.append(subgraph)
+            extracted_triple_sizes.append(len(subgraph.triples))
+        
+        import random
+        random.shuffle(total_data_list)
+        
+        print("Min # of Retrieved Triples =", min(extracted_triple_sizes))
+        print("Max # of Retrieved Triples =", max(extracted_triple_sizes))
+        print("Average # of Retrieved Triples =", sum(extracted_triple_sizes) / len(extracted_triple_sizes))
+        
+        # 60:20:20 split
+        data_lists["train"] = total_data_list[:int(.6 * len(total_data_list))]
+        data_lists["validation"] = total_data_list[
+            int(.6 * len(total_data_list)):int(.8 * len(total_data_list))]
+        data_lists["test"] = total_data_list[int(.8 * len(total_data_list)):]
+        
+        torch.save(data_lists, dataset_path)
+        
+        del model
+        gc.collect()
+        torch.cuda.empty_cache()
+        
+        return data_lists
+
+if __name__ == '__main__':
+    # for reproducibility
+    seed_everything(50)
+    args = parse_args()
+    
+    # Create output directory
+    os.makedirs(args.output_dir, exist_ok=True)
+    
+    # Process and save dataset
+    data_lists = make_dataset(args)
+    print(f"Dataset processed and saved to {os.path.join(args.output_dir, args.dataset_file)}")
+    print("Training data size:", len(data_lists["train"]))
+    print("Validation data size:", len(data_lists["validation"]))
+    print("Testing data size:", len(data_lists["test"])) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/scripts/gnn/preprocess_data.py b/nvidia/txt2kg/assets/scripts/gnn/preprocess_data.py
new file mode 100644
index 0000000..727eeb0
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/gnn/preprocess_data.py
@@ -0,0 +1,412 @@
+#!/usr/bin/env python3
+
+import argparse
+import gc
+import json
+import os
+import torch
+from glob import glob
+from itertools import chain
+from tqdm import tqdm
+from python_arango import ArangoClient
+
+# Import the necessary modules from PyTorch Geometric
+from torch_geometric import seed_everything
+from torch_geometric.nn import SentenceTransformer
+from torch_geometric.utils.rag.backend_utils import (
+    create_remote_backend_from_triplets,
+    make_pcst_filter,
+    preprocess_triplet,
+)
+from torch_geometric.utils.rag.feature_store import ModernBertFeatureStore
+from torch_geometric.utils.rag.graph_store import NeighborSamplingRAGGraphStore
+from torch_geometric.loader import RAGQueryLoader
+
+# Define constants for better readability
+NV_NIM_MODEL_DEFAULT = "nvidia/llama-3.1-nemotron-70b-instruct"
+CHUNK_SIZE_DEFAULT = 512
+DEFAULT_ENDPOINT_URL = "https://integrate.api.nvidia.com/v1"
+
+# ArangoDB defaults from docker-compose.yml
+ARANGO_URL_DEFAULT = "http://localhost:8529"
+ARANGO_DB_DEFAULT = "txt2kg"
+ARANGO_USER_DEFAULT = ""
+ARANGO_PASSWORD_DEFAULT = ""
+
+# File paths and directories
+DATASET_FILE = "tech_qa.pt"
+TRIPLES_FILE = "tech_qa_just_triples.pt"
+CHECKPOINT_FILE = "checkpoint_kg.pt"
+TRAIN_DATA_FILE = "train.json"
+CORPUS_DIR = "corpus"
+BACKEND_PATH = "backend"
+OUTPUT_DIR = "output"
+
+def parse_args():
+    parser = argparse.ArgumentParser()
+    # Data processing related arguments
+    parser.add_argument('--NV_NIM_MODEL', type=str, default=NV_NIM_MODEL_DEFAULT, help="The NIM LLM to use for TXT2KG for LLMJudge")
+    parser.add_argument('--NV_NIM_KEY', type=str, default="", help="NVIDIA API key")
+    parser.add_argument(
+        '--ENDPOINT_URL', type=str, default=DEFAULT_ENDPOINT_URL, help=
+        "The URL hosting your model, in case you are not using the public NIM."
+    )
+    parser.add_argument(
+        '--chunk_size', type=int, default=512, help="When splitting context documents for txt2kg,\
+        the maximum number of characters per chunk.")
+    parser.add_argument('--checkpointing', action="store_true")
+    
+    # Add ArangoDB-specific arguments
+    parser.add_argument('--arango_url', type=str, default=ARANGO_URL_DEFAULT, help="ArangoDB URL")
+    parser.add_argument('--arango_db', type=str, default=ARANGO_DB_DEFAULT, help="ArangoDB database name")
+    parser.add_argument('--arango_user', type=str, default=ARANGO_USER_DEFAULT, help="ArangoDB username")
+    parser.add_argument('--arango_password', type=str, default=ARANGO_PASSWORD_DEFAULT, help="ArangoDB password")
+    parser.add_argument('--use_arango', action="store_true", help="Use ArangoDB instead of TXT2KG")
+    
+    # Add file path arguments
+    parser.add_argument('--dataset_file', type=str, default=DATASET_FILE, help="Path to save/load dataset")
+    parser.add_argument('--triples_file', type=str, default=TRIPLES_FILE, help="Path to save/load triples")
+    parser.add_argument('--checkpoint_file', type=str, default=CHECKPOINT_FILE, help="Path to save/load checkpoint")
+    parser.add_argument('--train_data_file', type=str, default=TRAIN_DATA_FILE, help="Path to training data file")
+    parser.add_argument('--corpus_dir', type=str, default=CORPUS_DIR, help="Directory containing corpus documents")
+    parser.add_argument('--backend_path', type=str, default=BACKEND_PATH, help="Path for backend storage")
+    parser.add_argument('--output_dir', type=str, default=OUTPUT_DIR, help="Directory for output files")
+    
+    return parser.parse_args()
+
+def load_triples_from_arangodb(arango_url, arango_db, arango_user, arango_password):
+    """
+    Load triples from ArangoDB for use with the TXT2KG dataset
+    
+    Args:
+        arango_url: ArangoDB connection URL
+        arango_db: ArangoDB database name
+        arango_user: ArangoDB username
+        arango_password: ArangoDB password
+        
+    Returns:
+        Array of triples in the format expected by create_remote_backend_from_triplets
+    """
+    try:
+        # Connect to ArangoDB
+        client = ArangoClient(hosts=arango_url)
+        
+        # Get database (no auth in our docker setup)
+        if arango_user and arango_password:
+            db = client.db(arango_db, username=arango_user, password=arango_password)
+        else:
+            db = client.db(arango_db)
+        
+        # Query to get all triples from ArangoDB as structured objects
+        # Handle case sensitivity and trim whitespace
+        aql_query = """
+        FOR e IN relationships
+        LET subject = TRIM(DOCUMENT(e._from).name)
+        LET object = TRIM(DOCUMENT(e._to).name)
+        LET predicate = TRIM(e.type)
+        FILTER subject != "" AND predicate != "" AND object != ""
+        RETURN {
+            subject: subject,
+            predicate: predicate,
+            object: object
+        }
+        """
+        
+        # Execute the query
+        cursor = db.aql.execute(aql_query)
+        triple_dicts = list(cursor)
+        
+        # Format triples as strings in the format expected by PyTorch Geometric
+        # The expected format is a list of strings in the form "subject predicate object"
+        triples = format_triples_for_pytorch_geometric(triple_dicts)
+        
+        print(f"Loaded {len(triples)} triples from ArangoDB")
+        # Print sample triples for debugging
+        if len(triples) > 0:
+            print("Sample triples:")
+            for i in range(min(3, len(triples))):
+                print(f"  {triples[i]}")
+        
+        return triples
+    except Exception as error:
+        print(f"Error loading triples from ArangoDB: {error}")
+        raise error
+
+def format_triples_for_pytorch_geometric(triple_dicts):
+    """
+    Format triples from ArangoDB into the format expected by PyTorch Geometric
+    
+    Args:
+        triple_dicts: List of dictionaries with subject, predicate, object keys
+        
+    Returns:
+        List of strings in the format "subject predicate object"
+    """
+    triples = []
+    # Create a set to avoid duplicates
+    unique_triples = set()
+    
+    for triple_dict in triple_dicts:
+        # Skip any triple with empty values
+        if not triple_dict['subject'] or not triple_dict['predicate'] or not triple_dict['object']:
+            continue
+            
+        # Create a space-separated string in the format that preprocess_triplet expects
+        triple_str = f"{triple_dict['subject']} {triple_dict['predicate']} {triple_dict['object']}"
+        
+        # Only add if not already in the set
+        if triple_str not in unique_triples:
+            unique_triples.add(triple_str)
+            triples.append(triple_str)
+    
+    return triples
+
+def get_data(args):
+    # need a JSON dict of Questions and answers, see below for how its used
+    with open(args.train_data_file) as file:
+        json_obj = json.load(file)
+    
+    text_contexts = []
+    # need a folder of text files to use for RAG and to make a KG from
+    for file_path in glob(f"{args.corpus_dir}/*"):
+        with open(file_path, "r+") as f:
+            text_contexts.append(f.read())
+    
+    return json_obj, text_contexts
+
+def validate_triple_format(triples):
+    """
+    Validate and fix triple format if needed to ensure compatibility with preprocess_triplet
+    
+    Args:
+        triples: List of triples to validate
+        
+    Returns:
+        Fixed list of triples in the format expected by preprocess_triplet
+    """
+    validated_triples = []
+    
+    print(f"Validating {len(triples)} triples...")
+    for i, triple in enumerate(triples):
+        # If triple is already a proper string with subject, predicate, object
+        if isinstance(triple, str):
+            parts = triple.split()
+            # Ensure there are at least 3 parts (subject, predicate, object)
+            if len(parts) >= 3:
+                # For strings with more than 3 parts, use first as subject, second as predicate, 
+                # and join the rest as object
+                subject = parts[0]
+                predicate = parts[1]
+                obj = ' '.join(parts[2:])
+                validated_triple = f"{subject} {predicate} {obj}"
+                validated_triples.append(validated_triple)
+            else:
+                print(f"Warning: Triple at index {i} has fewer than 3 parts: {triple}")
+        # If triple is a dictionary with subject, predicate, object keys
+        elif isinstance(triple, dict) and 'subject' in triple and 'predicate' in triple and 'object' in triple:
+            validated_triple = f"{triple['subject']} {triple['predicate']} {triple['object']}"
+            validated_triples.append(validated_triple)
+        # If triple is a tuple or list of length 3
+        elif (isinstance(triple, tuple) or isinstance(triple, list)) and len(triple) == 3:
+            validated_triple = f"{triple[0]} {triple[1]} {triple[2]}"
+            validated_triples.append(validated_triple)
+        else:
+            print(f"Warning: Skipping triple at index {i} with invalid format: {triple}")
+    
+    print(f"Validation complete. {len(validated_triples)} valid triples out of {len(triples)}")
+    return validated_triples
+
+def make_dataset(args):
+    """Modified make_dataset function that can use ArangoDB as a data source"""
+    # Create output directory if it doesn't exist
+    os.makedirs(args.output_dir, exist_ok=True)
+    dataset_path = os.path.join(args.output_dir, args.dataset_file)
+    triples_path = os.path.join(args.output_dir, args.triples_file)
+    checkpoint_path = os.path.join(args.output_dir, args.checkpoint_file)
+    
+    if os.path.exists(dataset_path):
+        print(f"Re-using Saved TechQA KG-RAG Dataset from {dataset_path}...")
+        return torch.load(dataset_path, weights_only=False)
+    else:
+        qa_pairs, context_docs = get_data(args)
+        print("Number of Docs in our VectorDB =", len(context_docs))
+        data_lists = {"train": [], "validation": [], "test": []}
+        
+        # Load triples either from saved file or from sources
+        triples = []
+        if os.path.exists(triples_path):
+            triples = torch.load(triples_path, weights_only=False)
+        else:
+            if args.use_arango:
+                # Load triples from ArangoDB instead of generating with TXT2KG
+                print("Loading triples from ArangoDB...")
+                triples = load_triples_from_arangodb(
+                    args.arango_url, 
+                    args.arango_db, 
+                    args.arango_user, 
+                    args.arango_password
+                )
+                # Validate and fix triples format if needed
+                triples = validate_triple_format(triples)
+                # Save triples for future use
+                torch.save(triples, triples_path)
+            else:
+                # Original TXT2KG code path
+                from torch_geometric.nn import TXT2KG
+                kg_maker = TXT2KG(
+                    NVIDIA_NIM_MODEL=args.NV_NIM_MODEL,
+                    NVIDIA_API_KEY=args.NV_NIM_KEY,
+                    ENDPOINT_URL=args.ENDPOINT_URL,
+                    chunk_size=args.chunk_size
+                )
+                print(
+                    "Note that if the TXT2KG process is too slow for you're liking using the public NIM, "
+                    "consider deploying yourself using local_lm flag of TXT2KG or using "
+                    "https://build.nvidia.com/nvidia/llama-3_1-nemotron-70b-instruct?snippet_tab=Docker "
+                    "to deploy to a private endpoint, which you can pass to this script w/ --ENDPOINT_URL flag."
+                )
+                
+                total_tqdm_count = len(context_docs)
+                initial_tqdm_count = 0
+                if os.path.exists(checkpoint_path):
+                    print(f"Restoring KG from checkpoint at {checkpoint_path}...")
+                    saved_relevant_triples = torch.load(checkpoint_path, weights_only=False)
+                    kg_maker.relevant_triples = saved_relevant_triples
+                    kg_maker.doc_id_counter = len(saved_relevant_triples)
+                    initial_tqdm_count = kg_maker.doc_id_counter
+                    context_docs = context_docs[(kg_maker.doc_id_counter - 1):]
+                
+                if args.checkpointing:
+                    interval = 10
+                    count = 0
+                
+                for context_doc in tqdm(context_docs, total=total_tqdm_count, 
+                                       initial=initial_tqdm_count, desc="Extracting KG triples"):
+                    kg_maker.add_doc_2_KG(txt=context_doc)
+                    if args.checkpointing:
+                        count += 1
+                        if count == interval:
+                            print(f" checkpointing KG to {checkpoint_path}...")
+                            count = 0
+                            kg_maker.save_kg(checkpoint_path)
+                
+                relevant_triples = kg_maker.relevant_triples
+                triples.extend(
+                    list(
+                        chain.from_iterable(
+                            triple_set for triple_set in relevant_triples.values()
+                        )
+                    )
+                )
+                triples = list(dict.fromkeys(triples))
+                torch.save(triples, triples_path)
+                
+                if args.checkpointing and os.path.exists(checkpoint_path):
+                    os.remove(checkpoint_path)
+        
+        print("Number of triples in our GraphDB =", len(triples))
+        
+        device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        sent_trans_batch_size = 256
+        model = SentenceTransformer(
+            model_name='Alibaba-NLP/gte-modernbert-base').to(device)
+        
+        backend_path = os.path.join(args.output_dir, args.backend_path)
+        fs, gs = create_remote_backend_from_triplets(
+            triplets=triples,
+            node_embedding_model=model,
+            node_method_to_call="encode",
+            path=backend_path,
+            pre_transform=preprocess_triplet,
+            node_method_kwargs={
+                "batch_size": min(len(triples), sent_trans_batch_size)
+            },
+            graph_db=NeighborSamplingRAGGraphStore,
+            feature_db=ModernBertFeatureStore
+        ).load()
+        
+        # encode the raw context docs
+        embedded_docs = model.encode(
+            context_docs,
+            output_device=device,
+            batch_size=int(sent_trans_batch_size / 4),
+            verbose=True
+        )
+        
+        # k for KNN
+        knn_neighsample_bs = 1024
+        # number of neighbors for each seed node selected by KNN
+        fanout = 100
+        # number of hops for neighborsampling
+        num_hops = 2
+        
+        local_filter_kwargs = {
+            "topk": 5,  # nodes
+            "topk_e": 5,  # edges
+            "cost_e": .5,  # edge cost
+            "num_clusters": 10,  # num clusters
+        }
+        
+        print("Now to retrieve context for each query from our Vector and Graph DBs...")
+        # GraphDB retrieval done with KNN+NeighborSampling+PCST
+        # PCST = Prize Collecting Steiner Tree
+        # VectorDB retrieval just vanilla RAG
+        query_loader = RAGQueryLoader(
+            data=(fs, gs),
+            seed_nodes_kwargs={"k_nodes": knn_neighsample_bs},
+            sampler_kwargs={"num_neighbors": [fanout] * num_hops},
+            local_filter=make_pcst_filter(triples, model),
+            local_filter_kwargs=local_filter_kwargs,
+            raw_docs=context_docs,
+            embedded_docs=embedded_docs
+        )
+        
+        total_data_list = []
+        extracted_triple_sizes = []
+        
+        for data_point in tqdm(qa_pairs, desc="Building un-split dataset"):
+            if data_point["is_impossible"]:
+                continue
+                
+            QA_pair = (data_point["question"], data_point["answer"])
+            q = QA_pair[0]
+            subgraph = query_loader.query(q)
+            subgraph.label = QA_pair[1]
+            total_data_list.append(subgraph)
+            extracted_triple_sizes.append(len(subgraph.triples))
+        
+        import random
+        random.shuffle(total_data_list)
+        
+        print("Min # of Retrieved Triples =", min(extracted_triple_sizes))
+        print("Max # of Retrieved Triples =", max(extracted_triple_sizes))
+        print("Average # of Retrieved Triples =", sum(extracted_triple_sizes) / len(extracted_triple_sizes))
+        
+        # 60:20:20 split
+        data_lists["train"] = total_data_list[:int(.6 * len(total_data_list))]
+        data_lists["validation"] = total_data_list[
+            int(.6 * len(total_data_list)):int(.8 * len(total_data_list))]
+        data_lists["test"] = total_data_list[int(.8 * len(total_data_list)):]
+        
+        torch.save(data_lists, dataset_path)
+        
+        del model
+        gc.collect()
+        torch.cuda.empty_cache()
+        
+        return data_lists
+
+if __name__ == '__main__':
+    # for reproducibility
+    seed_everything(50)
+    args = parse_args()
+    
+    # Create output directory
+    os.makedirs(args.output_dir, exist_ok=True)
+    
+    # Process and save dataset
+    data_lists = make_dataset(args)
+    print(f"Dataset processed and saved to {os.path.join(args.output_dir, args.dataset_file)}")
+    print("Training data size:", len(data_lists["train"]))
+    print("Validation data size:", len(data_lists["validation"]))
+    print("Testing data size:", len(data_lists["test"])) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/scripts/gnn/train_test_gnn.py b/nvidia/txt2kg/assets/scripts/gnn/train_test_gnn.py
new file mode 100644
index 0000000..21845cf
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/gnn/train_test_gnn.py
@@ -0,0 +1,321 @@
+#!/usr/bin/env python3
+
+import argparse
+import os
+import torch
+from tqdm import tqdm
+from torch.nn.utils import clip_grad_norm_
+
+# Import the necessary modules from PyTorch Geometric
+from torch_geometric import seed_everything
+from torch_geometric.loader import DataLoader
+from torch_geometric.nn import (
+    GAT, LLM, GRetriever, LLMJudge
+)
+
+# Define constants for better readability
+NV_NIM_MODEL_DEFAULT = "nvidia/llama-3.1-nemotron-70b-instruct"
+LLM_GENERATOR_NAME_DEFAULT = "meta-llama/Meta-Llama-3.1-8B-Instruct"
+GNN_HID_CHANNELS_DEFAULT = 1024
+GNN_LAYERS_DEFAULT = 4
+LR_DEFAULT = 1e-5
+EPOCHS_DEFAULT = 2
+BATCH_SIZE_DEFAULT = 1
+EVAL_BATCH_SIZE_DEFAULT = 2
+LLM_GEN_MODE_DEFAULT = "full"
+DEFAULT_ENDPOINT_URL = "https://integrate.api.nvidia.com/v1"
+
+# File paths and directories
+DATASET_FILE = "tech_qa.pt"
+MODEL_SAVE_PATH = "tech-qa-model.pt"
+OUTPUT_DIR = "output"
+
+# Prompt template for questions
+prompt_template = """Answer this question based on retrieved contexts. Just give the answer without explanation.
+[QUESTION] {question} [END_QUESTION]
+[RETRIEVED_CONTEXTS] {context} [END_RETRIEVED_CONTEXTS]
+Answer: """
+
+def parse_args():
+    parser = argparse.ArgumentParser()
+    # Model and training related arguments
+    parser.add_argument('--NV_NIM_MODEL', type=str, default=NV_NIM_MODEL_DEFAULT, help="The NIM LLM to use for evaluation with LLMJudge")
+    parser.add_argument('--NV_NIM_KEY', type=str, default="", help="NVIDIA API key")
+    parser.add_argument(
+        '--ENDPOINT_URL', type=str, default=DEFAULT_ENDPOINT_URL, help=
+        "The URL hosting your model, in case you are not using the public NIM."
+    )
+    
+    parser.add_argument('--gnn_hidden_channels', type=int, default=GNN_HID_CHANNELS_DEFAULT, help="Hidden channels for GNN")
+    parser.add_argument('--num_gnn_layers', type=int, default=GNN_LAYERS_DEFAULT, help="Number of GNN layers")
+    parser.add_argument('--lr', type=float, default=LR_DEFAULT, help="Learning rate")
+    parser.add_argument('--epochs', type=int, default=EPOCHS_DEFAULT, help="Number of epochs")
+    parser.add_argument('--batch_size', type=int, default=BATCH_SIZE_DEFAULT, help="Batch size")
+    parser.add_argument('--eval_batch_size', type=int, default=EVAL_BATCH_SIZE_DEFAULT, help="Evaluation batch size")
+    parser.add_argument('--llm_generator_name', type=str, default=LLM_GENERATOR_NAME_DEFAULT, help="The LLM to use for Generation")
+    parser.add_argument(
+        '--llm_generator_mode', type=str, default=LLM_GEN_MODE_DEFAULT, choices=["frozen", "lora", "full"],
+        help="Whether to freeze the Generator LLM, use LORA, or fully finetune"
+    )
+    parser.add_argument('--dont_save_model', action="store_true", help="Whether to skip model saving.")
+    parser.add_argument('--eval_only', action="store_true", help="Skip training and only run evaluation")
+    
+    # File path arguments
+    parser.add_argument('--dataset_file', type=str, default=DATASET_FILE, help="Path to load dataset")
+    parser.add_argument('--model_save_path', type=str, default=MODEL_SAVE_PATH, help="Path to save/load model")
+    parser.add_argument('--output_dir', type=str, default=OUTPUT_DIR, help="Directory for output files")
+    
+    return parser.parse_args()
+
+def load_params_dict(model, load_path):
+    """
+    Load model parameters from a saved checkpoint
+    """
+    print(f"Loading model parameters from {load_path}")
+    state_dict = torch.load(load_path, weights_only=True)
+    model.load_state_dict(state_dict)
+    return model
+
+def save_params_dict(model, save_path):
+    """
+    Save model parameters to a checkpoint
+    """
+    print(f"Saving model parameters to {save_path}")
+    torch.save(model.state_dict(), save_path)
+
+def adjust_learning_rate(param_group, base_lr, progress, num_training_steps):
+    """
+    Implement learning rate schedule with warmup and decay
+    """
+    if progress < 0.1:
+        # Linear warmup for first 10% of training
+        lr = base_lr * progress / 0.1
+    else:
+        # Cosine decay for remaining 90%
+        progress = (progress - 0.1) / 0.9
+        lr = base_lr * 0.5 * (1.0 + math.cos(math.pi * progress))
+    
+    param_group["lr"] = lr
+    return lr
+
+def get_loss(model, batch):
+    """
+    Calculate loss for a batch
+    """
+    return model(
+        input_question=batch.question,
+        input_graph=batch,
+        output_labels=batch.label
+    )
+
+def inference_step(model, batch):
+    """
+    Run inference on a batch and return predictions
+    """
+    with torch.no_grad():
+        preds = model.generate(
+            input_question=batch.question,
+            input_graph=batch
+        )
+    return preds
+
+def train(args, data_lists):
+    """
+    Train the GNN model
+    
+    Args:
+        args: Command line arguments
+        data_lists: Dictionary containing train, validation, and test datasets
+        
+    Returns:
+        Trained model and test dataloader
+    """
+    batch_size = args.batch_size
+    eval_batch_size = args.eval_batch_size
+    hidden_channels = args.gnn_hidden_channels
+    num_gnn_layers = args.num_gnn_layers
+    
+    train_loader = DataLoader(data_lists["train"], batch_size=batch_size,
+                             drop_last=True, pin_memory=True, shuffle=True)
+    val_loader = DataLoader(data_lists["validation"], batch_size=eval_batch_size,
+                           drop_last=False, pin_memory=True, shuffle=False)
+    test_loader = DataLoader(data_lists["test"], batch_size=eval_batch_size,
+                            drop_last=False, pin_memory=True, shuffle=False)
+    
+    gnn = GAT(in_channels=768, hidden_channels=hidden_channels,
+             out_channels=1024, num_layers=num_gnn_layers, heads=4)
+    
+    if args.llm_generator_mode == "full":
+        llm = LLM(model_name=args.llm_generator_name)
+        model = GRetriever(llm=llm, gnn=gnn)
+    elif args.llm_generator_mode == "lora":
+        llm = LLM(model_name=args.llm_generator_name, dtype=torch.float32)
+        model = GRetriever(llm=llm, gnn=gnn, use_lora=True)
+    else:  # frozen
+        llm = LLM(model_name=args.llm_generator_name, dtype=torch.float32).eval()
+        for _, p in llm.named_parameters():
+            p.requires_grad = False
+        model = GRetriever(llm=llm, gnn=gnn)
+    
+    # Use the path from arguments
+    model_path = os.path.join(args.output_dir, args.model_save_path)
+    if os.path.exists(model_path):
+        print(f"Re-using saved G-retriever model from {model_path}...")
+        model = load_params_dict(model, model_path)
+        
+        if args.eval_only:
+            print("Skipping training as --eval_only flag is set")
+            return model, test_loader
+    
+    if not args.eval_only:
+        params = [p for _, p in model.named_parameters() if p.requires_grad]
+        lr = args.lr
+        optimizer = torch.optim.AdamW([{
+            'params': params, 'lr': lr, 'weight_decay': 0.05
+        }], betas=(0.9, 0.95))
+        
+        for epoch in range(args.epochs):
+            model.train()
+            epoch_loss = 0
+            epoch_str = f'Epoch: {epoch + 1}|{args.epochs}'
+            loader = tqdm(train_loader, desc=epoch_str)
+            
+            for step, batch in enumerate(loader):
+                new_qs = []
+                for i, q in enumerate(batch["question"]):
+                    # insert VectorRAG context
+                    new_qs.append(
+                        prompt_template.format(question=q, context=batch.text_context[i]))
+                batch.question = new_qs
+                
+                optimizer.zero_grad()
+                loss = get_loss(model, batch)
+                loss.backward()
+                clip_grad_norm_(optimizer.param_groups[0]['params'], 0.1)
+                
+                if (step + 1) % 2 == 0:
+                    adjust_learning_rate(optimizer.param_groups[0], lr,
+                                        step / len(train_loader) + epoch, args.epochs)
+                
+                optimizer.step()
+                epoch_loss += float(loss)
+                
+                if (step + 1) % 2 == 0:
+                    lr = optimizer.param_groups[0]['lr']
+            
+            train_loss = epoch_loss / len(train_loader)
+            print(epoch_str + f', Train Loss: {train_loss:4f}')
+            
+            # Eval Step
+            val_loss = 0
+            model.eval()
+            with torch.no_grad():
+                for step, batch in enumerate(val_loader):
+                    new_qs = []
+                    for i, q in enumerate(batch["question"]):
+                        # insert VectorRAG context
+                        new_qs.append(
+                            prompt_template.format(question=q, context=batch.text_context[i]))
+                    batch.question = new_qs
+                    
+                    loss = get_loss(model, batch)
+                    val_loss += loss.item()
+            
+            val_loss = val_loss / len(val_loader)
+            print(epoch_str + f", Val Loss: {val_loss:4f}")
+            torch.cuda.empty_cache()
+            torch.cuda.reset_max_memory_allocated()
+        
+        model.eval()
+        if not args.dont_save_model:
+            # Create output directory if it doesn't exist
+            os.makedirs(args.output_dir, exist_ok=True)
+            save_params_dict(model, save_path=model_path)
+    
+    return model, test_loader
+
+def test(model, test_loader, args):
+    """
+    Test the GNN model and calculate evaluation metrics
+    
+    Args:
+        model: Trained GNN model
+        test_loader: DataLoader for test dataset
+        args: Command line arguments
+    """
+    llm_judge = LLMJudge(args.NV_NIM_MODEL, args.NV_NIM_KEY, args.ENDPOINT_URL)
+    
+    def eval(question: str, pred: str, correct_answer: str):
+        # calculate the score based on pred and correct answer
+        return llm_judge.score(question, pred, correct_answer)
+    
+    scores = []
+    eval_tuples = []
+    
+    for test_batch in tqdm(test_loader, desc="Testing"):
+        new_qs = []
+        for i, q in enumerate(test_batch["question"]):
+            # insert VectorRAG context
+            new_qs.append(
+                prompt_template.format(question=q, context=test_batch.text_context[i]))
+        test_batch.question = new_qs
+        
+        preds = inference_step(model, test_batch)
+        for question, pred, label in zip(test_batch.question, preds, test_batch.label):
+            eval_tuples.append((question, pred, label))
+    
+    for question, pred, label in tqdm(eval_tuples, desc="Evaluating"):
+        scores.append(eval(question, pred, label))
+    
+    avg_scores = sum(scores) / len(scores)
+    print("Avg marlin accuracy =", avg_scores)
+    
+    # Save results to file
+    results_path = os.path.join(args.output_dir, "test_results.txt")
+    with open(results_path, "w") as f:
+        f.write(f"Average marlin accuracy: {avg_scores}\n\n")
+        f.write("Example predictions:\n")
+        for i, (question, pred, label) in enumerate(eval_tuples[:5]):  # Show first 5 examples
+            f.write(f"Example {i+1}:\n")
+            f.write(f"Question: {question}\n")
+            f.write(f"Prediction: {pred}\n")
+            f.write(f"Ground Truth: {label}\n")
+            f.write(f"Score: {scores[i]}\n\n")
+    
+    print(f"Test results saved to {results_path}")
+
+def load_dataset(args):
+    """
+    Load preprocessed dataset from file
+    """
+    dataset_path = os.path.join(args.output_dir, args.dataset_file)
+    if not os.path.exists(dataset_path):
+        raise FileNotFoundError(f"Dataset file not found at {dataset_path}. Please run preprocess_data.py first.")
+    
+    print(f"Loading dataset from {dataset_path}...")
+    data_lists = torch.load(dataset_path, weights_only=False)
+    print("Dataset loaded successfully!")
+    print(f"Train set size: {len(data_lists['train'])}")
+    print(f"Validation set size: {len(data_lists['validation'])}")
+    print(f"Test set size: {len(data_lists['test'])}")
+    
+    return data_lists
+
+if __name__ == '__main__':
+    import math
+    
+    # for reproducibility
+    seed_everything(50)
+    args = parse_args()
+    
+    # Create output directory
+    os.makedirs(args.output_dir, exist_ok=True)
+    
+    # Load preprocessed dataset
+    data_lists = load_dataset(args)
+    
+    # Train model
+    model, test_loader = train(args, data_lists)
+    
+    # Test model
+    test(model, test_loader, args) 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/scripts/kg_extraction_benchmark.py b/nvidia/txt2kg/assets/scripts/kg_extraction_benchmark.py
new file mode 100755
index 0000000..50ef484
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/kg_extraction_benchmark.py
@@ -0,0 +1,570 @@
+#!/usr/bin/env python3
+"""
+Knowledge Graph Extraction Benchmark: vLLM vs Ollama
+Realistic benchmark based on the txt2kg codebase use case
+Tests triple extraction from 512-character text chunks
+"""
+
+import asyncio
+import aiohttp
+import time
+import json
+import statistics
+import argparse
+import subprocess
+import sys
+import os
+from typing import List, Dict, Any, Optional
+from dataclasses import dataclass
+
+@dataclass
+class KGBenchmarkResult:
+    service: str
+    model: str
+    text_chunk: str
+    prompt_tokens: int
+    completion_tokens: int
+    total_tokens: int
+    response_time: float
+    tokens_per_second: float
+    extracted_triples: List[Dict] = None
+    raw_response: str = ""
+    error: str = ""
+
+class KGExtractionBenchmark:
+    def __init__(self):
+        self.vllm_url = "http://localhost:8001"
+        self.ollama_url = "http://localhost:11434"
+        self.vllm_dir = "/home/nvidia/txt2kg/txt2kg/deploy/services/vllm"
+        self.ollama_dir = "/home/nvidia/txt2kg/txt2kg/deploy/services/ollama"
+        
+        # Real prompts from the txt2kg codebase
+        self.system_prompt = """You are a knowledge graph builder that extracts structured information from text.
+Extract subject-predicate-object triples from the following text.
+
+Guidelines:
+- Extract only factual triples present in the text
+- Normalize entity names to their canonical form
+- Return results in JSON format as an array of objects with "subject", "predicate", "object" fields
+- Each triple should represent a clear relationship between two entities
+- Focus on the most important relationships in the text"""
+
+        # Alternative system prompt from the codebase
+        self.alternative_system_prompt = """You are an expert that can extract knowledge triples with the form `('entity', 'relation', 'entity)` from a text, mainly using entities from the entity list given by the user. Keep relations 2 words max.
+Separate each with a new line. Do not output anything else (no notes, no explanations, etc)."""
+        
+    def get_realistic_text_chunks(self) -> List[str]:
+        """Generate realistic 512-character text chunks for knowledge extraction"""
+        chunks = [
+            # Scientific/Technical text
+            """Apple Inc. was founded by Steve Jobs, Steve Wozniak, and Ronald Wayne in April 1976. The company is headquartered in Cupertino, California, and is known for developing innovative consumer electronics like the iPhone, iPad, and Mac computers. Tim Cook became the CEO after Steve Jobs passed away in 2011. Apple's market capitalization exceeded $3 trillion in 2022, making it one of the most valuable companies in the world. The company operates retail stores globally and has a strong focus on design and user experience.""",
+            
+            # Business/Corporate text
+            """Tesla Motors was founded in 2003 by Martin Eberhard and Marc Tarpenning. Elon Musk joined the company as chairman of the board in 2004 and became CEO in 2008. Tesla is headquartered in Austin, Texas, and manufactures electric vehicles, energy storage systems, and solar panels. The company's Gigafactory in Nevada produces lithium-ion batteries for its vehicles. Tesla went public in 2010 and has become a leader in the electric vehicle market with models like the Model S, Model 3, and Model Y.""",
+            
+            # Academic/Research text
+            """The University of California, Berkeley was established in 1868 and is located in Berkeley, California. It is part of the University of California system and is known for its research programs in computer science, engineering, and physics. Notable alumni include Steve Wozniak, co-founder of Apple, and Eric Schmidt, former CEO of Google. The university operates the Lawrence Berkeley National Laboratory and has produced numerous Nobel Prize winners. Berkeley's computer science department developed the BSD Unix operating system.""",
+            
+            # Historical/Biographical text
+            """Albert Einstein was born in Ulm, Germany in 1879 and later moved to Princeton, New Jersey. He developed the theory of relativity, which revolutionized physics and our understanding of space and time. Einstein worked at Princeton University's Institute for Advanced Study from 1933 until his death in 1955. He received the Nobel Prize in Physics in 1921 for his explanation of the photoelectric effect. Einstein's famous equation E=mc² demonstrates the relationship between mass and energy.""",
+            
+            # Medical/Healthcare text
+            """The World Health Organization (WHO) is a specialized agency of the United Nations responsible for international public health. It was established in 1948 and is headquartered in Geneva, Switzerland. Dr. Tedros Adhanom Ghebreyesus serves as the current Director-General. WHO coordinates international health work, monitors disease outbreaks, and provides technical assistance to countries. During the COVID-19 pandemic, WHO played a crucial role in coordinating the global response and providing guidance on vaccines and treatments.""",
+            
+            # Technology/Innovation text
+            """Google was founded by Larry Page and Sergey Brin in 1998 while they were PhD students at Stanford University. The company is now part of Alphabet Inc. and is headquartered in Mountain View, California. Google's search engine processes billions of queries daily and the company has expanded into cloud computing, artificial intelligence, and autonomous vehicles. Sundar Pichai became CEO of Google in 2015. The company's other products include Gmail, YouTube, Android, and Google Cloud Platform."""
+        ]
+        
+        # Ensure each chunk is approximately 512 characters
+        processed_chunks = []
+        for chunk in chunks:
+            if len(chunk) > 512:
+                # Truncate to 512 characters at word boundary
+                truncated = chunk[:512]
+                last_space = truncated.rfind(' ')
+                if last_space > 400:  # Ensure we don't cut too much
+                    chunk = truncated[:last_space] + "."
+                else:
+                    chunk = truncated
+            processed_chunks.append(chunk)
+            
+        return processed_chunks
+    
+    def run_command(self, cmd: str, cwd: str = None) -> tuple[int, str]:
+        """Run a shell command and return exit code and output"""
+        try:
+            result = subprocess.run(
+                cmd, 
+                shell=True, 
+                cwd=cwd, 
+                capture_output=True, 
+                text=True,
+                timeout=120
+            )
+            return result.returncode, result.stdout + result.stderr
+        except subprocess.TimeoutExpired:
+            return -1, "Command timed out"
+        except Exception as e:
+            return -1, str(e)
+    
+    def stop_all_services(self):
+        """Stop both vLLM and Ollama services"""
+        print("🛑 Stopping all services...")
+        
+        # Stop vLLM
+        exit_code, output = self.run_command("docker compose down", self.vllm_dir)
+        if exit_code != 0:
+            print(f"Warning: Failed to stop vLLM: {output}")
+        
+        # Stop Ollama
+        exit_code, output = self.run_command("docker compose down", self.ollama_dir)
+        if exit_code != 0:
+            print(f"Warning: Failed to stop Ollama: {output}")
+        
+        # Wait for services to fully stop
+        time.sleep(10)
+        print("✅ All services stopped")
+    
+    def start_vllm(self) -> bool:
+        """Start vLLM service and wait for it to be ready"""
+        print("🚀 Starting vLLM service...")
+        
+        # Start the service
+        exit_code, output = self.run_command("bash -c 'source .env && docker compose up -d'", self.vllm_dir)
+        if exit_code != 0:
+            print(f"❌ Failed to start vLLM: {output}")
+            return False
+        
+        # Wait for service to be ready (extended timeout for 70B model)
+        print("⏳ Waiting for vLLM to be ready (70B model may take 10-15 minutes)...")
+        for i in range(180):  # Wait up to 15 minutes for 70B model
+            try:
+                response = subprocess.run(
+                    ["curl", "-s", f"{self.vllm_url}/health"],
+                    capture_output=True,
+                    timeout=5
+                )
+                if response.returncode == 0:
+                    print("✅ vLLM is ready!")
+                    return True
+            except:
+                pass
+            
+            time.sleep(5)
+            if i % 6 == 0:  # Print progress every 30 seconds
+                print(f"   Still waiting... ({i*5}s)")
+        
+        print("❌ vLLM failed to start within timeout")
+        return False
+    
+    def start_ollama(self) -> bool:
+        """Start Ollama service and wait for it to be ready"""
+        print("🚀 Starting Ollama service...")
+        
+        # Start the service
+        exit_code, output = self.run_command("docker compose up -d", self.ollama_dir)
+        if exit_code != 0:
+            print(f"❌ Failed to start Ollama: {output}")
+            return False
+        
+        # Wait for service to be ready
+        print("⏳ Waiting for Ollama to be ready...")
+        for i in range(24):  # Wait up to 2 minutes
+            try:
+                response = subprocess.run(
+                    ["curl", "-s", f"{self.ollama_url}/api/tags"],
+                    capture_output=True,
+                    timeout=5
+                )
+                if response.returncode == 0:
+                    print("✅ Ollama is ready!")
+                    return True
+            except:
+                pass
+            
+            time.sleep(5)
+            if i % 6 == 0:  # Print progress every 30 seconds
+                print(f"   Still waiting... ({i*5}s)")
+        
+        print("❌ Ollama failed to start within timeout")
+        return False
+    
+    def extract_triples_from_response(self, response_text: str) -> List[Dict]:
+        """Extract triples from LLM response"""
+        triples = []
+        try:
+            # Try to parse as JSON first
+            json_match = None
+            if '[' in response_text and ']' in response_text:
+                start = response_text.find('[')
+                end = response_text.rfind(']') + 1
+                json_match = response_text[start:end]
+            
+            if json_match:
+                triples = json.loads(json_match)
+            else:
+                # Fallback: parse line by line for tuple format
+                lines = response_text.strip().split('\n')
+                for line in lines:
+                    line = line.strip()
+                    if '(' in line and ')' in line and ',' in line:
+                        # Extract tuple format ('entity', 'relation', 'entity')
+                        try:
+                            # Simple tuple parsing
+                            content = line[line.find('(')+1:line.rfind(')')]
+                            parts = [part.strip().strip("'\"") for part in content.split(',')]
+                            if len(parts) >= 3:
+                                triples.append({
+                                    'subject': parts[0],
+                                    'predicate': parts[1],
+                                    'object': parts[2]
+                                })
+                        except:
+                            continue
+        except Exception as e:
+            print(f"Warning: Failed to parse triples: {e}")
+        
+        return triples
+    
+    async def test_vllm_kg_extraction(self, session: aiohttp.ClientSession, text_chunk: str) -> KGBenchmarkResult:
+        """Test vLLM knowledge graph extraction"""
+        start_time = time.time()
+        
+        # Use chat completions format for better results
+        payload = {
+            "model": "nvidia/Llama-3.1-8B-Instruct-FP8",
+            "messages": [
+                {
+                    "role": "system",
+                    "content": self.system_prompt
+                },
+                {
+                    "role": "user", 
+                    "content": f"Extract triples from this text:\n\n{text_chunk}"
+                }
+            ],
+            "max_tokens": 1024,
+            "temperature": 0.1,
+            "stream": False
+        }
+        
+        try:
+            async with session.post(f"{self.vllm_url}/v1/chat/completions", json=payload) as response:
+                if response.status != 200:
+                    error_text = await response.text()
+                    return KGBenchmarkResult(
+                        service="vLLM",
+                        model="Llama-3.1-8B-Instruct-FP8",
+                        text_chunk=text_chunk[:100] + "...",
+                        prompt_tokens=0,
+                        completion_tokens=0,
+                        total_tokens=0,
+                        response_time=0,
+                        tokens_per_second=0,
+                        error=f"HTTP {response.status}: {error_text}"
+                    )
+                
+                result = await response.json()
+                end_time = time.time()
+                
+                response_time = end_time - start_time
+                usage = result.get("usage", {})
+                prompt_tokens = usage.get("prompt_tokens", 0)
+                completion_tokens = usage.get("completion_tokens", 0)
+                total_tokens = usage.get("total_tokens", 0)
+                
+                tokens_per_second = completion_tokens / response_time if response_time > 0 else 0
+                
+                # Extract response text
+                raw_response = result.get("choices", [{}])[0].get("message", {}).get("content", "")
+                extracted_triples = self.extract_triples_from_response(raw_response)
+                
+                return KGBenchmarkResult(
+                    service="vLLM",
+                    model="Llama-3.3-70B-Instruct-FP4",
+                    text_chunk=text_chunk[:100] + "...",
+                    prompt_tokens=prompt_tokens,
+                    completion_tokens=completion_tokens,
+                    total_tokens=total_tokens,
+                    response_time=response_time,
+                    tokens_per_second=tokens_per_second,
+                    extracted_triples=extracted_triples,
+                    raw_response=raw_response
+                )
+                
+        except Exception as e:
+            return KGBenchmarkResult(
+                service="vLLM",
+                model="Llama-3.3-70B-Instruct-FP4",
+                text_chunk=text_chunk[:100] + "...",
+                prompt_tokens=0,
+                completion_tokens=0,
+                total_tokens=0,
+                response_time=0,
+                tokens_per_second=0,
+                error=str(e)
+            )
+    
+    async def test_ollama_kg_extraction(self, session: aiohttp.ClientSession, text_chunk: str) -> KGBenchmarkResult:
+        """Test Ollama knowledge graph extraction"""
+        start_time = time.time()
+        
+        payload = {
+            "model": "llama3.1:8b",
+            "messages": [
+                {
+                    "role": "system",
+                    "content": self.system_prompt
+                },
+                {
+                    "role": "user",
+                    "content": f"Extract triples from this text:\n\n{text_chunk}"
+                }
+            ],
+            "stream": False,
+            "options": {
+                "num_predict": 1024,
+                "temperature": 0.1
+            }
+        }
+        
+        try:
+            async with session.post(f"{self.ollama_url}/api/chat", json=payload) as response:
+                if response.status != 200:
+                    error_text = await response.text()
+                    return KGBenchmarkResult(
+                        service="Ollama",
+                        model="llama3.1:8b",
+                        text_chunk=text_chunk[:100] + "...",
+                        prompt_tokens=0,
+                        completion_tokens=0,
+                        total_tokens=0,
+                        response_time=0,
+                        tokens_per_second=0,
+                        error=f"HTTP {response.status}: {error_text}"
+                    )
+                
+                result = await response.json()
+                end_time = time.time()
+                
+                response_time = end_time - start_time
+                
+                # Ollama response format
+                prompt_eval_count = result.get("prompt_eval_count", 0)
+                eval_count = result.get("eval_count", 0)
+                total_tokens = prompt_eval_count + eval_count
+                
+                tokens_per_second = eval_count / response_time if response_time > 0 else 0
+                
+                # Extract response text
+                raw_response = result.get("message", {}).get("content", "")
+                extracted_triples = self.extract_triples_from_response(raw_response)
+                
+                return KGBenchmarkResult(
+                    service="Ollama",
+                    model="llama3.1:8b",
+                    text_chunk=text_chunk[:100] + "...",
+                    prompt_tokens=prompt_eval_count,
+                    completion_tokens=eval_count,
+                    total_tokens=total_tokens,
+                    response_time=response_time,
+                    tokens_per_second=tokens_per_second,
+                    extracted_triples=extracted_triples,
+                    raw_response=raw_response
+                )
+                
+        except Exception as e:
+            return KGBenchmarkResult(
+                service="Ollama",
+                model="llama3.1:8b",
+                text_chunk=text_chunk[:100] + "...",
+                prompt_tokens=0,
+                completion_tokens=0,
+                total_tokens=0,
+                response_time=0,
+                tokens_per_second=0,
+                error=str(e)
+            )
+    
+    async def benchmark_service(self, service: str, text_chunks: List[str], runs_per_chunk: int = 2) -> List[KGBenchmarkResult]:
+        """Benchmark a single service for knowledge graph extraction"""
+        results = []
+        
+        print(f"\n{'='*70}")
+        print(f"BENCHMARKING {service.upper()} - KNOWLEDGE GRAPH EXTRACTION")
+        print(f"{'='*70}")
+        
+        async with aiohttp.ClientSession() as session:
+            for i, chunk in enumerate(text_chunks, 1):
+                print(f"\nText Chunk {i}/{len(text_chunks)} ({len(chunk)} chars): {chunk[:80]}...")
+                
+                for run in range(runs_per_chunk):
+                    print(f"  Run {run + 1}/{runs_per_chunk}...", end=" ")
+                    
+                    if service == "vLLM":
+                        result = await self.test_vllm_kg_extraction(session, chunk)
+                    else:
+                        result = await self.test_ollama_kg_extraction(session, chunk)
+                    
+                    results.append(result)
+                    
+                    # Print quick results
+                    if result.error:
+                        print(f"ERROR - {result.error}")
+                    else:
+                        triples_count = len(result.extracted_triples) if result.extracted_triples else 0
+                        print(f"{result.response_time:.2f}s ({result.tokens_per_second:.1f} tok/s, {triples_count} triples)")
+                    
+                    # Small delay between runs
+                    await asyncio.sleep(2)
+        
+        return results
+    
+    async def run_kg_benchmark(self, text_chunks: List[str], runs_per_chunk: int = 2) -> Dict[str, List[KGBenchmarkResult]]:
+        """Run knowledge graph extraction benchmark with services running one at a time"""
+        print("🧠 Starting Knowledge Graph Extraction Benchmark")
+        print(f"📊 Testing {len(text_chunks)} text chunks with {runs_per_chunk} runs each")
+        print(f"📝 Using realistic txt2kg prompts for triple extraction")
+        
+        all_results = {}
+        
+        # First, stop all services to ensure clean start
+        self.stop_all_services()
+        
+        # Test vLLM
+        if self.start_vllm():
+            vllm_results = await self.benchmark_service("vLLM", text_chunks, runs_per_chunk)
+            all_results["vLLM"] = vllm_results
+            self.stop_all_services()
+        else:
+            print("❌ Skipping vLLM benchmark due to startup failure")
+            all_results["vLLM"] = []
+        
+        # Test Ollama
+        if self.start_ollama():
+            ollama_results = await self.benchmark_service("Ollama", text_chunks, runs_per_chunk)
+            all_results["Ollama"] = ollama_results
+            self.stop_all_services()
+        else:
+            print("❌ Skipping Ollama benchmark due to startup failure")
+            all_results["Ollama"] = []
+        
+        return all_results
+    
+    def analyze_kg_results(self, results: Dict[str, List[KGBenchmarkResult]]):
+        """Analyze and print knowledge graph extraction benchmark results"""
+        print("\n" + "=" * 90)
+        print("KNOWLEDGE GRAPH EXTRACTION BENCHMARK ANALYSIS")
+        print("=" * 90)
+        
+        for service_name, service_results in results.items():
+            print(f"\n{service_name} Results:")
+            print("-" * 50)
+            
+            # Filter out errors
+            valid_results = [r for r in service_results if not r.error]
+            error_results = [r for r in service_results if r.error]
+            
+            if error_results:
+                print(f"Errors: {len(error_results)}/{len(service_results)}")
+                for error in set(r.error for r in error_results):
+                    print(f"  - {error}")
+                print()
+            
+            if not valid_results:
+                print("No valid results to analyze.")
+                continue
+            
+            # Calculate statistics
+            response_times = [r.response_time for r in valid_results]
+            tokens_per_second = [r.tokens_per_second for r in valid_results]
+            completion_tokens = [r.completion_tokens for r in valid_results]
+            triple_counts = [len(r.extracted_triples) if r.extracted_triples else 0 for r in valid_results]
+            
+            print(f"Valid runs: {len(valid_results)}")
+            print(f"Response time (avg): {statistics.mean(response_times):.3f}s")
+            print(f"Response time (median): {statistics.median(response_times):.3f}s")
+            print(f"Response time (min/max): {min(response_times):.3f}s / {max(response_times):.3f}s")
+            print(f"Tokens/second (avg): {statistics.mean(tokens_per_second):.1f}")
+            print(f"Tokens/second (median): {statistics.median(tokens_per_second):.1f}")
+            print(f"Completion tokens (avg): {statistics.mean(completion_tokens):.1f}")
+            print(f"Extracted triples (avg): {statistics.mean(triple_counts):.1f}")
+            print(f"Extracted triples (median): {statistics.median(triple_counts):.1f}")
+            print(f"Extracted triples (min/max): {min(triple_counts)} / {max(triple_counts)}")
+            
+            # Show sample extractions
+            if valid_results:
+                print(f"\nSample triple extraction:")
+                sample_result = valid_results[0]
+                if sample_result.extracted_triples:
+                    for i, triple in enumerate(sample_result.extracted_triples[:3]):
+                        print(f"  {i+1}. ({triple.get('subject', 'N/A')}, {triple.get('predicate', 'N/A')}, {triple.get('object', 'N/A')})")
+                    if len(sample_result.extracted_triples) > 3:
+                        print(f"  ... and {len(sample_result.extracted_triples) - 3} more")
+        
+        # Comparison
+        vllm_valid = [r for r in results.get("vLLM", []) if not r.error]
+        ollama_valid = [r for r in results.get("Ollama", []) if not r.error]
+        
+        if vllm_valid and ollama_valid:
+            print("\n" + "=" * 50)
+            print("KNOWLEDGE EXTRACTION PERFORMANCE COMPARISON")
+            print("=" * 50)
+            
+            vllm_avg_response = statistics.mean([r.response_time for r in vllm_valid])
+            ollama_avg_response = statistics.mean([r.response_time for r in ollama_valid])
+            
+            vllm_avg_tokens_sec = statistics.mean([r.tokens_per_second for r in vllm_valid])
+            ollama_avg_tokens_sec = statistics.mean([r.tokens_per_second for r in ollama_valid])
+            
+            vllm_avg_triples = statistics.mean([len(r.extracted_triples) if r.extracted_triples else 0 for r in vllm_valid])
+            ollama_avg_triples = statistics.mean([len(r.extracted_triples) if r.extracted_triples else 0 for r in ollama_valid])
+            
+            if vllm_avg_response < ollama_avg_response:
+                speedup = ollama_avg_response / vllm_avg_response
+                print(f"🏆 vLLM is {speedup:.2f}x FASTER in response time")
+            else:
+                speedup = vllm_avg_response / ollama_avg_response
+                print(f"🏆 Ollama is {speedup:.2f}x FASTER in response time")
+            
+            if vllm_avg_tokens_sec > ollama_avg_tokens_sec:
+                throughput_ratio = vllm_avg_tokens_sec / ollama_avg_tokens_sec
+                print(f"🚀 vLLM has {throughput_ratio:.2f}x HIGHER throughput")
+            else:
+                throughput_ratio = ollama_avg_tokens_sec / vllm_avg_tokens_sec
+                print(f"🚀 Ollama has {throughput_ratio:.2f}x HIGHER throughput")
+            
+            if vllm_avg_triples > ollama_avg_triples:
+                extraction_ratio = vllm_avg_triples / ollama_avg_triples
+                print(f"🧠 vLLM extracts {extraction_ratio:.2f}x MORE triples on average")
+            else:
+                extraction_ratio = ollama_avg_triples / vllm_avg_triples
+                print(f"🧠 Ollama extracts {extraction_ratio:.2f}x MORE triples on average")
+
+def main():
+    parser = argparse.ArgumentParser(description="Knowledge Graph Extraction Benchmark: vLLM vs Ollama")
+    parser.add_argument("--runs", type=int, default=2, help="Number of runs per text chunk")
+    parser.add_argument("--quick", action="store_true", help="Run quick test with fewer chunks")
+    
+    args = parser.parse_args()
+    
+    benchmark = KGExtractionBenchmark()
+    text_chunks = benchmark.get_realistic_text_chunks()
+    
+    if args.quick:
+        text_chunks = text_chunks[:2]  # Use only first 2 chunks for quick test
+    
+    try:
+        results = asyncio.run(benchmark.run_kg_benchmark(text_chunks, args.runs))
+        benchmark.analyze_kg_results(results)
+    except KeyboardInterrupt:
+        print("\n🛑 Benchmark interrupted by user.")
+        benchmark.stop_all_services()
+        sys.exit(1)
+    except Exception as e:
+        print(f"\n❌ Benchmark failed: {e}")
+        benchmark.stop_all_services()
+        sys.exit(1)
+
+if __name__ == "__main__":
+    main()
diff --git a/nvidia/txt2kg/assets/scripts/requirements.txt b/nvidia/txt2kg/assets/scripts/requirements.txt
new file mode 100644
index 0000000..9d4e581
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/requirements.txt
@@ -0,0 +1,6 @@
+torch>=1.13.0
+tqdm
+python-arango
+torch-geometric
+transformers
+numpy 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/scripts/setup-pinecone.js b/nvidia/txt2kg/assets/scripts/setup-pinecone.js
new file mode 100644
index 0000000..37f4196
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/setup-pinecone.js
@@ -0,0 +1,7 @@
+/**
+ * Simplified Pinecone setup script for Docker environments
+ */
+
+console.log('🌲 Pinecone setup running in container environment...');
+console.log('✅ Using container-provided Pinecone configuration');
+console.log('🌲 Pinecone setup complete!'); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/scripts/test-graph-transformer.js b/nvidia/txt2kg/assets/scripts/test-graph-transformer.js
new file mode 100644
index 0000000..d41dc3f
--- /dev/null
+++ b/nvidia/txt2kg/assets/scripts/test-graph-transformer.js
@@ -0,0 +1,45 @@
+// Test script for LLMGraphTransformer - DEPRECATED
+// xAI integration has been removed
+// const { ChatXAI } = require('@langchain/xai');
+const { LLMGraphTransformer } = require('@langchain/community/experimental/graph_transformers/llm');
+const { Document } = require('langchain/document');
+
+async function testGraphTransformer() {
+  console.log('Testing LLMGraphTransformer...');
+  
+  try {
+    // xAI integration has been removed - this script is deprecated
+    console.error('This test script is deprecated - xAI integration has been removed');
+    return;
+    
+    // const llm = new ChatXAI({
+    //   model: "grok-2-latest",
+    //   temperature: 0.1,
+    //   apiKey: xaiApiKey
+    // });
+    
+    // Initialize LLMGraphTransformer
+    const graphTransformer = new LLMGraphTransformer({
+      llm,
+      allowedNodes: ["Person", "Organization", "Concept", "Location", "Event", "Product"],
+      allowedRelationships: ["RELATED_TO", "PART_OF", "LOCATED_IN", "WORKS_AT", "CREATED", "BELONGS_TO", "HAS_PROPERTY"],
+      nodeProperties: ["name", "type", "description"]
+    });
+    
+    // Create a test document
+    const text = "Albert Einstein was a German-born theoretical physicist who developed the theory of relativity. He worked at the Patent Office in Bern.";
+    const documents = [new Document({ pageContent: text })];
+    
+    // Convert to graph documents
+    console.log('Converting document to graph...');
+    const graphDocuments = await graphTransformer.convertToGraphDocuments(documents);
+    
+    // Display the result
+    console.log('Graph Document:', JSON.stringify(graphDocuments, null, 2));
+  } catch (error) {
+    console.error('Error testing LLMGraphTransformer:', error);
+  }
+}
+
+// Run the test
+testGraphTransformer().catch(console.error); 
\ No newline at end of file
diff --git a/nvidia/txt2kg/assets/start.sh b/nvidia/txt2kg/assets/start.sh
new file mode 100755
index 0000000..bb08788
--- /dev/null
+++ b/nvidia/txt2kg/assets/start.sh
@@ -0,0 +1,130 @@
+#!/bin/bash
+
+# Setup script for txt2kg project
+
+# Parse command line arguments
+DEV_FRONTEND=false
+USE_VLLM=false
+USE_COMPLETE=false
+
+while [[ $# -gt 0 ]]; do
+  case $1 in
+    --dev-frontend)
+      DEV_FRONTEND=true
+      shift
+      ;;
+    --vllm)
+      USE_VLLM=true
+      shift
+      ;;
+    --complete)
+      USE_COMPLETE=true
+      shift
+      ;;
+    --help|-h)
+      echo "Usage: ./start.sh [OPTIONS]"
+      echo ""
+      echo "Options:"
+      echo "  --dev-frontend   Run frontend in development mode (without Docker)"
+      echo "  --vllm           Use vLLM instead of Ollama for LLM inference"
+      echo "  --complete       Use complete stack with MinIO S3 storage"
+      echo "  --help, -h       Show this help message"
+      echo ""
+      echo "Default: Starts with Ollama, ArangoDB, local Pinecone, and Next.js frontend"
+      exit 0
+      ;;
+    *)
+      echo "Unknown option: $1"
+      echo "Run './start.sh --help' for usage information"
+      exit 1
+      ;;
+  esac
+done
+
+if [ "$DEV_FRONTEND" = true ]; then
+  echo "Starting frontend in development mode..."
+  cd frontend
+  if ! command -v pnpm &> /dev/null; then
+    echo "Error: pnpm is not installed. Install it with: npm install -g pnpm"
+    exit 1
+  fi
+  pnpm run dev
+  exit 0
+fi
+
+# Check for GPU support
+echo "Checking for GPU support..."
+if command -v nvidia-smi &> /dev/null; then
+  if nvidia-smi &> /dev/null; then
+    echo "✓ NVIDIA GPU detected"
+    GPU_INFO=$(nvidia-smi --query-gpu=name,memory.total --format=csv,noheader | head -n1)
+    echo "  GPU: $GPU_INFO"
+  else
+    echo "⚠ NVIDIA GPU not accessible. Services will run in CPU mode (slower)."
+  fi
+else
+  echo "⚠ nvidia-smi not found. Services will run in CPU mode (slower)."
+fi
+
+# Check which Docker Compose version is available
+DOCKER_COMPOSE_CMD=""
+if docker compose version &> /dev/null; then
+  DOCKER_COMPOSE_CMD="docker compose"
+  echo "Using Docker Compose V2"
+elif command -v docker-compose &> /dev/null; then
+  DOCKER_COMPOSE_CMD="docker-compose"
+  echo "Using Docker Compose V1 (deprecated - consider upgrading)"
+else
+  echo "Error: Neither 'docker compose' nor 'docker-compose' is available"
+  echo "Please install Docker Compose: https://docs.docker.com/compose/install/"
+  exit 1
+fi
+
+# Build the docker-compose command
+if [ "$USE_VLLM" = true ]; then
+  CMD="$DOCKER_COMPOSE_CMD -f $(pwd)/deploy/compose/docker-compose.vllm.yml"
+  echo "Using vLLM for GPU-accelerated LLM inference with FP8 quantization..."
+elif [ "$USE_COMPLETE" = true ]; then
+  CMD="$DOCKER_COMPOSE_CMD -f $(pwd)/deploy/compose/docker-compose.complete.yml"
+  echo "Using complete stack with MinIO S3 storage..."
+else
+  CMD="$DOCKER_COMPOSE_CMD -f $(pwd)/deploy/compose/docker-compose.yml"
+  echo "Using default configuration (Ollama + ArangoDB + local Pinecone)..."
+fi
+
+# Execute the command
+echo ""
+echo "Starting services..."
+echo "Running: $CMD up -d"
+cd $(dirname "$0")
+eval "$CMD up -d"
+
+echo ""
+echo "=========================================="
+echo "txt2kg is now running!"
+echo "=========================================="
+echo ""
+echo "Services:"
+echo "  • Web UI: http://localhost:3001"
+echo "  • ArangoDB: http://localhost:8529"
+echo "  • Ollama API: http://localhost:11434"
+echo "  • Local Pinecone: http://localhost:5081"
+echo ""
+
+if [ "$USE_VLLM" = true ]; then
+  echo "  • vLLM API: http://localhost:8001"
+  echo ""
+fi
+
+echo "Next steps:"
+echo "  1. Pull an Ollama model (if not already done):"
+echo "     docker exec ollama-compose ollama pull llama3.1:8b"
+echo ""
+echo "  2. Open http://localhost:3001 in your browser"
+echo "  3. Upload documents and start building your knowledge graph!"
+echo ""
+echo "Other options:"
+echo "  • Run frontend in dev mode: ./start.sh --dev-frontend"
+echo "  • Use vLLM instead of Ollama: ./start.sh --vllm"
+echo "  • View logs: docker compose logs -f"
+echo ""