mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-06-24 07:09:31 +00:00
Compare commits
No commits in common. "6a749bdcb0399d6c78a27f193bb902a28ea73006" and "b8cc262bede8d74a3c27fe5c07221b4a60b3540d" have entirely different histories.
6a749bdcb0
...
b8cc262bed
@ -125,23 +125,12 @@ Write a short README checklist for a Python project.
|
|||||||
|
|
||||||
Expected output should show the model responding in the terminal. When you are done, type `/bye` or press `Ctrl+D` to exit the interactive session before continuing.
|
Expected output should show the model responding in the terminal. When you are done, type `/bye` or press `Ctrl+D` to exit the interactive session before continuing.
|
||||||
|
|
||||||
## Step 5. Install and launch Claude Code with Ollama
|
## Step 5. Launch Claude Code with Ollama
|
||||||
|
|
||||||
**Description**: Install [Claude Code](https://docs.claude.com/en/docs/claude-code), then use Ollama's built-in [launch method](https://ollama.com/blog/launch) to start Claude Code against your local model. No environment variables or config files are required.
|
**Description**: Use Ollama's built-in [launch method](https://ollama.com/blog/launch) to start [Claude Code](https://docs.claude.com/en/docs/claude-code) against your local model. No environment variables or config files are required.
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
curl -fsSL https://claude.ai/install.sh | bash
|
ollama launch claude
|
||||||
claude --version
|
|
||||||
```
|
|
||||||
|
|
||||||
If Claude Code is already installed, just verify the version:
|
|
||||||
|
|
||||||
```bash
|
|
||||||
claude --version
|
|
||||||
```
|
|
||||||
|
|
||||||
```bash
|
|
||||||
ollama launch claude --model qwen3.6
|
|
||||||
```
|
```
|
||||||
|
|
||||||
Expected output should show Claude Code starting and using the local Qwen3.6 model. Qwen3.6 ships with a 256K context window by default; adjust context length through Ollama's settings if you need to tune it further.
|
Expected output should show Claude Code starting and using the local Qwen3.6 model. Qwen3.6 ships with a 256K context window by default; adjust context length through Ollama's settings if you need to tune it further.
|
||||||
@ -161,8 +150,6 @@ printf 'import math_utils\n\n\ndef test_add():\n assert math_utils.add(1, 2)
|
|||||||
If you do not already have pytest installed:
|
If you do not already have pytest installed:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python3 -m venv .venv
|
|
||||||
source .venv/bin/activate
|
|
||||||
python3 -m pip install -U pytest
|
python3 -m pip install -U pytest
|
||||||
```
|
```
|
||||||
|
|
||||||
@ -178,7 +165,7 @@ Run the test:
|
|||||||
python3 -m pytest -q
|
python3 -m pytest -q
|
||||||
```
|
```
|
||||||
|
|
||||||
Expected output should show the test passing. When you are done, run `deactivate` to exit the virtual environment.
|
Expected output should show the test passing.
|
||||||
|
|
||||||
## Step 7. Cleanup and rollback
|
## Step 7. Cleanup and rollback
|
||||||
|
|
||||||
@ -272,7 +259,7 @@ Expected output should show the model responding. When you are done, type `/bye`
|
|||||||
**Description**: Use Ollama's built-in [launch method](https://ollama.com/blog/launch) to start [OpenCode](https://opencode.ai) against your local model. No [`opencode.json`](https://opencode.ai/docs/config/) provider configuration is required.
|
**Description**: Use Ollama's built-in [launch method](https://ollama.com/blog/launch) to start [OpenCode](https://opencode.ai) against your local model. No [`opencode.json`](https://opencode.ai/docs/config/) provider configuration is required.
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
ollama launch opencode --model qwen3.6
|
ollama launch opencode
|
||||||
```
|
```
|
||||||
|
|
||||||
If you want to pre-configure OpenCode without launching immediately:
|
If you want to pre-configure OpenCode without launching immediately:
|
||||||
@ -298,8 +285,6 @@ printf 'import math_utils\n\n\ndef test_add():\n assert math_utils.add(1, 2)
|
|||||||
If you do not already have pytest installed:
|
If you do not already have pytest installed:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python3 -m venv .venv
|
|
||||||
source .venv/bin/activate
|
|
||||||
python3 -m pip install -U pytest
|
python3 -m pip install -U pytest
|
||||||
```
|
```
|
||||||
|
|
||||||
@ -315,7 +300,7 @@ Run the test:
|
|||||||
python3 -m pytest -q
|
python3 -m pytest -q
|
||||||
```
|
```
|
||||||
|
|
||||||
Expected output should show the test passing. When you are done, run `deactivate` to exit the virtual environment.
|
Expected output should show the test passing.
|
||||||
|
|
||||||
## Step 7. Cleanup and rollback
|
## Step 7. Cleanup and rollback
|
||||||
|
|
||||||
@ -409,7 +394,7 @@ Expected output should show the model responding. When you are done, type `/bye`
|
|||||||
**Description**: Use Ollama's built-in [launch method](https://ollama.com/blog/launch) to start [Codex CLI](https://github.com/openai/codex) against your local model. No `~/.codex/config.toml` and no manual `npm install -g @openai/codex` are required — Ollama handles the Codex integration.
|
**Description**: Use Ollama's built-in [launch method](https://ollama.com/blog/launch) to start [Codex CLI](https://github.com/openai/codex) against your local model. No `~/.codex/config.toml` and no manual `npm install -g @openai/codex` are required — Ollama handles the Codex integration.
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
ollama launch codex --model qwen3.6
|
ollama launch codex
|
||||||
```
|
```
|
||||||
|
|
||||||
Expected output should show Codex CLI starting with Ollama as the provider and Qwen3.6 as the model. Qwen3.6 ships with a 256K context window by default, which is well suited to Codex's agentic workflows.
|
Expected output should show Codex CLI starting with Ollama as the provider and Qwen3.6 as the model. Qwen3.6 ships with a 256K context window by default, which is well suited to Codex's agentic workflows.
|
||||||
@ -429,8 +414,6 @@ printf 'import math_utils\n\n\ndef test_add():\n assert math_utils.add(1, 2)
|
|||||||
If you do not already have pytest installed:
|
If you do not already have pytest installed:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python3 -m venv .venv
|
|
||||||
source .venv/bin/activate
|
|
||||||
python3 -m pip install -U pytest
|
python3 -m pip install -U pytest
|
||||||
```
|
```
|
||||||
|
|
||||||
@ -446,7 +429,7 @@ Run the test:
|
|||||||
python3 -m pytest -q
|
python3 -m pytest -q
|
||||||
```
|
```
|
||||||
|
|
||||||
Expected output should show the test passing. When you are done, run `deactivate` to exit the virtual environment.
|
Expected output should show the test passing.
|
||||||
|
|
||||||
## Step 7. Cleanup and rollback
|
## Step 7. Cleanup and rollback
|
||||||
|
|
||||||
@ -482,10 +465,6 @@ ollama rm qwen3.6
|
|||||||
| `connection refused` to localhost:11434 | Ollama service not running | Start with `ollama serve` or `sudo systemctl start ollama` |
|
| `connection refused` to localhost:11434 | Ollama service not running | Start with `ollama serve` or `sudo systemctl start ollama` |
|
||||||
| `ollama launch <agent>` exits immediately | Agent integration failed to initialize | Re-run `ollama launch <agent>`; if it persists, check `journalctl -u ollama` |
|
| `ollama launch <agent>` exits immediately | Agent integration failed to initialize | Re-run `ollama launch <agent>`; if it persists, check `journalctl -u ollama` |
|
||||||
| Slow responses or OOM errors | Model variant too large for GPU memory | Switch to `qwen3.6:35b-a3b-nvfp4` or close other GPU workloads |
|
| Slow responses or OOM errors | Model variant too large for GPU memory | Switch to `qwen3.6:35b-a3b-nvfp4` or close other GPU workloads |
|
||||||
| `python3 -m pip install -U pytest` reports `externally-managed-environment` | Ubuntu 24.04 protects the system Python environment | Create and activate a virtual environment first: `python3 -m venv .venv && source .venv/bin/activate` |
|
|
||||||
| `ollama pull` reports that a model tag is a sharded GGUF | The selected model tag is not supported by Ollama | Use the Qwen3.6 commands in Step 3 instead of sharded GGUF tags |
|
|
||||||
| `ollama run` fails with `CUDA error: context is destroyed` on a multi-GPU system | Ollama is initializing across a mixed-GPU topology | Pin Ollama to one GPU. For a foreground test, run `CUDA_VISIBLE_DEVICES=0 ollama serve`; for a system service, add `Environment="CUDA_VISIBLE_DEVICES=0"` to an Ollama systemd drop-in and restart Ollama |
|
|
||||||
| A direct Claude Code setup using an Anthropic-compatible Ollama endpoint produces prose but does not edit files | Some model/server combinations do not emit tool calls reliably | Use `ollama launch claude` with Qwen3.6 as shown in this playbook |
|
|
||||||
|
|
||||||
> [!NOTE]
|
> [!NOTE]
|
||||||
> DGX Spark uses a Unified Memory Architecture (UMA), which enables dynamic memory sharing
|
> DGX Spark uses a Unified Memory Architecture (UMA), which enables dynamic memory sharing
|
||||||
|
|||||||
@ -241,9 +241,6 @@ sudo netplan apply
|
|||||||
|
|
||||||
#### Option 3: Manual IP Assignment with the netplan configuration file
|
#### Option 3: Manual IP Assignment with the netplan configuration file
|
||||||
|
|
||||||
> [!NOTE]
|
|
||||||
> `enp1s0f1np1` and `enP2p1s0f1np1` are assigned to **different subnets** (`192.168.100.x/24` and `192.168.101.x/24` respectively). This is required — assigning two distinct network interfaces to the same subnet causes networking and software conflicts (e.g., routing ambiguity and NCCL communication failures).
|
|
||||||
|
|
||||||
On node 1:
|
On node 1:
|
||||||
```bash
|
```bash
|
||||||
## Create the netplan configuration file
|
## Create the netplan configuration file
|
||||||
@ -253,11 +250,11 @@ network:
|
|||||||
ethernets:
|
ethernets:
|
||||||
enp1s0f1np1:
|
enp1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.100.1/24
|
- 192.168.100.10/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
enP2p1s0f1np1:
|
enP2p1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.101.1/24
|
- 192.168.100.11/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
EOF
|
EOF
|
||||||
|
|
||||||
@ -277,11 +274,11 @@ network:
|
|||||||
ethernets:
|
ethernets:
|
||||||
enp1s0f1np1:
|
enp1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.100.2/24
|
- 192.168.100.12/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
enP2p1s0f1np1:
|
enP2p1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.101.2/24
|
- 192.168.100.13/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
EOF
|
EOF
|
||||||
|
|
||||||
@ -301,11 +298,11 @@ network:
|
|||||||
ethernets:
|
ethernets:
|
||||||
enp1s0f1np1:
|
enp1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.100.3/24
|
- 192.168.100.14/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
enP2p1s0f1np1:
|
enP2p1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.101.3/24
|
- 192.168.100.15/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
EOF
|
EOF
|
||||||
|
|
||||||
@ -325,11 +322,11 @@ network:
|
|||||||
ethernets:
|
ethernets:
|
||||||
enp1s0f1np1:
|
enp1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.100.4/24
|
- 192.168.100.16/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
enP2p1s0f1np1:
|
enP2p1s0f1np1:
|
||||||
addresses:
|
addresses:
|
||||||
- 192.168.101.4/24
|
- 192.168.100.17/24
|
||||||
dhcp4: no
|
dhcp4: no
|
||||||
EOF
|
EOF
|
||||||
|
|
||||||
|
|||||||
@ -51,10 +51,10 @@ You will deploy NVIDIA's VSS AI Blueprint on NVIDIA Spark hardware with Blackwel
|
|||||||
* Container startup can be resource-intensive and time-consuming with large model downloads
|
* Container startup can be resource-intensive and time-consuming with large model downloads
|
||||||
* Network configuration conflicts if shared network already exists
|
* Network configuration conflicts if shared network already exists
|
||||||
* Remote API endpoints may have rate limits or connectivity issues (hybrid deployment)
|
* Remote API endpoints may have rate limits or connectivity issues (hybrid deployment)
|
||||||
* **Rollback:** Stop all containers with `deploy/docker/scripts/dev-profile.sh down`
|
* **Rollback:** Stop all containers with `scripts/dev-profile.sh down`
|
||||||
* **Last Updated:** 06/17/2026
|
* **Last Updated:** 3/16/2026
|
||||||
* Update required OS and Driver versions
|
* Update required OS and Driver versions
|
||||||
* Support for VSS 3.2.0 with Cosmos Reason 2 VLM
|
* Support for VSS 3.1.0 with Cosmos Reason 2 VLM
|
||||||
|
|
||||||
## Instructions
|
## Instructions
|
||||||
|
|
||||||
@ -65,7 +65,7 @@ Check that your system meets the hardware and software [prerequisites](https://d
|
|||||||
```bash
|
```bash
|
||||||
## Verify driver version
|
## Verify driver version
|
||||||
nvidia-smi | grep "Driver Version"
|
nvidia-smi | grep "Driver Version"
|
||||||
## Expected output: Driver Version: 580.95.05 or higher
|
## Expected output: Driver Version: 580.126.09 or higher
|
||||||
|
|
||||||
## Verify CUDA version
|
## Verify CUDA version
|
||||||
nvcc --version
|
nvcc --version
|
||||||
@ -106,19 +106,10 @@ sudo docker run --rm --runtime=nvidia --gpus all ubuntu nvidia-smi
|
|||||||
|
|
||||||
Clone the Video Search and Summarization repository from NVIDIA's public GitHub.
|
Clone the Video Search and Summarization repository from NVIDIA's public GitHub.
|
||||||
|
|
||||||
**Note** Install Git LFS if not already present on the system
|
|
||||||
|
|
||||||
```bash
|
|
||||||
sudo apt-get install -y git-lfs && git lfs install
|
|
||||||
```
|
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
## Clone the VSS AI Blueprint repository
|
## Clone the VSS AI Blueprint repository
|
||||||
git clone https://github.com/NVIDIA-AI-Blueprints/video-search-and-summarization.git
|
git clone https://github.com/NVIDIA-AI-Blueprints/video-search-and-summarization.git
|
||||||
cd video-search-and-summarization
|
cd video-search-and-summarization
|
||||||
git checkout tags/v3.2.0
|
|
||||||
git lfs install
|
|
||||||
git lfs pull
|
|
||||||
```
|
```
|
||||||
|
|
||||||
## Step 4. Run the cache cleaner script
|
## Step 4. Run the cache cleaner script
|
||||||
@ -157,11 +148,12 @@ sudo -b /usr/local/bin/sys-cache-cleaner.sh
|
|||||||
```
|
```
|
||||||
|
|
||||||
> [!NOTE]
|
> [!NOTE]
|
||||||
The above runs the cache cleaner in the current session only; it does not persist across reboots. To have the cache cleaner run across reboots, create a systemd service instead.
|
+> The above runs the cache cleaner in the current session only; it does not persist across reboots. To have the cache cleaner run across reboots, create a systemd service instead.
|
||||||
To stop the background cache cleaner:
|
+>
|
||||||
```bash
|
+> To stop the background cache cleaner:
|
||||||
sudo pkill -f sys-cache-cleaner.sh
|
+> ```bash
|
||||||
```
|
+> sudo pkill -f sys-cache-cleaner.sh
|
||||||
|
+> ```
|
||||||
|
|
||||||
|
|
||||||
## Step 5. Authenticate with NVIDIA Container Registry
|
## Step 5. Authenticate with NVIDIA Container Registry
|
||||||
@ -180,7 +172,7 @@ docker login nvcr.io
|
|||||||
|
|
||||||
## Step 6. Choose deployment scenario
|
## Step 6. Choose deployment scenario
|
||||||
|
|
||||||
Choose the deployment options based on your requirements:
|
Choose between two deployment options based on your requirements:
|
||||||
|
|
||||||
| Deployment Scenario | VLM (Cosmos-Reason2-8B)| LLM |
|
| Deployment Scenario | VLM (Cosmos-Reason2-8B)| LLM |
|
||||||
|-------------------------------------------|------------------------|-------------------------------|
|
|-------------------------------------------|------------------------|-------------------------------|
|
||||||
@ -210,21 +202,19 @@ In this hybrid deployment, we would use NIMs from [build.nvidia.com](https://bui
|
|||||||
|
|
||||||
```bash
|
```bash
|
||||||
## Start Standard VSS (Base)
|
## Start Standard VSS (Base)
|
||||||
## Set NGC CLI API key and Hugging Face token (required for VA-MCP)
|
|
||||||
export NGC_CLI_API_KEY='your_ngc_api_key'
|
export NGC_CLI_API_KEY='your_ngc_api_key'
|
||||||
export HF_TOKEN='hf_your_token_here'
|
|
||||||
export LLM_ENDPOINT_URL=https://your-llm-endpoint.com
|
export LLM_ENDPOINT_URL=https://your-llm-endpoint.com
|
||||||
deploy/docker/scripts/dev-profile.sh up -p base -H DGX-SPARK --use-remote-llm --llm <REMOTE LLM MODEL NAME>
|
scripts/dev-profile.sh up -p base -H DGX-SPARK --use-remote-llm --llm <REMOTE LLM MODEL NAME>
|
||||||
|
|
||||||
## Start Standard VSS (Alert Verification)
|
## Start Standard VSS (Alert Verification)
|
||||||
export NGC_CLI_API_KEY='your_ngc_api_key'
|
export NGC_CLI_API_KEY='your_ngc_api_key'
|
||||||
export LLM_ENDPOINT_URL=https://your-llm-endpoint.com
|
export LLM_ENDPOINT_URL=https://your-llm-endpoint.com
|
||||||
deploy/docker/scripts/dev-profile.sh up -p alerts -m verification -H DGX-SPARK --use-remote-llm --llm <REMOTE LLM MODEL NAME>
|
scripts/dev-profile.sh up -p alerts -m verification -H DGX-SPARK --use-remote-llm --llm <REMOTE LLM MODEL NAME>
|
||||||
|
|
||||||
## Start Standard VSS (Real-Time Alerts)
|
## Start Standard VSS (Real-Time Alerts)
|
||||||
export NGC_CLI_API_KEY='your_ngc_api_key'
|
export NGC_CLI_API_KEY='your_ngc_api_key'
|
||||||
export LLM_ENDPOINT_URL=https://your-llm-endpoint.com
|
export LLM_ENDPOINT_URL=https://your-llm-endpoint.com
|
||||||
deploy/docker/scripts/dev-profile.sh up -p alerts -m real-time -H DGX-SPARK --use-remote-llm --llm <REMOTE LLM MODEL NAME>
|
scripts/dev-profile.sh up -p alerts -m real-time -H DGX-SPARK --use-remote-llm --llm <REMOTE LLM MODEL NAME>
|
||||||
```
|
```
|
||||||
|
|
||||||
> [!NOTE]
|
> [!NOTE]
|
||||||
@ -236,11 +226,11 @@ deploy/docker/scripts/dev-profile.sh up -p alerts -m real-time -H DGX-SPARK --us
|
|||||||
> • **OPENAI_API_KEY** — (optional) For remote LLM/VLM endpoints that require it
|
> • **OPENAI_API_KEY** — (optional) For remote LLM/VLM endpoints that require it
|
||||||
> • **VLM_CUSTOM_WEIGHTS** — (optional) Absolute path to a custom weights directory
|
> • **VLM_CUSTOM_WEIGHTS** — (optional) Absolute path to a custom weights directory
|
||||||
>
|
>
|
||||||
> Pass these additional flags to **`deploy/docker/scripts/dev-profile.sh`** for remote LLM mode:
|
> Pass these additional flags to **`scripts/dev-profile.sh`** for remote LLM mode:
|
||||||
> • **`--use-remote-llm`** — (required) Use a remote LLM, the base URL is read from **`LLM_ENDPOINT_URL`** in the environment
|
> • **`--use-remote-llm`** — (required) Use a remote LLM, the base URL is read from **`LLM_ENDPOINT_URL`** in the environment
|
||||||
> • **`--llm`** — (required) Remote LLM model name (for example: `nvidia/nvidia-nemotron-nano-9b-v2`). **Strongly recommended** for alert workflows (verification and real-time): use `nvidia/nvidia-nemotron-nano-9b-v2`. Omitting `--llm` may cause the script to use whatever model is returned by the remote endpoint.
|
> • **`--llm`** — (required) Remote LLM model name (for example: `nvidia/nvidia-nemotron-nano-9b-v2`). **Strongly recommended** for alert workflows (verification and real-time): use `nvidia/nvidia-nemotron-nano-9b-v2`. Omitting `--llm` may cause the script to use whatever model is returned by the remote endpoint.
|
||||||
>
|
>
|
||||||
> Run **`deploy/docker/scripts/dev-profile.sh --help`** for a full list of supported arguments.
|
> Run **`scripts/dev-profile.sh -h`** for a full list of supported arguments.
|
||||||
|
|
||||||
|
|
||||||
**7.3 Validate Standard VSS deployment**
|
**7.3 Validate Standard VSS deployment**
|
||||||
@ -251,7 +241,7 @@ Access the VSS UI to confirm successful deployment.
|
|||||||
```bash
|
```bash
|
||||||
## Test Agent UI accessibility
|
## Test Agent UI accessibility
|
||||||
## If running locally on your Spark device, use localhost:
|
## If running locally on your Spark device, use localhost:
|
||||||
curl -I http://localhost:7777
|
curl -I http://localhost:3000
|
||||||
## Expected: HTTP 200 response
|
## Expected: HTTP 200 response
|
||||||
|
|
||||||
## If your Spark is running in Remote/Accessory mode, replace 'localhost' with the IP address or hostname of your Spark device.
|
## If your Spark is running in Remote/Accessory mode, replace 'localhost' with the IP address or hostname of your Spark device.
|
||||||
@ -260,23 +250,20 @@ hostname -I
|
|||||||
## Or to get the hostname:
|
## Or to get the hostname:
|
||||||
hostname
|
hostname
|
||||||
## Then test accessibility (replace <SPARK_IP_OR_HOSTNAME> with the actual value):
|
## Then test accessibility (replace <SPARK_IP_OR_HOSTNAME> with the actual value):
|
||||||
curl -I http://<SPARK_IP_OR_HOSTNAME>:7777
|
curl -I http://<SPARK_IP_OR_HOSTNAME>:3000
|
||||||
```
|
```
|
||||||
|
|
||||||
Open `http://localhost:7777` or `http://<SPARK_IP_OR_HOSTNAME>:7777` in your browser to access the Agent interface.
|
Open `http://localhost:3000` or `http://<SPARK_IP_OR_HOSTNAME>:3000` in your browser to access the Agent interface.
|
||||||
|
|
||||||
## Step 8. Test video processing workflow
|
## Step 8. Test video processing workflow
|
||||||
|
|
||||||
Run a basic test to verify the video analysis pipeline is functioning based on your deployment.
|
Run a basic test to verify the video analysis pipeline is functioning based on your deployment. The UI comes with a few example videos pre-populated for uploading and testing
|
||||||
|
|
||||||
**For Standard VSS deployment**
|
**For Standard VSS deployment**
|
||||||
|
|
||||||
Follow the steps [here](https://docs.nvidia.com/vss/latest/quickstart.html#deploy) to navigate VSS Agent UI.
|
Follow the steps [here](https://docs.nvidia.com/vss/latest/quickstart.html#deploy) to navigate VSS Agent UI.
|
||||||
- Access VSS Agent interface at `http://localhost:7777`
|
- Access VSS Agent interface at `http://localhost:3000`
|
||||||
- Download the sample data from NGC [here](https://docs.nvidia.com/vss/latest/quickstart.html#download-sample-data-from-ngc) and upload videos and test features
|
- Download the sample data from NGC [here](https://docs.nvidia.com/vss/latest/quickstart.html#download-sample-data-from-ngc) and upload videos and test features [here](https://docs.nvidia.com/vss/latest/quickstart.html#download-sample-data-from-ngc)
|
||||||
- Test Standard VSS deployment (Base) [here](https://docs.nvidia.com/vss/latest/quickstart.html#step-2-upload-a-video)
|
|
||||||
- Test Standard VSS deployment (Alert Verification) [here](https://docs.nvidia.com/vss/latest/agent-workflow-alert-verification.html#step-2-add-a-video-stream)
|
|
||||||
- Test Standard VSS deployment (Real-Time Alerts) [here](https://docs.nvidia.com/vss/latest/agent-workflow-rt-alert.html#step-2-add-a-video-stream)
|
|
||||||
|
|
||||||
|
|
||||||
## Step 9. Cleanup and rollback
|
## Step 9. Cleanup and rollback
|
||||||
@ -288,7 +275,7 @@ To completely remove the VSS deployment and free up system resources [Follow](ht
|
|||||||
|
|
||||||
```bash
|
```bash
|
||||||
## For Standard VSS deployment
|
## For Standard VSS deployment
|
||||||
deploy/docker/scripts/dev-profile.sh down
|
scripts/dev-profile.sh down
|
||||||
```
|
```
|
||||||
|
|
||||||
## Step 10. Next steps
|
## Step 10. Next steps
|
||||||
@ -296,8 +283,8 @@ deploy/docker/scripts/dev-profile.sh down
|
|||||||
With VSS deployed, you can now:
|
With VSS deployed, you can now:
|
||||||
|
|
||||||
**Standard VSS deployment:**
|
**Standard VSS deployment:**
|
||||||
- Access full VSS capabilities at port 7777
|
- Access full VSS capabilities at port 3000
|
||||||
- Test video and Q&A features
|
- Test video summarization and Q&A features
|
||||||
- Configure knowledge graphs and graph databases
|
- Configure knowledge graphs and graph databases
|
||||||
- Integrate with existing video processing workflows
|
- Integrate with existing video processing workflows
|
||||||
|
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user