dgx-spark-playbooks/nvidia/nemoclaw/README.md
2026-04-14 00:52:53 +00:00

22 KiB

NemoClaw with Nemotron 3 Super and Telegram on DGX Spark

Install NemoClaw on DGX Spark with local Ollama inference and Telegram bot integration

Table of Contents


Overview

Overview

Basic idea

NVIDIA NemoClaw is an open-source reference stack that simplifies running OpenClaw always-on assistants more safely. It installs the NVIDIA OpenShell runtime -- an environment designed for executing agents with additional security -- and open-source models like NVIDIA Nemotron. A single installer command handles Node.js, OpenShell, and the NemoClaw CLI, then walks you through an onboard wizard to create a sandboxed agent on your DGX Spark using Ollama with Nemotron 3 Super.

By the end of this playbook you will have a working AI agent inside an OpenShell sandbox, accessible via a web dashboard and a Telegram bot, with inference routed to a local Nemotron 3 Super 120B model on your Spark -- all without exposing your host filesystem or network to the agent.

What you'll accomplish

  • Configure Docker and the NVIDIA container runtime for OpenShell on DGX Spark
  • Install Ollama, pull Nemotron 3 Super 120B, and configure it for sandbox access
  • Install NemoClaw with a single command (handles Node.js, OpenShell, and the CLI)
  • Run the onboard wizard to create a sandbox and configure local inference
  • Chat with the agent via the CLI, TUI, and web UI
  • Set up a Telegram bot that forwards messages to your sandboxed agent

Notice and disclaimers

The following sections describe safety, risks, and your responsibilities when running this demo.

Quick start safety check

Use only a clean environment. Run this demo on a fresh device or VM with no personal data, confidential information, or sensitive credentials. Keep it isolated like a sandbox.

By installing this demo, you accept responsibility for all third-party components, including reviewing their licenses, terms, and security posture. Read and accept before you install or use.

What you're getting

This experience is provided "AS IS" for demonstration purposes only -- no warranties, no guarantees. This is a demo, not a production-ready solution. You will need to implement appropriate security controls for your environment and use case.

Key risks with AI agents

  • Data leakage -- Any materials the agent accesses could be exposed, leaked, or stolen.
  • Malicious code execution -- The agent or its connected tools could expose your system to malicious code or cyber-attacks.
  • Unintended actions -- The agent might modify or delete files, send messages, or access services without explicit approval.
  • Prompt injection and manipulation -- External inputs or connected content could hijack the agent's behavior in unexpected ways.

Participant acknowledgement

By participating in this demo, you acknowledge that you are solely responsible for your configuration and for any data, accounts, and tools you connect. To the maximum extent permitted by law, NVIDIA is not responsible for any loss of data, device damage, security incidents, or other harm arising from your configuration or use of NemoClaw demo materials, including OpenClaw or any connected tools or services.

Isolation layers (OpenShell)

Layer What it protects When it applies
Filesystem Prevents reads/writes outside allowed paths. Locked at sandbox creation.
Network Blocks unauthorized outbound connections. Hot-reloadable at runtime.
Process Blocks privilege escalation and dangerous syscalls. Locked at sandbox creation.
Inference Reroutes model API calls to controlled backends. Hot-reloadable at runtime.

What to know before starting

  • Basic use of the Linux terminal and SSH
  • Familiarity with Docker (permissions, docker run)
  • Awareness of the security and risk sections above

Prerequisites

Hardware and access:

  • A DGX Spark (GB10) with keyboard and monitor, or SSH access
  • An NVIDIA API key from build.nvidia.com (needed for the Telegram bridge)
  • A Telegram bot token from @BotFather (create one with /newbot)

Software:

  • Fresh install of DGX OS with latest updates

Verify your system before starting:

head -n 2 /etc/os-release
nvidia-smi
docker info --format '{{.ServerVersion}}'

Expected: Ubuntu 24.04, NVIDIA GB10 GPU, Docker 28.x+.

Have ready before you begin

Item Where to get it
NVIDIA API key build.nvidia.com/settings/api-keys
Telegram bot token @BotFather on Telegram -- create with /newbot

Ancillary files

All required assets are handled by the NemoClaw installer. No manual cloning is needed.

Time and risk

  • Estimated time: 20--30 minutes (with Ollama and model already downloaded). First-time model download adds ~15--30 minutes depending on network speed.
  • Risk level: Medium -- you are running an AI agent in a sandbox; risks are reduced by isolation but not eliminated. Use a clean environment and do not connect sensitive data or production accounts.
  • Last Updated: 03/31/2026
    • First Publication

Instructions

Phase 1: Prerequisites

These steps prepare a fresh DGX Spark for NemoClaw. If Docker, the NVIDIA runtime, and Ollama are already configured, skip to Phase 2.

Step 1. Configure Docker and the NVIDIA container runtime

OpenShell's gateway runs k3s inside Docker. On DGX Spark (Ubuntu 24.04, cgroup v2), Docker must be configured with the NVIDIA runtime and host cgroup namespace mode.

Configure the NVIDIA container runtime for Docker:

sudo nvidia-ctk runtime configure --runtime=docker

Set the cgroup namespace mode required by OpenShell on DGX Spark:

sudo python3 -c "
import json, os
path = '/etc/docker/daemon.json'
d = json.load(open(path)) if os.path.exists(path) else {}
d['default-cgroupns-mode'] = 'host'
json.dump(d, open(path, 'w'), indent=2)
"

Restart Docker:

sudo systemctl restart docker

Verify the NVIDIA runtime works:

docker run --rm --runtime=nvidia --gpus all ubuntu nvidia-smi

If you get a permission denied error on docker, add your user to the Docker group and activate the new group in your current session:

sudo usermod -aG docker $USER
newgrp docker

This applies the group change immediately. Alternatively, you can log out and back in instead of running newgrp docker.

Note

DGX Spark uses cgroup v2. OpenShell's gateway embeds k3s inside Docker and needs host cgroup namespace access. Without default-cgroupns-mode: host, the gateway can fail with "Failed to start ContainerManager" errors.

Step 2. Install Ollama

Install Ollama:

curl -fsSL https://ollama.com/install.sh | sh

Configure Ollama to listen on all interfaces so the sandbox container can reach it:

sudo mkdir -p /etc/systemd/system/ollama.service.d
printf '[Service]\nEnvironment="OLLAMA_HOST=0.0.0.0"\n' | sudo tee /etc/systemd/system/ollama.service.d/override.conf
sudo systemctl daemon-reload
sudo systemctl restart ollama

Verify it is running and reachable on all interfaces:

curl http://0.0.0.0:11434

Expected: Ollama is running. If not, start it with sudo systemctl start ollama.

Important

Always start Ollama via systemd (sudo systemctl restart ollama) — do not use ollama serve &. A manually started Ollama process does not pick up the OLLAMA_HOST=0.0.0.0 setting above, and the NemoClaw sandbox will not be able to reach the inference server.

Step 3. Pull the Nemotron 3 Super model

Download Nemotron 3 Super 120B (~87 GB; may take 15--30 minutes depending on network speed):

ollama pull nemotron-3-super:120b

Run it briefly to pre-load weights into memory (type /bye to exit):

ollama run nemotron-3-super:120b

Verify the model is available:

ollama list

You should see nemotron-3-super:120b in the output.


Phase 2: Install and Run NemoClaw

Step 4. Install NemoClaw

This single command handles everything: installs Node.js (if needed), installs OpenShell, clones the latest stable NemoClaw release, builds the CLI, and runs the onboard wizard to create a sandbox.

curl -fsSL https://www.nvidia.com/nemoclaw.sh | bash

The onboard wizard walks you through setup:

  1. Sandbox name -- Pick a name (e.g. my-assistant). Names must be lowercase alphanumeric with hyphens only.
  2. Inference provider -- Select Local Ollama (option 7).
  3. Model -- Select nemotron-3-super:120b (option 1).
  4. Policy presets -- Accept the suggested presets when prompted (hit Y).

When complete you will see output like:

──────────────────────────────────────────────────
Dashboard    http://localhost:18789/
Sandbox      my-assistant (Landlock + seccomp + netns)
Model        nemotron-3-super:120b (Local Ollama)
──────────────────────────────────────────────────
Run:         nemoclaw my-assistant connect
Status:      nemoclaw my-assistant status
Logs:        nemoclaw my-assistant logs --follow
──────────────────────────────────────────────────

Important

Save the tokenized Web UI URL printed at the end -- you will need it in Step 8. It looks like: http://127.0.0.1:18789/#token=<long-token-here>

Note

If nemoclaw is not found after install, run source ~/.bashrc to reload your shell path.

Step 5. Connect to the sandbox and verify inference

Connect to the sandbox:

nemoclaw my-assistant connect

You will see sandbox@my-assistant:~$ -- you are now inside the sandboxed environment.

Verify that the inference route is working:

curl -sf https://inference.local/v1/models

Expected: JSON listing nemotron-3-super:120b.

Step 6. Talk to the agent (CLI)

Still inside the sandbox, send a test message:

openclaw agent --agent main --local -m "hello" --session-id test

The agent will respond using Nemotron 3 Super. First responses may take 30--90 seconds for a 120B parameter model running locally.

Step 7. Interactive TUI

Launch the terminal UI for an interactive chat session:

openclaw tui

Press Ctrl+C to exit the TUI.

Step 8. Exit the sandbox and access the Web UI

Exit the sandbox to return to the host:

exit

If accessing the Web UI directly on the Spark (keyboard and monitor attached), open a browser and navigate to the tokenized URL from Step 4:

http://127.0.0.1:18789/#token=<long-token-here>

If accessing the Web UI from a remote machine, you need to set up port forwarding.

First, find your Spark's IP address. On the Spark, run:

hostname -I | awk '{print $1}'

This prints the primary IP address (e.g. 192.168.1.42). You can also find it in Settings > Wi-Fi or Settings > Network on the Spark's desktop, or check your router's connected-devices list.

Start the port forward on the Spark host:

openshell forward start 18789 my-assistant --background

Then from your remote machine, create an SSH tunnel to the Spark (replace <your-spark-ip> with the IP address from above):

ssh -L 18789:127.0.0.1:18789 <your-user>@<your-spark-ip>

Now open the tokenized URL in your remote machine's browser:

http://127.0.0.1:18789/#token=<long-token-here>

Important

Use 127.0.0.1, not localhost -- the gateway origin check requires an exact match.


Phase 3: Telegram Bot

Note

If you already configured Telegram during the NemoClaw onboarding wizard (step 5/8), you can skip this phase. These steps cover adding Telegram after the initial setup.

Step 9. Create a Telegram bot

Open Telegram, find @BotFather, send /newbot, and follow the prompts. Copy the bot token it gives you.

Step 10. Configure and start the Telegram bridge

Make sure you are on the host (not inside the sandbox). If you are inside the sandbox, run exit first.

Set the required environment variables. Replace the placeholders with your actual values. SANDBOX_NAME must match the sandbox name you chose during the onboard wizard:

export TELEGRAM_BOT_TOKEN=<your-bot-token>
export SANDBOX_NAME=my-assistant
export NVIDIA_API_KEY=<your-nvidia-api-key>

Add the Telegram network policy to the sandbox:

nemoclaw my-assistant policy-add

When prompted, select telegram and hit Y to confirm.

Start the Telegram bridge.

export TELEGRAM_BOT_TOKEN=<your-bot-token>
nemoclaw start

The Telegram bridge starts only when the TELEGRAM_BOT_TOKEN environment variable is set. Verify the services are running:

nemoclaw status

Open Telegram, find your bot, and send it a message. The bot forwards it to the agent and replies.

Note

The first response may take 30--90 seconds for a 120B parameter model running locally.

Note

If the bridge does not appear in nemoclaw status, make sure TELEGRAM_BOT_TOKEN is exported in the same shell session where you run nemoclaw start. You can also try stopping and restarting:

nemoclaw stop
export TELEGRAM_BOT_TOKEN=<your-bot-token>
nemoclaw start

Note

For details on restricting which Telegram chats can interact with the agent, see the NemoClaw Telegram bridge documentation.


Phase 4: Cleanup and Uninstall

Step 11. Stop services

Stop any running auxiliary services (Telegram bridge, cloudflared tunnel):

nemoclaw stop

Stop the port forward:

openshell forward list          # find active forwards
openshell forward stop 18789    # stop the dashboard forward

Step 12. Uninstall NemoClaw

Run the uninstaller from the cloned source directory. It removes all sandboxes, the OpenShell gateway, Docker containers/images/volumes, the CLI, and all state files. Docker, Node.js, npm, and Ollama are preserved.

cd ~/.nemoclaw/source
./uninstall.sh

Uninstaller flags:

Flag Effect
--yes Skip the confirmation prompt
--keep-openshell Leave the openshell binary in place
--delete-models Also remove the Ollama models pulled by NemoClaw

To remove everything including the Ollama model:

./uninstall.sh --yes --delete-models

The uninstaller runs 6 steps:

  1. Stop NemoClaw helper services and port-forward processes
  2. Delete all OpenShell sandboxes, the NemoClaw gateway, and providers
  3. Remove the global nemoclaw npm package
  4. Remove NemoClaw/OpenShell Docker containers, images, and volumes
  5. Remove Ollama models (only with --delete-models)
  6. Remove state directories (~/.nemoclaw, ~/.config/openshell, ~/.config/nemoclaw) and the OpenShell binary

Note

The source clone at ~/.nemoclaw/source is removed as part of state cleanup in step 6. If you want to keep a local copy, move or back it up before running the uninstaller.

Useful commands

Command Description
nemoclaw my-assistant connect Shell into the sandbox
nemoclaw my-assistant status Show sandbox status and inference config
nemoclaw my-assistant logs --follow Stream sandbox logs in real time
nemoclaw list List all registered sandboxes
nemoclaw start Start auxiliary services (Telegram bridge, cloudflared)
nemoclaw stop Stop auxiliary services
openshell term Open the monitoring TUI on the host
openshell forward list List active port forwards
openshell forward start 18789 my-assistant --background Restart port forwarding for Web UI
cd ~/.nemoclaw/source && ./uninstall.sh Remove NemoClaw (preserves Docker, Node.js, Ollama)
cd ~/.nemoclaw/source && ./uninstall.sh --delete-models Remove NemoClaw and Ollama models

Troubleshooting

Symptom Cause Fix
nemoclaw: command not found after install Shell PATH not updated Run source ~/.bashrc (or source ~/.zshrc for zsh), or open a new terminal window.
Installer fails with Node.js version error Node.js version below 20 Install Node.js 20+: curl -fsSL https://deb.nodesource.com/setup_22.x | sudo -E bash - && sudo apt-get install -y nodejs then re-run the installer.
npm install fails with EACCES permission error npm global directory not writable mkdir -p ~/.npm-global && npm config set prefix ~/.npm-global && export PATH=~/.npm-global/bin:$PATH then re-run the installer. Add the export line to ~/.bashrc to make it permanent.
Docker permission denied User not in docker group sudo usermod -aG docker $USER, then log out and back in.
Gateway fails with cgroup / "Failed to start ContainerManager" errors Docker not configured for host cgroup namespace on DGX Spark Run the cgroup fix: sudo python3 -c "import json, os; path='/etc/docker/daemon.json'; d=json.load(open(path)) if os.path.exists(path) else {}; d['default-cgroupns-mode']='host'; json.dump(d, open(path,'w'), indent=2)" then sudo systemctl restart docker. Alternatively, run sudo nemoclaw setup-spark which applies this fix automatically.
Gateway fails with "port 8080 is held by container..." Another OpenShell gateway or container is using port 8080 Stop the conflicting container: openshell gateway destroy -g <old-gateway-name> or docker stop <container-name> && docker rm <container-name>, then retry nemoclaw onboard.
Sandbox creation fails Stale gateway state or DNS not propagated Run openshell gateway destroy && openshell gateway start, then re-run the installer or nemoclaw onboard.
CoreDNS crash loop Known issue on some DGX Spark configurations Run sudo ./scripts/fix-coredns.sh from the NemoClaw repo directory.
"No GPU detected" during onboard DGX Spark GB10 reports unified memory differently Expected on DGX Spark. The wizard still works and uses Ollama for inference.
Inference timeout or hangs Ollama not running or not reachable Check Ollama: curl http://localhost:11434. If not running: ollama serve &. If running but unreachable from sandbox, ensure Ollama is configured to listen on 0.0.0.0 (see Step 2 in Instructions).
Agent gives no response or is very slow Normal for 120B model running locally Nemotron 3 Super 120B can take 30--90 seconds per response. Verify inference route: nemoclaw my-assistant status.
Port 18789 already in use Another process is bound to the port lsof -i :18789 then kill <PID>. If needed, kill -9 <PID> to force-terminate.
Web UI port forward dies or dashboard unreachable Port forward not active openshell forward stop 18789 my-assistant then openshell forward start 18789 my-assistant --background.
Web UI shows origin not allowed Accessing via localhost instead of 127.0.0.1 Use http://127.0.0.1:18789/#token=... in the browser. The gateway origin check requires 127.0.0.1 exactly.
Telegram bridge does not start Missing environment variables Ensure TELEGRAM_BOT_TOKEN and SANDBOX_NAME are set on the host. SANDBOX_NAME must match the sandbox name from onboarding.
Telegram bridge needs restart but nemoclaw stop does not work Known bug in nemoclaw stop Find the PID from the nemoclaw start output, force-kill with kill -9 <PID>, then run nemoclaw start again.
Telegram bot receives messages but does not reply Telegram policy not added to sandbox Run nemoclaw my-assistant policy-add, type telegram, hit Y. Then restart the bridge with nemoclaw start.

Note

DGX Spark uses a Unified Memory Architecture (UMA), which enables dynamic memory sharing between the GPU and CPU. With many applications still updating to take advantage of UMA, you may encounter memory issues even when within the memory capacity of DGX Spark. If that happens, manually flush the buffer cache with:

sudo sh -c 'sync; echo 3 > /proc/sys/vm/drop_caches'

For the latest known issues, please review the DGX Spark User Guide.