mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-23 18:33:54 +00:00
This playbook installs the DGX Spark MCP Server, a tool for hardware-aware Spark optimization on DGX systems. Includes: - Installation script (npm based) - Systemd service configuration - Default configuration - Documentation |
||
|---|---|---|
| .. | ||
| config | ||
| deploy | ||
| scripts | ||
| README.md | ||
DGX Spark MCP Server Playbook
This playbook installs and configures the DGX Spark MCP Server, a tool that provides hardware-aware Apache Spark optimization for NVIDIA DGX systems via the Model Context Protocol (MCP).
Overview
The DGX Spark MCP Server enables MCP clients (like Claude Desktop or Claude Code) to:
- Detect Hardware: Automatically read DGX GPU topology, memory, and CPU specs.
- Optimize Spark: Generate tuned Spark configurations (
spark-submitargs) based on detected hardware and workload type (ETL, ML Training, Inference). - Monitor: Check real-time GPU availability before submitting jobs.
Prerequisites
- NVIDIA DGX System (or compatible GPU server)
- NVIDIA Drivers installed (
nvidia-smiavailable) - Node.js 18+
- Root access (for systemd service installation)
Directory Structure
.
├── config/
│ └── default.json # Default configuration
├── deploy/
│ └── dgx-spark-mcp.service # Systemd service file
└── scripts/
└── install.sh # Automated installer
Installation
-
Run the installer:
sudo ./scripts/install.shThis script will:
- Install
dgx-spark-mcpglobally vianpm. - Create a dedicated system user (
dgx). - Setup logging directory
/var/log/dgx-spark-mcp. - Install and start the systemd service.
- Install
-
Verify Installation:
systemctl status dgx-spark-mcp
Configuration
The configuration file is located at /etc/dgx-spark-mcp/config.json.
Key Settings
mcp.transport:stdio(default) orsse.hardware.enableGpuMonitoring: Set totrueto enable real-timenvidia-smiqueries.logging.level:infoordebug.
Usage with Claude Desktop
Add the following to your claude_desktop_config.json:
{
"mcpServers": {
"dgx-spark": {
"command": "dgx-spark-mcp"
}
}
}
Troubleshooting
Service fails to start? Check logs:
journalctl -u dgx-spark-mcp -f
Permission denied?
Ensure the dgx user has permissions to access nvidia-smi. You may need to add the user to the video group:
usermod -a -G video dgx