This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| wiki:ai:dgx-spark-monitoring [2026/04/17 11:28] – swilson | wiki:ai:dgx-spark-monitoring [2026/04/17 11:36] (current) – [Step 1: SSH into the DGX Spark] swilson | ||
|---|---|---|---|
| Line 12: | Line 12: | ||
| =====Step 1: SSH into the DGX Spark===== | =====Step 1: SSH into the DGX Spark===== | ||
| - | From your Mac terminal, SSH into the Spark: | + | From your Local terminal, SSH into the Spark: |
| ssh YOUR_USERNAME@YOUR_SPARK_IP | ssh YOUR_USERNAME@YOUR_SPARK_IP | ||
| Line 39: | Line 38: | ||
| Verify it works by launching the interactive TUI: | Verify it works by launching the interactive TUI: | ||
| - | |||
| ./ | ./ | ||
| Line 55: | Line 53: | ||
| Start nv-monitor in headless mode with a Bearer token: | Start nv-monitor in headless mode with a Bearer token: | ||
| - | |||
| cd ~/ | cd ~/ | ||
| ./ | ./ | ||
| Line 72: | Line 69: | ||
| Verify it is working: | Verify it is working: | ||
| - | |||
| curl -s -H " | curl -s -H " | ||
| Line 95: | Line 91: | ||
| global: | global: | ||
| scrape_interval: | scrape_interval: | ||
| - | |||
| scrape_configs: | scrape_configs: | ||
| - job_name: ' | - job_name: ' | ||
| Line 127: | Line 122: | ||
| Connect both containers to a shared Docker network so Grafana can reach Prometheus by name: | Connect both containers to a shared Docker network so Grafana can reach Prometheus by name: | ||
| - | |||
| docker network create monitoring | docker network create monitoring | ||
| docker network connect monitoring prometheus | docker network connect monitoring prometheus | ||
| Line 133: | Line 127: | ||
| Verify both are healthy: | Verify both are healthy: | ||
| - | |||
| docker ps | docker ps | ||
| curl -s localhost: | curl -s localhost: | ||
| Line 148: | Line 141: | ||
| **Note:** The DGX Spark does not have UFW installed. Use iptables directly: | **Note:** The DGX Spark does not have UFW installed. Use iptables directly: | ||
| - | |||
| sudo iptables -I INPUT -s 172.17.0.0/ | sudo iptables -I INPUT -s 172.17.0.0/ | ||
| Line 162: | Line 154: | ||
| On your **Mac**, open a **new local terminal** (not an SSH session to the Spark — the prompt must show your Mac hostname): | On your **Mac**, open a **new local terminal** (not an SSH session to the Spark — the prompt must show your Mac hostname): | ||
| - | |||
| ssh -L 9090: | ssh -L 9090: | ||
| Line 298: | Line 289: | ||
| **Fix 1** — Use the correct target IP in '' | **Fix 1** — Use the correct target IP in '' | ||
| - | |||
| targets: [' | targets: [' | ||
| Line 306: | Line 296: | ||
| **Fix 2** — Allow Docker bridge through the firewall: | **Fix 2** — Allow Docker bridge through the firewall: | ||
| - | |||
| sudo iptables -I INPUT -s 172.17.0.0/ | sudo iptables -I INPUT -s 172.17.0.0/ | ||
| Line 317: | Line 306: | ||
| ====Grafana cannot connect to Prometheus — " | ====Grafana cannot connect to Prometheus — " | ||
| The containers are not on the same Docker network. Run: | The containers are not on the same Docker network. Run: | ||
| - | |||
| docker network create monitoring | docker network create monitoring | ||
| docker network connect monitoring prometheus | docker network connect monitoring prometheus | ||