Published: 2026-05-13

Run a 24/7 Private Hermes Agent on Your NVIDIA DGX Spark

Alex Finn demos setting up Hermes Agent on an NVIDIA DGX Spark powered entirely by a local model—no cloud API, no subscription fees, completely private. The DGX Spark runs headless (no monitor needed) and connects to your main machine via Tailscale, letting Hermes manage the device, install models, and run agentic tasks around the clock. The entire setup—including Tailscale installation—is handled by giving Hermes a single plain-English prompt.

Source video

"Hermes Agent powered by local models on the DGX Spark is basically magic" by Alex FinnWatch on YouTube →

Key Takeaways

  • DGX Spark runs in headless mode—plug it in, and Hermes Agent on your main machine can control it via Tailscale from anywhere in the world.
  • Prompt to get started: "I purchased a new DGX Spark and want to set it up. I want to run it headless and I want you to be able to control it. Walk through setup, then install Tailscale."
  • Local models are free once the hardware is paid for—no per-token billing, fully private, all data stays on device.
  • LoRA adapters let you customize the local model's voice and output style to match your own.
  • Use Hermes connected to a cloud model first to set up the Spark, then switch to the local model once it's running.