How to Build and Run a Self-Improving AI Agent with Hermes on NVIDIA Hardware

By • min read

Introduction

Agentic AI is transforming how we work, and the open-source community has embraced frameworks that make self-improving agents a reality. Hermes Agent, developed by Nous Research, has quickly become the most used agent worldwide on OpenRouter, with over 140,000 GitHub stars in under three months. It’s designed for reliability and continuous learning—capabilities that were historically difficult to achieve. By running locally on NVIDIA RTX PCs, RTX PRO workstations, or DGX Spark, you get always-on, high-speed performance. This guide walks you through setting up Hermes and leveraging its unique features, such as self-evolving skills and contained sub-agents, while pairing it with Qwen 3.6 models for a powerful, local AI assistant.

How to Build and Run a Self-Improving AI Agent with Hermes on NVIDIA Hardware
Source: blogs.nvidia.com

What You Need

Step-by-Step Guide

Step 1: Verify Your Hardware Environment

Ensure your system meets the requirements for running local AI agents. Hermes is optimized for always-on use on NVIDIA hardware. Check that your GPU has sufficient VRAM (e.g., at least 8GB for smaller models, 20GB+ for Qwen 3.6 35B). For the best experience, use an NVIDIA RTX GPU or a DGX Spark, which provides the computational power needed for 24/7 operation without cloud dependencies. Update your NVIDIA drivers to the latest version to ensure compatibility with CUDA and PyTorch.

Step 2: Install Hermes Agent Framework

Clone the official Hermes repository from GitHub. Open a terminal and run:

git clone https://github.com/NousResearch/Hermes.git
cd Hermes

Create a Python virtual environment to avoid conflicts:

python -m venv herm-env
source herm-env/bin/activate  # On Windows: herm-env\Scripts\activate

Install the required dependencies:

pip install -r requirements.txt

This pulls in libraries for model loading, tool integration, and GPU acceleration.

Step 3: Configure Hermes for Local Execution

Hermes is provider- and model-agnostic, but for local use you’ll need to set it to run on your NVIDIA GPU. Edit the configuration file (usually config.yaml or .env) to specify:

Example snippet:

model:
  path: "/path/to/qwen3.6-35b"
  device: "cuda"
agent:
  persistent: true

Save the file and test the configuration by running a simple command like python herm.py --check.

Step 4: (Optional) Download and Integrate Qwen 3.6 Models

For the best performance with Hermes, use the Qwen 3.6 series from Alibaba. These open-weight LLMs are designed for local agents. The 35B model runs on ~20GB VRAM and outperforms previous 120B models. Download the model from Hugging Face or official repository:

pip install huggingface-hub
huggingface-cli download Qwen/Qwen3.6-35B

Then update your Hermes configuration to point to the downloaded model folder. The 27B variant is also available and delivers accuracy matching 400B-parameter predecessors, making it ideal for lower-memory systems.

How to Build and Run a Self-Improving AI Agent with Hermes on NVIDIA Hardware
Source: blogs.nvidia.com

Step 5: Launch Hermes and Explore Core Capabilities

Start the agent with:

python herm.py

Once running, you can interact via command line, integrate with messaging apps, or allow file access. Hermes’s unique features become active automatically:

Try giving it a challenging task like “Organize my documents by project and summarize each folder” and observe how it refines its approach.

Step 6: Enable Self-Improvement Through Feedback

The real power of Hermes is its ability to improve itself. After each interaction, provide explicit feedback (e.g., “That worked well, save the method” or “Please try a different approach”). Hermes records these learnings as new skills. Over time, it becomes more efficient and accurate without manual reprogramming. You can also review the skill library by calling herm.skills.list() to see what it has learned.

Tips for Success

By following these steps, you turn your NVIDIA-powered PC into a self-improving AI assistant that works locally, privately, and reliably. Whether you’re automating workflows, managing files, or exploring agentic AI, Hermes with Qwen 3.6 unlocks a new level of productivity.

Recommended

Discover More

Acoustic Harmony: Soundproofing Your Home Theater Without Sacrificing StyleCritical RCE Vulnerability Discovered in xrdp Remote Desktop Server – CVE-2025-68670Bohmian Mechanics: A Radical Quantum Reality CheckMegaETH ‘MEGA’ Token Launches at $2 Billion Valuation, Real-Time Trading Begins Across 13 ExchangesRobinhood Opens Venture Fund to Retail Investors: 150,000+ Join Early Access to Private Tech Giants