Run Local LLMs on Mac
This guide walks you through running a local LLM server on macOS with an OpenAI compatible API. You get full privacy, zero API costs, and surprisingly good perf
This guide walks you through running a local LLM server on macOS with an OpenAI-compatible API. You get full privacy, zero API costs, and surprisingly good performance on Apple Silicon. We cover two backends: Both expose an OpenAI-compatible endpoint. Hermes works with either one — just point it at or . …
What this page covers
- Choosing a model
- Option A: llama.cpp
- Install
- Download the model
- Start the server
- Memory optimization for constrained systems
- Test it
- Get the model name
- Option B: MLX via omlx
- Install
- Download the model
- Start the server
- Test it
- List available models
- Benchmarks: llama.cpp vs MLX
- Results
- What this means
- Which one should you pick?
- Connect to Hermes
- Timeouts
Section outline mirrored from the official Hermes Agent documentation. Follow any heading to read the complete text on the source site.
More in Guides & Tutorials
Tips & Best Practices
A quick wins collection of practical tips that make you immediately more effective with Hermes Agent. Each section targets a different aspect — scan the headers
Tutorial: Daily Briefing Bot
Tutorial: Build a Daily Briefing Bot In this tutorial, you'll build a personal briefing bot that wakes up every morning, researches topics you care about, summa
Tutorial: Team Telegram Assistant
Set Up a Team Telegram Assistant This tutorial walks you through setting up a Telegram bot powered by Hermes Agent that multiple team members can use. By the en
Using Hermes as a Python Library
Hermes isn't just a CLI tool. You can import directly and use it programmatically in your own Python scripts, web applications, or automation pipelines. This gu
Use MCP with Hermes
This guide shows how to actually use MCP with Hermes Agent in day to day workflows. If the feature page explains what MCP is, this guide is about how to get val
Use Voice Mode with Hermes
This guide is the practical companion to the Voice Mode feature reference. If the feature page explains what voice mode can do, this guide shows how to actually