Run Hermes Free Forever: Gemma 4 MLX Update Makes It Up to 90% Faster (Ollama + Apple Silicon)
Julian demonstrates how to run Hermes for free using a new Gemma 4 update with Ollama on Apple Silicon via MLX, claiming up to 90โ95% faster local performance compared to before. He shows Gemma 4 set up as a free profile inside an agent OS, with examples like quickly responding to prompts, building a simple toโdo list app, and using Hermesโs /learn command to turn a tutorial into a reusable skill that runs in the background. For nonโMac users, he notes an alternative: using a free OpenRouter API option for the 31B model. He explains that local models keep workflows private, offline, and suitable for long-running agent loops without token or subscription costs, and he briefly outlines setup by updating Ollama and selecting the newer MOX Gemma 4 models.
00:00 Run Hermes Free Faster
00:33 Agent OS Demo Setup
01:14 Free API Alternative
02:08 Hermes Learn Skill
03:40 Install Ollama Gemma
05:20 Loops and Subagents
06:55 Local Loop Engine
08:05 Easy Commands Objections
08:58 Wrap Up and Offer
09:20 Boardroom Walkthrough
10:35 Final Thanks