this post was submitted on 15 Jun 2026

3 points (66.7% liked)

Free Open-Source Artificial Intelligence

4739 readers

4 users here now

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

GitHub Stars

FOSAI Time Capsule

founded 3 years ago

MODERATORS

Blaed@lemmy.world

fosai@lemmy.world

Thoughts on Hermes? (lemmy.world)

submitted 2 weeks ago by damnthefilibuster@lemmy.world to c/fosai@lemmy.world

12 comments fedilink hide all child comments

Folks,

I’m setting up Hermes Agent on my Mac with Ollama hosting a local model. But I’m on the fence on whether I should go with Hermes or OpenClaw. Hermes makes some pretty bold claims about “growing with you” and “self improvement”.

Anyone have any insight into whether it’s as good as promised?

top 12 comments

sorted by: hot top controversial new old

[–] lunarwingorg@lemmy.world 2 points 1 week ago (1 children)

hermes has some built in agent orchestration layer which seems cool on paper. never tried it. other small nice things that are unique to hermes which other agents really don't have, which I have actually tried, include: switchable agent personalities, pretty decent thread suspension mechanism, decent webhook subscriptions, and human delay mode. The biggest thing, at least in my opinion, is certainly: Self-improving skills with patching - with an entire slew of caveats... In my opinion, this is useful but I strongly recommend using a manual review process. Otherwise, the agent has the potential to "teach itself wrong". Human review.

[–] damnthefilibuster@lemmy.world 1 points 1 week ago (1 children)

Yeah I’ve been trying for hours to get it to make a simple token counting skill. It keeps getting it wrong.

It has webhook subscriptions? That’s cool I guess. How do you use the switchable agent personalities?

[–] lunarwingorg@lemmy.world 2 points 1 week ago

you can try to ask the agent to talk like a pirate for example

[–] lefaucet@slrpnk.net 2 points 1 week ago (2 children)

IMO it's too eager to change its own code.

On multiple occasions I've just asked questions like, "what checks do you do on work from delegated workers," and it'll decide to rewrite some of its code and break itself.

[–] damnthefilibuster@lemmy.world 2 points 1 week ago

it just happened to me! I've been using telegram to chat with it all this time. Finally downloaded their desktop app and tried it out. It worked for one day and now it's throwing a "No LLM provider configured" error for Ollama local after an update.

I asked the telegram chat to figure this out and it's currently reading it's own internal code to "figure out when No LLM provider configured is called". 😂😂

[–] damnthefilibuster@lemmy.world 2 points 1 week ago

😂😂😂😂

[–] obelisk_complex@piefed.ca 1 points 2 weeks ago (1 children)

I've been using it with Opencode Go, Ollama, and Claude Code (it can delegate tasks to models through all those, so you can have Claude plan and Deepseek Flash build); I really like it.

I ran into that problem with the agent reporting that subagents succeeded, or work had been done, where it hadn't ("I said I tested that, but I didn't. That's on me. Won't happen again"), so I built a self-check enforcement system for it. You or your agent can set up the system by reading this: https://github.com/obelisk-complex/hermes-agent/blob/main/self-check-enforcement-system-v15.md

It includes the source patch which adds a hook on_output; this allows you to intercept text sent directly from the LLM to the user, which in vanilla is unblockable. So, this system ensures that if something remains unfinished, the LLM can't say it's done; it has to acknowledge what it didn't do before it can send you a message to close the conversation loop. I've built the fork to automatically merge upstream changes around this patch daily at 0400 Pacific time, so I should stay up to date (ish).

I also put in a feature request to get this added upstream. Feature request here: https://github.com/NousResearch/hermes-agent/issues/45881

[–] damnthefilibuster@lemmy.world 2 points 1 week ago (1 children)

what's opencode go and how is it different than opencode?

I'll check out the subagent reporting issue. I did run into it with Gemma-4 but Qwen3.5 and 3.6 both work well in completing tasks. Local models aren't perfect, but they're damn close!

[–] obelisk_complex@piefed.ca 2 points 1 week ago* (last edited 1 week ago) (1 children)

The harness helps a lot even with local models. In fact, I just found this this morning and cherrypicked it: https://github.com/DietrichGebert/ponytail

Recommend doing the same, and for superpowers if you don't have 'em already: https://github.com/obra/superpowers

Opencode Go is the $10/month cloud model subscription from the same group maintaining the OpenCode software. Opencode Zen is a pay-as-you-go version which gives you access to Claude models as well. Keeping pay-as-you-go to subagents only (e.g. telling your agent to launch an opus subagent via your opencode zen key) is actually surprisingly economical - when you're not going turn after turn with hundreds of thousands of tokens of context, claude is pretty reasonably priced.

What I'm doing is spreading out my usage over multiple cheap subscriptions, and augmenting with the occasional pay-as-you-go frontier agent, to get quality in line with what you get out of Claude, at usage that would require the $200/month level, for a lot less money than that.

[–] damnthefilibuster@lemmy.world 2 points 1 week ago (1 children)

I’m a little surprised to hear you say PAYG for Opus sub agents is economical. Maybe the superpowers and ponytail really do have a massive impact on things. I’ll send these to three people I know building heavy production apps right now. And integrate them into my own Hermes setup.

Thank you for the recommendations!

[–] obelisk_complex@piefed.ca 1 points 1 week ago* (last edited 1 week ago) (1 children)

Any time, I hope they're helpful! (☞ﾟヮﾟ)☞

I’m a little surprised to hear you say PAYG for Opus sub agents is economical

I did say it was surprising! 😂 To give you an idea what I mean by "economical", it's never more than a few bucks a day, even on days of heavy use and development with "loop until clean" instructions on QA (for which I use Opus). I accidentally blew through my opencode go quota really early in the first month, so I ended up on PAYG; here's the usage graph:

And here's the numbers breakdown for the highest day (I was evaluating GLM5.1 for general tasks - don't use it for that, it's really token hungry)

That includes a lot of experimentation too while I figured out which models were best for what. I hid Fable because it crushed the rest of the table - really expensive, but worth it for one-shotting very long tasks on the Anthropic subscription is what I found.

[–] damnthefilibuster@lemmy.world 2 points 1 week ago

Damn those are good numbers!