Activity
Mon
Wed
Fri
Sun
Aug
Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
What is this?
Less
More

Memberships

UPAYA | Skillful Healer Skool

134 members • Free

AI Automation Society

418.8k members • Free

Constellations

89 members • Free

Clief Notes

41.2k members • Free

153 contributions to Clief Notes
🔥 Follow-up: Fable found the leaks. I fixed them. 48 hours later, I re-measured... and one of them is already growing back. 📏
📝Note: This post is not about ICM, it's about tuning the AI systems you are using with ICM, if you're new to ICM, you can start Here: https://www.skool.com/cliefnotes/welcome-to-clief-notes-heres-where-to-start-2?p=f8f85a09 @Mira Bradshaw also made a great post to get you started! https://www.skool.com/cliefnotes/clief-notes-gems-for-starting-in-icm?p=0385cbf9 Let's dive into the results just 48 hours later! 👇 Two days ago, I posted about pointing Fable at my own environment and finding out where my tokens were really going. 📝If you need to catch up with my first post: https://www.skool.com/cliefnotes/i-just-spent-the-last-few-hours-asking-fable-to-make-itself-cheaper?p=cc666986 A lot of you went hunting in your own setups after that post (and found GOLD 👏). So, here's the part two: what happened AFTER the audit. 📌 TL;DR - The fixes shipped, the savings are real, and the most important lesson came from re-measuring two days later: optimization is not an event. Drift never sleeps. The audit is worth nothing without a cadence behind it. 🧾 First, the receipts. Every fix from the audit is now live: ✅ The file that lied. My startup menu file claimed "450 tokens" in its own header and was actually ~7,000. It got rebuilt down to ~750, and the detail it was hoarding moved to where it already lived anyway (each project's own handoff file). 💡One fact, one place, and everything else that needs it points to it by design. ✅ The silent tax. My safety hook was injecting ~800 tokens into EVERY prompt, repeating rules that already load once at session start. It's now ~85 tokens: a short pointer to the rules instead of a full copy of them. 💡The enforcement never lived in the repetition; it lives in the hooks that physically block bad actions. Cut the prose, keep the mechanism. That one saves on every prompt, in every session, forever.
🔥 Follow-up: Fable found the leaks. I fixed them. 48 hours later, I re-measured... and one of them is already growing back. 📏
niiiceeeeeee excited about the local set up
@Bas Rosario Lmk I’m available this weekend for calls once u have lm studio up and running reliably I can show you how to have it work as an inference server so you can use it from other machines / phone etc
Fable… let’s see if this works
My weekly limits roll over in about 9 hours. I have a bunch of Fable tokens left that I do not want to use. But I’m also at ~96% of my 5 hour usage window and I need to go to bed. Doesn’t reset for an hour. So, I did what anyone would do, I asked Fable what my options are. Apparently it has a timer. It has set one for 80 minutes and that will wake it so it can fire what we have queued up… I guess I’ll find out what does or doesn’t happen to that branch come morning…!!! Fable has immense confidence that all the results will be ready for me to review over a coffee… ☕️
I’d love to hear if this worked for you. Mine has not been successful. And only after I ping it, it comes back and is like: “ oh yes everything was done like an hour ago …”
😅 I just spent the last few hours asking Fable to make itself cheaper.
💪 It did not disappoint. 👉TL:DR - It made all AI use cheaper in my environment, made me more efficient, and it increased my memory system usage by almost half. It should be no secret to anyone here: I don't count tokens. 🪙 I'm on a Pro Max plan and I go where the work, and the passion takes me. But I have used Fable before, and I know about the token burn! So I came up with a plan! I pointed Fable at my own setup and pulled the report. 😅 I'm glad I did. Turns out it could've been cheaper and cleaner the whole time. I didn't ask Fable what it thinks. I pointed it at the real thing. My hooks, my handoff files, my subagent config, the actual token counts on disk. Two questions: where does the money go? & how do we spend less? 💡The answers. A silent forgotten tax on every message, A safety hook was injecting around 800 tokens into every prompt I sent. Repeated rules I already load once at the top. And a file that runs at every session start called itself " 450 tokens" in its own header. It was not... it was 7,000. My actual face when I saw this--->🤬 a few moments later--->😆 (Apparently, it's not only AI that is bad at counting sometimes!) My own file lied to me, and I'd read past that number a hundred times with a smile on my face! My long sessions. The ones I'm proudest of. Turns out that hour six is my most expensive and also least impactful at the same time. 💩 I know that models follow instructions worse at higher context. And because the quality was still there, the marathon I read as momentum was being billed to me at top rates for the weakest output of the day. 🙄 My helper agents were all running on the flagship (Most expensive model). These are the agents Claude spins up to send off to do tasks to get more accomplished in a shorter amount of time! This I knew about, and it was by choice, it's my environment and I never hit my 5 hour or weekly cap, so I did not care about this for myself, bigger = better right? Use Fable 5 on UltraCode in my environment, and we find out that logic was wrong....
😅 I just spent the last few hours asking Fable to make itself cheaper.
@Bas Rosario yesss!!!! Also one more thing gpt5.6 sol when it comes out will be similar quality and cheaper than fable … that’s my plan after July 7
@Alex Brown tbh for me I had to make myself just go to bed at the same time every day. Regardless of hyperfocus (which these things are built to milk btw it’s a literal slot machine regarding the effects on your dopamine etc.) Also it’s about self awareness knowing yourself and your body. Sometimes you just need a walk or a drink of water or a nap . In the more tactical side of things I literally had a post mortem and forensic data deep dive with my agent . It has organically slowly improved but the next step should be to turn that forensic data analysis into guardrails.
Fable 5 is back…. 🥳🥳🥳
Is it real? Edit: How much better is Claude Fable 5 vs. Opus 4.8? (Anthropic’s launch benchmarks, June 9, 2026) • SWE-Bench Pro (agentic coding): 80.3% vs. 69.2% for Opus 4.8 • FrontierCode Diamond: 29.3% vs. 13.4% — more than double • Core pattern: the longer and more complex the task, the larger Fable 5’s lead; on short, well-scoped tasks the two are much closer In practice: noticeably better on long multi-step work (migrations, feature builds, complex pipelines); barely different for quick, simple tasks. Caveat: these are Anthropic’s own benchmarks — directionally correct, but vendor numbers.
@Andre Cordero oof. Best Buy has a good return policy no? Also remember I’m not running a 35b dense model. It’s a mixture of experts. Different requirements. Also make sure to check all your parameters and maybe ask a cloud model to help you optimize your setup
@Andre Cordero no lol you just need to find the right models. That took me a lot of trial and error . U try gemma4 12b? Or the moe one?
Sneak Peak at the platform we are building
This is a devlog from David! A lot of people are saying they want the ability to scale and deploy their ICM and their workflows. We looked at all the possible problems security issues and we have been spending a lot of time building something for all of you! It's almost ready for release, but here's a little developers log to kind of check out some things that David has been doing to build it up. It's far from perfect, but for those technical folks out there you may enjoy it!
Thanks for the update !
1-10 of 153
Simon Gonzalez De Cruz
6
1,447points to level up
@simon-gonzalez-de-cruz-3638
Perpetual learner and builder.

Active 17h ago
Joined Mar 10, 2026
INTP
Long Beach, CA
Powered by