Clief Notes

Write something

Afternoon Tea is happening in 7 days

Pinned

Curtis Hays

May 3 •

🛠️ Show Your Work

The Folder System Became My Agency

Twenty-four days ago I posted about Jake's folder system video. This is what happened next. Same foundation — markdown files, orchestration prompts, clear roles. I just kept building. Fifteen named specialists. Each one with a soul file, guardrails, and a playbook. Duke orchestrates. Cash writes. Trace pulls the data. Hank runs the financials. Clint handles the MCP integrations. Behind each one is either a human counterpart doing the real work alongside them — or a role I can't afford to hire yet. Katie who's been with me for 18 years, now has her own orchestrator running the same system. Twenty-seven client folders. Twelve live MCP integrations. One shared repo. The folder system isn't replacing my agency. It becoming my agency. Jake gave me the unlock. This is how it's going.

124

180

New comment 14m ago

Pinned

Matthew Creamer

Mar 15 •

📢 Announcements

Welcome to Clief Notes. Here's where to start.

1. Watch the intro video and introduce yourself in the intro post here 2. Start with The Foundation (free course). Concepts, folder architecture, prompting framework. Everything else builds on this. 3. Check in at the bottom of each lesson. Polls, discussion posts, other members working through the same stuff. Use them. 4. When you're ready to build real things, move to Implementation Playbooks (Level 2). When you're ready to build your own tools, Building Your Stack (Level 3). 5. Post your work. Ask questions. Help others when you can. What are you here to build?

Poll

6606 members have voted

3.2k

New comment 6m ago

Pinned

Matthew Creamer

6d •

💰 Competitions

🏆 WEEKLY COMP #7: THE OPERATOR 🏆

🎟️ PRIZE: FREE SEAT IN THE LYCEUM 🎟️ Pick your cohort. Technical, Business, or Creator. Your call. ---- 🇬🇧 We're back. Good morning from London. 👋 Thanks for the patience last week. Jake and I needed a few days to breathe before London Tech Week kicked off, and you all responded with nothing but support. We don't take that for granted. Now let's get back to building. ---- 📋 THE CHALLENGE Build a folder-based AI operator that handles ONE operational workflow end-to-end. You pick the workflow. This week's deliverable is one operator folder that someone could drop into a Claude project and use to handle a real business workflow without babysitting. ---- 🎯 PICK YOUR WORKFLOW The workflow is yours. Pick something specific. Pick something you'd actually use. A few sparks to get you thinking: - 🎫 Customer support triage (which tier handles this ticket?) - ✅ Content review and approval - 📨 Lead intake and qualification - 💸 Refund request handler - 🤝 Partnership pitch evaluator - 🎙️ Podcast guest pitch sorter - 💼 Freelance project intake - 📄 Resume screen for one specific role - 📅 Meeting request triage (book, decline, delegate) The more specific, the better. "Customer support" is too broad. "Refund request triage for an ecommerce store doing under 200 orders per month" is right. 📎 If you want a fully written client brief as a reference, the attached PDF walks through one example. Don't build the example. Use it as a template for how to think about scoping your own operator. ---- 🗂️ THE METHODOLOGY If this is your first comp, welcome. Here's what you need to know: This week (and every week) you're learning interpretable context methodology. Folders as architecture. Each file does one job well. Your operator is a folder with five things: - 📄 identity.md (who the operator is and what workflow they own) - 📐 rules.md (the decision logic: criteria, edge cases, escalation rules) - 💬 examples.md (decisions in action, including at least one edge case) - 📚 reference/ (checklists, templates, rubrics) - 📖 README.md (how to use it)

New comment 4m ago

Curtis Hays

1d •

📚 Resources & Finds

SkillOpt — Has Anyone Looked at This?

A CEO of an MSP that I've been one on one consulting on ICM sent this article to me this morning. Microsoft's SkillOpt automatically upgrades AI agent skills without touching model weights Repo: https://github.com/microsoft/SkillOpt The short version: instead of fine-tuning model weights, it treats your markdown skill files as the trainable parameter. It runs your agent against benchmark tasks, analyzes what went wrong, proposes bounded edits to the skill doc, and only accepts changes if held-out validation strictly improves. The deployed artifact is a single best_skill.md file — no extra model calls at inference. They're reporting +19.1 points on Claude Code benchmarks. That's output quality — the agent getting the right answer more often — not token savings. When I first started building skills in my ICM, they were bloated. Long, unstructured, burning tokens. If you're letting Claude build your skills for you (which is the natural thing to do), you're not necessarily getting an optimized artifact — you're getting whatever Claude thought was thorough at the time. The question SkillOpt is trying to answer: is the skill actually performing, or is it just big? Where I'm less sure it translates: most of the benchmarks are coding tasks, where "correct" is binary. I have three copywriters — Cash, Clyde, and Wradley. Scoring whether a piece of copy is better is a different problem. Harder to define, harder to gate automatically. Is this a layer worth putting on top of nuanced specialist work? Has anyone here dug into it?

New comment 14m ago

Simon Gonzalez De Cruz

24m •

💬 General discussion

When Fable got pulled, only 10% of my stack noticed

I sort every task by leverage — what changes the shape of the whole build if it's right or wrong. On an expensive model, that sort is the whole game. The top of the list gets the frontier. Everything under the line gets offloaded to cheaper and local models through Pushing Dispatch from @Ari Evergreen credit where it's due. So Fable was never running my stack. It sat at the top doing the one slice worth frontier money: deep diagnosis, strategic synthesis, finding the gaps and the false "fixed" claims. The 90% underneath — building, refactors, the delegated volume — ran on MiniMax M3, GLM, Kimi, Codex and local models. When you see Fable delegating or spinning out subagents, that's the offload. Fable decides; the cheaper lanes do the work. Concrete from yesterday: I had Fable audit CheckYourself, my open production-readiness checker. Deep review plus hands-on testing, three real bugs found, fixes shipped. (It also kept bouncing me to Opus 4.8 on anything it decided was "sensitive." Annoying then. Funny now.) So when access got pulled, what broke was that top slice. That's it. The 90% never ran on Fable in the first place. I swapped which model sits at the top, and the machine kept moving. That's the part I keep coming back to. Sort by leverage, and the expensive model becomes a specialist you can fire and replace — not the foundation everything sits on. If losing one model stops your work, the model was never the problem. The wiring was. I don't know if Fable comes back the same, dumbed down, or at all. Doesn't change what I ship Monday. When it got pulled — did your work stop, or did you swap one model and keep going?

New comment 22m ago