Question?
how are you structuring your AI coding setup? (sharing mine, want to compare notes)
been deep in the weeds optimizing my coding stack lately and want to see how you guys are actually running yours.
here's where I'm at:
what I pay for
  • Ollama
  • ChatGPT / Codex
  • Claude Code ($100 plan)
how I split it
  • manual coding sessions → Claude Code
  • my orchestration layer (I call it Hermes) → runs on Codex with an Ollama fallback
the issue: I cap out on the $100 Claude plan fast. so lately I've been running fcc-claude (free-claude-code) — it keeps the Claude Code terminal workflow but lets you point it at whatever API you want. feels like working in Opus, but the model underneath is swappable.
I rotate the backend between:
  • DeepSeek
  • MiniMax M3
  • and lately Qwen3:30b fully local
right now I'm routing local Qwen through Ollama into fcc-claude, so I'm basically running the Claude Code experience on my own hardware for free.
for the workflow layer I lean on Superpowers + a few MCP connections, and that's pretty much it.
so what I'm trying to figure out:
  • how are you structuring sessions — one main agent, or orchestration with subagents?
  • what's your model routing look like? mixing local + API like this, or all-in on one provider?
  • what are you running for the "framework" layer — superpowers, custom skills, raw prompts?
  • and real talk: what am I missing? where am I leaving gains on the table?
drop your setup below, even a rough sketch. trying to make sure I'm actually maxing this out and not just stacking tools for the sake of it.
3
1 comment
Carlos Jimenez
2
Question?
powered by
Ai Titus
skool.com/aititus-2906
Build your AI business before AGI changes everything. Real income, real freedom. I'll show you what actually works. Let's doooo this!
Build your own community
Bring people together around your passion and get paid.
Powered by