RecapFlow : May 26th Coaching call analysis
πŸ“ SUMMARY
This week's call featured deep dives into agentic AI workflows, production-grade PDF extraction strategies, and real-world client monetization plays. Patrick unveiled his pre-release Community Brain distribution and detailed his migration from OpenClaw to Hermes, while Brandon shared his overnight "deep work" parallelization system using Codex. Scott broke down multiple shipping projects including a parking app that generated $5K in its first weekend and a video generation pipeline, while Morgan, Juan, Alex, and Bastian presented updates on lobby displays, AI photo booths, multi-tenant architecture decisions, and healthcare partnership negotiations.
πŸ’‘ KEY INSIGHTS
Spec-driven development only works after you have clarity on infrastructure, framework, and database choices. Without this foundation, it loops endlessly without forward progress.
For high-frequency simple tasks like single-word lookups, use smaller models like Gemini Flash or GPT-4o Mini instead of frontier models. The latency and cost drop dramatically with no quality loss.
When extracting content from complex PDFs, convert each page to an image and feed it to a cheap multimodal model like Gemini Flash with a markdown output prompt. This handles tables, flowcharts, and mixed content far better than text-only extractors like Dockling.
Hermes offers significant advantages over OpenClaw including cleaner API key management, built-in identity isolation, a Kanban dashboard, recursive self-updating skills, and native Codex integration.
AI engagements often serve as a foot in the door for larger consulting relationships. Clients present surface-level problems that reveal deeper systemic work once untangled. Focus on solving high-margin problems for businesses with significant revenue where ROI is obvious.
When building multi-tenant applications, start a brand-new project with organizations as a first-class concept rather than retrofitting an existing single-tenant app to avoid breaking live customers.
Adversarial code review using the Codex plugin for Claude Code consistently finds issues that Claude alone misses due to different model biases.
Queue up multiple parallel Codex work-tree sessions overnight, each running self-critiquing iterations on specific goals. Review and merge in the morning to effectively double productive hours.
Hermes recursively builds and prunes its own skills based on usage, keeping the system prompt within character limits and removing stale skills automatically.
❓ KEY Q&A
Question: Should I use Recap Flow to scan large PDFs and retrieve data?
Answer: No. Recap Flow is specifically built not to digest documents. Use Ollama for embeddings and handle PDF extraction separately beforehand.
Question: I built a vocabulary app prototype but lost momentum when shifting to spec-driven development. When is it worth it?
Answer: Spec-driven development requires a clear infrastructure foundation first. Since you already have mocks and specs, go back through ShipKit's master plan phases, point to your existing work, and let it fill gaps to create a roadmap. Then execute phase by phase.
Question: The vocabulary app is slow. How do I fix latency?
Answer: Use a smaller, faster model like Gemini Flash or GPT-4o Mini for single-word lookups. Large frontier models cost more and add unnecessary latency for simple tasks.
Question: Is Hermes productive outside of coding?
Answer: Yes. Every morning it emails asking for your daily goal, then follows up an hour later with research, prepped materials, and scheduled jobs. It interacts via Discord channels to keep context separated by topic.
Question: Any cons to Hermes?
Answer: Like any independent agent, you must isolate it properly for security. However, it is more organized than OpenClaw with single-place API key management and clear warnings about key safety.
Question: For OCRing scanned PDFs, should I use Tesseract?
Answer: No. Convert each page to an image and pass it to Gemini with a prompt to produce markdown. This works far better for complex layouts, tables, and graphics than Tesseract.
Question: Do you use specific prompting to convert flowchart images into Mermaid diagrams?
Answer: For medical protocols, convert decision trees into nested markdown bullet lists rather than Mermaid diagrams, as the output needs to be readable plain text for end users.
Question: Do you spawn parallel agents to process each PDF page?
Answer: It is simpler than that. Convert the page to image, base64 encode it, make one LLM call per page, and get markdown back. The complexity lies in handling edge cases like tables split across pages.
Question: I want to make my studio booking app multi-tenant. Any recommendations?
Answer: Build a brand-new project with organizations as a first-class concept from the start. Once it works with dummy orgs, port the existing customer over. Retrofitting risks breaking your current live customer.
Question: Have you found it necessary to add an additional memory layer to Hermes?
Answer: Yes. Honcho was added as an additional memory layer.
πŸ› οΈ TOOLS AND CONCEPTS MENTIONED
Hermes β€” Agentic AI framework replacing OpenClaw with cleaner API management, identity isolation, and recursive skill building. Patrick connects it to Codex, Discord, Tavily, Firecrawl, and Honcho.
Codex (OpenAI) β€” Used as a sub-agent tool within Hermes for coding tasks and for overnight deep-work sessions with adversarial code review.
Claude Code β€” Primary coding environment used by multiple members for all development work.
Gemini Flash β€” Cheap multimodal model recommended for converting PDF page images to markdown and for high-frequency simple tasks.
Honcho β€” Memory layer service added to Hermes for enhanced context retention.
ShipKit β€” Brandon's course/framework including master plan phases and prep templates used for structured development.
Ollama β€” Local model runner recommended for embeddings in PDF and RAG pipelines.
Firecrawl and Tavily β€” Web scraping and search APIs integrated with agentic workflows.
Remotion, ElevenLabs, and Higgs Field β€” Video generation stack using React-based framework, voice synthesis, and AI scene generation.
ORGO.ai β€” Cloud full-computer-use machines with API recommended for running Hermes agents in the cloud.
Fieldy β€” AI meeting note-taker with API that supports wake-word and termination-word patterns for agent commands.
πŸ“Ž SHARED RESOURCES
Patrick's Community Brain distribution repository
Hermes autonomous GitHub profile
OpenAI Codex plugin for Claude Code
Scott's video generation repository using Remotion and ElevenLabs
Honcho memory layer service
Wavespeed AI inference dashboard
Anthropic Claude Code in Action course
πŸ”„ FOLLOW-UPS WORTH EXPLORING
Hiral will restart her vocabulary app using ShipKit's prep phases with sequential and parallel LLM calls, switching to Gemini Flash or GPT-4o Mini for latency reduction.
Patrick will test the Community Brain distribution repo before wider release and extract his ChatGPT chat logs for integration into Hermes memory.
Brandon will add Codex adversarial code review to his overnight deep-work sessions and test Scott's video-generator for personalized cold-outreach clips.
Morgan will complete setting up Hermes on a VPS and evaluate ORGO.ai as a cloud alternative.
Scott will share his developer guide documents and n8n Claude Code skill via email to interested members, and will research newsletter monetization playbooks from Milk Road and The Hustle.
Alex will build a brand-new multi-tenant version of his studio booking app with organizations as a first-class concept rather than retrofitting the existing codebase.
Bastian will report back on the healthcare center partnership negotiation outcome.
Tom will investigate the Association of Old Crows as a potential customer for his military membership database application.
Juan will implement Vercel and AWS log drain with alerting for his photo booth application.
0
0 comments
Patrick Chouinard
5
RecapFlow : May 26th Coaching call analysis
AI Developer Accelerator
skool.com/ai-developer-accelerator
Master AI & software development to build apps and unlock new income streams. Transform ideas into profits. πŸ’‘βž•πŸ€–βž•πŸ‘¨β€πŸ’»πŸŸ°πŸ’°
Leaderboard (30-day)
Powered by