🍌🎨 G👀gle PaperBanana
Paper Banana: AI Multi-Agent System for Publication-Quality Diagrams.
Unlike single-model approaches, Paper Banana uses 5 specialized AI agents working together to create, critique, and refine technical diagrams with stunning accuracy.
How It Works:
PaperBanana implements a two-phase multi-agent pipeline with 5 specialized agents:
  • Phase 1 -- Linear Planning:
- Retriever selects the most relevant reference examples from a curated set of 13 methodology diagrams spanning agent/reasoning, vision/perception, generative/learning, and science/applications domains
-Planner generates a detailed textual description of the target diagram via in-context learning from the retrieved examples
-Stylist refines the description for visual aesthetics using NeurIPS-style guidelines (color palette, layout, typography)
  • Phase 2 -- Iterative Refinement (3 rounds):
Visualizer renders the description into an image (Gemini 3 Pro for diagrams, Matplotlib code for plots)
Critic evaluates the generated image against the source context and provides a revised description addressing any issues
  • Steps 4-5 repeat for up to 3 iterations
Open source ❤️ implementation and extension of Google Research’s PaperBanana
***Official code coming soon...
3
0 comments
Mišel Čupković
5
🍌🎨 G👀gle PaperBanana
powered by
Ai Titus
skool.com/aititus-2906
Build your AI business before AGI changes everything. Real income, real freedom. I'll show you what actually works. Let's doooo this!
Build your own community
Bring people together around your passion and get paid.
Powered by