Paper Banana: AI Multi-Agent System for Publication-Quality Diagrams.
Unlike single-model approaches, Paper Banana uses 5 specialized AI agents working together to create, critique, and refine technical diagrams with stunning accuracy.
How It Works:
PaperBanana implements a two-phase multi-agent pipeline with 5 specialized agents:
- Phase 1 -- Linear Planning:
- Retriever selects the most relevant reference examples from a curated set of 13 methodology diagrams spanning agent/reasoning, vision/perception, generative/learning, and science/applications domains
-Planner generates a detailed textual description of the target diagram via in-context learning from the retrieved examples
-Stylist refines the description for visual aesthetics using NeurIPS-style guidelines (color palette, layout, typography)
- Phase 2 -- Iterative Refinement (3 rounds):
Visualizer renders the description into an image (Gemini 3 Pro for diagrams, Matplotlib code for plots)
Critic evaluates the generated image against the source context and provides a revised description addressing any issues
- Steps 4-5 repeat for up to 3 iterations
***Official code coming soon...