⚡️ n8n speed tips wanted — how would you cut latency here?

I’m trying to get this flow consistently fast (aiming for sub-10s per run).

Current setup:

Questions for the pros:

Best way to parallelize TTS/upload/analytics—Queue Mode + workers or sub-workflows?
Any wins from HTTP keep-alive, batching, or reducing hops (replace multiple Set/If with one Function)?
Tips for caching/pre-warming (tokens, Pinecone lookups) and passing binary data to avoid base64 bloat?
Anyone using streaming (LLM partials → start TTS early) to overlap steps?
Execution-data/DB settings you tweak to lower overhead?

Short node patterns or screenshots would be awesome—thanks! 🙏

0 comments