AI reliability is a hot topic. And something I figured out a long time ago while building in n8n is having the right harness. In fact, IBM's Tejas Kumar talked about the same thing. To increase the reliability of AI output, focus on the harness. Which is everything around it that can help it be more deterministic instead of random. Here's what I mean:
When AI agents first came out on n8n, everybody was building AI brains with 10 tools. However, if you ever tried doing this, you quickly realized how unreliable it was. At least I did. So what I did was go from the AI is the brain to the AI is part of the system.
This increased reliability by 10x, because now the AI could only take some input and output that. And I controlled that data, what nodes to fire, and how the output from the nodes should look.
This yielded far better results than trying to prompt the AI agent to do this and do that. Because as you know, trying to prompt yourself out of a problem never works. But controlling everything around the AI and only letting it do a few things? That works quite well.