John Johns

AI & QA Accelerator

Activity

Mon

Wed

Fri

Sun

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mar

Apr

What is this?

Less

Memberships

AI & QA Accelerator

604 members • Free

AI Automation Society

351.3k members • Free

1 contribution to AI & QA Accelerator

Matviy Cherniavski

Feb 28 •

AI&QA

AI Coding Agents for QA: Part 1 — What They Are and Why It Matters

AI is everywhere, and it's easy to feel overwhelmed. Codex. Claude Code. Cursor. Windsurf. Copilot. New names every week, new hype every day. But they all describe the same concept: AI coding agents. ──────────────────────────────────────── 𝐖𝐡𝐚𝐭 𝐈𝐬 𝐚𝐧 𝐀𝐈 𝐂𝐨𝐝𝐢𝐧𝐠 𝐀𝐠𝐞𝐧𝐭? Simple: it's a tool that interacts with AI and generates code. That's it. But like any tool in a QA engineer's kit, not all of them are equal. Some are great for specific tasks, some are poor at most things, and some are solid generalists you can use anywhere and get good results. I spent over $3,000 testing them so you don't have to. In this series of posts I'll share exactly what I found. Today, we start with the fundamentals. ──────────────────────────────────────── 🧠 𝐖𝐡𝐚𝐭 𝐈𝐬 𝐚𝐧 𝐋𝐋𝐌? LLM stands for Large Language Model, the brain powering every AI coding agent. But here's the key thing to understand: you never talk to the LLM directly. There's always a tool sitting in between: ► YOU ► Tool (Cursor / Copilot / Claude Code) ► LLM (GPT-5 / Claude / Gemini) The same pattern applies when you use AI chat apps, except the interface is built for conversation, not code. ──────────────────────────────────────── ⚡ 𝐖𝐡𝐲 𝐓𝐡𝐢𝐬 𝐌𝐚𝐭𝐭𝐞𝐫𝐬 𝐟𝐨𝐫 𝐘𝐨𝐮 The tool (cursor, etc) you pick is responsible for roughly 50% of your results. Here's why: the tool reads your code, decides what information to send to the LLM, and determines how much the AI actually understands about your project and how it can write the actual code. Different tools. Different developers. Different quality. Same LLM. Wildly different output. This is exactly why the same engineer, using the same LLM but a different tool, can get completely different results. For example, using the exact same ChatGPT LLM in Cursor versus Copilot for the same task will produce very different quality output. ──────────────────────────────────────── 📌 𝐊𝐞𝐲 𝐓𝐚𝐤𝐞𝐚𝐰𝐚𝐲𝐬 - LLM = the brain. You can't access it directly. - Tools (Cursor, Copilot, Claude Code) sit between you and the LLM. - The tool accounts for ~50% of the quality you get. - Different tools, different quality, different output even with the same LLM underneath.

New comment 16d ago

AI Coding Agents for QA: Part 1 — What They Are and Why It Matters

John Johns

2 likes • 16d

Interesting, I was thinking that a Model is the same as a Coding Agent.

1-1 of 1

Level 1

3points to level up

John Johns

@john-john-8186

Lead SDET, Fintech

Active 16d ago

Joined Apr 16, 2026

East

Contributions

Followers

Following