Most people treat AI like a vending machine. You type something in, something comes out, and you move on. That works. Until it doesn't. When you understand what's actually happening under the hood, everything changes. Your prompts get sharper. Your results get better. Your strategy gets smarter. And you stop being surprised when AI confidently tells you something completely wrong. So let's go there. An LLM Is Not a Brain. It's a Pattern Engine. LLM stands for Large Language Model. Large because it's trained on a staggering volume of text. Language because its entire existence lives in words, symbols, and code. Model because it's a mathematical system designed to find patterns. That's it. No consciousness. No memory between sessions. No tiny genius parsing your intent. What it does have: a trained ability to predict what word, phrase, or idea comes next based on everything it's seen. That single capability, applied billions of times across billions of parameters, produces outputs that look a lot like thinking. It is not thinking. It's predicting. At a scale that makes the distinction feel irrelevant, until it matters. The Core Mechanic: Next Word, Next Word, Next Word Every response you've ever gotten from an AI was built one token at a time. You type a prompt. The model evaluates every possible next word, assigns a probability to each one, picks a likely candidate, and repeats the process until the response is complete. It never sees the full answer before it writes it. It's building forward, token by token, the entire time. A token isn't exactly a word. It's a chunk: sometimes a full word, sometimes a syllable, sometimes a symbol. "Unbelievable" might become three or four tokens. "Cat" is one. This matters because AI tools are priced by tokens, context windows are measured in tokens, and some of the odd behaviors you've noticed (like AI miscounting letters in a word) trace directly back to how tokenization works. When a model says it supports 128,000 tokens of context, that's roughly a 300-page book it can hold in working memory at once.