Generative AI

Generative AI is software that creates new content — text, images, code, audio, video — based on patterns learned from enormous amounts of existing content. The "generative" part means it produces new output rather than classifying or analysing existing data. The "AI" part means it learned to do this from data rather than being explicitly programmed with rules.

That's the one-sentence version. The rest of this article builds the mental models you need to use generative AI effectively without falling into common traps.

How It Works (The Useful Abstraction)

You don't need to understand transformer architectures to use an LLM effectively, any more than you need to understand internal combustion to drive a car. But you do need a useful mental model.

The Autocomplete Model

Large Language Models (LLMs) like GPT-4, Claude, and Gemini are, at their core, extraordinarily sophisticated autocomplete engines. Given a sequence of text, they predict the most likely next token (roughly, the next word or word-piece). They do this prediction thousands of times in sequence to generate a response.

This single fact explains most LLM behaviour:

Why they're fluent: They've seen billions of examples of fluent text, so their predictions produce fluent text.
Why they hallucinate: They predict plausible-sounding next words, not true next words. "The capital of Australia is Sydney" sounds plausible — it's the kind of sentence that would appear in many texts — even though it's wrong (it's Canberra).
Why they're good at coding: Code is highly structured and predictable. The next token after for i in range( is very constrained.
Why context matters: The more context you provide, the more constrained the predictions become. Vague prompts produce generic outputs because many continuations are equally plausible.
Why they seem creative: They combine patterns in novel ways. A model that has read both Shakespeare and Python documentation can generate code comments in iambic pentameter — not because it's creative, but because it can blend learned patterns.

What "Training" Means

Pre-training: The model reads the internet (roughly). Billions of web pages, books, code repositories, academic papers. It learns statistical patterns: what follows what, in what contexts, in what styles. This takes months and millions of dollars. You don't do this.

Fine-tuning: The pre-trained model is further trained on specific types of interactions — like being helpful, following instructions, refusing harmful requests. This is what makes ChatGPT feel like a conversation partner rather than a text predictor. This takes days to weeks.

Prompting: You shape the model's behaviour at inference time by providing context in your message. No training required — just the right words. This is what you do dozens of times a day. See Practical Prompt Engineering.

For hands-on understanding of all three layers, see Running Local LLMs — running a model locally makes these distinctions tangible in a way reading about them never can.

What Generative AI Is Good At

Task	Why It Works	Example
First drafts	Producing "pretty good" text is exactly what next-token prediction does well	Draft a blog post, email, project proposal
Code generation	Code is structured and well-represented in training data	Generate a function, write tests, convert between languages
Summarisation	Compressing long text into key points follows learnable patterns	Summarise a 30-page report into 5 bullet points
Translation and reformatting	Converting between known formats is highly constrained	JSON to CSV, formal to casual, English to Spanish
Brainstorming	Generating many plausible ideas from a prompt space is the model's natural mode	"Give me 20 names for a productivity app"
Explaining concepts	The model has seen concepts explained at every level in its training data	"Explain Kubernetes like I'm a Rails developer"
Research assistance	Finding patterns and connections across large text inputs	Analyse interview transcripts for themes

What Generative AI Is Bad At

Task	Why It Fails	Workaround
Factual accuracy	The model predicts plausible text, not true text. It will confidently state wrong facts.	Always verify facts. Use AI for structure and drafting, not as a source of truth.
Math and logic	Next-token prediction is not computation. Models frequently make arithmetic errors.	Use AI to write code that does the math, not to do the math directly.
Knowing what it doesn't know	Models cannot assess their own uncertainty. They don't say "I'm not sure about this."	Treat all outputs as drafts requiring verification.
Consistency across long outputs	Context windows are finite. In very long outputs, the model may contradict earlier statements.	Break long tasks into sections. Provide the prior output as context for the next section.
Private or recent information	The model only knows what was in its training data (with a knowledge cutoff date).	Provide relevant context in your prompt, or use RAG (retrieval-augmented generation).
Following complex multi-step instructions	More steps = more opportunities for the prediction to drift off course.	Break complex tasks into simpler sequential prompts.

Mental Models That Prevent Mistakes

1. AI as Intern, Not Expert

Treat AI output like work from a smart, eager intern who has read a lot but has no professional experience. The output is often 70-80% right, structured well, and produced fast. But it needs review by someone who actually knows the domain. Never ship AI output without review.

2. AI as Thought Partner, Not Oracle

The best use of AI is not asking it for answers — it's thinking out loud with it. Describe your problem, ask it to poke holes in your reasoning, have it generate alternatives. The value is in the dialogue, not the first response.

3. Garbage In, Garbage Out (Still Applies)

Vague prompts get vague outputs. Specific prompts with examples, constraints, and context get dramatically better results. The difference between a 30-second prompt and a 3-minute prompt is often a 10x improvement in output quality. See Practical Prompt Engineering.

4. Different Models Have Different Personalities. Choose Wisely.

Claude tends toward careful, nuanced responses. GPT-4 tends toward confident, comprehensive ones. Gemini excels at multimodal tasks. Open-source models via local deployment vary wildly. Try the same prompt across models — the differences are instructive.

5. The Technology Is Moving Fast. Your Skills Compound.

Specific tools will change. GPT-5, Claude 5, Gemini 3 — by the time you read this, newer models exist. But the skills of effective prompting, workflow integration, and knowing what AI can and cannot do are durable. Invest in skills, not tool-specific knowledge. See Accelerating AI Learning.

The Honest Assessment

Generative AI in 2026 is genuinely transformative for certain tasks and genuinely overhyped for others. It will not replace you at your job. It will make someone who uses it effectively faster than someone who doesn't. The gap between "I've never tried it" and "I use it daily for the right tasks" is large and growing.

The rest of this cluster shows you how to cross that gap efficiently.