Generative AI

Generative AI is software that creates new content — text, images, code, audio, video — based on patterns learned from enormous amounts of existing content. The "generative" part means it produces new output rather than classifying or analysing existing data. The "AI" part means it learned to do this from data rather than being explicitly programmed with rules.

That's the one-sentence version. The rest of this article builds the mental models you need to use generative AI effectively without falling into common traps.

How It Works (The Useful Abstraction)

You don't need to understand transformer architectures to use an LLM effectively, any more than you need to understand internal combustion to drive a car. But you do need a useful mental model.

The Autocomplete Model

Large Language Models (LLMs) like GPT-4, Claude, and Gemini are, at their core, extraordinarily sophisticated autocomplete engines. Given a sequence of text, they predict the most likely next token (roughly, the next word or word-piece). They do this prediction thousands of times in sequence to generate a response.

**This single fact explains most LLM behaviour:**

- **Why they're fluent**: They've seen billions of examples of fluent text, so their predictions produce fluent text.

- **Why they hallucinate**: They predict plausible-sounding next words, not *true* next words. "The capital of Australia is Sydney" sounds plausible — it's the kind of sentence that would appear in many texts — even though it's wrong (it's Canberra).

- **Why they're good at coding**: Code is highly structured and predictable. The next token after `for i in range(` is very constrained.

- **Why context matters**: The more context you provide, the more constrained the predictions become. Vague prompts produce generic outputs because many continuations are equally plausible.

- **Why they seem creative**: They combine patterns in novel ways. A model that has read both Shakespeare and Python documentation can generate code comments in iambic pentameter — not because it's creative, but because it can blend learned patterns.

What "Training" Means

**Pre-training**: The model reads the internet (roughly). Billions of web pages, books, code repositories, academic papers. It learns statistical patterns: what follows what, in what contexts, in what styles. This takes months and millions of dollars. You don't do this.

**Fine-tuning**: The pre-trained model is further trained on specific types of interactions — like being helpful, following instructions, refusing harmful requests. This is what makes ChatGPT feel like a conversation partner rather than a text predictor. This takes days to weeks.

**Prompting**: You shape the model's behaviour at inference time by providing context in your message. No training required — just the right words. This is what you do dozens of times a day. See [Practical Prompt Engineering](PracticalPromptEngineering).

For hands-on understanding of all three layers, see [Running Local LLMs](RunningLocalLlms) — running a model locally makes these distinctions tangible in a way reading about them never can.

What Generative AI Is Good At

| Task | Why It Works | Example |

|------|-------------|--------|

| **First drafts** | Producing "pretty good" text is exactly what next-token prediction does well | Draft a blog post, email, project proposal |

| **Code generation** | Code is structured and well-represented in training data | Generate a function, write tests, convert between languages |

| **Summarisation** | Compressing long text into key points follows learnable patterns | Summarise a 30-page report into 5 bullet points |

| **Translation and reformatting** | Converting between known formats is highly constrained | JSON to CSV, formal to casual, English to Spanish |

| **Brainstorming** | Generating many plausible ideas from a prompt space is the model's natural mode | "Give me 20 names for a productivity app" |

| **Explaining concepts** | The model has seen concepts explained at every level in its training data | "Explain Kubernetes like I'm a Rails developer" |

| **Research assistance** | Finding patterns and connections across large text inputs | Analyse interview transcripts for themes |

What Generative AI Is Bad At

| Task | Why It Fails | Workaround |

|------|-------------|------------|

| **Factual accuracy** | The model predicts plausible text, not true text. It will confidently state wrong facts. | Always verify facts. Use AI for structure and drafting, not as a source of truth. |

| **Math and logic** | Next-token prediction is not computation. Models frequently make arithmetic errors. | Use AI to write code that does the math, not to do the math directly. |

| **Knowing what it doesn't know** | Models cannot assess their own uncertainty. They don't say "I'm not sure about this." | Treat all outputs as drafts requiring verification. |

| **Consistency across long outputs** | Context windows are finite. In very long outputs, the model may contradict earlier statements. | Break long tasks into sections. Provide the prior output as context for the next section. |

| **Private or recent information** | The model only knows what was in its training data (with a knowledge cutoff date). | Provide relevant context in your prompt, or use RAG (retrieval-augmented generation). |

| **Following complex multi-step instructions** | More steps = more opportunities for the prediction to drift off course. | Break complex tasks into simpler sequential prompts. |

Mental Models That Prevent Mistakes

1. AI as Intern, Not Expert

Treat AI output like work from a smart, eager intern who has read a lot but has no professional experience. The output is often 70-80% right, structured well, and produced fast. But it needs review by someone who actually knows the domain. Never ship AI output without review.

2. AI as Thought Partner, Not Oracle

The best use of AI is not asking it for answers — it's thinking out loud with it. Describe your problem, ask it to poke holes in your reasoning, have it generate alternatives. The value is in the dialogue, not the first response.

3. Garbage In, Garbage Out (Still Applies)

Vague prompts get vague outputs. Specific prompts with examples, constraints, and context get dramatically better results. The difference between a 30-second prompt and a 3-minute prompt is often a 10x improvement in output quality. See [Practical Prompt Engineering](PracticalPromptEngineering).

4. Different Models Have Different Personalities. Choose Wisely.

Claude tends toward careful, nuanced responses. GPT-4 tends toward confident, comprehensive ones. Gemini excels at multimodal tasks. Open-source models via [local deployment](RunningLocalLlms) vary wildly. Try the same prompt across models — the differences are instructive.

5. The Technology Is Moving Fast. Your Skills Compound.

Specific tools will change. GPT-5, Claude 5, Gemini 3 — by the time you read this, newer models exist. But the skills of effective prompting, workflow integration, and knowing what AI can and cannot do are durable. Invest in skills, not tool-specific knowledge. See [Accelerating AI Learning](AcceleratingAiLearning).

The Honest Assessment

Generative AI in 2026 is genuinely transformative for certain tasks and genuinely overhyped for others. It will not replace you at your job. It will make someone who uses it effectively faster than someone who doesn't. The gap between "I've never tried it" and "I use it daily for the right tasks" is large and growing.

The rest of this cluster shows you how to cross that gap efficiently.

Further Reading

- [Generative AI Tools for Individuals](GenerativeAiToolsForIndividuals) — What to use

- [Practical Prompt Engineering](PracticalPromptEngineering) — How to use it well

- [Running Local LLMs](RunningLocalLlms) — How to understand it deeply

- [AI-Augmented Workflows](AiAugmentedWorkflows) — How to integrate it into your work

- [Accelerating AI Learning](AcceleratingAiLearning) — How to keep getting better

- [Generative AI Adoption Guide](GenerativeAiAdoptionGuide) — Hub page