The Handbook · Chapter 1

Episode 04

Every Needs a Watcher

Which AI tool to open for which job, what the words everyone keeps saying actually mean, and how to use any of them without becoming the cautionary slide.

Illuminated intro emblem for the SLAiYER Handbook

There is a very specific kind of workplace panic that happens when someone says, "You should just use AI for that," as if "AI" is a complete instruction and not an entire mall directory of tools all wearing the same name tag. As if! Which AI? ChatGPT? Claude? Copilot? Gemini? Perplexity? The thing in Teams? The thing your company announced in a town hall that nobody can actually find?

Here's the thing: "use AI" isn't really an instruction. AI is a term to describe an entire field. It's like being told to "just use internet." Umm, what now? Use internet for what? The difference is that with the internet, you already know which part does what, so you translate it without thinking. With AI, nobody's handed you that map yet — so the instruction isn't just vague, it's vague and you're decoding it blind. And the person who told you to "just use AI" probably couldn't tell you either.

But this is where the broken telephone stops.

Episode 4 opens the first page of something new: the Grimoire — our growing reference shelf, where the tools, terms, receipts, and "wait, what does that mean?" answers live. Inside it is the Handbook: the practical survival guide for using AI without becoming the person who says "agentic workflow" in a meeting and then explains nothing.

Every Slayer needs a Watcher — the one who tells her what she's walking into before she walks in. That's the whole job here. Not "which tool is best." The far more useful: what are these things, which one do I open for which job, and how do I work with any of them without getting vamped. Because "just use AI" is walking into the Hellmouth with a tote bag, a vague sense of optimism, and no idea what's down there.

Section

Illuminated emblem for Part I: VOCABULAiRY

Before we open a single tool: the words. Because half of feeling lost in any AI meeting is nodding along to vocabulary nobody defined — including, sometimes, the person saying it. You're allowed not to know any of these. Half the room is bluffing. Five foundational words for the thing itself, then the four types everybody argues about — sometimes in the same sentence.

Illuminated emblem: a quill writing on a scroll, gold star above

1Prompt

What it is

A prompt is what you type into AI. Not a search query. Not a command. A brief.

Anthropic's own prompt engineering documentation puts it this way: "Think of Claude as a brilliant but new employee who lacks context on your norms and workflows." The model has read more than you have. It has not met your VP. It does not know what "the deck" is, or that the Tuesday all-hands ran long. Everything that makes this work yours — the audience, the stakes, the deadline, the politics — lives in your head. The prompt is the only way it crosses over.

There is one rule that holds across every tool, every model, every version of every model:

The Golden Rule. Show your prompt to a colleague with no context on the task. If they would be confused, AI will be too.

That's it. That's the whole skill. Two people typing into the same tool, on the same Tuesday afternoon, get wildly different answers — because they wrote wildly different briefs. The variable isn't AI. It's the brief.

This isn't a talent. It's the most leveraged skill in this Handbook. Get it right and every other tool you ever open gets sharper. Get it wrong and the priciest model on earth hands you beige.

What makes a good brief

A useful brief gives AI five pieces of context. Each one closes a specific gap the model cannot close on its own.

Deliverable — what you actually want

AI cannot tell whether you want an email, a memo, a list, a slide critique, or a paragraph of analysis. Each is a different shape of output. Without a named deliverable, AI guesses the most generic option — usually a paragraph. Naming the deliverable is the first move because it constrains every choice that follows.
Audience — who's reading it

A note to your CEO is not a note to your team. AI knows the difference between formal and casual register, but it does not know who is reading. Tell it: "for my VP," "for the team channel," "for a client who's frustrated." The vocabulary, emphasis, and assumed knowledge all shift.
Format — length, structure, shape

"Brief" can mean three sentences or three paragraphs. "Bulleted" can mean five items or fifty. "A summary" can be a haiku or a memo. Specify length, structure, and shape — especially length — so AI doesn't have to guess.
Tone — formal, warm, direct, casual

Tone is a knob, not a setting. "Professional but warm" gets you something different than "warm and professional." "Direct, no hedging" gets you something different than "concise." The specificity here is taste, and your taste is one of the few things AI can't supply.
Constraints — what to include, what to leave out

"No buzzwords." "Don't mention pricing." "Skip the intro." Constraints feel like restrictions, but for AI they're gifts — each one rules out a generic average you didn't want. Phrase them positively where you can: "use flowing prose" beats "don't use bullets." (More on why in Where it goes wrong, below.)

If your own thinking is still messy, make the tool ask you questions before it writes. Something as simple as "Before you write this, ask me three questions that would change the output." The questions it asks will surface what you actually needed to brief.

Vague vs. briefed

Here's the same task asked two different ways. Same model, same minute.

Vague

"help with the deck."

A beige paragraph about decks that could be about anything. AI is filling the blanks with averages — every deck it's ever seen on the internet, none of them yours.

Briefed

"I'm presenting Q3 results to my VP tomorrow. Find the one slide that's still weak — too dense, too vague, or burying what matters most. Suggest a fix."

A specific, usable answer about your slide, with the stakes of your meeting, in the time you have.

Same tool. Same thirty seconds. The difference is the brief.

Where it goes wrong

The most common mistake is treating the prompt as a wish. "Help me with this." "Make this better." "Can you take a look?" These aren't briefs — they're gestures. AI doesn't read minds; it pattern-matches against averages from everything it's ever seen. Short brief in, generic answer out.

The second most common mistake is telling AI what not to do. "Don't be cheesy." "Don't use buzzwords." "Don't sound like a LinkedIn post." Anthropic's docs explicitly recommend the opposite — positive instructions outperform negative ones — because the moment you name the thing to avoid, the model now has the cheesy / buzzword / LinkedIn pattern in its head and has to actively suppress it. Say what you want instead. "Warm, direct, no jargon." "Plain prose, no bullets." "Sound like a human who's slightly amused."

In Episode 2's terms: AI hearing "write an email about the project" is David Rose in Moira's kitchen hearing "fold in the cheese." It tries. It has no idea what that means.

→ Episode 2: Tell Me What You Want — the prompting deep dive

Practice

On your next real task, prompt it twice. First the way you'd ask standing in the doorway — quick, vague, the way you'd ask a coworker. Then again with deliverable + audience + format + tone + constraints. Read both. The gap between answer one and answer two is exactly what this section just taught you. Once you've felt it, you can't un-feel it.

Sources

Anthropic. Prompt engineering best practices. The "brilliant but new employee" framing and Golden Rule are theirs.
OpenAI. Prompt engineering guide. Hierarchical instructions, few-shot examples, structured prompts.

Illuminated emblem for the model / LLM concept

2Model / LLM

What it is

The "model" is the engine inside the app. The app is what you open — claude.ai, chatgpt.com, gemini.google.com, the Copilot inside Teams. The model is what's actually generating the answers. One app usually offers several engines, and you usually have a choice between them.

"LLM" — large language model — is the technical name for the engine behind most of the AI you're using right now. You don't need to know which one. You just need to know there's more than one, and they aren't all the same.

Wharton professor Ethan Mollick makes this concrete with a three-tier framework that's worth borrowing for the rest of this Handbook:

Models — the underlying AI brains. Claude Opus 4.8. GPT-5.2. Gemini 3 Pro. The capability comes from here.
Apps — the interfaces you open. claude.ai, chatgpt.com, the Copilot embedded in Teams. The experience comes from here.
Harnesses — the systems that let AI do things, not just say things. Claude Code, ChatGPT Agent mode, Cursor. Same model, different harness — completely different capability.

Same model + different harness = different capability. Same app + different model = different answer. Knowing which lever to pull is half the skill.

How models differ

Inside almost every modern AI app you'll find at least two kinds of model. They behave quite differently.

Base / chat models — the fluent ones

Fast, conversational, optimized for back-and-forth. Excellent for quick rewrites, brainstorms, surface explanations, drafting. They generate one answer fast. You get a paragraph in under three seconds.
Reasoning models — the slow, deliberate ones

These work the problem through internally before answering. They're better at math, logic, multi-step plans, anything that needs its work checked. They were a genuinely new development — OpenAI shipped the first one (o1) in September 2024. Anthropic and Google followed within the year (Claude's "extended thinking," Gemini "Thinking"). The cost: 20–60 seconds per response instead of two. The benefit: dramatically fewer confidently-wrong answers on the hard stuff.

Mollick's bluntest line: most people leave the picker on "auto" and get the base model by default. For anything that matters, manually switch to the reasoning version. The minute you wait is the hour you don't spend cleaning up the answer.

Which one when

A rough rule:

Fast model

Quick. Recoverable. Conversational.

The rewrite. The short list. The "what's the word for...?" If the answer's wrong, you'll know in five seconds.

Reasoning model

Logic. Math. Multi-step. Stakes.

The five-step plan. The financial scenario. The contract you're reading. The decision you can't undo.

Mollick is blunt about the free-tier trap: free AI is "optimized for chat, rather than accuracy." For work that matters, paying for the frontier reasoning model is the difference between confident-and-wrong and slow-and-right.

Where it goes wrong

Two common mistakes.

Using the fast model for hard problems. It happens because the fast model is the default. You ask a strategic question and get back a fluent paragraph that sounds confident — but didn't actually work the problem through. The reasoning model would have caught the gap.

Using the reasoning model for everything. It's overkill for a quick rewrite, and you pay for thinking you didn't need (in time, sometimes in dollars). Save it for when the stakes justify the wait.

The other trap: assuming all the AI tools are equivalent. They aren't. As of mid-2026, Claude Opus, GPT-5.2 Thinking, and Gemini 3 Pro are peer-level for general capability, but each has strengths. Claude is widely regarded as the strongest writer. GPT excels at structured reasoning. Gemini integrates deeply with Google's data and image tools. The right question isn't "which is best?" — it's "which one is good at this kind of task?"

Practice

Open your tool. Find the model picker — usually a small dropdown near the top of the chat. Note which models are available and which is set as default. Then take a real decision you're avoiding and ask the same question two ways: once with the default model, once with the reasoning version ("extended thinking," "Thinking," whatever your app calls it). Read both side by side. The difference is the muscle you'll want the next time the stakes are real.

Sources

Anthropic. Models overview. Current Claude models, pricing, knowledge cutoffs.
Ethan Mollick. A Guide to Which AI to Use in the Agentic Era. One Useful Thing, February 2026. The Models / Apps / Harnesses framework.

Illuminated emblem for the training-data concept

3Training data

What it is

Training data is everything the AI was fed while it was learning. The composition depends entirely on what the model is built to do.

Text-generating models (Claude, ChatGPT, Gemini) — trained on books, websites, articles, forum posts, transcripts, code. Trillions of words.
Image generators (DALL·E, Midjourney, Stable Diffusion, Google's Nano Banana) — trained on billions of images paired with their captions.
Video generators (Sora, Veo, Runway) — trained on enormous quantities of video, often with text descriptions of what's in them.
Voice / audio models (ElevenLabs, Suno) — trained on hours of human speech or music.
Math & code models — trained on mathematical equations, solved problems, GitHub repositories, technical papers.
Multimodal frontier models (GPT-5, Claude Opus 4.8, Gemini 3 Pro) — trained on text and images and audio and code, all together, so a single model can read a screenshot, summarize a PDF, and write an email back.

A model can only generate things it was shown how to generate. The provider compiles the training set, runs the training, and then the feeding stops. From that point on, the model has a fixed view of the world. Two things matter about that fixed view, and both shape how you use AI at work — and both apply across every modality above.

The cutoff. At some point the training stops, and the model's view freezes. Claude Opus 4.8's training cutoff is January 2026. Claude Haiku 4.5's is July 2025. OpenAI publishes the same for GPT. Anything that happened after is invisible to the model unless you hand it the new information directly.
What you type today might train tomorrow's model. The default is not what most people expect — and it varies sharply by account type. More on that below.

The cutoff, explained

Imagine a friend who watched every episode of Sex and the City through the 2004 finale and stopped there. She remembers all of it — Carrie's column, the Aidan years, the Berger Post-It. Ask her about anything pre-2004 and she's flawless. Ask her about And Just Like That — about Big having a heart attack on the Peloton in the first episode of the spin-off — and she has nothing. That's not on her. That's where the watching stopped.

The model is the same. Brilliant up to its cutoff, blind after. Three workarounds:

Hand it the new information

Paste the current policy. Attach the recent doc. Quote the email. If the relevant facts live in your conversation, the cutoff doesn't matter — the model now has them.
Use a tool with live web search

Perplexity, ChatGPT with browsing, Claude with web search, Gemini grounded with Google. The tool fetches the now and brings it into the conversation. The output should cite the sources it actually pulled.
Wait for the next model release

New training run = new cutoff. Anthropic, OpenAI, and Google all release new models every few months. Your knowledge horizon moves with theirs.

What you can't do: trust the model on time-sensitive information without checking. Confidently-wrong is the model's default failure mode when the answer lives after the cutoff.

Who uses your inputs

The most consequential paragraph on this page. The default behavior of every major AI tool depends on what kind of account you have. The defaults are not the same.

Personal account

ChatGPT Free / Plus / Pro · Claude Free / Pro · Gemini Free

Training: ON by default. Your conversations may be used to train future models unless you opt out in settings. ChatGPT also offers "Temporary Chat" mode that skips training.

Business account

ChatGPT Team / Enterprise / Edu · Claude for Work · API · Google Workspace Gemini

Training: OFF by default. Your inputs and outputs are protected by the business agreement and not used to train future models.

Anthropic's published policy puts it directly: "We may use your Inputs and Outputs to train and improve Anthropic AI models, unless you opt out through your account settings." Business and enterprise accounts are governed by separate commercial terms — the consumer policy doesn't apply.

Translation for the working professional: anything you paste into a PERSONAL account today might become part of what a stranger's AI model knows next year. Anything you paste into a COMPANY-APPROVED business account stays inside the business agreement. The line between safe-to-paste and not-safe-to-paste isn't a moral judgment — it's a contract. This is the entire foundation of the Account Rule in Part V.

Where it goes wrong

The cutoff trap. You ask the tool about something current — last month's policy, this quarter's launch, today's news. The tool answers confidently, in the present tense, as if the information were live. It isn't. The model is filling in from old patterns. The most dangerous version: the tool invents a recent thing that doesn't exist (a product, a person, a paper) because the pattern of "a recent thing about X" is in its training data even when the specific X isn't.

How to spot it: anything time-sensitive. Anything with a version number or a date. Anything where "as of when?" is a fair question.

The default-account trap. You paste a confidential client doc into your personal ChatGPT, asking for a summary. You've just signed your client's information into OpenAI's training set. This isn't a thought experiment — it's the policy. There have been real incidents: the Samsung engineer who pasted proprietary code into ChatGPT in 2023, and many quieter examples since. The fix is structural, not behavioral: company-approved business account, or nothing.

→ Chapter 5: The Account Rule — the safety box

Practice

Two checks, under sixty seconds total.

First, the cutoff. Open your tool. Ask point-blank: "What's your training cutoff date?" If the answer is more than six months ago, anything time-sensitive in your work needs to come from somewhere else — a live web search, a pasted source document, a verified human.

Second, the account type. Look at the account name in the top corner. Personal email = personal account = training default ON. Work email plus your company's enterprise tenant URL = business account = training default OFF. Most people get this wrong because "I'm signed in with my work Google" doesn't always mean business. Confirm before you paste.

Sources

Anthropic. Privacy Policy. Consumer-account default is training ON with opt-out; business accounts governed by separate terms.
OpenAI. Data Controls FAQ. Free / Plus / Pro: training default ON with opt-out via Settings → Data Controls. Team / Enterprise / API: training default OFF.
Anthropic. Models overview. Published training-data cutoff dates per model.

Illuminated emblem for the token / context-window concept

4Token & context window

What it is

Two related ideas, often used interchangeably even though they're not the same.

A token is a chunk of text — usually a word or part of one. AI doesn't process letters; it processes tokens. The rough rule for English: about 0.75 tokens per word, or four characters per token. When you hear "this model has a 200k token context window," that's roughly 150,000 words of working memory.

The context window is how much the tool can hold in mind at the same time — everything you've typed in this conversation, every file you've attached, every instruction the tool was given at the start. Anthropic calls it the model's "working memory" — separate from the training data, which is the model's long-term knowledge.

Current sizes, as of mid-2026:

Claude Opus 4.8, Sonnet 4.6, Fable 5 — 1 million tokens (~750,000 words)
Claude Haiku 4.5 — 200,000 tokens (~150,000 words)
ChatGPT GPT-5 — 1 million tokens on paid tiers
Gemini 3 Pro — 2 million tokens

Sounds enormous. It is. But there's a crucial second fact: more isn't always better.

The context window degrades under load

Anthropic calls it "context rot": as the conversation grows, the model's accuracy and recall degrade — even within the official limit. This isn't a bug. It's how attention works at scale.

The defining research is a 2024 Stanford paper, "Lost in the Middle: How Language Models Use Long Contexts" (Liu et al., TACL). The team tested six AI models on long documents and found a consistent U-shaped curve: accuracy is highest when relevant information sits at the beginning or end of the input, and degrades more than 30% when it's stuck in the middle.

The finding replicated across GPT-3.5, GPT-4, Claude, LongChat, MPT-30B, and Cohere. It isn't a quirk of one tool. It's a fundamental property of how large language models read long inputs. Practically:

The first thing you say and the last thing you say are the things the model remembers best.
Stuff buried in the middle of a long document or long conversation is the most likely to be missed or paraphrased loosely.
"Just paste in everything" is rarely as good as "paste in the relevant section, plus the one fact that matters."

Memory is not the context window

Quick clarification because the terms blur, and the difference matters.

Context window

Working memory of THIS conversation.

Resets when you start a new chat. Everything earlier is gone. The desk.

Memory feature

Facts saved ACROSS conversations.

"My name is Sarah." "I prefer numbered lists." Stored as a profile the tool reads first in every new chat. The file cabinet behind the desk.

Memory comes onto the desk. The desk doesn't keep what was on it yesterday.

How to use the window well

Three moves that compound.

Keep one conversation focused on one thing

When you switch topics, start a fresh chat. Don't make the tool drag a 60-turn architecture discussion behind your follow-up question about formatting an email. The fresher the context, the sharper the answer.
For long docs, put what matters at the top

Anthropic explicitly recommends placing long inputs near the top of the prompt, above your query. This is the inverse of how most people compose it ("here's my question, here's the doc"). Flip it: doc first, then question. The "Lost in the Middle" effect makes the top and bottom of context the strongest positions.
Watch for drift, and reset early

The signal: tone shifts. Your original constraints get dropped. The answer paraphrases your source instead of quoting it. That's the model losing the thread. Close the chat, open a new one, and re-paste only the parts that still matter.

Where it goes wrong

The wall-of-text trap. You paste a 60-page document and ask three questions. By question three, the answer sounds right but is paraphrasing the gist rather than quoting the source. You won't catch the slip unless you check against the document.

The eternal-thread trap. You've been in the same chat for two weeks. The model knows everything that's ever happened in this conversation — which sounds great, except attention degrades as the window fills, and the early stuff (your original constraints, your style preferences) starts dropping out. Long marathon chats are precisely where details slip.

The memory confusion. You turn on Memory expecting the tool to remember your project. Memory only stores what you (or the tool) explicitly told it to remember — usually facts about you, your role, your preferences. It does not store the actual content of last week's conversation. If you need that, you'll need to paste it back in.

Practice

Two short experiments.

First, the "Lost in the Middle" effect. Paste a long document into a fresh chat (fifteen-plus pages). Ask one question about the FIRST page, one about the MIDDLE, one about the LAST page. Compare. The middle answer will be the weakest. You will have just felt the U-shaped curve in real time.

Second, the drift signal. In your next long planning conversation, after eight or ten exchanges, ask: "What were the constraints I gave you at the start of this conversation?" If the answer is vague or partial, the window is filling. Start a fresh chat with just the constraints, and continue.

Sources

Anthropic. Context windows. Definition, current sizes, "context rot" framing.
Liu, Lin, Hewitt, Paranjape, Bevilacqua, Petroni, & Liang. Lost in the Middle: How Language Models Use Long Contexts. Stanford, Transactions of ACL, 2024.
Anthropic. Effective context engineering for AI agents.

Illuminated emblem for the hallucination concept

5Hallucination

What it is

A hallucination is when AI says something false in the exact same confident voice it uses for true things. The system isn't lying — it doesn't know what lying is. It's predicting the next plausible word, and sometimes the next plausible word is just wrong. It doesn't sound wrong. That's the whole problem.

A hallucination can be a fabricated source (a real-looking journal article that doesn't exist), an invented quote, a wrong date, a person who never worked at the company AI named them at, a legal precedent that was never filed, a price that doesn't match the live one. The category isn't "obvious errors." It's "errors that look exactly like correct answers."

It's also widespread. Stanford's 2026 AI Index Report tested 26 top AI models on one new accuracy benchmark and found hallucination rates ranging from 22% to 94%. Even on the best of them, roughly one answer in five contained fabricated information. The newest models are better than the old ones, but "better" still leaves enormous room for error in your work.

Why hallucinations happen

The most important insight on this page, and the one almost no one knows.

In September 2025, OpenAI researchers published a paper titled "Why Language Models Hallucinate" (Kalai, Nachum, Vempala & Zhang) with a finding that reframes the entire problem. Hallucinations don't happen because AI is broken. They happen because the training process actively rewards confident guessing.

The mechanic: AI models are evaluated against accuracy benchmarks. A confidently-wrong answer scores zero. A confident "I don't know" also scores zero — but a confident guess might be right, so guessing is the statistically optimal strategy. Across millions of training examples, the model learns: when uncertain, guess confidently anyway.

The paper's blunt conclusion: hallucinations are a "natural statistical error" baked into how AI is trained. They aren't a bug to be patched. They're a property of the incentive structure. A 2026 Nature paper reached the same conclusion: even retrieval, tool use, and self-verification can't fully eliminate hallucinations in current models. The rate is reducible. It is not yet removable.

This is the crucial reframe: hallucinations aren't going away soon. The skill is detection and prevention — not waiting for AI to "fix itself."

How to reduce hallucinations (Anthropic's playbook)

Anthropic's own guidance for reducing hallucinations comes down to four moves. They lower the rate significantly. They do not eliminate it.

Allow the model to say "I don't know"

The single most effective technique. Add this line to your prompt: "If you don't know or aren't sure, say so. Don't guess." It works because — as the OpenAI paper proved — the model is trained to fill silences confidently. You have to give it permission to abstain.
Ground answers in provided material

For tasks involving documents, ask the model to extract direct quotes first, then build the answer from those quotes only: "Find the exact sentences in this document that support your answer. Quote them. If you can't find one, say so." Grounding in the source closes the gap where hallucinations live.
Require citations for every claim

Ask the model to attach a source to every factual statement and to remove any claim it can't source. Anthropic's example prompt: "For each claim in your draft, find a direct quote in the source that supports it. If you can't find one, delete the claim and mark where it was with empty brackets."
Show the reasoning

Ask the model to walk through its logic step-by-step before answering. Faulty assumptions become visible. (This is what reasoning models do automatically; you can force it on basic models with: "Before you answer, walk through your reasoning step by step.")

How to detect hallucinations (Episode 3's playbook)

The deeper skill — and the full topic of Episode 3 — is detection after the AI has answered. The pattern that catches hallucinations is what Episode 3 called the Elle Woods move: look for the detail that doesn't survive contact with reality.

Chutney Windham gave a calm, confident, repeated alibi — she was in the shower right after a fresh perm. Sounded true. Sounded true twice. Fell apart on one specific check: you don't wash a fresh perm without destroying it. The story collapsed on a single piece of domain knowledge.

AI hallucinations work the same way. The answer is fluent, internally consistent, and confident. It collapses on the one specific check:

Does this link actually open?
Does the quote actually appear on that page?
Was this person at the title the model gave them?
Does this number match the source it cited?
Was this legal case actually filed? In which court? When?

The KPMG / GPTZero incident in 2026 is the canonical case study: KPMG (a Big Four firm) published an agentic AI report; organizations cited in it said the claims about their AI use were untrue. GPTZero confirmed the inaccuracies were caused by hallucinations. KPMG pulled the report. The lesson isn't that one firm got it wrong — it's that "we used AI" without verification is a real career risk, even at the most regulated companies.

→ Episode 3: The Burn Book Problem — the full detection playbook

Where it goes wrong

Trusting fluency. The output sounds right. It uses the right vocabulary. It's structured the way a real answer would be structured. Plausibility isn't truth. The most expensive mistakes in the corporate AI era so far have all had this shape: someone read a fluent AI answer, didn't verify, sent it to a client or a court or a board.

Asking the model "are you sure?" Asking AI to verify itself produces another fluent answer with no actual verification underneath. The model is generating "the kind of thing someone confident would say" — which is sometimes the same kind of thing it said the first time. (Episode 3's framing: that's Chutney repeating her alibi, not Elle finding the contradiction.)

Confusing reasoning models with reliability. Reasoning models (Claude with extended thinking, GPT with Thinking, Gemini Thinking) make hallucinations less common, but not absent. The Stanford 2026 finding (22%-94%) includes the latest reasoning models. The reasoning model is more careful. It is not infallible.

Practice

The single most useful exercise in this Handbook.

Ask your tool, on any work task: "Give me three recent stats with clickable sources to back this up." Then actually click them. The first link to nowhere — or to a real-looking site that doesn't say what the answer says it says — is the only proof you need to start checking everything that touches a fact, name, or number.

Bonus exercise: take a recent AI answer you actually used at work and try to verify ONE claim in it against a primary source. If you can verify it in under thirty seconds, the claim was safe. If you can't, the claim was a draft pretending to be a final.

Sources

Kalai, Nachum, Vempala & Zhang. Why Language Models Hallucinate. OpenAI / Georgia Tech, September 2025. The training-rewards-guessing finding.
Anthropic. Reduce hallucinations. The four-move playbook.
Stanford HAI. 2026 AI Index Report, Responsible AI chapter. Hallucination rates of 22%-94% across 26 top models.
Issue 03 of LAiDIES: The Burn Book Problem. The Chutney / Elle Woods detection playbook in full.

What it is: Confident false output that sounds exactly like a correct answer.
Why it happens: Training rewards guessing over saying "I don't know" (OpenAI 2025 paper).
How widespread: 22%–94% across top models (Stanford AI Index 2026).
Reduce it: Allow "I don't know." Ground in quotes. Require citations. Show reasoning.
Detect it: One specific check. Click the link. Verify the quote. The Chutney move.
Picture this: Joey Tribbiani on "moo point." Fluent. Confident. Wrong.
Try this: Ask for three recent stats with sources. Click the links.
Go deeper: → Episode 3: The Burn Book Problem
Remember: If she doesn't do the assignment, you can't do yours.

The four types everyone argues about

Best understood together — the whole point is how they differ. Picture the kind of show this brand was raised on: a group of friends fighting supernatural forces, each with a different role in the fight.

Bestiary plate: the witch who conjures (generative AI)

1Generative AI

Makes · the witch who conjures

You give her words; she makes a thing appear — a paragraph, an image, a slide, some code. This is what most people mean when they say "AI" right now.

Bestiary plate: the Watcher in the library (reasoning models)

2Reasoning models

Thinks first · the Watcher in the library

Works the whole problem through, then says what to do. Slower, and better at math, logic, multi-step plans — anything that needs its work checked. (This is your title: a Slayer needs a Watcher.)

Bestiary plate: the Slayer on patrol (agentic AI)

3Agentic AI

Acts · the Slayer on patrol

You send her out and she goes and does it. The leap from talking to doing — and exactly where the stakes change, because something that can act is only as safe as the permissions you hand it.

Bestiary plate: the prophesied one (AGI)

4AGI

Hypothetical · the prophesied one

The foretold figure everyone waits for — a hypothetical system that could handle basically any task a person can. Hold onto this: no public product you can open today is generally considered AGI.

One crucial thing: these are moods, not personalities. A single tool can be generative, switch on reasoning for a hard problem, and act agentically — shifting between them by task. They're not separate species you choose between; they're modes the same tool moves in and out of. (AGI is the exception — not a mood any tool today can switch into.)

End of Chapter 1

What's next

2

Next chapter The Briefing Rules

The 5 rules for talking to any AI tool — so you actually get back what you wanted, instead of beige.

Live

← Back to the Handbook

VOCABULAiRY

1Prompt

What it is

What makes a good brief

Deliverable — what you actually want

Audience — who's reading it

Format — length, structure, shape

Tone — formal, warm, direct, casual

Constraints — what to include, what to leave out

Vague vs. briefed

Where it goes wrong

Practice

Sources

2Model / LLM

What it is

How models differ

Base / chat models — the fluent ones

Reasoning models — the slow, deliberate ones

Which one when

Where it goes wrong

Practice

Sources

3Training data

What it is

The cutoff, explained

Hand it the new information

Use a tool with live web search

Wait for the next model release

Who uses your inputs

Where it goes wrong

Practice

Sources

4Token & context window

What it is

The context window degrades under load

Memory is not the context window

How to use the window well

Keep one conversation focused on one thing

For long docs, put what matters at the top

Watch for drift, and reset early

Where it goes wrong

Practice

Sources

5Hallucination

What it is

Why hallucinations happen

How to reduce hallucinations (Anthropic's playbook)

Allow the model to say "I don't know"

Ground answers in provided material

Require citations for every claim

Show the reasoning

How to detect hallucinations (Episode 3's playbook)

Where it goes wrong

Practice

Sources

The four types everyone argues about

1Generative AI

2Reasoning models

3Agentic AI

4AGI

What's next