Lesson 7: Fact-checking AI's confident answers — AI Skills

The confidence problem

Language models are trained to produce fluent, confident text. That training doesn't distinguish between things the model knows well and things it's making up. A hallucinated statistic and a real one read identically — same sentence structure, same declarative tone, same apparent certainty.

This is not a bug that will be fixed in the next update. It's inherent to how the models work. The only defence is a human review habit that targets the right things.

What to verify (and what to skip)

You don't need to verify everything — you need to verify the things that would matter if wrong. Apply different levels of scrutiny based on consequence:

Verify every time: specific numbers and statistics, named people and their roles, citations and quotes, legal or regulatory claims, medical facts, pricing and availability, dates and timelines.
Spot-check: historical claims, scientific consensus, named processes or frameworks (do they exist as described?).
Trust with minimal checking: general explanations of well-established concepts, structural suggestions (outlines, frameworks), tone and style choices, brainstormed ideas.

The three-step verify routine

For any claim in the "verify every time" category:

Step 1 — Ask the model itself. "Are you confident this is accurate? Is there anything here you're uncertain about?" This sometimes surfaces doubt the model didn't volunteer. It also sometimes doesn't — but it takes five seconds and occasionally saves you.
Step 2 — Search independently. Put the specific claim into a search engine (or a more recent AI tool with web access). You're looking for corroboration from a source you trust — a news outlet, government website, academic source, or the organisation directly. One good source is enough for most purposes.
Step 3 — Check the date. Even a correct fact can be outdated. If the claim involves anything that changes — rates, laws, prices, statistics — confirm when the source was published and whether it reflects the current situation.

A useful trick: ask for uncertainty

Build uncertainty disclosure into your prompts by adding: "Flag any claims you're not fully confident about, and note where I should verify independently." Some models honour this consistently; others don't. But when it works, it pre-sorts which parts of the output deserve the most scrutiny.

Another useful prompt: after getting an answer, ask "What's a counterargument or alternative view to what you just said?" This forces the model to surface evidence against its own output — useful for anything where you want a balanced view.

The two-minute standard. A thorough fact-check of a typical AI output — scanning for red flags, searching two or three key claims — takes about two minutes. That's the right investment before sending anything that represents you professionally. The one time you skip it is usually the time that matters.

Exercise

Fact-check a real AI output

Generate a response on a topic in your field, then apply the verify routine to it.

Ask AI about something in your industry or field.

Choose a topic where you'd actually use the answer — a trend, a statistic, a best practice, a named framework. Ask for a 200-word summary.

Highlight the red-flag items.

Go through the output and mark every specific number, name, date, citation, and "currently" claim. How many did you find?

Verify two or three of them.

Pick the two or three claims that would matter most if wrong. Run a quick search. Were they accurate? Outdated? Fabricated?

Note what you found.

Write one sentence: "In this output, I found [X] red-flag items. [Y] were accurate, [Z] were wrong or needed updating." This builds your calibration over time.

Key takeaways

What to remember

Confidence is not accuracy. AI produces wrong answers in the same tone as right ones.
Focus verification on: numbers, names, citations, legal/medical/financial claims, dates, and anything "current."
The three steps: ask the model for its uncertainty, search independently, check the date.
Add "flag anything you're uncertain about" to prompts where accuracy matters.