Issue 1 · Themed Reading List

How to Think About Decisions

Books about "decisions" flood the shelf, but few make the mechanism clear. These four each catch one stretch: where bias comes from, when simple beats complex, when expert intuition can actually be trusted, and how amateurs out-forecast experts.

2026 · Book Recommendations · Issue 1

Why These Four

"Decision-making" has been worked to death, but the four books here are not after the same thing. Kahneman catches the structure of error — why we get things wrong in predictable ways. Gigerenzer catches the boundary of simplicity — under genuine uncertainty, complex models collapse and one or two good rules suffice. Klein catches the fact that expert intuition really exists — but only inside certain environments, and for a specific reason. Tetlock catches that forecasting is trainable — not via IQ, but via a foxy cognitive style plus relentless scorekeeping. The goal isn't to memorize four labels; it's to see each mechanism clearly enough to restate it and apply it to the work in your own hands.

The Four at a Glance

BookAuthorYearThe One Thing This Book Nails
Thinking, Fast and SlowDaniel Kahneman2011What looks like "intuition" hides systematic, predictable biases — your mistakes aren't random, they're structural
Risk SavvyGerd Gigerenzer2014Separate "risk" (known probabilities) from "uncertainty" (unknown structure) — complex models win the former, simple heuristics the latter, and confusing the two costs dearly
Sources of PowerGary Klein1998Firefighters, ICU nurses, special-ops commanders don't "compare options" — they recognize a pattern, mentally simulate it, and decide within seconds
SuperforecastingPhilip E. Tetlock & Dan Gardner2015A handful of amateurs consistently out-forecast CIA analysts — not because they're smarter, but via a learnable foxy style and continuous probability calibration

The Four in Detail

Thinking, Fast and Slow
Daniel Kahneman · 2011
Farrar, Straus and Giroux · ~499 pages
Your mind runs in two modes — fast intuition and slow reasoning — and each one's strength is also its trap. Worse, the slow system is lazy.
Core Insight

Kahneman uses one metaphor to pack decades of experiments into a frame: two "fictional" characters live in your head. System 1 is automatic, fast, effortless — running in the background every second. It reads faces, judges distances, completes "2 + 2 = ?". System 2 is slow, effortful, attention-hungry — it does long division, holds a phone number in memory, restrains impulses. Most everyday judgment is handled by System 1; System 2 only wakes up when summoned, and it is lazy by default.

The real insight is not "we make mistakes" — everybody knows that — but that the mistakes have structure. System 1 uses a trick called attribute substitution: asked the hard question "Am I satisfied with my life?" it silently substitutes the easy one "How do I feel right now?" and delivers a fluent answer; the asker mistakes one for the other. Anchoring, availability, representativeness, loss aversion, framing — none of these are random glitches; they are the predictable systematic shifts System 1 produces when it uses substitution, pattern matching, and fluency as shortcuts.

The most counterintuitive part is Kahneman and Tversky's puncturing of the "expert." From clinical psychological diagnosis to wine ratings to judicial sentencing, simple linear formulas routinely beat human experts — because experts are unwittingly swayed by irrelevant variables (hunger, fatigue, the previous case's outcome) while formulas are not. The point isn't that experts are useless; it's that their judgment carries an unstable noise term and needs external rules to absorb it.

A less-quoted insight is the focusing illusion: whatever you are thinking about, in that moment, gets its importance inflated automatically. "Would I be happier if I moved to California?" — what you're actually answering is "California vs. current weather," because you cannot, while thinking, give income, commute, and relationships their proper weight. From this Kahneman draws an uncomfortable conclusion: human predictions of "how happy I will be after a decision" are structurally unreliable.

Key Quotes
"We can be blind to the obvious, and we are also blind to our blindness."
— Thinking, Fast and Slow, Introduction
"Nothing in life is as important as you think it is, while you are thinking about it."
— Thinking, Fast and Slow, Chapter 38 (the famous line on the focusing illusion)
Limits

During the 2010s replication crisis in social psychology, most experiments in Chapter 4 (priming effects) failed to replicate; Kahneman himself publicly acknowledged in 2017 that he would rewrite that section. System 1 / System 2 is a useful metaphor, not a literal neural structure — overextending it flattens phenomena it can't actually explain.

BigCat's Application

Kahneman's sharpest point in the AI era is that LLM output is extraordinarily fluent, which hits System 1's "fluent = correct" illusion squarely. You believe Claude's plan as you read it precisely because there are no pauses, no stumbles, every sentence is even. One thing to try next week: for every important judgment you make with AI (not daily emails), forcibly engage System 2 — ask the same model to generate "why the previous version is wrong," and then to produce an independent third version. Read all three side by side, and the spell of fluency breaks. The real AI power user is not the person who prompts fastest, but the one who resists the seduction of fluency.

Risk Savvy
How to Make Good Decisions · Gerd Gigerenzer · 2014
Viking / Penguin · ~322 pages
Where Kahneman sees biases as bugs, Gigerenzer sees many of them as features evolved for a noisy world — provided you tell "risk" apart from "uncertainty."
Core Insight

Gigerenzer has been in a decades-long debate with the Kahneman camp, and the disagreement isn't about any single experiment — it's about a premise. He insists on a distinction most readers skip yet that is decisive: risk is a world of known structure and computable probabilities — casinos, insurance actuarial tables, test false-positive rates. Uncertainty is a world of unknown structure, scarce samples, and shifting rules — startups, mate selection, long-horizon investing. The two worlds require two toolkits. Treating uncertainty as if it were risk is the true source of most "rationality failures."

His signature finding is that simple often beats complex. In the real, uncertain world, judgments made with one or two good rules routinely outperform multi-variable regressions. The statistical reason is the bias–variance tradeoff: a complex model fits the sample tightly, but most of what it grabs is noise; change the sample and it collapses. A simple rule ignores noise, and what's left is real signal. Equal-weighted 1/N diversification beats Markowitz mean-variance optimization over 50-year backtests — not because Markowitz is wrong, but because in an uncertain world the estimated covariance matrix is mostly noise.

His "take-the-best" heuristic: to judge which of two cities has the larger population, use only the first cue that distinguishes them (capital? major university? international airport?) — and ignore the rest. On multiple real datasets it beats logistic regression. The person who decides on one cue isn't lazy — they are choosing to ignore noise.

Another large section is on the real-world cost of statistical illiteracy. The classic case is breast-cancer screening for a 40-year-old woman: take the numbers from "sensitivity 90% / false-positive rate 9% / prevalence 1%" and recast them as "in 1,000 women, 10 actually have the disease, 9 of them test positive; among the 990 who don't, 89 test false-positive" — same numbers, and doctors' accuracy on the question jumps from roughly 10% to 87%. Representation determines whether the brain can compute it. From this comes "defensive decision-making" — doctors order tests they know are unnecessary, because the personal cost of a miss vastly outweighs the patient's cost of over-testing. Local rationality, systemic failure — institutional, not individual.

Key Quotes
"Risk literacy, like reading and writing literacy, is a basic survival skill of modern society."
— Risk Savvy, Introduction (the book's core claim)
"A complex problem does not always require a complex solution."
— Risk Savvy, Chapter 1 (the less-is-more principle)
Limits

In his polemics with the Kahneman camp, his critique of failed priming studies sometimes slides into impatience with behavioral economics as a whole. The boundary of "when does a heuristic actually beat the complex model" remains qualitative. The "natural frequencies" pedagogy has spread poorly in medical education — people's reasoning improves, but institutional flows don't follow.

BigCat's Application

Gigerenzer lands directly on investing judgment. Multi-factor models, AI stock-picking, and complex quant may work in the risk domain (HFT, market-making); but over a 5–10 year hold, estimated covariances and returns are mostly noise. One thing to try next week: for a long-term holding you currently judge by seven or eight weighted factors, perform a take-the-best inversion — allow yourself only one strongest cue (say, "is management putting irreversible capital into the second growth curve?"), drop the other six or seven, and ask: does the conclusion change? If yes, the complex model is helping; if no, the model is just rationalizing a decision you'd already made. A second move: anytime a doctor presents an "X% risk" diagnosis or screening recommendation, translate it into natural frequencies first — the same numbers will change your decision.

Sources of Power
How People Make Decisions · Gary Klein · 1998
MIT Press · ~330 pages
Real experts do not "compare options" — they recognize a pattern and mentally simulate it once. The catch: they live in a world that gives fast, clear feedback on regular events.
Core Insight

Klein went into fire departments expecting to confirm classical decision theory — that the incident commander, mid-fire, weighs options A, B, C. The commanders insisted: "We're not comparing — we just know what to do." He changed methods, interrogated hundreds of cases, and produced the RPD model (Recognition-Primed Decision): an expert sees a novel scene, identifies the closest match from thousands of stored patterns, simulates the corresponding action in their head once, and if the simulation runs clean, executes it; if a problem surfaces, they swap in the next pattern. They simulate one option at a time and never compare in parallel.

This rescued intuition from mysticism: expert intuition = pattern recognition + mental simulation, with no third ingredient. It also defines the boundary conditions. In 2009 Klein and Kahneman co-wrote Conditions for Intuitive Expertise: A Failure to Disagree, where the two seemingly opposed camps reached agreement: intuition is trustworthy if and only if two conditions hold — (1) the environment is regular enough (the same category of situation recurs), and (2) feedback is fast and clear enough (you find out whether your judgment was right). Chess, anesthesia, firefighting, ICU triage: both conditions hold. Stock-picking, long-horizon political forecasting, first-time executive hiring: neither holds. "Years of experience" in such fields is structurally the bias machinery Kahneman describes, dressed up as expertise.

When to Trust Intuition · Klein–Kahneman 2009
Feedback Speed → slow & vague ········ fast & clear Environment Regularity → Trust intuition Don't trust intuition Chess Anesthesia Fire command ICU Stock-picking Long political forecast First-time exec hire

Klein's other lasting contribution is the premortem. Conventional brainstorming asks "what could go wrong?" and is mediocre. Premortem shifts time forward: "Assume the decision has been made and one year from now it has spectacularly failed — each of you, independently, write down why." The "already happened" frame pries open the organizational silence (no one wants to be the wet blanket), and studies show it surfaces roughly 30% more real risks than forward-looking brainstorming.

Key Quotes
"Decision makers usually generate just one workable option and don't bother to think of others — they evaluate it by mentally simulating it, not by comparing it with others."
— Sources of Power, Introduction (the heart of the RPD model)
"The conditions for effective intuition: an environment of sufficient regularity, and an opportunity to learn its regularities through prolonged practice."
— Kahneman & Klein 2009, cited in later editions of Sources of Power
Limits

The method leans heavily on retrospective interviews — "recall a difficult decision and tell me about it" — and memory reconstruction makes experts narrate a judgment as more certain than it was at the time. The samples are almost entirely from high-validity industries (firefighting, military, medicine), so the conclusion "expert intuition is reliable" is partially baked into the research design.

BigCat's Application

Klein is most usable in hiring and team management. Technical interviews are usually built to "compare candidate A and B on details" — that's a Kahneman-friendly process. Klein flips it: put first the question of which people you have led before this candidate resembles, and what their 18-month trajectory turned out to be — pattern recognition is itself signal. Two things to try next week: (1) Five minutes before a key interview, independently write a premortem: "Assume this person has resigned 18 months in — what was the most likely reason?" Two or three concrete reasons surfacing means the red flags you only half-noticed are already there. (2) Use the 2D map above to audit your own decision domains: where on the map sits the type of decision you make (hiring, technical bets, your child's education path)? If lower-left, calling it "experienced judgment" is self-deception.

Superforecasting
The Art and Science of Prediction · Philip E. Tetlock & Dan Gardner · 2015
Crown · ~340 pages
A handful of part-timers beat their CIA counterparts. Not because they were smarter — because they used a learnable foxy style plus relentless calibration. So can you.
Core Insight

Tetlock's earlier book, Expert Political Judgment (2005), tracked twenty years of geopolitical forecasts and reached a result the field found awkward: the experts on TV, speaking with confidence, scored statistically no better than a dart-throwing chimpanzee. Superforecasting is the good news that followed — the IARPA-funded Good Judgment Project ran open forecasting tournaments and identified the top 2% of amateur forecasters who consistently outperformed intelligence-agency analysts (with classified data) by roughly 30%.

The dividing line wasn't IQ or expertise. It was Berlin's distinction (borrowed from Archilochus): the hedgehog has one big idea (the free market, geopolitical rivalry, technological determinism) and applies it to everything — great on TV, because confident and narratively clean. The fox knows many small things and flexibly assembles frameworks. Tetlock's data show that superforecasters are almost all foxes.

But the worldview switch is far from sufficient — the real engineering hides in two practices. The first is probabilization: translate "Will Trump win?" into "On date X, the probability he wins is __%," and write it down. At year-end, the Brier score scores you (rewarding both accuracy and confidence, punishing extremes that turn out wrong). The second is incremental updating: when new information arrives, move the estimate from 35% to 42% — and write down even a 7-point move. People hate the middle state of "neither anchored nor overreacting"; superforecasters drill it into a muscle. The book also stresses "outside view first": start from base rates (the historical frequency of similar events), then let the inside-view details adjust — a step nearly everyone skips.

Tetlock also concedes a boundary: superforecasters' edge concentrates on medium-short geopolitical questions in the 6–18 month range; beyond three years, they perform no better than average — Taleb's true black swans are structurally outside what this method can do. It's an honest concession: the method is not claimed to be universal.

Key Quotes
"Beliefs are hypotheses to be tested, not treasures to be guarded."
— Superforecasting, Chapter 7
"The hedgehog knows one big thing; the fox knows many things — and superforecasters are almost all foxes."
— Superforecasting, Chapter 4 (via Berlin via Archilochus)
Limits

Most tournament questions are short-to-mid-term geopolitical items (6–18 months) — scoreable, with clear deadlines, with bounded scope. The "black swans" that actually reshape history fall almost entirely outside this distribution, and the method does nothing for them. Whether the training effect persists post-IARPA funding has also been questioned.

BigCat's Application

Tetlock's most direct use is building a personal "probability ledger" for your big judgments. Take five core beliefs you currently hold and rewrite each as "On [specific date], the probability of X is __%," and park them in a Notion / spreadsheet: e.g., "By end of 2027, LLM inference cost has dropped another order of magnitude," "By end of 2026, holding A in the A-share market outperforms MSCI World," "By year-end, the number of days the child reads in English voluntarily is ≥ 80%." When the date arrives, return and score. Two counterintuitive side effects: (1) judgments you can't translate into a probability turn out not to be judgments at all but attitudes; (2) six months in, the ledger reveals the class of question on which you are systematically wrong — which is the only progress, in Tetlock's sense, that counts.

Questions to Ask Yourself

  1. The last time you made an important judgment "on intuition" — what kind of environment were you in? Lots of repetition + fast clear feedback (intuition trustworthy), or one-off + multi-year delayed feedback (intuition is most likely bias wearing the costume of expertise)?
    A LENS TO TRY

    Use the Klein–Kahneman 2D map: place the judgment onto it, which quadrant does it land in? Upper-right — your intuition is worth listening to. Lower-left — you have to force slow thinking, an external checklist, a premortem. What most people get wrong isn't the judgment itself; they don't notice that they are in the lower-left while believing they are in the upper-right. Years of "experience" in an irregular environment just accumulate more bias, not more wisdom.

  2. The decision you currently make with a complex model (five or more weighted factors): if you were allowed to keep only one, would the conclusion change?
    A LENS TO TRY

    This is Gigerenzer's take-the-best inversion. If the conclusion changes, the complex model is doing real work; if it doesn't, the model is post-hoc rationalization for a decision you'd already made — more common than you'd think. In high-uncertainty, low-sample domains, "simple" isn't cognitive laziness, it's hard work resisting overfit. The test: can you actually write down "that one most important factor" in a sentence? If yes, you already know. If no, you do need the complex model — and you're also admitting you're partly guessing.

  3. Take the single most important judgment in your head right now and rewrite it as: "By [specific date], the probability of X is __%" — can you write a specific number, or only "probably yes"?
    A LENS TO TRY

    This is Tetlock's core training. If "60–70%" and "40–50%" feel interchangeable to you, you're not forecasting — you're hedging your wording. A qualified probabilistic judgment satisfies three conditions: (1) a clear deadline; (2) a verifiable event; (3) you'd be willing to bet on it at the stated odds (if you'd never accept any bet, you don't actually have a number in your head). All three = a real judgment; learning is possible when you turn out wrong. Any one missing = it's an attitude or a wish, and time won't turn it into knowledge.