Cognitive Bias Evidence Map: Robust vs. Failed-to-Replicate Biases

Q: "Which cognitive biases are best supported by evidence?"

"Anchoring, framing effects, the availability heuristic, confirmation bias, and overconfidence/miscalibration are among the most robust and well-replicated. Loss aversion is robust as a phenomenon though its universality is debated. Planning fallacy and sunk cost are real but with caveats."

Q: "Which famous biases failed to replicate?"

"Ego depletion (the idea that willpower is a finite fuel) showed little to no effect in a large preregistered multi-lab replication. Several dramatic social/behavioral priming results (e.g. 'elderly words slow walking') have also failed to replicate. These should be cited only with the replication failure noted."

Q: "What is the replication crisis and why does it matter for bias claims?"

"In the 2010s, large efforts found many published psychology findings did not reproduce at their original strength - the Open Science Collaboration (2015) replicated well under half of 100 studies at full effect. Decision and social psychology were heavily affected, so any responsible cognitive-bias reference must flag which effects survived scrutiny."

Q: "Does knowing about a bias remove it?"

"Not reliably. The existence of a bias does not mean any particular 'debiasing' intervention works - debiasing is its own, often weaker, literature. Structural countermeasures (e.g. reference-class forecasting for the planning fallacy) tend to beat simply being aware of the bias."

Cognitive biases are popular - they fill listicles, business books, and slide decks. But not all of them are equally well-supported, and some famous effects have weakened or failed when researchers tried to replicate them.

This is an evidence map: it rates the best-known biases and decision effects by how strong and replicable the evidence is, with each entry linked to a primary source.

The goal is to let you tell the difference between a robust, well-replicated finding you can build on and a famous-but-shaky effect you should cite with caution. Where the research is strong, we say so.

Where it is contested or has failed to replicate, we say that plainly - because a bias map that ignores the replication crisis is itself misleading.

Why replication status matters here

Psychology went through a "replication crisis" in the 2010s: large coordinated efforts found that a substantial share of published findings did not reproduce at their original effect sizes.

The Open Science Collaboration's 2015 attempt to replicate 100 psychology studies successfully reproduced well under half at full strength. Decision and social psychology were among the hardest hit.

So a responsible cognitive-bias reference can't just list effects - it has to flag which ones survived scrutiny. That is what this map does.

"Robust" means the effect replicates widely and is broadly accepted; "supported with caveats" means real but bounded or sensitive to conditions; "contested / weak replication" means the original claim has been seriously challenged.

The evidence map

Bias / effect	What it claims	Evidence status	Honest caveat	Key source
Loss aversion	Losses loom larger than equivalent gains in decision-making.	Robust (but debated in scope)	Well-replicated as a phenomenon; researchers debate whether it is as universal/large as once claimed, and it can attenuate in some contexts.	Kahneman & Tversky (1979)
Framing effects	The same choice, framed as a gain vs. a loss, changes preferences.	Robust	Widely replicated across domains; magnitude varies with how the framing is constructed.	Tversky & Kahneman (1981)
Anchoring	An initial number disproportionately influences subsequent numerical estimates.	Robust	One of the more reliably replicated effects; size depends on relevance/plausibility of the anchor.	Tversky & Kahneman (1974)
Availability heuristic	We judge probability by how easily examples come to mind.	Robust	Well-established; "ease of recall" mechanisms are nuanced and context-dependent.	Tversky & Kahneman (1973)
Confirmation bias	We seek, interpret, and recall information that confirms prior beliefs.	Robust	Broad, well-documented family of effects rather than a single tidy experiment.	Nickerson (1998)
Overconfidence / miscalibration	People are more confident in their judgments than accuracy warrants.	Robust	Reliable in calibration studies; "overconfidence" is several distinct phenomena (overestimation, overplacement, overprecision).	Moore & Healy (2008)
Planning fallacy	We underestimate the time/cost of our own projects despite past evidence.	Supported	Well-documented; reference-class forecasting is the evidence-based countermeasure.	Kahneman & Tversky (1979, intro of concept); Buehler et al. (1994)
Sunk cost fallacy	Past, unrecoverable investment irrationally drives future decisions.	Supported with caveats	Real, but effect size and moderators vary; some lab paradigms are weaker than the folk version implies.	Arkes & Blumer (1985)
Ego depletion	Self-control is a finite resource that "depletes" with use.	Contested / weak replication	A large multi-lab replication found little to no effect; treat the strong "willpower as fuel" claim as unsupported.	Hagger et al. (2016), Registered Replication Report
Social/behavioral priming (e.g. "elderly words slow walking")	Subtle cues unconsciously and strongly shape behavior.	Contested / failed replication	Several flagship priming results have failed to replicate; treat dramatic priming claims with strong skepticism.	Open Science Collaboration (2015)

How to use this responsibly

Robust effects (anchoring, framing, availability, confirmation, overconfidence) are safe to teach and design around, while remembering effect sizes are conditional.
Supported / supported-with-caveats effects (planning fallacy, sunk cost, loss aversion) are real but should be described with their boundaries, not as iron laws.
Contested effects (ego depletion, dramatic social priming) should be cited only with the replication failure noted - or not used as load-bearing evidence at all.
The meta-lesson: the existence of a bias does not mean any specific intervention reliably "debiases" it. Debiasing evidence is its own, often weaker, literature.

Methodology and scope notes

Evidence basis: foundational papers for each effect plus, where relevant, large replication efforts (e.g. Registered Replication Reports, Open Science Collaboration 2015). Status ratings weight replication evidence heavily.
"Robust" is not "unlimited." Even well-replicated biases have moderators, cultural variation, and context limits. Treat ratings as evidence strength, not universal magnitude.
Not individual advice. This summarises general findings about typical decision-makers; it is not personalised, clinical, financial, or legal guidance.
Maintenance: updated when major new replication evidence changes a rating, not on a fixed schedule.

Sources

Tversky, A., & Kahneman, D. (1974). Judgment under uncertainty: Heuristics and biases. Science, 185(4157), 1124-1131. DOI: 10.1126/science.185.4157.1124
Kahneman, D., & Tversky, A. (1979). Prospect theory: An analysis of decision under risk. Econometrica, 47(2), 263-291. DOI: 10.2307/1914185
Tversky, A., & Kahneman, D. (1981). The framing of decisions and the psychology of choice. Science, 211(4481), 453-458. DOI: 10.1126/science.7455683
Tversky, A., & Kahneman, D. (1973). Availability: A heuristic for judging frequency and probability. Cognitive Psychology, 5(2), 207-232. DOI: 10.1016/0010-0285(73)90033-9
Nickerson, R. S. (1998). Confirmation bias: A ubiquitous phenomenon in many guises. Review of General Psychology, 2(2), 175-220. DOI: 10.1037/1089-2680.2.2.175
Moore, D. A., & Healy, P. J. (2008). The trouble with overconfidence. Psychological Review, 115(2), 502-517. DOI: 10.1037/0033-295X.115.2.502
Buehler, R., Griffin, D., & Ross, M. (1994). Exploring the "planning fallacy." Journal of Personality and Social Psychology, 67(3), 366-381. DOI: 10.1037/0022-3514.67.3.366
Arkes, H. R., & Blumer, C. (1985). The psychology of sunk cost. Organizational Behavior and Human Decision Processes, 35(1), 124-140. DOI: 10.1016/0749-5978(85)90049-4
Hagger, M. S., et al. (2016). A multilab preregistered replication of the ego-depletion effect. Perspectives on Psychological Science, 11(4), 546-573. DOI: 10.1177/1745691616652873
Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. DOI: 10.1126/science.aac4716

Frequently Asked Questions

Which cognitive biases are best supported by evidence?

Anchoring, framing effects, the availability heuristic, confirmation bias, and overconfidence/miscalibration are among the most robust and well-replicated. Loss aversion is robust as a phenomenon though its universality is debated. Planning fallacy and sunk cost are real but with caveats.

Which famous biases failed to replicate?

Ego depletion (the idea that willpower is a finite fuel) showed little to no effect in a large preregistered multi-lab replication. Several dramatic social/behavioral priming results (e.g. ‘elderly words slow walking’) have also failed to replicate. These should be cited only with the replication failure noted.

What is the replication crisis and why does it matter for bias claims?

In the 2010s, large efforts found many published psychology findings did not reproduce at their original strength - the Open Science Collaboration (2015) replicated well under half of 100 studies at full effect. Decision and social psychology were heavily affected, so any responsible cognitive-bias reference must flag which effects survived scrutiny.

Does knowing about a bias remove it?

Not reliably. The existence of a bias does not mean any particular ‘debiasing’ intervention works - debiasing is its own, often weaker, literature. Structural countermeasures (e.g. reference-class forecasting for the planning fallacy) tend to beat simply being aware of the bias.

Cognitive Bias Evidence Map: Robust vs. Failed-to-Replicate Biases

Why replication status matters here

The evidence map

How to use this responsibly

Methodology and scope notes

Sources

Related reading on When Notes Fly

Frequently Asked Questions

Share this article

Continue Reading

Effective Money-Saving Strategies Backed by Research

Inflation: Understanding Its Measurement and Effects

An Introduction to Second-Order Thinking and Its Examples

What Is Political Philosophy? Justice and Power Analyzed

Winner's Curse: Why Winning Auctions Can Cost You

Understanding Availability Bias in Investing

How to Build Credit: A Complete Guide

Understanding Financial Decision-Making Through Psychology

Why replication status matters here

The evidence map

How to use this responsibly

Methodology and scope notes

Sources

Related reading on When Notes Fly

Frequently Asked Questions

Share this article

Continue Reading

Effective Money-Saving Strategies Backed by Research

Inflation: Understanding Its Measurement and Effects

An Introduction to Second-Order Thinking and Its Examples

What Is Political Philosophy? Justice and Power Analyzed

Winner's Curse: Why Winning Auctions Can Cost You

Understanding Availability Bias in Investing

How to Build Credit: A Complete Guide

Understanding Financial Decision-Making Through Psychology

We Value Your Privacy

Cookie Preferences

Essential Cookies

Analytics & Performance Cookies

Advertising & Marketing Cookies