This is an evidence map for the core findings of learning science - a structured, sourced reference to what the research actually shows about how people learn durably. It is built for a specific use: when you want to know not just that a technique works, but how strong the evidence is, where it comes from, and what it does not claim.
Every entry links the claim to a named, peer-reviewed source you can verify yourself. Where the evidence is strong, we say so. Where it is mixed, narrow, or contested, we say that too. This page is maintained as a reference, not a one-time roundup.
How to read this map
Each technique below is rated on two axes that are easy to conflate but should be kept separate:
- Evidence strength - how well-replicated and methodologically sound the supporting research is. "Strong" means multiple independent replications, often synthesized in meta-analyses; "moderate" means real support with meaningful caveats or narrower conditions; "contested/limited" means the picture is genuinely unsettled.
- Effect type - what the technique actually improves. Many learning techniques help long-term retention and transfer while feeling worse in the moment than easier strategies. This gap between performance-during-study and durable-learning is itself one of the most robust findings in the field, and it is why learner intuition is an unreliable guide.
A note on effect sizes: where we cite them, they are drawn from the named meta-analysis and should be read as that synthesis reported them, not as universal constants. Effect sizes vary with population, material, retention interval, and study design. Treat them as directional evidence of strength, not precise predictions.
The evidence map
| Technique | What it is | Evidence strength | What the evidence supports | Key source |
|---|---|---|---|---|
| Retrieval practice (testing effect) | Attempting to recall information from memory, rather than re-reading it. | Strong | Substantially better long-term retention than restudying the same material; one of the highest-utility techniques reviewed across the literature. | Roediger & Karpicke (2006); Dunlosky et al. (2013) |
| Spaced practice (distributed practice) | Distributing study across time rather than massing it into one session. | Strong | Reliably better long-term retention than massed practice across content types and ages; the optimal gap grows with the target retention interval. | Cepeda et al. (2006) |
| Interleaving | Mixing different problem types or topics within a session rather than blocking them. | Moderate-Strong | Better discrimination learning and transfer, especially in maths and category learning; feels harder during practice (a desirable difficulty). | Rohrer & Taylor (2007); Kornell & Bjork (2008) |
| Elaborative interrogation | Asking and answering "why is this true?" for facts being learned. | Moderate | Improves retention of factual material; benefits depend on prior knowledge and are clearest for discrete facts. | Dunlosky et al. (2013) |
| Self-explanation | Explaining to yourself how new information relates to what you know, or the steps of a worked example. | Moderate | Improves learning and transfer across tasks; effortful and not always done well without prompting. | Dunlosky et al. (2013) |
| Worked examples (early learning) | Studying fully worked solutions before attempting problems, when a domain is new. | Moderate-Strong | Reduces extraneous cognitive load and accelerates early skill acquisition; the advantage fades and can reverse as expertise grows (the expertise-reversal effect). | Sweller (1988) |
| Highlighting / re-reading | Marking text or reading it again - the most common student strategies. | Strong evidence of LOW utility | Popular and low-effort, but consistently among the least effective techniques for durable learning; produces fluency that is easily mistaken for mastery. | Dunlosky et al. (2013) |
| Matching instruction to "learning styles" | Teaching to a learner's preferred modality (visual/auditory/kinaesthetic). | Contested - not supported | The meshing hypothesis (matching modality improves outcomes) has repeatedly failed to find support in controlled tests, despite widespread belief. | Pashler et al. (2008) |
Two findings that frame everything above
1. Desirable difficulties
Several of the most effective techniques - retrieval, spacing, interleaving - share a counterintuitive property: they make studying feel harder and slower in the moment, yet produce stronger long-term retention. Conditions that speed up apparent learning during practice often weaken it long term, and vice versa.
This is Robert Bjork's concept of "desirable difficulties," and it explains why learners systematically prefer less effective strategies: the easy ones feel like they are working.
2. The performance-learning distinction
Performance is what you can do during or immediately after study, under supportive conditions. Learning is what remains after a delay and generalises to new contexts. They frequently dissociate. Because learners and instructors usually observe performance, not learning, intuitive judgments about "what's working" are unreliable - which is the practical reason this evidence map exists.
Methodology and scope notes
- What counts as evidence here: peer-reviewed experimental research and meta-analyses/syntheses. Where a row cites a synthesis (e.g. Dunlosky et al., 2013), the rating reflects that synthesis's assessment of utility, not a single study.
- Effect sizes are conditional. They depend on population, material type, retention interval, and design. This map reports evidence strength and direction, not guaranteed magnitudes for any individual.
- Lab vs. classroom. Much of the strongest evidence comes from controlled studies; classroom replications generally support the same directions but with smaller and more variable effects. Treat real-world gains as smaller than lab maxima.
- This is not advice for a specific person or condition. It summarises general findings about typical learners; it is not individualised educational, clinical, or accessibility guidance.
- Maintenance: this page is updated when the underlying evidence base changes materially, not on a fixed schedule. A visible "last reviewed" date reflects a real review.
Sources
All sources are peer-reviewed and linked to their DOI or canonical record. These are the primary references behind the ratings above.
- Dunlosky, J., Rawson, K. A., Marsh, E. J., Nathan, M. J., & Willingham, D. T. (2013). Improving students' learning with effective techniques: Promising directions from cognitive and educational psychology. Psychological Science in the Public Interest, 14(1), 4-58. https://doi.org/10.1177/1529100612453266
- Roediger, H. L., & Karpicke, J. D. (2006). Test-enhanced learning: Taking memory tests improves long-term retention. Psychological Science, 17(3), 249-255. https://doi.org/10.1111/j.1467-9280.2006.01693.x
- Cepeda, N. J., Pashler, H., Vul, E., Wixted, J. T., & Rohrer, D. (2006). Distributed practice in verbal recall tasks: A review and quantitative synthesis. Psychological Bulletin, 132(3), 354-380. https://doi.org/10.1037/0033-2909.132.3.354
- Kornell, N., & Bjork, R. A. (2008). Learning concepts and categories: Is spacing the "enemy of induction"? Psychological Science, 19(6), 585-592. https://doi.org/10.1111/j.1467-9280.2008.02127.x
- Rohrer, D., & Taylor, K. (2007). The shuffling of mathematics problems improves learning. Instructional Science, 35, 481-498. https://doi.org/10.1007/s11251-007-9015-8
- Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2), 257-285. https://doi.org/10.1207/s15516709cog1202_4
- Pashler, H., McDaniel, M., Rohrer, D., & Bjork, R. (2008). Learning styles: Concepts and evidence. Psychological Science in the Public Interest, 9(3), 105-119. https://doi.org/10.1111/j.1539-6053.2009.01038.x
- Bjork, R. A., & Bjork, E. L. (1992). A new theory of disuse and an old theory of stimulus fluctuation. In A. Healy, S. Kosslyn, & R. Shiffrin (Eds.), From Learning Processes to Cognitive Processes (Vol. 2, pp. 35-67). Erlbaum.
Related reading on When Notes Fly
- How Learning Works: A Research Guide - the full pillar this map supports
- Spaced Repetition Explained
- The Testing Effect
- Deliberate Practice Explained
- What Is Cognitive Load
Last reviewed: June 2026. Compiled and reviewed by the WhenNotesFly editorial team against the cited primary sources. Corrections: editorial@whennotesfly.com � Editorial standards. Free to cite with attribution to When Notes Fly.
Frequently Asked Questions
What is a learning science evidence map?
It is a structured reference that rates each major learning technique by how strong the supporting research is and what it actually improves, with every claim linked to a named peer-reviewed source. It separates evidence strength from effect type, and is honest about where findings are contested.
Which learning techniques have the strongest evidence?
Retrieval practice (the testing effect) and spaced practice have the strongest, most replicated evidence for durable long-term retention. Interleaving and worked examples (early in learning) have moderate-to-strong support. Highlighting and re-reading are popular but have strong evidence of LOW effectiveness.
Do learning styles work?
The ‘meshing hypothesis’ - that matching instruction to a learner’s preferred modality (visual/auditory/kinaesthetic) improves outcomes - has repeatedly failed to find support in controlled experiments, despite being widely believed. The evidence does not support tailoring teaching to learning styles.
Why do effective study techniques feel harder?
Several of the most effective techniques (retrieval, spacing, interleaving) create ‘desirable difficulties’ - they feel harder and slower during study but produce stronger long-term retention. Easy strategies create fluency that learners mistake for mastery, which is why intuition is an unreliable guide to what works.
Can I cite this evidence map?
Yes. It is free to cite with attribution to When Notes Fly. Every rating links to its primary peer-reviewed source (with DOI) so you can verify the underlying research directly.
