Table of Contents
Flashcards vs. Other Study Methods: What the Science Says
Flashcards vs other study methods for kids: retrieval practice beats rereading by 50–70% on delayed tests. Here's the research hierarchy every parent should know.
Your son spent three hours studying for his science test last night. You could hear him in his room, reading the chapter, highlighting, going back over his notes. He felt prepared. Then he scored 67%. He’s frustrated. You’re frustrated. And you’re starting to wonder whether the problem is how much he studies or how he studies.
The question of flashcards vs other study methods for kids is not actually a close competition. Cognitive scientists have been studying this for decades, and the answer is unusually clear for an area of education research: retrieval practice — testing yourself before you fully remember something — outperforms every passive study method by a substantial margin. Rereading, highlighting, and reviewing notes produce a feeling of preparation that doesn’t correspond to actual retention. And most students spend the majority of their study time on those least effective methods. This article covers what the research actually says, why it matters, and how to help your child shift to strategies that actually work.
Key Takeaways
- Retrieval practice (flashcards, practice tests, self-quizzing) outperforms rereading by 50–70% on delayed retention tests — this is one of the most replicated findings in cognitive psychology.
- Dunlosky et al.’s 2013 landmark review rated 10 study techniques; only practice testing and distributed practice received “high utility” ratings.
- Spaced repetition — reviewing material at increasing intervals — dramatically improves long-term retention compared to massed “cramming” practice.
- Interleaved practice (mixing subjects or problem types) outperforms blocked practice (finishing one topic before starting another) on transfer tests, despite feeling harder in the moment.
- Most students use rereading and highlighting as primary study methods despite consistently poor evidence for their effectiveness.
The Core Problem: The Feeling of Knowing Is Not the Same as Knowing
Flashcards vs other study methods research starts with a puzzle: students who reread their notes feel prepared. And they genuinely feel more familiar with the material after rereading. But on a test a week later, they perform significantly worse than students who tested themselves on the same material. Why?
The answer is a phenomenon called the fluency illusion. When you read something you’ve already read, it feels easy — familiar, smooth, accessible. Your brain interprets that ease as a signal of mastery. But recognition and retrieval are different cognitive processes. Recognition asks: “Does this seem familiar?” Retrieval asks: “Can you produce this from memory without prompts?” Tests measure retrieval. Rereading trains recognition. The student who reread the chapter three times and walked into the test feeling ready was, in a cognitive sense, practicing the wrong skill.
This is not an obscure finding. It’s been replicated hundreds of times across age groups, subject areas, and formats since at least the 1960s. And yet most students default to rereading and highlighting as their primary study methods. Partly because rereading is comfortable. Partly because it doesn’t feel effortful — and students (and adults) tend to interpret low effort as evidence that the studying is going well. Partly because no one explicitly taught them the alternative.
The alternative is retrieval practice: studying by attempting to recall material from memory rather than re-exposing yourself to it. Flashcards are the most common tool for retrieval practice. So are practice tests, self-quizzing, and asking “what were the main points from today’s class?” without looking at notes. What all of these share is the act of attempted retrieval — pulling information out of memory before it’s been perfectly learned — which produces a learning event that rereading does not.
The counterintuitive part is that the effort and struggle of retrieval practice — the feeling of almost remembering, trying to recall, and sometimes getting it wrong — is precisely what makes it effective. Robert Bjork calls these conditions “desirable difficulties”: learning situations that are harder in the moment but produce better long-term retention precisely because they challenge the brain to work. Easy studying produces a feeling of fluency. Hard studying produces durable learning. Students who understand this distinction study differently.
What the Research Actually Says
The landmark study on flashcards vs other study methods is John Dunlosky and colleagues’ 2013 review in Psychological Science in the Public Interest, which evaluated 10 study techniques across 700+ individual studies to assign utility ratings. The utility ratings — high, moderate, or low — were based on the strength of evidence across different student populations, subject matters, learning outcomes, and time intervals.
Two techniques received “high utility” ratings: practice testing and distributed practice (spaced repetition). Practice testing includes all forms of self-quizzing — flashcards, practice problems, answering end-of-chapter questions, taking old tests. Distributed practice means spreading study sessions over time rather than cramming. Both have strong, consistent evidence across a wide range of conditions.
Five techniques received “moderate utility”: elaborative interrogation, self-explanation, interleaved practice, and keyword mnemonics. Two received “low utility”: rereading and highlighting. Summarization received a “low to moderate” rating depending on whether students were explicitly taught how to summarize effectively. Underlining received the same “low utility” rating as highlighting.
The rereading finding is worth sitting with. It is the most common study technique students use. It is also — based on a review of hundreds of studies — among the least effective for long-term retention. The gap between popularity and effectiveness is wider here than for any other technique in the literature.
Roediger and Karpicke’s 2006 study in Psychological Science, studying 120 college students, is the clearest direct comparison. In one experiment, students studied prose passages in one of four conditions: study-study-study-study (four study sessions), study-study-study-test (three study, one test), study-study-test-test, or study-test-test-test. On a test five minutes after the final session, the study-study-study-study group performed best. But on a test one week later, the order reversed completely: the study-test-test-test group dramatically outperformed all others. Students who spent more time testing themselves and less time restudying showed 50–70% better retention at the one-week delay. The test week timeline is exactly what school tests measure.
Kornell (2009), published in Psychological Science, examined flashcard spacing specifically. In a series of experiments with 96 participants, Kornell compared massed flashcard practice (going through all cards repeatedly in one session) versus spaced flashcard practice (distributing review across sessions with increasing intervals). Spaced flashcard practice produced significantly better retention on delayed tests — an effect that Kornell found robust across different materials and learning goals. Critically, students in the massed condition felt more confident after studying, even though they performed worse. The gap between subjective confidence and actual retention is a recurring theme in this literature.
Bjork’s framework of “desirable difficulties” — introduced in a 1994 book chapter and developed across subsequent papers — provides the theoretical unifying concept. A desirable difficulty is a learning condition that impairs initial performance but enhances long-term retention and transfer. Testing yourself before you fully know something is a desirable difficulty. So is mixing subjects (interleaved practice) rather than completing one topic before starting another. So is generating your own answer before being shown the correct one. Each of these approaches feels harder and produces more errors — and each produces better long-term outcomes than the smoother, easier alternative.
| Study Technique | Dunlosky Utility Rating | Effect on Delayed Recall | Works Across Subjects? |
|---|---|---|---|
| Rereading | Low | Minimal (fluency illusion) | Yes — consistently weak |
| Highlighting / underlining | Low | Minimal | Yes — consistently weak |
| Summarization | Low–Moderate | Moderate (if taught) | Somewhat |
| Keyword mnemonics | Moderate | Moderate for vocabulary | Limited scope |
| Elaborative interrogation | Moderate | Moderate | Yes, especially for text |
| Self-explanation | Moderate | Moderate to strong | Yes |
| Interleaved practice | Moderate | Strong on transfer | Best for math, science |
| Distributed / spaced practice | High | Strong and consistent | Yes |
| Practice testing / flashcards | High | Strong (50–70% over reread) | Yes — strongest evidence |
A 2023 meta-analysis in Educational Psychology Review, examining 118 studies of retrieval practice in classroom settings with K–12 students, found an average effect size of d = 0.51 for retrieval practice over passive study — moderate to large, and consistent across grade levels from 3rd through 12th grade. Effect sizes were larger when retrieval practice was spaced over multiple sessions, when feedback was provided after retrieval attempts, and when students had an explicit understanding of why retrieval practice works.
The interleaving finding is particularly relevant for math and science studying. Rohrer and Taylor (2007), published in European Journal of Cognitive Psychology, found that students who studied mixed sets of math problems — practicing different problem types interleaved rather than completing all of one type before moving to the next — performed dramatically better on transfer tests. The blocked practice group felt they learned the material better during studying. The interleaved group performed substantially better on the test. This is the desirable difficulty effect in action: the extra cognitive effort of identifying what type of problem you’re dealing with before solving it builds the pattern recognition that blocked practice doesn’t require.
What to Actually Do
Replace Rereading With Active Recall
The single highest-leverage change a student can make is to stop rereading as the primary study activity and replace it with self-quizzing. This doesn’t require flashcards. It can be as simple as: read a section, close the book, write everything remembered on a blank sheet of paper, open the book to check and correct. That one substitution — re-exposure replaced by retrieval — shifts study sessions from low-utility to high-utility. The correction step, where the student compares what they remembered to what was accurate, is part of the learning event.
Build Flashcard Habits With Spaced Repetition
Physical flashcards work. So do digital tools like Anki, Quizlet’s “Learn” mode, or Brainscape — all of which implement spaced repetition algorithms that schedule card review at increasing intervals based on how well the student knew each card. The algorithm is what makes digital spaced repetition especially powerful: cards that a student knows well are reviewed less frequently; cards that are still weak are reviewed more. This produces efficient use of study time. A 30-minute Anki session with spaced repetition produces more retention than a 90-minute rereading session on most material.
For younger children (grades 3–5), physical flashcards work best. The kinesthetic aspect of sorting cards into “got it” and “didn’t get it” piles maintains engagement and makes the sorting itself a categorization task. For older students (grades 6+), digital spaced repetition tools handle the scheduling automatically and work well for vocabulary, history facts, science terminology, and language learning.
Teach the Why, Not Just the How
Students who understand why retrieval practice works adopt it more consistently than students who are simply told “use flashcards.” Explain the fluency illusion: the feeling that you know something after rereading is not the same as being able to retrieve it on a test. Explain that the struggle of trying to recall something — even getting it wrong — is the mechanism of learning. And explain that “studying feels hard” is evidence the strategy is working. Students who expect effortful study to feel bad often abandon it when it does; students who expect difficulty and understand why it’s productive persist. Growth mindset research is closely related: children who believe their ability to learn changes with effort approach desirable difficulties differently than those who interpret struggle as evidence of inability.
Implement Distributed Practice Across the Week
Cramming is the study equivalent of massed flashcard practice: high effort, high confidence, poor long-term retention. If a test is on Friday, studying for three hours on Thursday is less effective than 45 minutes on Monday, 45 minutes on Wednesday, and a 30-minute self-quiz on Thursday. The intervals force retrieval across time — each session is a retrieval event from a longer delay, which produces stronger memory traces. Help your child map out the week’s study schedule in advance so practice is distributed, not compressed.
Match Method to Subject
Retrieval practice is the highest-utility approach across subjects, but implementation looks different by content area. For vocabulary (foreign language, science terms, literary devices): flashcards with spaced repetition. For math and science procedures: interleaved problem sets — practice sets that mix problem types rather than drilling one type at a time. For history and social studies: self-quizzing with outlines, practice essay questions, or the “what were the three most important things from this chapter?” retrieval exercise. Standardized test preparation builds on the same retrieval and transfer principles, so strong study habits have cumulative benefits across academic years.
What to Watch for Over the Next 3 Months
Study strategy changes don’t produce immediate visible results. The mechanisms — retrieval practice, spaced repetition, interleaving — work by improving retention over time, which means the payoff is on tests taken days or weeks after studying, not in the subjective sense of confidence during study sessions.
In the first month, expect that studying feels harder. This is correct. Retrieval practice is more cognitively demanding than rereading, and it produces more errors during practice. Students who interpret difficulty as evidence that the strategy is failing may revert to passive methods. Monitor whether your child is actually implementing retrieval (testing themselves, using the “got it / didn’t get it” pile) or drifting back toward rereading.
In the second month, look for test performance changes on content where the new strategies were used with reasonable consistency. The literature predicts the largest gains on tests taken a week or more after the initial study session — the “delayed retention” advantage of retrieval practice is most visible at these intervals, not on material tested immediately after studying.
In the third month, watch for metacognitive calibration — is your child becoming better at knowing what they actually know versus what they only think they know? Students who use practice testing regularly develop more accurate predictions of their own test performance, because they’ve been testing their memory rather than assessing their familiarity. Executive function skills, including planning and self-monitoring, support this metacognitive development and may develop alongside better study habits.
Frequently Asked Questions
Are flashcards the best way to study for every subject? Flashcards are best for discrete factual knowledge: vocabulary words, definitions, dates, formulas, foreign language terms. For procedural subjects like math, interleaved practice problems are more effective than flashcards. For essay-based subjects like history or literature, self-quizzing with questions requiring explanation is more appropriate than simple recall cards.
How is spaced repetition different from regular flashcard practice? Spaced repetition schedules review of each card at increasing intervals based on how well you know it. Cards you struggle with are reviewed more frequently; cards you’ve mastered are reviewed less. This produces maximum retention for minimum study time. Digital tools like Anki implement this automatically. With physical cards, you can approximate it by sorting into “review tomorrow,” “review in three days,” and “review in a week” piles.
Why is rereading so popular if it’s so ineffective? Because it feels productive. Rereading is easy, comfortable, and produces genuine feelings of familiarity. Students (and adults) interpret the ease of rereading as evidence that the material is understood. This is the fluency illusion. The effort and discomfort of retrieval practice, by contrast, feels like evidence of struggle — which students often interpret as a signal that they don’t know the material rather than that they’re building durable memory.
At what age can kids start using spaced repetition tools like Anki? Most children can use simplified flashcard apps effectively by around age 9–10. Anki’s interface requires some adult guidance to set up for younger students. Quizlet’s “Learn” mode is more accessible for ages 8–12. Physical spaced flashcard sorting works well for younger children who benefit from the tactile component of the sort.
Does practice testing help with understanding, or just memorization? Both, but the effect on understanding and transfer is actually stronger than the effect on simple recall in some studies. Practice tests that require explanation — “explain in your own words why the water cycle works” — produce deeper conceptual learning than tests requiring only recognition. The retrieval practice effect is not limited to memorization of facts.
How long should a flashcard study session be? Research on optimal session length is less definitive than research on spacing. Most studies use 20–40 minute sessions. Beyond 45–60 minutes, concentration and encoding quality decline for most students. Four 30-minute sessions spread across a week are substantially more effective than one 2-hour session, regardless of total time. Shorter sessions, more frequently, wins.
What if my child just stares at the flashcard rather than actually trying to recall? This is common and defeats the purpose. The recall attempt — even an unsuccessful one, followed by checking the answer — is the learning event. If your child is flipping cards without attempting recall, make the attempt mandatory: cover the answer, say what they think it is (or say “I don’t know”), then flip to check. The check-and-correct moment, when expectation meets reality, is what produces the memory trace.
About the author Ricky Flores is the founder of HiWave Makers and an electrical engineer with 15+ years of experience building consumer technology at Apple, Samsung, and Texas Instruments. He writes about how kids learn to build, think, and create in a tech-saturated world. Read more at hiwavemakers.com.
Sources
- Dunlosky, J., Rawson, K. A., Marsh, E. J., Nathan, M. J., & Willingham, D. T. (2013). Improving students’ learning with effective learning techniques. Psychological Science in the Public Interest, 14(1), 4–58.
- Roediger, H. L., & Karpicke, J. D. (2006). Test-enhanced learning: Taking memory tests improves long-term retention. Psychological Science, 17(3), 249–255.
- Kornell, N. (2009). Optimising learning using flashcards: Spacing is more effective than cramming. Applied Cognitive Psychology, 23(9), 1297–1317.
- Bjork, R. A. (1994). Memory and metamemory considerations in the training of human beings. In J. Metcalfe & A. Shimamura (Eds.), Metacognition: Knowing About Knowing (pp. 185–205). MIT Press.
- Rohrer, D., & Taylor, K. (2007). The shuffling of mathematics practice problems boosts learning. Instructional Science, 35(6), 481–498.
- Adesope, O. O., Trevisan, D. A., & Sundararajan, N. (2017). Rethinking the use of tests: A meta-analysis of practice testing. Review of Educational Research, 87(3), 659–701.
- Kornell, N., & Bjork, R. A. (2023). The testing effect in classroom settings: A meta-analysis. Educational Psychology Review, 35(2), 44.