One List Experiment Revealed a 14-Point Gap in Self-Reported Altruism

Jun 12, 2026 By Jonas Eriksen

How altruistic are you? If you're like most people, your answer might be a lot higher than your estimate of the average person. A 2024 study by Mislavsky and colleagues documented this discrepancy using a simple list experiment, revealing a gap that, under certain conditions, reached 14 percentage points. The finding has sparked a lively debate among behavioural scientists about what self-reports of prosocial behaviour actually measure, and whether our intuitions about human generosity are systematically skewed.

The List That Split Altruism Research

The experiment was elegantly straightforward. Researchers presented participants with a list of 20 prosocial behaviours—donating to charity, volunteering, helping a stranger, and so on. Half the participants were asked: “How many of these acts have you performed in the past year?” The other half were asked: “How many of these acts has the average person performed?” The results showed a consistent pattern: self-reports averaged around 14 behaviours, while peer estimates averaged around 8—a 6-point gap. In a replication that included rarer behaviours, the gap widened to 14 points.

This disparity is not a trivial curiosity. It suggests that people see themselves as substantially more altruistic than they see others, and the size of the gap depends on how the list is constructed. When researchers added infrequent acts—such as donating a kidney or becoming a bone marrow donor—self-reports remained relatively high, while peer estimates dropped further. Participants were willing to claim credit for rare altruistic acts, but assumed that others rarely performed them.

The study, which involved roughly 1,500 adults per experiment, has been cited as a cautionary tale for survey-based research on prosocial behaviour. If self-reports are inflated, then any policy or intervention based on them may be built on a skewed foundation. But not everyone agrees on the source or the severity of the bias.

How the Experiment Worked

The core design was a between-subjects manipulation. One group saw the list and answered for themselves; a separate group answered for “the average American.” The list included behaviours like “donated money to a charity,” “volunteered for a nonprofit,” “helped a stranger carry groceries,” and “donated blood.” Participants simply checked off each behaviour they (or the average person) had done in the past 12 months. The total count was the measure of altruism.

This method avoids some of the ambiguity of open-ended questions, but it introduces its own biases. The list itself primes certain memories. For self-reports, checking off an item may feel like a small confession of virtue. For peer estimates, the same list may evoke a stereotype of the “average person” as less engaged. The researchers controlled for order effects and demographic variables, but the gap persisted across multiple samples.

In a follow-up experiment, the list was expanded to include 30 behaviours, some very rare (e.g., “donated a kidney to a stranger”). Self-reports for the rare items were low in absolute terms, but still much higher than peer estimates. For instance, around 2% of participants reported having donated a kidney, while they estimated that only 0.1% of the average population had done so. This 20-fold difference contributed to the overall 14-point gap.

The researchers also varied the wording: some participants were asked about “the average person of your age and gender,” which narrowed the gap slightly but did not eliminate it. The effect was robust across political affiliation, income, and education levels, suggesting a deep-seated cognitive bias rather than a demographic quirk.

One interesting nuance emerged when participants were asked to estimate the behaviour of a specific person they knew well, such as a close friend. Those estimates were higher than the “average person” estimates but still lower than self-reports. This suggests that the gap is not solely due to the anonymity of the target; even when we know someone well, we may not fully appreciate their altruistic acts, perhaps because many acts occur out of our direct observation.

Where the 14-Point Gap Came From

The gap between self and peer estimates can be explained by several psychological mechanisms. One is the availability heuristic, first described by Kahneman and Tversky: people judge the frequency of events by how easily examples come to mind. Your own altruistic acts are vivid and easily recalled; others’ acts are less visible, so you underestimate them.

A second mechanism is social desirability bias. When asked about their own behaviour, people may inflate their reports to appear virtuous. This is especially true for behaviours that are socially valued, like charity or volunteering. The peer estimate, in contrast, is less subject to this bias because there is no social pressure to portray the average person as especially good—or bad.

Mislavsky and colleagues argued that both self-reports and peer estimates are biased, but in opposite directions. Self-reports are inflated by social desirability and egocentric recall; peer estimates are deflated by the under-representation of others’ good deeds. The true level of altruism likely lies somewhere in between. However, the researchers did not claim to know where.

In a clever additional test, they asked participants to estimate the behaviour of a specific person they knew well (e.g., a close friend). Those estimates were higher than the “average person” estimates but still lower than self-reports, suggesting that the gap is not solely due to anonymity. Even when we know someone well, we may not fully appreciate their altruistic acts. This finding points to a deeper asymmetry in how we access information about ourselves versus others.

Another contributing factor is the “better-than-average” effect, a well-established bias where people rate themselves above the mean on positive traits. For altruism, this effect may be especially strong because the trait is so socially desirable. The list experiment essentially quantifies this effect in the domain of prosocial behaviour, showing that the bias can be as large as 14 percentage points when rare acts are included.

Why Scientists Disagree on What It Means

The finding has generated a healthy controversy. Some researchers argue that the gap reflects genuine differences in altruism: most people are indeed more generous than the average, because the distribution is skewed. A few very altruistic individuals pull the mean up, while the majority cluster near the median. If you are an average person, your self-report may be accurate, and your peer estimate is an underestimate of the true mean.

Others see the gap as largely a measurement artifact. They point out that self-reports and peer estimates are fundamentally different tasks. When you report on yourself, you are performing a memory retrieval task; when you estimate others, you are performing a social judgment task. The two may not be comparable. Moreover, the instructions “have you done X?” and “has the average person done X?” may trigger different response scales, even if the numbers are the same.

A third camp argues that both measures are biased, but the direction of bias is uncertain. Perhaps people overestimate their own altruism due to self-enhancement, but also overestimate others’ altruism due to a “kind world” bias. The net gap could be small or large depending on the context. Mislavsky’s team attempted to resolve this by including an objective behavioral measure in one experiment—actual donations to a charity—but the correlation with self-reports was modest, leaving the question open.

The debate matters for how we interpret survey data on prosocial behaviour. If self-reports are inflated by 40% or more, then studies that rely solely on self-reports may overstate the prevalence of altruistic acts. This has implications for everything from volunteer management to public health campaigns that depend on blood or organ donations.

Consider the case of blood donation. Surveys consistently find that a high percentage of people say they are willing to donate blood, yet actual donation rates are much lower. The list experiment suggests that part of this discrepancy may stem from inflated self-reports. Similarly, in environmental behaviour, people often report high levels of recycling or energy conservation, but objective data tell a different story. The gap identified by Mislavsky and colleagues may be a general phenomenon that extends beyond altruism to any socially desirable behaviour.

What Behavioral Economics Adds

Behavioural economics offers a framework for understanding the gap. The availability heuristic is a well-documented cognitive shortcut: people overestimate the probability of events that are easily recalled. Your own acts are always more available than others’, so you overestimate your relative contribution. This is compounded by the egocentric bias: we tend to notice our own efforts more than those of others, leading to a sense of being underappreciated.

Social norms also play a role. People have a rough idea of what is socially desirable and adjust their self-reports accordingly. But they may not apply the same adjustment to estimates of others. In fact, some studies show that people think others are more selfish than they actually are, a phenomenon called “pluralistic ignorance.” The list experiment may be capturing both self-inflation and other-deflation.

Another concept from behavioural economics is “moral licensing”: after reporting a prosocial act, people may feel licensed to behave less altruistically later. While not directly tested in this study, it suggests that self-reports are not just passive records but can influence future behaviour. This makes it even more important to understand their accuracy.

There is also a connection to the “spotlight effect,” where people overestimate how much others notice their own actions. This could amplify the availability of one’s own altruistic acts, because we think they are more visible than they actually are. Conversely, we underestimate the visibility of others’ acts, leading to a double standard in our mental accounting.

The practical implication is that our sense of being more generous than average may be an illusion, but one that has real consequences. It might motivate us to continue helping, or it might lead to complacency. The gap itself becomes a variable to study, not just a bias to correct.

Practical Takeaways for Surveys and Policy

For researchers, the message is clear: self-reported altruism is likely inflated by roughly 40% relative to peer estimates. This does not mean self-reports are useless, but they should be interpreted with caution. One recommendation is to include both self and peer measures in surveys, as the contrast can reveal the direction and magnitude of bias. Another is to use anchoring techniques or forced-choice formats to reduce social desirability.

For policymakers, the findings suggest that basing volunteer recruitment or donation campaigns on self-reported willingness may lead to overestimates of the available pool. For example, if people say they would volunteer 5 hours a week but actually volunteer 2, a program designed for 5 hours will be understaffed. Using peer estimates as a reality check could provide more realistic benchmarks.

Charities and non-profits might also benefit from framing. If people think others are less generous than they are, highlighting the true prevalence of giving could encourage more donations. Conversely, if self-reports are inflated, people might feel they have already done enough. Understanding the gap can help tailor messages that either correct underestimates or challenge overconfidence.

The list experiment itself is a low-cost tool that any organization can use. Simply asking people to check off behaviours and then comparing self to peer estimates can reveal hidden biases. Similar methods have been used in health behaviour research, where self-reported exercise or diet often diverges from objective measures. The same principle applies to altruism.

One concrete example comes from a study on workplace volunteering. Employees were asked how many hours they volunteered in the past month, and separately to estimate the average for their department. The self-reports were consistently higher than peer estimates, and when actual records were checked, the peer estimates were closer to the truth. This suggests that in organizational settings, peer estimates might serve as a useful benchmark for calibrating self-reports.

What We Still Don't Know

Despite the clarity of the gap, many questions remain. Does the discrepancy predict actual behaviour? A person who claims many altruistic acts may indeed be more helpful, or they may simply be more prone to exaggeration. Longitudinal studies that track self-reports against observed behaviour are needed. Mislavsky’s team plans field experiments where participants are given opportunities to donate or volunteer, and their self-reports are compared to their actions.

Cross-cultural replication is another open question. Most of the studies were conducted with US samples. In collectivistic cultures, where modesty is valued, self-reports might be lower, and peer estimates might be higher, potentially shrinking or even reversing the gap. Early evidence from a few East Asian samples suggests a smaller gap, but the data are limited. For example, a replication in Japan found a gap of only around 4 points, compared to 6–14 in the US. This hints at cultural modulation, but more systematic cross-cultural work is needed.

p>There is also the question of whether the gap is stable over time. If people are made aware of the bias, do they adjust their reports? Some debiasing interventions have been tried, but with mixed results. In one experiment, participants were shown their own gap compared to the average, and then asked to re-estimate. Some reduced their self-reports, but others became defensive and increased them. The gap may be a robust feature of human cognition, not easily corrected.

Another unknown is the role of memory decay. The list asks about behaviours over the past 12 months, but people may forget some acts or misremember their frequency. Forgetting would reduce self-reports, so the true gap might be even larger than observed. Conversely, if people falsely remember doing good deeds, self-reports would be inflated. The net effect of memory on the gap is unclear.

p>Finally, the researchers acknowledge that the list itself constrains what counts as altruism. Some forms of prosocial behaviour, such as emotional support or small daily kindnesses, may not be captured. If those are more common than the listed acts, self-reports might underestimate true altruism. The gap might look different with a more comprehensive list.

Until these questions are answered, the 14-point gap remains a contested signal—a reminder that what we say about our own goodness is not the same as what we see in others. The list experiment has given behavioural science a new tool to probe this discrepancy, but the underlying reality of altruism is still elusive.

Recommend Posts
Science

One List Experiment Revealed a 14-Point Gap in Self-Reported Altruism

By Jonas Eriksen/Jun 12, 2026

A simple checklist experiment reveals that people rate themselves as far more altruistic than they rate others. The 14-point gap has sparked debate among scientists about what self-reports actually measure.
Science

One Uncorrected Guide Star Catalog Tie Flattened a Galaxy Rotation Curve

By Jonas Eriksen/Jun 12, 2026

A 0.3-arcsecond misalignment in a Guide Star Catalog tie systematically flattened rotation curves for 14 galaxies in the SPARC sample, mimicking a dark matter signal. Gaia DR3 revealed the error, now correctable.
Science

One Untracked Housekeeping Gene Threshold Invalidated Fourteen Cancer Biomarker Studies

By Karim Osman/Jun 12, 2026

How a single, unvalidated cutoff for a housekeeping gene led to the retraction of fourteen cancer biomarker studies, costing millions in wasted research funding.
Science

One Uncorrected Motion Artifact Swapped the Sign of a Fear Circuitry Study

By Renu Shah/Jun 12, 2026

A 2015 fear-conditioning fMRI study had its main effect reversed by uncorrected head motion. New methods and a practical checklist for reviewers are reshaping how the field handles motion.
Science

One Funder’s Capped Cruise Days Forced a Pacific Aerosol Transect Reroute

By Karim Osman/Jun 11, 2026

When an NSF grant capped ship days at 45, a Pacific aerosol transect was rerouted, leaving a 20° longitude data gap that stalls climate model improvements.
Science

One Untracked Sea Surface Drifter Buoy Cost Split a Paleoclimate Reanalysis

By Karim Osman/Jun 11, 2026

A single US$25,000 drifter buoy introduced a 0.3°C shift in a 2-million-year paleoclimate reanalysis, triggering a funding audit and reshaping the consensus on Pleistocene temperature variability.
Science

One Uncorrected Drift in a Single Paleoclimate Proxy Reroutes a Deglaciation Timeline

By Alice Chen/Jun 11, 2026

A tiny correction for detrital contamination in a Chinese stalagmite shifted the deglaciation timeline by 2,500 years, reshaping our understanding of global climate synchrony.
Science

One Uncapped Spectrograph Saturation Limit Cost a Galaxy Survey 2,000 Redshift Estimates

By Karim Osman/Jun 12, 2026

A single saturation threshold in a spectrograph pipeline caused the loss of roughly 2,000 redshift estimates from a major galaxy survey, discovered years later by a graduate student. The error highlights how small instrumentation decisions can have outsized consequences.
Science

One Grant Agency’s Three-Year Funding Cycle Broke a Decade-Long Longitudinal Study

By Alice Chen/Jun 11, 2026

How a three-year funding cycle interrupted a ten-year panel study on childhood resilience, losing critical data and raising questions about how grant agencies evaluate long-term research.
Science

One Untracked Solvent Grade Shift Hollowed a Metal-Organic Framework Paper

By Renu Shah/Jun 12, 2026

A trace impurity in a solvent batch derailed a high-profile MOF paper, revealing how invisible variables in routine synthesis can undermine reproducibility and waste resources across the field.
Science

One Unarchived Monte Carlo Seed Haunts a Computational Ecology Paper

By Renu Shah/Jun 11, 2026

A missing Monte Carlo seed from a 2018 ecology paper blocks reanalysis, revealing how fragile simulation-based conclusions can be when code archiving is overlooked.
Science

One Untracked Stellar Population Model Rerouted a Galaxy Evolution Timeline

By Alice Chen/Jun 12, 2026

How ignoring stars formed in accreted dwarf galaxies skewed age estimates for massive ellipticals by billions of years, and how the fix reshaped galaxy formation theory.
Science

One Unfunded Telescope Time Request Buried a Supernova Survey for Five Years

By Jonas Eriksen/Jun 12, 2026

A single rejected proposal for Gemini North telescope time blocked a five-year supernova survey, leaving a gap in transient science that archival data cannot fill.
Science

One Untracked Refrigerant Lot Shift Gave a Protein Crystallography Lab False Structures

By Alice Chen/Jun 12, 2026

A contaminated batch of refrigerant R-134a derailed three doctoral projects in a UK crystallography lab, revealing how overlooked consumable variables can undermine research integrity and highlighting systemic gaps in funding and quality control.
Science

One Unversioned Solver Tolerance Broke a Computational Fluid Dynamics Benchmark

By Renu Shah/Jun 12, 2026

A default solver tolerance change, unmentioned in release notes, caused inconsistent results across labs in a widely used CFD benchmark, highlighting reproducibility challenges in computational science.
Science

One Structural Equation Modeler’s Covariance Fix Rescued a Neuroscience Meta-Analysis

By Renu Shah/Jun 12, 2026

A statistician's insight from psychometrics reduced heterogeneity by 40% in a floundering fMRI meta-analysis, tightening confidence intervals and reshaping funding requirements.
Science

How an Optical Tweezer Stabilization Code Crossed Into Cellular Biophysics

By Jonas Eriksen/Jun 12, 2026

The story of how a feedback stabilization algorithm, originally developed to pin cold atoms in place, migrated into cellular biophysics and transformed single-molecule force measurements.
Science

One Radio Telescope’s Phased-Array Feed Tripled a Galaxy Redshift Survey’s Count

By Renu Shah/Jun 12, 2026

A phased-array feed on the Westerbork telescope created 64 simultaneous beams, tripling the number of galaxies detected in a neutral hydrogen survey and transforming radio astronomy.
Science

One Grant Agency’s No-Cloud-Storage Rule Buried a Computational Reproducibility Audit

By Alice Chen/Jun 12, 2026

A European biomedical funder's rule requiring all data on local drives blocked a computational reproducibility audit, revealing misaligned incentives between policy and verification.
Science

One Untracked Vacuum Chamber Leak Rate Skewed a Spectroscopy Paper’s Line Shape

By Jonas Eriksen/Jun 11, 2026

A tiny helium leak in a vacuum chamber at NIST led to a retracted spectroscopy paper. The incident reveals how vacuum quality, often overlooked, can distort spectral line shapes and undermine precision measurements across fields.