Ask a woman and a man in the same country how they rate their life, top to bottom on a ladder, and in most places the woman’s answer comes out a touch higher. Ask the same two people whether they felt worried, sad, or stressed for much of yesterday, and the woman is more likely to say yes. The first question is about a life judged as a whole. The second is about a day as it was lived. They are different measurements of well-being, and on the gender question they disagree — not by accident, but systematically, across the world.
This piece treats that disagreement as the whole point. Survey researchers separate two constructs: evaluative well-being, the reflective rating you give your life (Gallup’s 0–10 Cantril ladder), and experiential well-being, the feelings you actually carry through a day (here, an index of yesterday’s worry, sadness, stress, pain, and anger). Conflating them is the classic mistake in happiness research. The gender gap is where the two pull apart most cleanly.
The method is deliberately simple and run identically for each construct. Inside every country we fit a weighted regression of the outcome on a female indicator, holding constant age, education, marital status, household income rank, and household size. That yields one number per country: the woman-minus-man gap, net of those controls. Then we step back and look at the distribution of that number across 155 countries. Does it point the same way nearly everywhere? And where it varies, what explains the spread?
A note on signs, because they matter throughout. The female coefficient is positive when women score higher. For the ladder, higher means a better life rating, so a positive evaluative gap means women rate their lives above men. For the negative-affect index and its components, higher means more of the bad feeling, so a positive experiential gap means women report more daily distress. Same sign, opposite human meaning — which is exactly why the two must never be averaged together.
Two clouds that don’t line up
Start with the distributions side by side. Each tick below is one country, placed by the size of its gender gap; the upper band is the evaluative gap, the lower band the experiential gap. A vertical line marks zero — perfect parity between women and men.
The evaluative cloud hugs zero; the affect cloud sits clearly to the right
Each mark is one country’s woman−man gap, net of controls. Right of the line = women higher.
Show the numbers as a table
The shapes tell the story before any number does. The evaluative cloud is centred barely above the parity line: women come out ahead in of countries, but the typical lead is small — a median of about ladder points, well inside the range where cross-cultural response styles alone could move things. The experiential cloud sits visibly to the right: in of countries women report more negative affect, and the gap is large enough, and consistent enough, to read as real.
Break the affect index into its parts and the pattern sharpens. The gap is most universal for sadness (women higher in of countries) and clear for stress () and worry (). Toggle the lower band above to watch each component slide. None of them mirror the evaluative cloud. A woman can sit higher on the ladder and still carry a heavier day.
Women rate their lives higher than men in roughly nine of ten countries — and report more daily distress in a clear majority of them. The two facts are both true at once.
The gradients run opposite ways
If these gaps were just noise, they would scatter randomly against national conditions. They don’t. Plot each country’s gap against the Gender Inequality Index — a UNDP measure where higher means a society is more unequal in reproductive health, empowerment, and labour-market participation — and a line appears. Two lines, in fact, sloping in opposite directions.
The inequality gradient flips sign between the two gaps
One dot per country. X = Gender Inequality Index (right = more unequal). Inverse-variance fit line shown.
Here is the counterintuitive part, and the data are unambiguous about it. The evaluative gap is steeper — more favourable to women — in countries that are more unequal, not less (meta-regression slope ladder points per unit of the index, 95% CI ). The places where women report rating their lives furthest above men are not Scandinavia; they include . This is the well-documented “gender-equality paradox” surfacing in life evaluation: the evaluative advantage women report is largest exactly where formal equality is lowest.
The experiential gap runs the other way. Its slope against the same index is negative (, 95% CI ): the daily-distress gap is, if anything, a little smaller in more unequal societies. The affect gap does not vanish in equal countries; it persists almost everywhere, largely indifferent to where a nation sits on the inequality scale. Both relationships are modest — each explains only a few percent of the variance across countries — but the two slopes carrying opposite signs is itself the finding, and it survives switching the country weighting from inverse-variance to population to unweighted.
Swap the moderator for the broader Human Development Index and the evaluative gradient washes out to roughly nothing while the affect gradient turns mildly positive — richer, more-developed countries showing a slightly wider daily-distress gap. Development and gender inequality are not the same axis, and the gaps respond to them differently. We report both and lean on neither.
Where each gap lives
The map puts the same country coefficients on the globe. Green means women score higher than men on the chosen measure; purple means men do; the scale is centred on parity. Switch between the two constructs to watch the world recolour.
Two different worlds, depending on which gap you map
Diverging scale centred at parity. Grey = suppressed, unavailable, or absent from the base map.
The evaluative map is patchy and pale, much of it close to parity, with the strongest pro-women readings concentrated across parts of the Middle East, North Africa, and South Asia. The affect map is greener and more uniform: across most of the inhabited world, women report more daily negative emotion, with only scattered exceptions where the sign flips — among them.
What this does — and doesn’t — show
These are associations, not causes. A woman-minus-man coefficient tells you that, among people alike in age, schooling, marital status, income rank, and household size, the two sexes answer differently. It does not tell you why. The likeliest stories — that women face different daily burdens, that men and women use emotion words differently, that the ladder and the affect questions tap different things — are exactly the kind this design cannot adjudicate.
The response-style caveat bites hardest on the evaluative gap, which is small. A median lead of about ladder points is inside the range where men and women, or one culture and another, might simply use the 0–10 scale differently. We treat the evaluative gap as “women at least as high as men,” not as a confident female advantage. The affect gap is larger and more consistent, but it too rides on binary yes/no items that cultures endorse at different rates.
We deliberately do not control for employment in the headline numbers, because employment is partly a consequence of gender, not a fair thing to hold constant when measuring a gender gap. Running the analysis the other way — adding an employment control on the 2009-and-later data where it exists — barely moves the picture: women still come out higher on the ladder in of countries and higher on negative affect in . The two gaps are robust to that choice; we show both so you can judge.
This is a global, cross-sectional cousin of a more familiar American story: the long decline in U.S. women’s happiness relative to men documented by Stevenson and Wolfers, which a sibling article tracks over time in the General Social Survey. That piece follows one country across decades; this one freezes a moment and spreads it across continents, asking not “when did the gap move?” but “does it point the same way everywhere, and why does it vary?” Different data, different method, same refusal to flatten well-being into a single number.
What survives all the hedging is the shape of the thing: two well-being gaps that share a sign convention but not a direction of meaning, each with its own geography and its own quarrel with national inequality. Whether women are “happier” than men has no answer. It has two answers, and they disagree.
Notes & data
Source. Gallup World Poll, interviews 2005–2020, one row per respondent (about a thousand per country–wave, repeated cross-section). Within-country gaps are weighted by Gallup’s within-country sampling weight (wgt), which is valid for single-country estimates.
Method. For each country we fit a weighted least-squares regression of the outcome on a female indicator (1 = woman, 0 = man) plus controls: age, age², education (3 levels), marital status, within-country income quintile, and household size. The coefficient on female is that country’s gap. Evaluative outcome: the 0–10 Cantril ladder. Experiential outcome: the 0–1 negative-affect index (mean of yesterday’s worry, sadness, stress, pain, anger), plus its binary components.
Sign convention. Positive coefficient = women higher. For the ladder, higher = better life rating (positive = women rate life above men). For negative affect and its components, higher = more of the feeling (positive = women report more daily distress).
Suppression. A country enters the distribution only with at least complete cases and at least survey waves; thinner countries are dropped and greyed on the map. Retained: countries.
Three “world” numbers, kept distinct. (1) Share of countries with a gap above zero — one vote per country, a generalisation statistic. (2) Pooled gap — the population-weighted mean of country gaps, weighting by ctry_pop_millions (total population, an on-hand proxy for the 15+ population; not a plain average). (3) Meta-regression slope — inverse-variance-weighted (1/SE²) WLS of the country gap on a national moderator, with population-weighted and unweighted fits checked as robustness.
Caveats. The panel is unbalanced — not every country is surveyed every year — which the suppression rule and the country count handle. Cross-cultural response styles mean small gaps (especially the evaluative one, median ≈ ladder points) are not definitive. National moderators (ctry_gender_ineq, ctry_hdi) are static country attributes of roughly 2015–2018 vintage, joined to all survey years on ISO3. Seven small states (Bahrain, Comoros, Hong Kong, Maldives, Malta, Mauritius, Singapore) are absent from the base map atlas and are greyed regardless of data. Results are associations, not causes.
Construct integrity. Evaluative (ladder) and experiential (affect) outcomes are reported separately throughout and never averaged into one “well-being” gap — their divergence is the article’s subject.
Prior work. Stevenson & Wolfers, “The Paradox of Declining Female Happiness” (AEJ: Policy, 2009); Batz-Barbarich et al., a meta-analysis of gender and subjective well-being (Psychological Science, 2018); Zuckerman, Li & Diener, on women, men, and well-being (PSPB, 2017); Tesch-Römer, Motel-Klingebiel & Tomasik, on gender differences in subjective well-being and societal gender equality (Social Indicators Research, 2008).