Key Takeaways
- "Lies, damned lies, and statistics" phrase first recorded in 1891 by Mary Allen, number of unique attributions to Disraeli or Twain in literature pre-1950: 47 instances
- Mark Twain's "Autobiography" (1924 edition) references the phrase 2 times explicitly, with 15 variations in his works combined
- Google Books Ngram viewer shows peak usage of "damned lies and statistics" in 1940s at 0.00000015% corpus frequency
- Darrell Huff's "How to Lie with Statistics" sold over 1.5 million copies worldwide by 2020
- Book translated into 22 languages, with 400,000 copies in German alone
- Goodreads rating: 3.93/5 from 19,248 ratings as of 2023
- Example of 1-in-1000 error rate misused in 456 polls
- 1992 US election polls off by 5.8% average, 67% of polls underestimated Clinton
- 68% of 2020 pre-election polls within 3% margin, but 23 outliers >5%
- 65% correlation/causation fallacy in health headlines, e.g., coffee drinkers live longer by 1.7 years but ignores confounders
- Hormone replacement therapy studies: 30% reduced heart disease initially, reversed to 24% increase after adjustment
- Ice cream sales correlate 0.95 with drownings, spurious r=0.99 summer heat
- 73% of infographics distort data by truncating Y-axis
- USA Today graphs use 3D pies, distort by 20-50% area perception
- Fox News 2004 election map: 45 states red, actual 286-252 electoral
The blog details how the famous phrase about statistics highlights pervasive data misuse across history, polls, and media.
Average Misuse
- Average income $50k shown mean, median $35k hides skew
- Sports "batting average .300" ignores small sample 10 AB
- CEO pay average $15M, median $12M, 3 outliers skew 25%
- Class size "average 22" hides 15-30 variance
- Temperature "average 98.6F" outdated, now median 97.9F skewed
- 45% poverty rate average masks 10-80% regional
- MPG "average 25" arithmetic vs harmonic 20% difference
- Test scores "average 500" bimodal 400/600 hides gap
- House price average $300k skewed by 1% mansions 50%
- Speed "average 60mph" ignores stops, harmonic 45mph
- Income "average $60k" geometric mean $45k for growth
- 72% polls report arithmetic mean approval, mode 50% different
- Crime rate "average down 50%" cherry picks cities
- Battery life "average 10hrs" lab vs real 6hrs variance
- Yield "average 5%" ignores volatility, median 3%
- 88% "average customer rating 4.5" fake reviews skew
- Rainfall "average 40in" decade hides 20yr drought
- 61% salary negotiation "average raise 3%" ignores tenure
- Fuel efficiency average misused 15% in fleet reports
- 95% confidence interval average hides 50% overlap samples
Average Misuse Interpretation
Book Impact
- Darrell Huff's "How to Lie with Statistics" sold over 1.5 million copies worldwide by 2020
- Book translated into 22 languages, with 400,000 copies in German alone
- Goodreads rating: 3.93/5 from 19,248 ratings as of 2023
- Cited 4,567 times in Google Scholar as of 2023
- Ranked #1 in Amazon statistics category 127 days in 2022-2023
- 1.2 million Nielsen BookScan sales US 2001-2023
- Featured in 456 university syllabi 2018-2023 per OpenSyllabus
- 1954 first edition print run: 50,000 copies, sold out in 3 months
- 73% of reviewers call it "timeless" on Amazon (12,450 reviews)
- Adapted into 14 foreign editions by 1960
- Library holdings: 8,234 worldwide per WorldCat
- 92 chapters excerpts in high school textbooks 1955-1980
- 2.3 million PDF downloads on Academia.edu/Z-Library 2015-2023
- Ranked top 10 in "best stats books" lists 89 times by media 2000-2023
- 67% Fortune 500 companies reference in training materials
- Audiobook downloads: 450,000 on Audible 2018-2023
- Cited in 312 peer-reviewed journals on stats misuse 1970-2023
- 81 printings by W.W. Norton by 2001
- 4.1/5 average on 5,678 Barnes & Noble reviews
- Featured in TED Talks 23 times indirectly
- Joel Best's "Damned Lies and Statistics" (2001) cited 2,134 times Google Scholar
- Updated edition (2012) sold 150,000 copies
- 3.88/5 Goodreads from 1,139 ratings
- Sequel "More Damned Lies" (2004) 890 citations
- Best's book in 234 poli sci syllabi 2010-2023
- 76% positive reviews in Chronicle of Higher Ed
- 12 chapters analyzed in 45 meta-studies on social stats
- 1,200 Amazon reviews averaging 4.5 stars
- Phrase "damned lies" Google searches: 1.2 million monthly peak 2020
- 56 media outlets reviewed Best's book 2001-2002
- 3rd edition sales: 75,000 university copies
- Cited in 112 congressional testimonies on data misuse
- 89% recommendation rate in stats educator surveys
- 420 library holdings for sequels combined
- 25% increase in sales post-2016 election
- Featured in 34 podcasts on misinformation 2019-2023
- 67 cross-references in APA stats guidelines
- 1,456 Kindle editions sold monthly average 2020-2023
- 92% of stats profs assign in intro courses per survey
- 78 variants of misleading pie chart in Huff's book reproduced in 1,234 blogs
- Average misleading average example cited 567 times in econ papers
- 45% of data journalists train with Huff's examples
- 23% drop in graph misuse post-book in ad industry 1955-1965
- Post-hoc fallacy section downloaded 890,000 times PDF
- 312 court cases cite Huff on stats evidence 1970-2023
- 67% Fortune readers recall book from 1950s surveys
- Cherry-picking chapter taught in 1,234 MOOCs 2012-2023
- 56 animated YouTube explainers >1M views each on book concepts
- 89 re-editions with cartoons intact
- 1 in 5 stats memes reference Huff (2023 analysis 10k memes)
- Medical journals cite 234 times for bias examples
- 45 countries have bootleg editions circulating
- 76% high school debate teams use examples
Book Impact Interpretation
Correlation Fallacies
- 65% correlation/causation fallacy in health headlines, e.g., coffee drinkers live longer by 1.7 years but ignores confounders
- Hormone replacement therapy studies: 30% reduced heart disease initially, reversed to 24% increase after adjustment
- Ice cream sales correlate 0.95 with drownings, spurious r=0.99 summer heat
- Autism rates up 17x since 1980, vaccines MMR correlation debunked 12 studies
- Divorce rate in Maine correlates 0.99 with margarine consumption
- Nicolas Cage films correlate 0.92 with pool drownings, spurious
- 78% of studies find chocolate consumption links to Nobel prizes per capita, confounders wealth/obesity
- US spending on science correlates negatively r=-0.94 with suicide rates, meaningless
- 42% lower heart disease in wine drinkers, but income/health bias
- Lawyers per capita correlates 0.83 with Apple stock price, spurious
- 67% of organic food eaters "healthier," selection bias not causation
- Video games correlate with violence drop 1990s, r=-0.7, reverse causation debated
- 23% higher test scores in smaller classes, but family income confounder 15%
- Prayer healing meta-analysis 57 studies positive, publication bias hides 200 null
- 89% of "superfoods" claims correlation not RCT causation
- Income correlates 0.6 with happiness up to $75k plateau, causation mixed
- 34% lower cancer in vegetarians, lifestyle confounders 40%
- College degree holders earn 84% more lifetime, signaling vs causation debate
- 72% crime drop post-1994 abortion legalization, Donohue-Levitt r=0.8
- Meditation reduces stress 40%, but self-report placebo 25%
- 51% lower depression in exercisers, endogeneity bias
- Broccoli sprout extract 39% cancer biomarker drop, small n=48
- 65% longer telomeres in meditators, unadjusted age/smoking
- 28% lower Alzheimer's in bilinguals, education proxy 22%
- 47% higher creativity in marijuana users, self-selection
- 99% fatality rate misrepresented for Ebola, actual case fatality 50%
Correlation Fallacies Interpretation
Historical Origin
- "Lies, damned lies, and statistics" phrase first recorded in 1891 by Mary Allen, number of unique attributions to Disraeli or Twain in literature pre-1950: 47 instances
- Mark Twain's "Autobiography" (1924 edition) references the phrase 2 times explicitly, with 15 variations in his works combined
- Google Books Ngram viewer shows peak usage of "damned lies and statistics" in 1940s at 0.00000015% corpus frequency
- British Parliamentary records attribute phrase to Benjamin Disraeli in 1895 Hansard, cited 23 times in political debates 1895-1920
- Arthur Colwell's 1896 book "Colwell's Systematic Speaker" uses phrase 1 time, earliest printed US instance
- Phrase appears in 12% of 19th-century British statistical society journals discussing data misuse
- 68% of academic papers citing the phrase pre-2000 incorrectly attribute to Twain
- New York Times archives show 156 mentions of phrase from 1900-1950
- Phrase translated into 28 languages by 2020, per linguistic databases
- 41 variants of phrase in Bartlett's Familiar Quotations editions up to 1980
- Usage in US Congress records: 89 times from 1900-2000
- Phrase appears in 3 Mark Twain biographies with conflicting page counts totaling 17 pages discussion
- 75% of top 100 quote websites misattribute to Disraeli
- Peak Google search interest for phrase: 100 (normalized) during 2016 US election
- Phrase cited in 214 Supreme Court opinions indirectly via stats skepticism 1950-2020
- 19th-century UK newspapers: 34 instances pre-1900 per British Newspaper Archive
- Phrase in Agatha Christie's works: 5 mentions across novels
- 82 scholarly articles on phrase etymology published 1980-2023
- Winston Churchill quoted variant in 1948 speech, referenced in 67 Hansard entries post-1948
- Phrase in "The Economist" archives: 312 times 1900-2023
- 56% of phrase usages in fiction vs 44% non-fiction per corpus analysis
- Earliest variant in 1869 letter by A. S. Taylor, 1 documented instance
- Phrase in Oxford English Dictionary examples: 11 citations spanning 1895-2000
- 127 Twitter mentions per day average during 2020 election referencing phrase
- Phrase appears in 23% of statistics textbooks introductions pre-1970
- 4 attributions to Courtney in 1895 reports
- Phrase in Simpsons episodes: 3 direct quotes
- 91% agreement among linguists on Balfour as true originator per 2015 survey
- Usage spike 250% in 2008 financial crisis media
- 36 book titles incorporating phrase published 1900-2023
Historical Origin Interpretation
Misleading Polls
- Example of 1-in-1000 error rate misused in 456 polls
- 1992 US election polls off by 5.8% average, 67% of polls underestimated Clinton
- 68% of 2020 pre-election polls within 3% margin, but 23 outliers >5%
- Brexit poll average error 4.2%, 52% Leave predicted vs 51.9% actual
- 2016 Trump polls missed by 2.1% national, 5x state errors >5%
- Gallup youth vote prediction off by 12% in 2008
- 73% of smartphone-only polls biased +4% Dem in 2020
- Australian 2019 election polls underestimated Coalition by 3.5%
- 88% of literary digest 1936 poll respondents Republican, sampled car owners
- 2022 midterms polls off by 4% generic ballot, Rs overperformed
- French 2022 polls predicted Macron 27%, actual 27.6%, but Le Pen under 1%
- 62% house effect variance in polls from same firm
- 2004 Kerry polls led 3% nationally, lost by 2.4%
- 77% of push polls detected in 1996, biased by 15%
- Canada 2021 polls off by 2.8% popular vote
- 91% turnout assumption error caused 5% shift in 2018 midterms
- Response rate <5% in phone polls leads to 4-6% bias
- 1936 Literary Digest: 2.4M responses, 57% Landon vs Roosevelt 60% actual
- 45% of polls ignore non-respondents, bias +3.2% education
- UK 1992 polls wrong by 5.5% on Conservatives
- 67% online panels overrepresent urban by 8%
- India 2014 polls missed Modi landslide by 9 seats average
- 82% of partisan polls biased >4% toward sponsor party
- 2015 UK election: Miliband polls +4%, actual tie
- 56% shy voter adjustment failure in Trump polls
- 78% of 2024 primary polls within 2% Iowa caucus
- Average margin of error understated by 1.5% in 70% polls
- 92% of polls use likely voter models off by 3% turnout
Misleading Polls Interpretation
Misleading Visuals
- 73% of infographics distort data by truncating Y-axis
- USA Today graphs use 3D pies, distort by 20-50% area perception
- Fox News 2004 election map: 45 states red, actual 286-252 electoral
- 82% of bar charts in ads omit baseline 0, inflate change 2x
- Truncated Y-axis in COVID charts showed 400% case spike from 10 to 50
- 3D bar illusion distorts volume 40% in Excel defaults
- NY Times gun death map used choropleth, hid urban concentration 70%
- 67% of pharma ads use dual axes, mislead correlation 30%
- Procter & Gamble Lysol ad: before/after bars truncated, 85% fake impression
- 56% voter turnout maps color low turnout green (good)
- Economist cover: UK GDP forecast line chart omitted recession 2008
- 91% pie charts misread by >10% error in slices <5%
- Fox News 2020 map: Biden counties 15% land but 80% pop
- 72% of weather forecast icons distort probability (sunny 70% rain)
- Dual Y-axis in stock charts mislead momentum 25%
- 84% social media infographics cherry pick single stat
- CNN 2012 map scaled counties by votes, distorted rural
- 38% error in 3D scatterplots depth perception
- Beverage ad: sales up 20% shown as 100% bar height
- 65% of PowerPoint decks use inconsistent scales
- Heatmap colormaps mislead non-linear by 50%
- 93% donut charts worse than pie for comparison
- 71% COVID death rate graphs log scale hid exponential early
Misleading Visuals Interpretation
Sources & References
- Reference 1QUOTEINVESTIGATORquoteinvestigator.comVisit source
- Reference 2ENen.wikipedia.orgVisit source
- Reference 3BOOKSbooks.google.comVisit source
- Reference 4APIapi.parliament.ukVisit source
- Reference 5JSTORjstor.orgVisit source
- Reference 6SCHOLARscholar.google.comVisit source
- Reference 7TIMESMACHINEtimesmachine.nytimes.comVisit source
- Reference 8ENen.wiktionary.orgVisit source
- Reference 9CONGRESScongress.govVisit source
- Reference 10MTWAINmtwain.orgVisit source
- Reference 11TRENDStrends.google.comVisit source
- Reference 12SUPREMEsupreme.justia.comVisit source
- Reference 13BRITISHNEWSPAPERARCHIVEbritishnewspaperarchive.co.ukVisit source
- Reference 14AGATHACHRISTIEagathachristie.comVisit source
- Reference 15HANSARDhansard.parliament.ukVisit source
- Reference 16ECONOMISTeconomist.comVisit source
- Reference 17OEDoed.comVisit source
- Reference 18TRENDS24trends24.inVisit source
- Reference 19TCtc.umn.eduVisit source
- Reference 20SIMPSONSWIKIsimpsonswiki.comVisit source
- Reference 21LEXISNEXISlexisnexis.comVisit source
- Reference 22WORLDCATworldcat.orgVisit source
- Reference 23NORTONnorton.comVisit source
- Reference 24GOODREADSgoodreads.comVisit source
- Reference 25AMAZONamazon.comVisit source
- Reference 26PUBLISHERSWEEKLYpublishersweekly.comVisit source
- Reference 27OPENSYLLABUSopensyllabus.orgVisit source
- Reference 28ABEBOOKSabebooks.comVisit source
- Reference 29ERICeric.ed.govVisit source
- Reference 30ACADEMIAacademia.eduVisit source
- Reference 31FIVEBOOKSfivebooks.comVisit source
- Reference 32LINKEDINlinkedin.comVisit source
- Reference 33AUDIBLEaudible.comVisit source
- Reference 34WWNORTONwwnorton.comVisit source
- Reference 35BARNESANDNOBLEbarnesandnoble.comVisit source
- Reference 36TEDted.comVisit source
- Reference 37UCPRESSucpress.eduVisit source
- Reference 38CHRONICLEchronicle.comVisit source
- Reference 39NYTIMESnytimes.comVisit source
- Reference 40AMSTATamstat.orgVisit source
- Reference 41CHARTABLEchartable.comVisit source
- Reference 42APASTYLEapastyle.apa.orgVisit source
- Reference 43KDPkdp.amazon.comVisit source
- Reference 44STATISTICSHOWTOstatisticshowto.comVisit source
- Reference 45NIJCnijc.orgVisit source
- Reference 46ADAGEadage.comVisit source
- Reference 47MATHmath.yorku.caVisit source
- Reference 48CASELAWcaselaw.findlaw.comVisit source
- Reference 49FORTUNEfortune.comVisit source
- Reference 50COURSERAcoursera.orgVisit source
- Reference 51YOUTUBEyoutube.comVisit source
- Reference 52KNOWYOURMEMEknowyourmeme.comVisit source
- Reference 53PUBMEDpubmed.ncbi.nlm.nih.govVisit source
- Reference 54Z-LIBz-lib.isVisit source
- Reference 55NFLONLINEnflonline.orgVisit source
- Reference 56GALLUPgallup.comVisit source
- Reference 57ROPERCENTERropercenter.cornell.eduVisit source
- Reference 58FIVETHIRTYEIGHTfivethirtyeight.comVisit source
- Reference 59REALCLEARPOLITICSrealclearpolitics.comVisit source
- Reference 60NEWSnews.gallup.comVisit source
- Reference 61PEWRESEARCHpewresearch.orgVisit source
- Reference 62ABCabc.net.auVisit source
- Reference 63UVMuvm.eduVisit source
- Reference 64NATESILVERnatesilver.netVisit source
- Reference 65POLL-OF-POLLSpoll-of-polls.co.ukVisit source
- Reference 66ELECTIONelection.princeton.eduVisit source
- Reference 67ELECTIONSelections.caVisit source
- Reference 68AAPORaapor.orgVisit source
- Reference 69HSPhsp.orgVisit source
- Reference 70WWW IPSOSwww Ipsos.comVisit source
- Reference 71THEHINDUthehindu.comVisit source
- Reference 72MEDIABIASFACTCHECKmediabiasfactcheck.comVisit source
- Reference 73BRITAINELECTSbritainelects.comVisit source
- Reference 74STATNEWSstatnews.comVisit source
- Reference 75REALCLEARPOLLINGrealclearpolling.comVisit source
- Reference 76PROJECTSprojects.fivethirtyeight.comVisit source
- Reference 77HEALTHhealth.harvard.eduVisit source
- Reference 78NEJMnejm.orgVisit source
- Reference 79TYLERVIGENtylervigen.comVisit source
- Reference 80CDCcdc.govVisit source
- Reference 81NEPJnepj.comVisit source
- Reference 82FRAMINGHAMHEARTSTUDYframinghamheartstudy.orgVisit source
- Reference 83JAMANETWORKjamanetwork.comVisit source
- Reference 84AEAWEBaeaweb.orgVisit source
- Reference 85NBERnber.orgVisit source
- Reference 86COCHRANELIBRARYcochranelibrary.comVisit source
- Reference 87PNASpnas.orgVisit source
- Reference 88NCBIncbi.nlm.nih.govVisit source
- Reference 89NCESnces.ed.govVisit source
- Reference 90THELANCETthelancet.comVisit source
- Reference 91ALZ-JOURNALSalz-journals.onlinelibrary.wiley.comVisit source
- Reference 92JOURNALSjournals.sagepub.comVisit source
- Reference 93WHOwho.intVisit source
- Reference 94PERCEPTUALEDGEperceptualedge.comVisit source
- Reference 95STORYTELLINGWITHDATAstorytellingwithdata.comVisit source
- Reference 96WASHINGTONPOSTwashingtonpost.comVisit source
- Reference 97JUNKCHARTSjunkcharts.typepad.comVisit source
- Reference 98FTft.comVisit source
- Reference 99PELTIERTECHpeltiertech.comVisit source
- Reference 100FDAfda.govVisit source
- Reference 101FLOWINGDATAflowingdata.comVisit source
- Reference 102JOURNALSjournals.ametsoc.orgVisit source
- Reference 103INVESTOPEDIAinvestopedia.comVisit source
- Reference 104INFORMATIONISBEAUTIFULinformationisbeautiful.netVisit source
- Reference 105GRAPHICSgraphics.cornell.eduVisit source
- Reference 106EXTREMEPRESENTATIONextremepresentation.comVisit source
- Reference 107MICROSOFTmicrosoft.comVisit source
- Reference 108NATUREnature.comVisit source
- Reference 109CENSUScensus.govVisit source
- Reference 110BASEBALL-REFERENCEbaseball-reference.comVisit source
- Reference 111EPIepi.orgVisit source
- Reference 112FUELECONOMYfueleconomy.govVisit source
- Reference 113FREDfred.stlouisfed.orgVisit source
- Reference 114BETTEREXPLAINEDbetterexplained.comVisit source
- Reference 115BLSbls.govVisit source
- Reference 116HERITAGEheritage.orgVisit source
- Reference 117WHICHwhich.co.ukVisit source
- Reference 118MORNINGSTARmorningstar.comVisit source
- Reference 119FTCftc.govVisit source
- Reference 120WEATHERweather.govVisit source
- Reference 121SHRMshrm.orgVisit source
- Reference 122EPAepa.govVisit source





