Key Takeaways
- The boxplot, also known as a box-and-whisker plot, was introduced by John W. Tukey in his 1977 book "Exploratory Data Analysis" as a method for graphical data summarization
- John Tukey's original boxplot design emphasized five-number summaries including minimum, lower quartile, median, upper quartile, and maximum
- The first published boxplot appeared in Tukey's work to visualize distributions resistant to outliers
- A boxplot's box spans from the first quartile (Q1, 25th percentile) to the third quartile (Q3, 75th percentile)
- The median is marked as a line within the box, representing the 50th percentile of the dataset
- Whiskers extend to the smallest and largest values within 1.5 times the interquartile range (IQR) from Q1 and Q3
- Boxplots assume ordinal or continuous data, ignoring nominal categories inherently
- The 1.5*IQR rule for outliers is arbitrary but empirically covers ~99.3% of normal data
- Boxplots are robust to outliers, with median having 50% breakdown point vs. mean's 0%
- Boxplots outperform histograms for comparing multiple distributions' locations
- Violin plots combine boxplot with KDE, showing density unlike plain boxplots
- ECDF plots preserve all data points vs. boxplot summarization loss
- Boxplots used in ANOVA Tukey HSD for group comparisons visually
- In genomics, boxplots compare gene expression across conditions
- Finance employs boxplots for daily returns volatility across stocks
Boxplots visualize data summaries from John Tukey's original design and later extensions.
Applications and Usage
Applications and Usage Interpretation
Comparisons and Alternatives
Comparisons and Alternatives Interpretation
Construction and Components
Construction and Components Interpretation
History and Development
History and Development Interpretation
Statistical Properties
Statistical Properties Interpretation
Sources & References
- Reference 1ENen.wikipedia.orgVisit source
- Reference 2ITLitl.nist.govVisit source
- Reference 3PROJECTEUCLIDprojecteuclid.orgVisit source
- Reference 4TANDFONLINEtandfonline.comVisit source
- Reference 5JSTORjstor.orgVisit source
- Reference 6STATstat.cmu.eduVisit source
- Reference 7STATMETHODSstatmethods.netVisit source
- Reference 8RESEARCHGATEresearchgate.netVisit source
- Reference 9IEEEXPLOREieeexplore.ieee.orgVisit source
- Reference 10UIui.adsabs.harvard.eduVisit source
- Reference 11MATHSISFUNmathsisfun.comVisit source
- Reference 12STATISTICSstatistics.laerd.comVisit source
- Reference 13STATOLOGYstatology.orgVisit source
- Reference 14TOWARDSDATASCIENCEtowardsdatascience.comVisit source
- Reference 15KHANACADEMYkhanacademy.orgVisit source
- Reference 16SEEING-THEORYseeing-theory.brown.eduVisit source
- Reference 17BENFREDERICKSONbenfrederickson.github.ioVisit source
- Reference 18NCBIncbi.nlm.nih.govVisit source
- Reference 19ONLINEonline.stat.psu.eduVisit source
- Reference 20QUALITYDIGESTqualitydigest.comVisit source
- Reference 21ROBUSTSTATrobuststat.netVisit source
- Reference 22STATSstats.stackexchange.comVisit source
- Reference 23STATCANwww150.statcan.gc.caVisit source
- Reference 24RDOCUMENTATIONrdocumentation.orgVisit source
- Reference 25JSTATSOFTjstatsoft.orgVisit source
- Reference 26CScs.toronto.eduVisit source
- Reference 27OCWocw.mit.eduVisit source
- Reference 28PERCEPTUALEDGEperceptualedge.comVisit source
- Reference 29FABIOMASIELLOfabiomasiello.comVisit source
- Reference 30AUTODESKautodesk.comVisit source
- Reference 31CScs.cornell.eduVisit source
- Reference 32DARKHORSEANALYTICSdarkhorseanalytics.comVisit source
- Reference 33WELLBEINGDATAwellbeingdata.orgVisit source
- Reference 34ONLINECOURSESonlinecourses.science.psu.eduVisit source
- Reference 35BIOCONDUCTORbioconductor.orgVisit source
- Reference 36INVESTOPEDIAinvestopedia.comVisit source
- Reference 37EPAepa.govVisit source
- Reference 38FIVETHIRTYEIGHTfivethirtyeight.comVisit source
- Reference 39ASQasq.orgVisit source
- Reference 40STATISTICSHOWTOstatisticshowto.comVisit source
- Reference 41CLIMATEclimate.govVisit source
- Reference 42WILEYwiley.comVisit source
- Reference 43AMSTATamstat.orgVisit source
- Reference 44R-PROJECTr-project.orgVisit source
- Reference 45SUPPORTsupport.microsoft.comVisit source
- Reference 46ISOiso.orgVisit source
- Reference 47MINITABminitab.comVisit source
- Reference 48DOCUMENTATIONdocumentation.sas.comVisit source
- Reference 49CScs.uic.eduVisit source
- Reference 50TAYLORFRANCIStaylorfrancis.comVisit source
- Reference 51MATPLOTLIBmatplotlib.orgVisit source
- Reference 52SEABORNseaborn.pydata.orgVisit source
- Reference 53GGPLOT2ggplot2.tidyverse.orgVisit source
- Reference 54BOOTSTRAPPINGbootstrapping.orgVisit source
- Reference 55GGPLOT2-EXTSggplot2-exts.orgVisit source
- Reference 56EDWARDTUFTEedwardtufte.comVisit source
- Reference 57HIGHCHARTShighcharts.comVisit source
- Reference 58MATHmath.ucla.eduVisit source
- Reference 59ATSats.ucla.eduVisit source
- Reference 60ONLINELIBRARYonlinelibrary.wiley.comVisit source
- Reference 61SCIENCEDIRECTsciencedirect.comVisit source
- Reference 62ARXIVarxiv.orgVisit source
- Reference 63CScs.uni.eduVisit source
- Reference 64TABLEAUtableau.comVisit source
- Reference 65WILKELABwilkelab.orgVisit source
- Reference 66D3JSd3js.orgVisit source
- Reference 67OPTIMIZELYoptimizely.comVisit source
- Reference 68ZILLOWzillow.comVisit source
- Reference 69FHWAfhwa.dot.govVisit source
- Reference 70EIAeia.govVisit source
- Reference 71PSYCHOLOGIEpsychologie.uni-heidelberg.deVisit source
- Reference 72ARSars.usda.govVisit source






