Key Takeaways
- The human genome contains approximately 3.2 billion base pairs of DNA sequence
- The haploid human genome size is measured at 3,054,815,472 base pairs in the GRCh38.p14 assembly
- Eukaryotic genomes like humans have linear chromosomes, with 22 autosomes and 2 sex chromosomes totaling 24 unique chromosomes
- The human genome contains an estimated 20,000-25,000 protein-coding genes
- Non-coding RNAs number over 20,000 in the human genome including lncRNAs and miRNAs
- Pseudogenes in humans total around 14,000, mostly processed pseudogenes
- Human Genome Project officially completed in 2003 with 99% coverage at 1x depth
- The first human genome sequence cost $2.7 billion and took 13 years
- Illumina HiSeq platform enabled 100x coverage human genomes for under $1,000 by 2015
- The common single nucleotide polymorphisms (SNPs) number over 10 million in the human genome with minor allele frequency >1%
- Structural variants (SVs) affect 20-50 kb per individual, totaling 1-2% of genome difference
- Copy number variations (CNVs) cover 12% of the human genome across populations
- Genome-wide association studies link 7,000 SNPs to disease risk
- Pharmacogenomics identifies 300 actionable variants for 100+ drugs
- Prenatal whole-genome sequencing detects 13% more pathogenic variants than microarrays
The human genome contains billions of base pairs, thousands of genes, and vast repetitive regions.
Applications and Impacts
Applications and Impacts Interpretation
Gene Content
Gene Content Interpretation
Genetic Variation
Genetic Variation Interpretation
Genome Size and Structure
Genome Size and Structure Interpretation
Sequencing Projects
Sequencing Projects Interpretation
How We Rate Confidence
Every statistic is queried across four AI models (ChatGPT, Claude, Gemini, Perplexity). The confidence rating reflects how many models return a consistent figure for that data point.
Only one AI model returns this statistic from its training data. The figure comes from a single primary source and has not been corroborated by independent systems. Use with caution; cross-reference before citing.
AI consensus: 1 of 4 models agree
Multiple AI models cite this figure or figures in the same direction, but with minor variance. The trend and magnitude are reliable; the precise decimal may differ by source. Suitable for directional analysis.
AI consensus: 2–3 of 4 models broadly agree
All AI models independently return the same statistic, unprompted. This level of cross-model agreement indicates the figure is robustly established in published literature and suitable for citation.
AI consensus: 4 of 4 models fully agree
Cite This Report
This report is designed to be cited. We maintain stable URLs and versioned verification dates. Copy the format appropriate for your publication below.
Emilia Santos. (2026, February 13). Genome Statistics. Gitnux. https://gitnux.org/genome-statistics
Emilia Santos. "Genome Statistics." Gitnux, 13 Feb 2026, https://gitnux.org/genome-statistics.
Emilia Santos. 2026. "Genome Statistics." Gitnux. https://gitnux.org/genome-statistics.
Sources & References
- Reference 1GENOMEgenome.govVisit source
- Reference 2NCBIncbi.nlm.nih.govVisit source
- Reference 3ENen.wikipedia.orgVisit source
- Reference 4MEDLINEPLUSmedlineplus.govVisit source
- Reference 5GHRghr.nlm.nih.govVisit source
- Reference 6NATUREnature.comVisit source
- Reference 7GENOMEgenome.ucsc.eduVisit source
- Reference 8ENSEMBLensembl.orgVisit source
- Reference 9GATKgatk.broadinstitute.orgVisit source
- Reference 10FLYBASEflybase.orgVisit source
- Reference 11ARABIDOPSISarabidopsis.orgVisit source
- Reference 12YEASTGENOMEyeastgenome.orgVisit source
- Reference 13WHEATGENOMEwheatgenome.orgVisit source
- Reference 14MAIZEGDBmaizegdb.orgVisit source
- Reference 15WORMBASEwormbase.orgVisit source
- Reference 16PLASMODBplasmodb.orgVisit source
- Reference 17GENCODEGENESgencodegenes.orgVisit source
- Reference 18IMGTimgt.orgVisit source
- Reference 19GUIDETOPHARMACOLOGYguidetopharmacology.orgVisit source
- Reference 20KINASEkinase.comVisit source
- Reference 21DRNELSONdrnelson.uthsc.eduVisit source
- Reference 22ECOCYCecocyc.orgVisit source
- Reference 23RICErice.uga.eduVisit source
- Reference 24ILLUMINAillumina.comVisit source
- Reference 25INTERNATIONALGENOMEinternationalgenome.orgVisit source
- Reference 26UKBIOBANKukbiobank.ac.ukVisit source
- Reference 27CANCERcancer.govVisit source
- Reference 28EARTHBIOGENOMEearthbiogenome.orgVisit source
- Reference 29SCIENCEscience.orgVisit source
- Reference 30ENCODEPROJECTencodeproject.orgVisit source
- Reference 31NANOPORETECHnanoporetech.comVisit source
- Reference 32ALLOFUSallofus.nih.govVisit source
- Reference 33GTEXPORTALgtexportal.orgVisit source
- Reference 341000GENOMES1000genomes.orgVisit source
- Reference 35EBIebi.ac.ukVisit source
- Reference 36CBIOPORTALcbioportal.orgVisit source
- Reference 37GNOMADgnomad.broadinstitute.orgVisit source
- Reference 38PHARMGKBpharmgkb.orgVisit source
- Reference 39NEJMnejm.orgVisit source
- Reference 40ACOGacog.orgVisit source
- Reference 41ISAAAisaaa.orgVisit source
- Reference 42ANCESTRYancestry.comVisit source
- Reference 43ANNALSOFONCOLOGYannalsofoncology.orgVisit source
- Reference 44GENOMEWEBgenomeweb.comVisit source






