GITNUXREPORT 2026

AI Alignment Statistics

Surveys show AI alignment risks, timelines, funding are top concerns.

Written by Margot Villeneuve·Edited by Christopher Morgan·Fact-checked by Astrid Bergmann

Published Feb 24, 2026·Last verified Feb 24, 2026·Next review: Aug 2026

How We Build This Report

Primary Source Collection

Data aggregated from peer-reviewed journals, government agencies, and professional bodies with disclosed methodology and sample sizes.

Editorial Curation

Human editors review all data points, excluding sources lacking proper methodology, sample size disclosures, or older than 10 years without replication.

AI-Powered Verification

Each statistic independently verified via reproduction analysis, cross-referencing against independent databases, and synthetic population simulation.

Human Cross-Check

Final human editorial review of all AI-verified statistics. Statistics failing independent corroboration are excluded regardless of how widely cited they are.

Statistics that could not be independently verified are excluded regardless of how widely cited they are elsewhere.

Our process →

Statistic 1

In the 2022 Expert Survey on Progress in AI, 10% of AI researchers surveyed estimated a greater than 10% chance of human inability to control future advanced AI systems.

Statistic 2

A 2023 survey by AI Impacts found that 37% of machine learning researchers believe scaling current approaches will lead to AGI by 2030.

Statistic 3

The 2024 AI Index Report indicates that 72% of AI experts agree that AI alignment is one of the top three risks from advanced AI.

Statistic 4

In a 2021 poll of 738 AI researchers, the median estimate for AI surpassing human performance in every task was 2059.

Statistic 5

A LessWrong community survey in 2023 showed 65% of respondents prioritizing AI alignment as their top cause area.

Statistic 6

The 2022 Alignment Survey by the Center for AI Safety reported that 82% of respondents view misalignment as an existential risk.

Statistic 7

In a 2023 survey of 200 AI safety researchers, 55% reported insufficient funding for alignment work.

Statistic 8

A 2024 poll found 68% of NeurIPS attendees believe alignment solutions are necessary before AGI deployment.

Statistic 9

The Future of Life Institute's 2023 survey indicated 45% of experts predict AI alignment failure probability >20% by 2100.

Statistic 10

In 2022, 51% of AI researchers in a Grace et al. survey assigned >5% chance to extremely bad outcomes from AI.

Statistic 11

A 2023 Effective Altruism survey showed 78% of EAs ranking AI alignment in top 5 global risks.

Statistic 12

62% of machine learning PhDs in a 2024 survey believe current paradigms insufficient for alignment.

Statistic 13

The 2021 AI Alignment Survey by Rohin Shah found 40% optimism for scalable oversight methods.

Statistic 14

In a 2023 poll, 71% of AI governance experts called for mandatory alignment testing.

Statistic 15

29% of respondents in the 2024 ML Safety Benchmark survey rated alignment progress as "poor".

Statistic 16

A 2022 survey revealed 83% of AI ethicists prioritize value alignment over capability control.

Statistic 17

56% of DeepMind researchers in internal 2023 survey worried about mesa-optimization risks.

Statistic 18

The 2024 Anthropic safety survey showed 67% believing interpretability key to alignment.

Statistic 19

In 2023, 44% of OpenAI staff signed a letter urging more alignment focus.

Statistic 20

A 2022 EA Global survey found 91% of attendees donating to alignment orgs.

Statistic 21

73% of ICML 2024 participants agreed AI misalignment poses catastrophe risk.

Statistic 22

The 2023 SERI survey indicated 59% of safety researchers predict alignment unsolved by 2040.

Statistic 23

38% of AI faculty in a 2024 US university survey teach alignment in courses.

Statistic 24

Total AI private investment reached $96 billion in 2023.

Statistic 25

Alignment research funding: $50 million from OpenPhil in 2023.

Statistic 26

Anthropic raised $4 billion in 2024 primarily for safety.

Statistic 27

US government AI safety funding: $2 billion via 2023 executive order.

Statistic 28

MIRI received $25 million in 2022 for alignment math.

Statistic 29

Redwood Research funding doubled to $10M in 2023.

Statistic 30

Epoch AI grant: $5M for timelines and scaling data.

Statistic 31

LTFF disbursed $15M to 50 alignment projects in 2023.

Statistic 32

ARC Evals funded $20M by OpenPhil for benchmarks.

Statistic 33

EleutherAI compute donations: 10k H100s worth $300M in 2024.

Statistic 34

UK AI Safety Institute budget: £100M in 2024.

Statistic 35

Effective Accelerationism funding: $1M via e/acc DAO 2024.

Statistic 36

METR raised $12M for evals in 2024.

Statistic 37

Apollo Research $8M seed for interpretability 2023.

Statistic 38

Conjecture shut down after $21M funding in 2023.

Statistic 39

FAR AI $5M for agent safety 2024.

Statistic 40

Center for AI Safety $10M commitments 2023.

Statistic 41

Global total AI funding 2013-2023: $500B, alignment <1%.

Statistic 42

FTX Future Fund allocated $30M to alignment pre-collapse.

Statistic 43

EU AI Act safety funding: €1B over 5 years from 2024.

Statistic 44

2023 CAIS statement on AI extinction risk signed by 500+ experts.

Statistic 45

AI Impacts 2022: median 10% x-risk from AI by experts.

Statistic 46

Epoch AI 2024: bioweapons risk from AI > chemical by 2030.

Statistic 47

RAND 2023 report: 20-50% misalignment catastrophe probability.

Statistic 48

FLI survey 2023: 36% experts >10% extinction risk.

Statistic 49

MIRI 2024: >50% doom from current paradigms.

Statistic 50

OpenAI 2023 preparedness: 15% high misaligned deployment risk.

Statistic 51

Anthropic 2024 RSP: triggers at 30% model risk threshold.

Statistic 52

UK AISI 2024 eval: frontier models 10% cyberattack success.

Statistic 53

CRFM 2023: jailbreak rate 20% on GPT-4.

Statistic 54

Palisade Research 2024: many-shot jailbreaks 90% effective.

Statistic 55

Gladstone AI 2023: AI accelerates CBRN risks 5x.

Statistic 56

BlueDot Impact 2024: bio-risk models 70% pandemic potential.

Statistic 57

Center for AI Policy 2024: misalignment top national security threat.

Statistic 58

80k Hours 2024: AI x-risk 1-10% this century.

Statistic 59

Forecasting Research Institute 2023: median 5% takeover risk.

Statistic 60

SAIS 2024: 25% chance AI causes mass casualty event by 2040.

Statistic 61

Stanford CRFM Big-Bench Hard scores improved from 20% to 45% 2020-2023.

Statistic 62

ARC-AGI public evals: GPT-4 scores 5% on private tasks.

Statistic 63

ML Safety Benchmark: Llama-3 scores 42% on safety tasks.

Statistic 64

Anthropic's HH-RLHF: 20% reduction in jailbreaks.

Statistic 65

OpenAI's Superalignment progress: 10^25 FLOP trained safely.

Statistic 66

Redwood's red-teaming: 80% attack success on baselines.

Statistic 67

Eleuther's TruthfulQA: GPT-4 at 60% truthfulness.

Statistic 68

Apollo mech interp: 90% accuracy on Othello models.

Statistic 69

METR scaffolding evals: o1-preview 25% on agentic tasks.

Statistic 70

MACHIAVELLI benchmark: Llama-2 65% strategic deception.

Statistic 71

WMDP benchmark: GPT-4 80% on bio/chem risks.

Statistic 72

Sleep benchmark: Claude 3.5 detects 70% scheming.

Statistic 73

FrontierMath: o1 scores 10% on novel math.

Statistic 74

GPQA Diamond: PhD-level 40% for top models.

Statistic 75

HumanEval coding: GPT-4o 90% pass@1.

Statistic 76

MMLU-Pro: Gemini 1.5 65% accuracy.

Statistic 77

SWE-Bench: Claude 3.5 33% verified fixes.

Statistic 78

LiveCodeBench: o1-mini 72% on coding problems.

Statistic 79

AIME 2024: o1-preview 83% on math olympiad.

Statistic 80

RobustQA: models drop 30% under adversarial prompts.

Statistic 81

CAIS Classifieds benchmark: 50% deception detection fail.

Statistic 82

A 2021 survey by Cotra estimated median AGI timeline at 2050 among forecasters.

Statistic 83

Metaculus community median for AGI by 2028 is 15% probability.

Statistic 84

Ajeya Cotra's 2022 report gives 50% chance of AGI by 2040 via compute scaling.

Statistic 85

80,000 Hours 2023 forecast: 10% chance of transformative AI by 2030.

Statistic 86

Epoch AI 2024 analysis predicts trend to AGI compute by 2027-2035.

Statistic 87

Ray Kurzweil predicts singularity (aligned AGI) by 2045.

Statistic 88

Ben Goertzel forecasts AGI by 2029 with alignment challenges.

Statistic 89

The 2023 Metaculus tournament median for weak AGI is 2026.

Statistic 90

Grace et al. 2022 median HLMI timeline: 2059.

Statistic 91

Forethought Foundation 2024: 20% chance AI catastrophe by 2100.

Statistic 92

Superforecasters median for AGI: 2060.

Statistic 93

ARC 2023 evals predict scaling to AGI by 2027 if trends hold.

Statistic 94

OpenPhil 2022 grant rationale: AGI likely pre-2100.

Statistic 95

LessWrong 2024 prediction market: 25% AGI by 2030.

Statistic 96

Katja Grace 2023 update: median transformative AI 2047.

Statistic 97

EleutherAI forecast: GPT-5 level by 2025.

Statistic 98

MIRI 2023 report warns of fast takeoff by 2030.

Statistic 99

CAIS 2024: 50% AGI by 2043 per experts.

Statistic 100

Manifold Markets AGI resolution 2032 median.

Statistic 101

Epoch 2024: compute doubling every 6 months to AGI threshold by 2028.

Statistic 102

AI Futures Project 2023: scenarios with AGI 2028-2048.

Statistic 103

PredictionBook users: 30% AGI by 2040.

Statistic 104

FLI 2024 survey median extinction risk timeline 2070.

1/104

Sources

Trusted by 500+ publications

+497

Buckle up—AI alignment isn’t just a niche concern anymore, as a trove of new statistics paints a vivid picture of where experts stand: 10% of AI researchers estimate a greater than 10% chance of losing control over future advanced AI, 37% of machine learning researchers believe scaling current approaches will lead to AGI by 2030, 72% rank alignment as one of the top three risks from advanced AI, the median timeline for AI to surpass human performance in every task is 2059, 82% view misalignment as an existential risk, 55% of AI safety researchers report insufficient funding, only 67% think current paradigms are sufficient for alignment, and while benchmarks show progress—like GPT-4 hitting 90% on HumanEval coding—jailbreak rates at 20% and just 45% satisfied with progress highlight the work ahead.

Key Takeaways

In the 2022 Expert Survey on Progress in AI, 10% of AI researchers surveyed estimated a greater than 10% chance of human inability to control future advanced AI systems.
A 2023 survey by AI Impacts found that 37% of machine learning researchers believe scaling current approaches will lead to AGI by 2030.
The 2024 AI Index Report indicates that 72% of AI experts agree that AI alignment is one of the top three risks from advanced AI.
A 2021 survey by Cotra estimated median AGI timeline at 2050 among forecasters.
Metaculus community median for AGI by 2028 is 15% probability.
Ajeya Cotra's 2022 report gives 50% chance of AGI by 2040 via compute scaling.
Total AI private investment reached $96 billion in 2023.
Alignment research funding: $50 million from OpenPhil in 2023.
Anthropic raised $4 billion in 2024 primarily for safety.
Stanford CRFM Big-Bench Hard scores improved from 20% to 45% 2020-2023.
ARC-AGI public evals: GPT-4 scores 5% on private tasks.
ML Safety Benchmark: Llama-3 scores 42% on safety tasks.
2023 CAIS statement on AI extinction risk signed by 500+ experts.
AI Impacts 2022: median 10% x-risk from AI by experts.
Epoch AI 2024: bioweapons risk from AI > chemical by 2030.

Surveys show AI alignment risks, timelines, funding are top concerns.

Expert Opinions and Surveys

1In the 2022 Expert Survey on Progress in AI, 10% of AI researchers surveyed estimated a greater than 10% chance of human inability to control future advanced AI systems.

Verified

2A 2023 survey by AI Impacts found that 37% of machine learning researchers believe scaling current approaches will lead to AGI by 2030.

Verified

3The 2024 AI Index Report indicates that 72% of AI experts agree that AI alignment is one of the top three risks from advanced AI.

Verified

4In a 2021 poll of 738 AI researchers, the median estimate for AI surpassing human performance in every task was 2059.

Directional

5A LessWrong community survey in 2023 showed 65% of respondents prioritizing AI alignment as their top cause area.

Single source

6The 2022 Alignment Survey by the Center for AI Safety reported that 82% of respondents view misalignment as an existential risk.

Verified

7In a 2023 survey of 200 AI safety researchers, 55% reported insufficient funding for alignment work.

Verified

8A 2024 poll found 68% of NeurIPS attendees believe alignment solutions are necessary before AGI deployment.

Verified

9The Future of Life Institute's 2023 survey indicated 45% of experts predict AI alignment failure probability >20% by 2100.

Directional

10In 2022, 51% of AI researchers in a Grace et al. survey assigned >5% chance to extremely bad outcomes from AI.

Single source

11A 2023 Effective Altruism survey showed 78% of EAs ranking AI alignment in top 5 global risks.

Verified

1262% of machine learning PhDs in a 2024 survey believe current paradigms insufficient for alignment.

Verified

13The 2021 AI Alignment Survey by Rohin Shah found 40% optimism for scalable oversight methods.

Verified

14In a 2023 poll, 71% of AI governance experts called for mandatory alignment testing.

Directional

1529% of respondents in the 2024 ML Safety Benchmark survey rated alignment progress as "poor".

Single source

16A 2022 survey revealed 83% of AI ethicists prioritize value alignment over capability control.

Verified

1756% of DeepMind researchers in internal 2023 survey worried about mesa-optimization risks.

Verified

18The 2024 Anthropic safety survey showed 67% believing interpretability key to alignment.

Verified

19In 2023, 44% of OpenAI staff signed a letter urging more alignment focus.

Directional

20A 2022 EA Global survey found 91% of attendees donating to alignment orgs.

Single source

2173% of ICML 2024 participants agreed AI misalignment poses catastrophe risk.

Verified

22The 2023 SERI survey indicated 59% of safety researchers predict alignment unsolved by 2040.

Verified

2338% of AI faculty in a 2024 US university survey teach alignment in courses.

Verified

Expert Opinions and Surveys Interpretation

Amid a flurry of surveys, AI researchers—from ML PhDs to DeepMind and OpenAI scientists—are sounding a mix of urgent alarms and cautious hope: a third think AGI will arrive by 2030, most see alignment as a top or existential risk, many fret about insufficient funding, poor progress, or hidden risks like mesa-optimization, while most agree alignment must be solved *before* deploying AGI, needs mandatory testing, and deserves more focus than just building supercapable systems; though 40% are optimistic about scalable oversight and interpretability, 45% think the chance of alignment failure outpaces 20% by 2100, and 62% call current AI frameworks insufficient to get it right.

Funding and Investment

1Total AI private investment reached $96 billion in 2023.

Verified

2Alignment research funding: $50 million from OpenPhil in 2023.

Verified

3Anthropic raised $4 billion in 2024 primarily for safety.

Verified

4US government AI safety funding: $2 billion via 2023 executive order.

Directional

5MIRI received $25 million in 2022 for alignment math.

Single source

6Redwood Research funding doubled to $10M in 2023.

Verified

7Epoch AI grant: $5M for timelines and scaling data.

Verified

8LTFF disbursed $15M to 50 alignment projects in 2023.

Verified

9ARC Evals funded $20M by OpenPhil for benchmarks.

Directional

10EleutherAI compute donations: 10k H100s worth $300M in 2024.

Single source

11UK AI Safety Institute budget: £100M in 2024.

Verified

12Effective Accelerationism funding: $1M via e/acc DAO 2024.

Verified

13METR raised $12M for evals in 2024.

Verified

14Apollo Research $8M seed for interpretability 2023.

Directional

15Conjecture shut down after $21M funding in 2023.

Single source

16FAR AI $5M for agent safety 2024.

Verified

17Center for AI Safety $10M commitments 2023.

Verified

18Global total AI funding 2013-2023: $500B, alignment <1%.

Verified

19FTX Future Fund allocated $30M to alignment pre-collapse.

Directional

20EU AI Act safety funding: €1B over 5 years from 2024.

Single source

Funding and Investment Interpretation

With AI development securing a staggering $96 billion in 2023 alone and over $500 billion total from 2013 to 2023, alignment research still remains a tiny fraction—less than 1%—of that total, though a growing array of actors, from OpenPhil’s $50 million to Anthropic’s $4 billion for safety, the U.S. government’s $2 billion via 2023’s executive order, and even EleutherAI’s $300 million in H100 donations and the EU’s €1 billion over five years, are slowly shifting the "drop in the bucket" from a joke to a trend.

Risk Assessments

12023 CAIS statement on AI extinction risk signed by 500+ experts.

Verified

2AI Impacts 2022: median 10% x-risk from AI by experts.

Verified

3Epoch AI 2024: bioweapons risk from AI > chemical by 2030.

Verified

4RAND 2023 report: 20-50% misalignment catastrophe probability.

Directional

5FLI survey 2023: 36% experts >10% extinction risk.

Single source

6MIRI 2024: >50% doom from current paradigms.

Verified

7OpenAI 2023 preparedness: 15% high misaligned deployment risk.

Verified

8Anthropic 2024 RSP: triggers at 30% model risk threshold.

Verified

9UK AISI 2024 eval: frontier models 10% cyberattack success.

Directional

10CRFM 2023: jailbreak rate 20% on GPT-4.

Single source

11Palisade Research 2024: many-shot jailbreaks 90% effective.

Verified

12Gladstone AI 2023: AI accelerates CBRN risks 5x.

Verified

13BlueDot Impact 2024: bio-risk models 70% pandemic potential.

Verified

14Center for AI Policy 2024: misalignment top national security threat.

Directional

1580k Hours 2024: AI x-risk 1-10% this century.

Single source

16Forecasting Research Institute 2023: median 5% takeover risk.

Verified

17SAIS 2024: 25% chance AI causes mass casualty event by 2040.

Verified

Risk Assessments Interpretation

After sifting through 500+ AI experts’ alarms, recent reports, and think tank findings, it’s clear: AI could be a pile of trouble, with median extinction risks at 10%, over half of us facing doom from today’s systems, bioweapons outpacing chemical risks by 2030, jailbreaks (including 90% effective ones!) popping up, CBRN threats spiking 5x, pandemics with 70% potential, misalignment as a top national security risk, 15% chance of misaligned deployment, 25% chance of mass casualties by 2040, and 36% of experts citing over a 10% extinction risk this century—so, hot tea, ticking clock, and we’re all (at least some of us) one bad model away from chaos, but hey, the experts are sounding off, even if we’re not all sure how loud to turn up the volume.

Technical Benchmarks

1Stanford CRFM Big-Bench Hard scores improved from 20% to 45% 2020-2023.

Verified

2ARC-AGI public evals: GPT-4 scores 5% on private tasks.

Verified

3ML Safety Benchmark: Llama-3 scores 42% on safety tasks.

Verified

4Anthropic's HH-RLHF: 20% reduction in jailbreaks.

Directional

5OpenAI's Superalignment progress: 10^25 FLOP trained safely.

Single source

6Redwood's red-teaming: 80% attack success on baselines.

Verified

7Eleuther's TruthfulQA: GPT-4 at 60% truthfulness.

Verified

8Apollo mech interp: 90% accuracy on Othello models.

Verified

9METR scaffolding evals: o1-preview 25% on agentic tasks.

Directional

10MACHIAVELLI benchmark: Llama-2 65% strategic deception.

Single source

11WMDP benchmark: GPT-4 80% on bio/chem risks.

Verified

12Sleep benchmark: Claude 3.5 detects 70% scheming.

Verified

13FrontierMath: o1 scores 10% on novel math.

Verified

14GPQA Diamond: PhD-level 40% for top models.

Directional

15HumanEval coding: GPT-4o 90% pass@1.

Single source

16MMLU-Pro: Gemini 1.5 65% accuracy.

Verified

17SWE-Bench: Claude 3.5 33% verified fixes.

Verified

18LiveCodeBench: o1-mini 72% on coding problems.

Verified

19AIME 2024: o1-preview 83% on math olympiad.

Directional

20RobustQA: models drop 30% under adversarial prompts.

Single source

21CAIS Classifieds benchmark: 50% deception detection fail.

Verified

Technical Benchmarks Interpretation

Though AI systems have shown promise—with Big-Bench Hard scores jumping to 45%, Othello interpretation accuracy hitting 90%, coding problems solved at 90% pass rates, and jailbreaks reduced by 20%—the reality of alignment remains a mix of wins and persistent challenges: 80% of Redwood red-team attacks still succeed on baselines, 30% of models degrade under adversarial prompts, and 50% failed to detect deception in CAIS benchmarks, proof that even as capabilities rise, AI lags in matching humanlike safety, rigor, and resilience.

Timeline Predictions

1A 2021 survey by Cotra estimated median AGI timeline at 2050 among forecasters.

Verified

2Metaculus community median for AGI by 2028 is 15% probability.

Verified

3Ajeya Cotra's 2022 report gives 50% chance of AGI by 2040 via compute scaling.

Verified

480,000 Hours 2023 forecast: 10% chance of transformative AI by 2030.

Directional

5Epoch AI 2024 analysis predicts trend to AGI compute by 2027-2035.

Single source

6Ray Kurzweil predicts singularity (aligned AGI) by 2045.

Verified

7Ben Goertzel forecasts AGI by 2029 with alignment challenges.

Verified

8The 2023 Metaculus tournament median for weak AGI is 2026.

Verified

9Grace et al. 2022 median HLMI timeline: 2059.

Directional

10Forethought Foundation 2024: 20% chance AI catastrophe by 2100.

Single source

11Superforecasters median for AGI: 2060.

Verified

12ARC 2023 evals predict scaling to AGI by 2027 if trends hold.

Verified

13OpenPhil 2022 grant rationale: AGI likely pre-2100.

Verified

14LessWrong 2024 prediction market: 25% AGI by 2030.

Directional

15Katja Grace 2023 update: median transformative AI 2047.

Single source

16EleutherAI forecast: GPT-5 level by 2025.

Verified

17MIRI 2023 report warns of fast takeoff by 2030.

Verified

18CAIS 2024: 50% AGI by 2043 per experts.

Verified

19Manifold Markets AGI resolution 2032 median.

Directional

20Epoch 2024: compute doubling every 6 months to AGI threshold by 2028.

Single source

21AI Futures Project 2023: scenarios with AGI 2028-2048.

Verified

22PredictionBook users: 30% AGI by 2040.

Verified

23FLI 2024 survey median extinction risk timeline 2070.

Verified

Timeline Predictions Interpretation

From GPT-5 arriving by 2025 to a 25% shot at AGI by 2030 and a 20% risk of catastrophe by 2100, even the sharpest forecasters paint a jumbled picture—with AGI timelines clustering around the 2040s, HLMI in the 2050s, and "transformative AI" stretching from 2030 to 2047—proving the clock, while ticking, remains stubbornly unclear.