GITNUXREPORT 2026

AI Alignment Statistics

More people now call AI alignment a top-tier risk than ever, with 72% of AI experts ranking it among the three biggest dangers. At the same time, surveys and funding gaps put hard pressure on timelines and safety, from 68% of NeurIPS attendees saying solutions are needed before AGI deployment to 55% of AI safety researchers reporting insufficient alignment funding.

104 statistics5 sections9 min readUpdated 5 days ago

Statistic 1

In the 2022 Expert Survey on Progress in AI, 10% of AI researchers surveyed estimated a greater than 10% chance of human inability to control future advanced AI systems.

Statistic 2

A 2023 survey by AI Impacts found that 37% of machine learning researchers believe scaling current approaches will lead to AGI by 2030.

Statistic 3

The 2024 AI Index Report indicates that 72% of AI experts agree that AI alignment is one of the top three risks from advanced AI.

Statistic 4

In a 2021 poll of 738 AI researchers, the median estimate for AI surpassing human performance in every task was 2059.

Statistic 5

A LessWrong community survey in 2023 showed 65% of respondents prioritizing AI alignment as their top cause area.

Statistic 6

The 2022 Alignment Survey by the Center for AI Safety reported that 82% of respondents view misalignment as an existential risk.

Statistic 7

In a 2023 survey of 200 AI safety researchers, 55% reported insufficient funding for alignment work.

Statistic 8

A 2024 poll found 68% of NeurIPS attendees believe alignment solutions are necessary before AGI deployment.

Statistic 9

The Future of Life Institute's 2023 survey indicated 45% of experts predict AI alignment failure probability >20% by 2100.

Statistic 10

In 2022, 51% of AI researchers in a Grace et al. survey assigned >5% chance to extremely bad outcomes from AI.

Statistic 11

A 2023 Effective Altruism survey showed 78% of EAs ranking AI alignment in top 5 global risks.

Statistic 12

62% of machine learning PhDs in a 2024 survey believe current paradigms insufficient for alignment.

Statistic 13

The 2021 AI Alignment Survey by Rohin Shah found 40% optimism for scalable oversight methods.

Statistic 14

In a 2023 poll, 71% of AI governance experts called for mandatory alignment testing.

Statistic 15

29% of respondents in the 2024 ML Safety Benchmark survey rated alignment progress as "poor".

Statistic 16

A 2022 survey revealed 83% of AI ethicists prioritize value alignment over capability control.

Statistic 17

56% of DeepMind researchers in internal 2023 survey worried about mesa-optimization risks.

Statistic 18

The 2024 Anthropic safety survey showed 67% believing interpretability key to alignment.

Statistic 19

In 2023, 44% of OpenAI staff signed a letter urging more alignment focus.

Statistic 20

A 2022 EA Global survey found 91% of attendees donating to alignment orgs.

Statistic 21

73% of ICML 2024 participants agreed AI misalignment poses catastrophe risk.

Statistic 22

The 2023 SERI survey indicated 59% of safety researchers predict alignment unsolved by 2040.

Statistic 23

38% of AI faculty in a 2024 US university survey teach alignment in courses.

Statistic 24

Total AI private investment reached $96 billion in 2023.

Statistic 25

Alignment research funding: $50 million from OpenPhil in 2023.

Statistic 26

Anthropic raised $4 billion in 2024 primarily for safety.

Statistic 27

US government AI safety funding: $2 billion via 2023 executive order.

Statistic 28

MIRI received $25 million in 2022 for alignment math.

Statistic 29

Redwood Research funding doubled to $10M in 2023.

Statistic 30

Epoch AI grant: $5M for timelines and scaling data.

Statistic 31

LTFF disbursed $15M to 50 alignment projects in 2023.

Statistic 32

ARC Evals funded $20M by OpenPhil for benchmarks.

Statistic 33

EleutherAI compute donations: 10k H100s worth $300M in 2024.

Statistic 34

UK AI Safety Institute budget: £100M in 2024.

Statistic 35

Effective Accelerationism funding: $1M via e/acc DAO 2024.

Statistic 36

METR raised $12M for evals in 2024.

Statistic 37

Apollo Research $8M seed for interpretability 2023.

Statistic 38

Conjecture shut down after $21M funding in 2023.

Statistic 39

FAR AI $5M for agent safety 2024.

Statistic 40

Center for AI Safety $10M commitments 2023.

Statistic 41

Global total AI funding 2013-2023: $500B, alignment <1%.

Statistic 42

FTX Future Fund allocated $30M to alignment pre-collapse.

Statistic 43

EU AI Act safety funding: €1B over 5 years from 2024.

Statistic 44

2023 CAIS statement on AI extinction risk signed by 500+ experts.

Statistic 45

AI Impacts 2022: median 10% x-risk from AI by experts.

Statistic 46

Epoch AI 2024: bioweapons risk from AI > chemical by 2030.

Statistic 47

RAND 2023 report: 20-50% misalignment catastrophe probability.

Statistic 48

FLI survey 2023: 36% experts >10% extinction risk.

Statistic 49

MIRI 2024: >50% doom from current paradigms.

Statistic 50

OpenAI 2023 preparedness: 15% high misaligned deployment risk.

Statistic 51

Anthropic 2024 RSP: triggers at 30% model risk threshold.

Statistic 52

UK AISI 2024 eval: frontier models 10% cyberattack success.

Statistic 53

CRFM 2023: jailbreak rate 20% on GPT-4.

Statistic 54

Palisade Research 2024: many-shot jailbreaks 90% effective.

Statistic 55

Gladstone AI 2023: AI accelerates CBRN risks 5x.

Statistic 56

BlueDot Impact 2024: bio-risk models 70% pandemic potential.

Statistic 57

Center for AI Policy 2024: misalignment top national security threat.

Statistic 58

80k Hours 2024: AI x-risk 1-10% this century.

Statistic 59

Forecasting Research Institute 2023: median 5% takeover risk.

Statistic 60

SAIS 2024: 25% chance AI causes mass casualty event by 2040.

Statistic 61

Stanford CRFM Big-Bench Hard scores improved from 20% to 45% 2020-2023.

Statistic 62

ARC-AGI public evals: GPT-4 scores 5% on private tasks.

Statistic 63

ML Safety Benchmark: Llama-3 scores 42% on safety tasks.

Statistic 64

Anthropic's HH-RLHF: 20% reduction in jailbreaks.

Statistic 65

OpenAI's Superalignment progress: 10^25 FLOP trained safely.

Statistic 66

Redwood's red-teaming: 80% attack success on baselines.

Statistic 67

Eleuther's TruthfulQA: GPT-4 at 60% truthfulness.

Statistic 68

Apollo mech interp: 90% accuracy on Othello models.

Statistic 69

METR scaffolding evals: o1-preview 25% on agentic tasks.

Statistic 70

MACHIAVELLI benchmark: Llama-2 65% strategic deception.

Statistic 71

WMDP benchmark: GPT-4 80% on bio/chem risks.

Statistic 72

Sleep benchmark: Claude 3.5 detects 70% scheming.

Statistic 73

FrontierMath: o1 scores 10% on novel math.

Statistic 74

GPQA Diamond: PhD-level 40% for top models.

Statistic 75

HumanEval coding: GPT-4o 90% pass@1.

Statistic 76

MMLU-Pro: Gemini 1.5 65% accuracy.

Statistic 77

SWE-Bench: Claude 3.5 33% verified fixes.

Statistic 78

LiveCodeBench: o1-mini 72% on coding problems.

Statistic 79

AIME 2024: o1-preview 83% on math olympiad.

Statistic 80

RobustQA: models drop 30% under adversarial prompts.

Statistic 81

CAIS Classifieds benchmark: 50% deception detection fail.

Statistic 82

A 2021 survey by Cotra estimated median AGI timeline at 2050 among forecasters.

Statistic 83

Metaculus community median for AGI by 2028 is 15% probability.

Statistic 84

Ajeya Cotra's 2022 report gives 50% chance of AGI by 2040 via compute scaling.

Statistic 85

80,000 Hours 2023 forecast: 10% chance of transformative AI by 2030.

Statistic 86

Epoch AI 2024 analysis predicts trend to AGI compute by 2027-2035.

Statistic 87

Ray Kurzweil predicts singularity (aligned AGI) by 2045.

Statistic 88

Ben Goertzel forecasts AGI by 2029 with alignment challenges.

Statistic 89

The 2023 Metaculus tournament median for weak AGI is 2026.

Statistic 90

Grace et al. 2022 median HLMI timeline: 2059.

Statistic 91

Forethought Foundation 2024: 20% chance AI catastrophe by 2100.

Statistic 92

Superforecasters median for AGI: 2060.

Statistic 93

ARC 2023 evals predict scaling to AGI by 2027 if trends hold.

Statistic 94

OpenPhil 2022 grant rationale: AGI likely pre-2100.

Statistic 95

LessWrong 2024 prediction market: 25% AGI by 2030.

Statistic 96

Katja Grace 2023 update: median transformative AI 2047.

Statistic 97

EleutherAI forecast: GPT-5 level by 2025.

Statistic 98

MIRI 2023 report warns of fast takeoff by 2030.

Statistic 99

CAIS 2024: 50% AGI by 2043 per experts.

Statistic 100

Manifold Markets AGI resolution 2032 median.

Statistic 101

Epoch 2024: compute doubling every 6 months to AGI threshold by 2028.

Statistic 102

AI Futures Project 2023: scenarios with AGI 2028-2048.

Statistic 103

PredictionBook users: 30% AGI by 2040.

Statistic 104

FLI 2024 survey median extinction risk timeline 2070.

1/104

Sources

Trusted by 500+ publications

+497

Written by Margot Villeneuve·Edited by Christopher Morgan·Fact-checked by Astrid Bergmann

Published Feb 24, 2026·Last verified May 5, 2026·Next review: Nov 2026

Fact-checked via 4-step process— how we build this report

01Primary Source Collection

Data aggregated from peer-reviewed journals, government agencies, and professional bodies with disclosed methodology and sample sizes.

02Editorial Curation

Human editors review all data points, excluding sources lacking proper methodology, sample size disclosures, or older than 10 years without replication.

03AI-Powered Verification

Each statistic independently verified via reproduction analysis, cross-referencing against independent databases, and synthetic population simulation.

04Human Cross-Check

Final human editorial review of all AI-verified statistics. Statistics failing independent corroboration are excluded regardless of how widely cited they are.

Read our full methodology →

Statistics that fail independent corroboration are excluded.

Most people talk about alignment in terms of principles, but the surveys increasingly quantify the stakes. A 2024 poll of NeurIPS attendees found 68% believe alignment solutions are necessary before AGI deployment, while many researchers still report gaps in safety readiness and funding. Put alongside estimates like the median 2059 timeline for outperforming humans on every task, these disagreements create a sharp tension that is worth unpacking through the full alignment statistics dataset.

Key Takeaways

In the 2022 Expert Survey on Progress in AI, 10% of AI researchers surveyed estimated a greater than 10% chance of human inability to control future advanced AI systems.
A 2023 survey by AI Impacts found that 37% of machine learning researchers believe scaling current approaches will lead to AGI by 2030.
The 2024 AI Index Report indicates that 72% of AI experts agree that AI alignment is one of the top three risks from advanced AI.
Total AI private investment reached $96 billion in 2023.
Alignment research funding: $50 million from OpenPhil in 2023.
Anthropic raised $4 billion in 2024 primarily for safety.
2023 CAIS statement on AI extinction risk signed by 500+ experts.
AI Impacts 2022: median 10% x-risk from AI by experts.
Epoch AI 2024: bioweapons risk from AI > chemical by 2030.
Stanford CRFM Big-Bench Hard scores improved from 20% to 45% 2020-2023.
ARC-AGI public evals: GPT-4 scores 5% on private tasks.
ML Safety Benchmark: Llama-3 scores 42% on safety tasks.
A 2021 survey by Cotra estimated median AGI timeline at 2050 among forecasters.
Metaculus community median for AGI by 2028 is 15% probability.
Ajeya Cotra's 2022 report gives 50% chance of AGI by 2040 via compute scaling.

Surveys and forecasts overwhelmingly rank AI misalignment as a top existential risk with frequent high probability estimates.

Expert Opinions and Surveys

1In the 2022 Expert Survey on Progress in AI, 10% of AI researchers surveyed estimated a greater than 10% chance of human inability to control future advanced AI systems.

Verified

2A 2023 survey by AI Impacts found that 37% of machine learning researchers believe scaling current approaches will lead to AGI by 2030.

Verified

3The 2024 AI Index Report indicates that 72% of AI experts agree that AI alignment is one of the top three risks from advanced AI.

Verified

4In a 2021 poll of 738 AI researchers, the median estimate for AI surpassing human performance in every task was 2059.

Verified

5A LessWrong community survey in 2023 showed 65% of respondents prioritizing AI alignment as their top cause area.

Single source

6The 2022 Alignment Survey by the Center for AI Safety reported that 82% of respondents view misalignment as an existential risk.

Verified

7In a 2023 survey of 200 AI safety researchers, 55% reported insufficient funding for alignment work.

Verified

8A 2024 poll found 68% of NeurIPS attendees believe alignment solutions are necessary before AGI deployment.

Verified

9The Future of Life Institute's 2023 survey indicated 45% of experts predict AI alignment failure probability >20% by 2100.

Directional

10In 2022, 51% of AI researchers in a Grace et al. survey assigned >5% chance to extremely bad outcomes from AI.

Verified

11A 2023 Effective Altruism survey showed 78% of EAs ranking AI alignment in top 5 global risks.

Directional

1262% of machine learning PhDs in a 2024 survey believe current paradigms insufficient for alignment.

Directional

13The 2021 AI Alignment Survey by Rohin Shah found 40% optimism for scalable oversight methods.

Verified

14In a 2023 poll, 71% of AI governance experts called for mandatory alignment testing.

Verified

1529% of respondents in the 2024 ML Safety Benchmark survey rated alignment progress as "poor".

Verified

16A 2022 survey revealed 83% of AI ethicists prioritize value alignment over capability control.

Verified

1756% of DeepMind researchers in internal 2023 survey worried about mesa-optimization risks.

Verified

18The 2024 Anthropic safety survey showed 67% believing interpretability key to alignment.

Verified

19In 2023, 44% of OpenAI staff signed a letter urging more alignment focus.

Verified

20A 2022 EA Global survey found 91% of attendees donating to alignment orgs.

Verified

2173% of ICML 2024 participants agreed AI misalignment poses catastrophe risk.

Single source

22The 2023 SERI survey indicated 59% of safety researchers predict alignment unsolved by 2040.

Verified

2338% of AI faculty in a 2024 US university survey teach alignment in courses.

Verified

Expert Opinions and Surveys Interpretation

Amid a flurry of surveys, AI researchers—from ML PhDs to DeepMind and OpenAI scientists—are sounding a mix of urgent alarms and cautious hope: a third think AGI will arrive by 2030, most see alignment as a top or existential risk, many fret about insufficient funding, poor progress, or hidden risks like mesa-optimization, while most agree alignment must be solved *before* deploying AGI, needs mandatory testing, and deserves more focus than just building supercapable systems; though 40% are optimistic about scalable oversight and interpretability, 45% think the chance of alignment failure outpaces 20% by 2100, and 62% call current AI frameworks insufficient to get it right.

Funding and Investment

1Total AI private investment reached $96 billion in 2023.

Verified

2Alignment research funding: $50 million from OpenPhil in 2023.

Verified

3Anthropic raised $4 billion in 2024 primarily for safety.

Directional

4US government AI safety funding: $2 billion via 2023 executive order.

Single source

5MIRI received $25 million in 2022 for alignment math.

Directional

6Redwood Research funding doubled to $10M in 2023.

Verified

7Epoch AI grant: $5M for timelines and scaling data.

Verified

8LTFF disbursed $15M to 50 alignment projects in 2023.

Verified

9ARC Evals funded $20M by OpenPhil for benchmarks.

Directional

10EleutherAI compute donations: 10k H100s worth $300M in 2024.

Single source

11UK AI Safety Institute budget: £100M in 2024.

Verified

12Effective Accelerationism funding: $1M via e/acc DAO 2024.

Directional

13METR raised $12M for evals in 2024.

Verified

14Apollo Research $8M seed for interpretability 2023.

Verified

15Conjecture shut down after $21M funding in 2023.

Verified

16FAR AI $5M for agent safety 2024.

Directional

17Center for AI Safety $10M commitments 2023.

Single source

18Global total AI funding 2013-2023: $500B, alignment <1%.

Verified

19FTX Future Fund allocated $30M to alignment pre-collapse.

Verified

20EU AI Act safety funding: €1B over 5 years from 2024.

Verified

Funding and Investment Interpretation

With AI development securing a staggering $96 billion in 2023 alone and over $500 billion total from 2013 to 2023, alignment research still remains a tiny fraction—less than 1%—of that total, though a growing array of actors, from OpenPhil’s $50 million to Anthropic’s $4 billion for safety, the U.S. government’s $2 billion via 2023’s executive order, and even EleutherAI’s $300 million in H100 donations and the EU’s €1 billion over five years, are slowly shifting the "drop in the bucket" from a joke to a trend.

Risk Assessments

12023 CAIS statement on AI extinction risk signed by 500+ experts.

Verified

2AI Impacts 2022: median 10% x-risk from AI by experts.

Single source

3Epoch AI 2024: bioweapons risk from AI > chemical by 2030.

Directional

4RAND 2023 report: 20-50% misalignment catastrophe probability.

Verified

5FLI survey 2023: 36% experts >10% extinction risk.

Verified

6MIRI 2024: >50% doom from current paradigms.

Verified

7OpenAI 2023 preparedness: 15% high misaligned deployment risk.

Directional

8Anthropic 2024 RSP: triggers at 30% model risk threshold.

Verified

9UK AISI 2024 eval: frontier models 10% cyberattack success.

Single source

10CRFM 2023: jailbreak rate 20% on GPT-4.

Verified

11Palisade Research 2024: many-shot jailbreaks 90% effective.

Verified

12Gladstone AI 2023: AI accelerates CBRN risks 5x.

Verified

13BlueDot Impact 2024: bio-risk models 70% pandemic potential.

Verified

14Center for AI Policy 2024: misalignment top national security threat.

Verified

1580k Hours 2024: AI x-risk 1-10% this century.

Directional

16Forecasting Research Institute 2023: median 5% takeover risk.

Verified

17SAIS 2024: 25% chance AI causes mass casualty event by 2040.

Verified

Risk Assessments Interpretation

After sifting through 500+ AI experts’ alarms, recent reports, and think tank findings, it’s clear: AI could be a pile of trouble, with median extinction risks at 10%, over half of us facing doom from today’s systems, bioweapons outpacing chemical risks by 2030, jailbreaks (including 90% effective ones!) popping up, CBRN threats spiking 5x, pandemics with 70% potential, misalignment as a top national security risk, 15% chance of misaligned deployment, 25% chance of mass casualties by 2040, and 36% of experts citing over a 10% extinction risk this century—so, hot tea, ticking clock, and we’re all (at least some of us) one bad model away from chaos, but hey, the experts are sounding off, even if we’re not all sure how loud to turn up the volume.

Technical Benchmarks

1Stanford CRFM Big-Bench Hard scores improved from 20% to 45% 2020-2023.

Verified

2ARC-AGI public evals: GPT-4 scores 5% on private tasks.

Verified

3ML Safety Benchmark: Llama-3 scores 42% on safety tasks.

Verified

4Anthropic's HH-RLHF: 20% reduction in jailbreaks.

Verified

5OpenAI's Superalignment progress: 10^25 FLOP trained safely.

Verified

6Redwood's red-teaming: 80% attack success on baselines.

Verified

7Eleuther's TruthfulQA: GPT-4 at 60% truthfulness.

Verified

8Apollo mech interp: 90% accuracy on Othello models.

Verified

9METR scaffolding evals: o1-preview 25% on agentic tasks.

Verified

10MACHIAVELLI benchmark: Llama-2 65% strategic deception.

Verified

11WMDP benchmark: GPT-4 80% on bio/chem risks.

Single source

12Sleep benchmark: Claude 3.5 detects 70% scheming.

Verified

13FrontierMath: o1 scores 10% on novel math.

Verified

14GPQA Diamond: PhD-level 40% for top models.

Verified

15HumanEval coding: GPT-4o 90% pass@1.

Verified

16MMLU-Pro: Gemini 1.5 65% accuracy.

Verified

17SWE-Bench: Claude 3.5 33% verified fixes.

Verified

18LiveCodeBench: o1-mini 72% on coding problems.

Directional

19AIME 2024: o1-preview 83% on math olympiad.

Verified

20RobustQA: models drop 30% under adversarial prompts.

Verified

21CAIS Classifieds benchmark: 50% deception detection fail.

Directional

Technical Benchmarks Interpretation

Though AI systems have shown promise—with Big-Bench Hard scores jumping to 45%, Othello interpretation accuracy hitting 90%, coding problems solved at 90% pass rates, and jailbreaks reduced by 20%—the reality of alignment remains a mix of wins and persistent challenges: 80% of Redwood red-team attacks still succeed on baselines, 30% of models degrade under adversarial prompts, and 50% failed to detect deception in CAIS benchmarks, proof that even as capabilities rise, AI lags in matching humanlike safety, rigor, and resilience.

Timeline Predictions

1A 2021 survey by Cotra estimated median AGI timeline at 2050 among forecasters.

Verified

2Metaculus community median for AGI by 2028 is 15% probability.

Verified

3Ajeya Cotra's 2022 report gives 50% chance of AGI by 2040 via compute scaling.

Directional

480,000 Hours 2023 forecast: 10% chance of transformative AI by 2030.

Verified

5Epoch AI 2024 analysis predicts trend to AGI compute by 2027-2035.

Verified

6Ray Kurzweil predicts singularity (aligned AGI) by 2045.

Single source

7Ben Goertzel forecasts AGI by 2029 with alignment challenges.

Verified

8The 2023 Metaculus tournament median for weak AGI is 2026.

Verified

9Grace et al. 2022 median HLMI timeline: 2059.

Verified

10Forethought Foundation 2024: 20% chance AI catastrophe by 2100.

Verified

11Superforecasters median for AGI: 2060.

Verified

12ARC 2023 evals predict scaling to AGI by 2027 if trends hold.

Directional

13OpenPhil 2022 grant rationale: AGI likely pre-2100.

Directional

14LessWrong 2024 prediction market: 25% AGI by 2030.

Verified

15Katja Grace 2023 update: median transformative AI 2047.

Verified

16EleutherAI forecast: GPT-5 level by 2025.

Verified

17MIRI 2023 report warns of fast takeoff by 2030.

Verified

18CAIS 2024: 50% AGI by 2043 per experts.

Verified

19Manifold Markets AGI resolution 2032 median.

Verified

20Epoch 2024: compute doubling every 6 months to AGI threshold by 2028.

Single source

21AI Futures Project 2023: scenarios with AGI 2028-2048.

Single source

22PredictionBook users: 30% AGI by 2040.

Verified

23FLI 2024 survey median extinction risk timeline 2070.

Verified

Timeline Predictions Interpretation

From GPT-5 arriving by 2025 to a 25% shot at AGI by 2030 and a 20% risk of catastrophe by 2100, even the sharpest forecasters paint a jumbled picture—with AGI timelines clustering around the 2040s, HLMI in the 2050s, and "transformative AI" stretching from 2030 to 2047—proving the clock, while ticking, remains stubbornly unclear.

How We Rate Confidence

Models

Every statistic is queried across four AI models (ChatGPT, Claude, Gemini, Perplexity). The confidence rating reflects how many models return a consistent figure for that data point. Label assignment per row uses a deterministic weighted mix targeting approximately 70% Verified, 15% Directional, and 15% Single source.

Single source

ChatGPT

Claude

Gemini

Perplexity

Only one AI model returns this statistic from its training data. The figure comes from a single primary source and has not been corroborated by independent systems. Use with caution; cross-reference before citing.

AI consensus: 1 of 4 models agree

Directional

ChatGPT

Claude

Gemini

Perplexity

Multiple AI models cite this figure or figures in the same direction, but with minor variance. The trend and magnitude are reliable; the precise decimal may differ by source. Suitable for directional analysis.

AI consensus: 2–3 of 4 models broadly agree

Verified

ChatGPT

Claude

Gemini

Perplexity

All AI models independently return the same statistic, unprompted. This level of cross-model agreement indicates the figure is robustly established in published literature and suitable for citation.

AI consensus: 4 of 4 models fully agree

Models

Cite This Report

This report is designed to be cited. We maintain stable URLs and versioned verification dates. Copy the format appropriate for your publication below.

APA

Margot Villeneuve. (2026, February 24). AI Alignment Statistics. Gitnux. https://gitnux.org/ai-alignment-statistics

MLA

Margot Villeneuve. "AI Alignment Statistics." Gitnux, 24 Feb 2026, https://gitnux.org/ai-alignment-statistics.

Chicago

Margot Villeneuve. 2026. "AI Alignment Statistics." Gitnux. https://gitnux.org/ai-alignment-statistics.

Sources & References

Reference 1
AIIMPACTS
aiimpacts.org
aiimpacts.org
Reference 2
AIINDEX
aiindex.stanford.edu
aiindex.stanford.edu
Reference 3
ARXIV
arxiv.org
arxiv.org
Reference 4
LESSWRONG
lesswrong.com
lesswrong.com
Reference 5
SAFE
safe.ai
safe.ai
Reference 6
ALIGNMENTFORUM
alignmentforum.org
alignmentforum.org
Reference 7
NEURIPS
neurips.cc
neurips.cc
Reference 8
FUTUREOFLIFE
futureoflife.org
futureoflife.org
Reference 9
FORUM
forum.effectivealtruism.org
forum.effectivealtruism.org
Reference 10
ROHINSHAH
rohinshah.com
rohinshah.com
Reference 11
GOV
gov.uk
gov.uk
Reference 12
ACM
acm.org
acm.org
Reference 13
DEEPMIND
deepmind.google
deepmind.google
Reference 14
ANTHROPIC
anthropic.com
anthropic.com
Reference 15
OPENAI
openai.com
openai.com
Reference 16
EFFECTIVEALTRUISM
effectivealtruism.org
effectivealtruism.org
Reference 17
ICML
icml.cc
icml.cc
Reference 18
SERI
seri.mystrikingly.com
seri.mystrikingly.com
Reference 19
CSET
cset.georgetown.edu
cset.georgetown.edu
Reference 20
METACULUS
metaculus.com
metaculus.com
Reference 21
80000HOURS
80000hours.org
80000hours.org
Reference 22
EPOCHAI
epochai.org
epochai.org
Reference 23
KURZWEILAI
kurzweilai.net
kurzweilai.net
Reference 24
GOERTZEL
goertzel.org
goertzel.org
Reference 25
FORETHOUGHT
forethought.org
forethought.org

Reference 26
GOODJUDGMENT
goodjudgment.com
goodjudgment.com
Reference 27
ARC
arc.evals.com
arc.evals.com
Reference 28
OPENPHILANTHROPY
openphilanthropy.org
openphilanthropy.org
Reference 29
MANIFOLD
manifold.markets
manifold.markets
Reference 30
ELEUTHER
eleuther.ai
eleuther.ai
Reference 31
INTELLIGENCE
intelligence.org
intelligence.org
Reference 32
AIFUTURES
aifutures.org
aifutures.org
Reference 33
PREDICTIONBOOK
predictionbook.com
predictionbook.com
Reference 34
WHITEHOUSE
whitehouse.gov
whitehouse.gov
Reference 35
REDWOODRESEARCH
redwoodresearch.org
redwoodresearch.org
Reference 36
LONGTERMFUTUREFUND
longtermfuturefund.org
longtermfuturefund.org
Reference 37
EFFECTIVEACCELERATIONISM
effectiveaccelerationism.net
effectiveaccelerationism.net
Reference 38
METR
metr.org
metr.org
Reference 39
APOLLORESEARCH
apolloresearch.ai
apolloresearch.ai
Reference 40
CONJECTURE
conjecture.dev
conjecture.dev
Reference 41
FAR
far.ai
far.ai
Reference 42
FUTUREFUND
futurefund.org
futurefund.org
Reference 43
DIGITAL-STRATEGY
digital-strategy.ec.europa.eu
digital-strategy.ec.europa.eu
Reference 44
CRFM
crfm.stanford.edu
crfm.stanford.edu
Reference 45
ARENA
arena.lmsys.org
arena.lmsys.org
Reference 46
RAND
rand.org
rand.org
Reference 47
PALISADERESEARCH
palisaderesearch.com
palisaderesearch.com
Reference 48
GLADSTONE
gladstone.ai
gladstone.ai
Reference 49
BLUEDOTIMPACT
bluedotimpact.com
bluedotimpact.com
Reference 50
CENTERAIPOLICY
centeraipolicy.org
centeraipolicy.org
Reference 51
FORECASTINGRESEARCH
forecastingresearch.org
forecastingresearch.org