Key Takeaways
- Exploratory factor analysis of SCL-90 confirmed 9-factor structure explaining 58% variance (N=1,018)
- NEO-FFI Big Five factors CFA fit CFI=0.92, RMSEA=0.06 (N=1,500)
- BDI-II hierarchical model 2nd-order depression factor CFI=0.95 (N=360)
- Concurrent validity between BDI-II and HRSD was r=0.72 (N=135 depressed patients)
- PHQ-9 vs. SCID diagnosis sensitivity 88%, specificity 88% (N=580)
- AUDIT alcohol screen vs. DSM-IV AUD correlation r=0.81 (N=7,000)
- Cronbach's alpha for Beck Anxiety Inventory was 0.92 in 1,000 general population sample
- Big Five Inventory (BFI) subscales had alpha coefficients from 0.79 to 0.87 (N=1,810 undergraduates)
- PHQ-9 depression screener alpha=0.89 (N=6,000 primary care patients)
- Kappa for interrater reliability on SCID-I diagnoses was 0.78 (95% CI 0.68-0.88, N=562)
- HAM-D rater agreement ICC=0.89 for total score (N=120 patients, 2 raters)
- ADOS-2 autism module 1 interrater ICC=0.88 (N=438 children)
- In a 2018 meta-analysis of personality inventories, average test-retest reliability for Big Five traits over 1-month intervals was r=0.82 (95% CI: 0.79-0.85, k=45 studies)
- Beck Depression Inventory showed test-retest reliability of r=0.93 over 1 week in 200 psychiatric outpatients (SD=12.4)
- MMPI-2 clinical scales had test-retest correlations ranging from 0.67 to 0.92 over 1 week (mean r=0.79, N=486)
Across many clinical and community samples, factor models, reliability, and validation evidence were consistently strong.
Related reading
- Manufacturing EngineeringTop 10 Best Reliability Analysis Software of 2026
- Science ResearchTop 10 Best Reliability Prediction Software of 2026
- Business FinanceTop 10 Best Reliability Centred Maintenance Software of 2026
- Manufacturing EngineeringTop 10 Best Reliability Centered Maintenance Software of 2026
01 · Category
Construct Validity19 stats
Construct Validity Interpretation
02 · Category
Criterion Validity20 stats
Criterion Validity Interpretation
03 · Category
Internal Consistency21 stats
Internal Consistency Interpretation
More related reading
04 · Category
Interrater Reliability18 stats
Interrater Reliability Interpretation
05 · Category
Test-Retest Reliability20 stats
Test-Retest Reliability Interpretation
Cite This Report
This report is designed to be cited. We maintain stable URLs and versioned verification dates. Copy the format appropriate for your publication below.
Henrik Dahl. (2026, February 27). Reliability And Validity Statistics. Gitnux. https://gitnux.org/reliability-and-validity-statistics
Henrik Dahl. "Reliability And Validity Statistics." Gitnux, 27 Feb 2026, https://gitnux.org/reliability-and-validity-statistics.
Henrik Dahl. 2026. "Reliability And Validity Statistics." Gitnux. https://gitnux.org/reliability-and-validity-statistics.
Sources & references
16 datasets cited across this report · attribution is report-level

