GITNUXREPORT 2025

Data Type Statistics

Most global data in two years, unstructured, user-driven, growing rapidly.

Jannik Lindner

Jannik Linder

Co-Founder of Gitnux, specialized in content and tech since 2016.

First published: April 29, 2025

Our Commitment to Accuracy

Rigorous fact-checking • Reputable sources • Regular updatesLearn more

Key Statistics

Statistic 1

Over 80% of global data was created in just the last two years

Statistic 2

The global datasphere is expected to reach 175 zettabytes by 2025

Statistic 3

90% of the world's data has been generated in the past two years

Statistic 4

The average data volume per company is projected to be 163 zettabytes by 2025

Statistic 5

80% of data in organizations is unstructured data

Statistic 6

The median size of data files for modern AI applications is around 16 terabytes

Statistic 7

The fastest growing data type is video data, increasing by 15% annually

Statistic 8

90% of data stored today has been generated in the last two years

Statistic 9

The average number of data sources per enterprise has increased to 1,200

Statistic 10

Structured data accounts for approximately 20% of all data in enterprises

Statistic 11

60% of global data traffic is generated by video streaming

Statistic 12

The volume of image data is expected to grow at a CAGR of 21% through 2025

Statistic 13

Globally, data-related jobs are expected to grow by 23% between 2021 and 2031

Statistic 14

By 2025, the number of connected IoT devices is expected to reach 30.9 billion, generating vast amounts of data

Statistic 15

85% of cloud data workloads are stored in multi-cloud environments

Statistic 16

The average size of data warehouses has increased by 45% over the past three years

Statistic 17

Data stored in cloud data lakes increased by 28% in 2022

Statistic 18

The retail industry generates over 2.5 quintillion bytes of data daily

Statistic 19

Digital transformation initiatives are responsible for a 33% increase in enterprise data volume

Statistic 20

By 2023, 90% of new enterprise data will be stored in a multi-cloud or hybrid cloud environment

Statistic 21

The average number of data breaches per year has increased by 67% over the past five years

Statistic 22

32% of global data traffic is generated by enterprise IoT devices

Statistic 23

The total number of digital payments worldwide reached 521 billion in 2022, generating massive transaction data

Statistic 24

The data management market size is expected to reach $145 billion by 2027

Statistic 25

68% of organizations report data quality issues affecting their analytics

Statistic 26

Nearly 60% of data scientists say that data quality issues impact model performance

Statistic 27

45% of organizations lack a clear data strategy

Statistic 28

Only 29% of organizations report having a high level of data literacy among employees

Statistic 29

By 2024, 70% of enterprise data will be processed at the edge

Statistic 30

55% of data engineers report challenges with data governance

Statistic 31

The proportion of sensitive data being encrypted in transit is about 87%

Statistic 32

77% of organizations cite data privacy regulations as a major data management challenge

Statistic 33

The average cost of a data breach in 2023 is estimated at $4.45 million

Statistic 34

65% of organizations are investing in data catalog tools to improve data discovery

Statistic 35

50% of data science projects fail to deliver expected ROI due to poor data management

Statistic 36

72% of organizations are increasing their data security investments

Statistic 37

The global data integration market is projected to grow to $23.7 billion by 2025

Statistic 38

65% of companies are adopting data governance frameworks to ensure data quality

Statistic 39

Only 17% of data within organizations is stored in a standardized, easily accessible format

Statistic 40

The average cost of managing and storing a single petabyte of data is approximately $1.5 million annually

Statistic 41

Nearly 80% of data used in AI models is unstructured

Statistic 42

The most common data format for big data is JSON, followed by XML and CSV

Statistic 43

80% of global data is unstructured, such as video, social media posts, and emails

Statistic 44

78% of organizations use some form of machine-readable data formats for automation

Statistic 45

The adoption rate of blockchain technology for data integrity is projected to reach 50% by 2024

Statistic 46

70% of organizations plan to implement real-time data analytics solutions within the next two years

Statistic 47

The use of GraphQL for data querying is increasing by 50% annually among API developers

Statistic 48

76% of companies using big data technologies reported competitive advantages

Statistic 49

Approximately 92% of data users believe data literacy is essential for effective data use

Statistic 50

85% of organizations believe that data-driven decision-making improves business performance

Slide 1 of 50
Share:FacebookLinkedIn
Sources

Our Reports have been cited by:

Trust Badges - Publications that have cited our reports

Key Highlights

  • Over 80% of global data was created in just the last two years
  • The global datasphere is expected to reach 175 zettabytes by 2025
  • 90% of the world's data has been generated in the past two years
  • The average data volume per company is projected to be 163 zettabytes by 2025
  • 68% of organizations report data quality issues affecting their analytics
  • 80% of data in organizations is unstructured data
  • The median size of data files for modern AI applications is around 16 terabytes
  • 76% of companies using big data technologies reported competitive advantages
  • The adoption rate of blockchain technology for data integrity is projected to reach 50% by 2024
  • Nearly 60% of data scientists say that data quality issues impact model performance
  • The fastest growing data type is video data, increasing by 15% annually
  • 90% of data stored today has been generated in the last two years
  • The average number of data sources per enterprise has increased to 1,200

Did you know that over 80% of the world’s data has been generated in just the last two years, fueling a data explosion that is reshaping industries and challenging organizations to master the complex world of data types?

Data Growth and Volume Trends

  • Over 80% of global data was created in just the last two years
  • The global datasphere is expected to reach 175 zettabytes by 2025
  • 90% of the world's data has been generated in the past two years
  • The average data volume per company is projected to be 163 zettabytes by 2025
  • 80% of data in organizations is unstructured data
  • The median size of data files for modern AI applications is around 16 terabytes
  • The fastest growing data type is video data, increasing by 15% annually
  • 90% of data stored today has been generated in the last two years
  • The average number of data sources per enterprise has increased to 1,200
  • Structured data accounts for approximately 20% of all data in enterprises
  • 60% of global data traffic is generated by video streaming
  • The volume of image data is expected to grow at a CAGR of 21% through 2025
  • Globally, data-related jobs are expected to grow by 23% between 2021 and 2031
  • By 2025, the number of connected IoT devices is expected to reach 30.9 billion, generating vast amounts of data
  • 85% of cloud data workloads are stored in multi-cloud environments
  • The average size of data warehouses has increased by 45% over the past three years
  • Data stored in cloud data lakes increased by 28% in 2022
  • The retail industry generates over 2.5 quintillion bytes of data daily
  • Digital transformation initiatives are responsible for a 33% increase in enterprise data volume
  • By 2023, 90% of new enterprise data will be stored in a multi-cloud or hybrid cloud environment
  • The average number of data breaches per year has increased by 67% over the past five years
  • 32% of global data traffic is generated by enterprise IoT devices
  • The total number of digital payments worldwide reached 521 billion in 2022, generating massive transaction data

Data Growth and Volume Trends Interpretation

With over 80% of global data crafted in just the past two years and video content fueling a 15% annual surge, it's clear that in our relentless quest for digital dominance, organizations are swimming in an ocean of unstructured data—highlighting that managing this exponential growth isn't just a challenge but the new business frontier.

Data Management

  • The data management market size is expected to reach $145 billion by 2027

Data Management Interpretation

With the data management market projected to hit $145 billion by 2027, it's clear that managing data isn't just a backend task—it's a billion-dollar affair shaping the future of business innovation.

Data Management, Quality, and Security

  • 68% of organizations report data quality issues affecting their analytics
  • Nearly 60% of data scientists say that data quality issues impact model performance
  • 45% of organizations lack a clear data strategy
  • Only 29% of organizations report having a high level of data literacy among employees
  • By 2024, 70% of enterprise data will be processed at the edge
  • 55% of data engineers report challenges with data governance
  • The proportion of sensitive data being encrypted in transit is about 87%
  • 77% of organizations cite data privacy regulations as a major data management challenge
  • The average cost of a data breach in 2023 is estimated at $4.45 million
  • 65% of organizations are investing in data catalog tools to improve data discovery
  • 50% of data science projects fail to deliver expected ROI due to poor data management
  • 72% of organizations are increasing their data security investments
  • The global data integration market is projected to grow to $23.7 billion by 2025
  • 65% of companies are adopting data governance frameworks to ensure data quality
  • Only 17% of data within organizations is stored in a standardized, easily accessible format
  • The average cost of managing and storing a single petabyte of data is approximately $1.5 million annually

Data Management, Quality, and Security Interpretation

Despite nearly two-thirds of organizations wrestling with data quality issues and a sizable chunk lacking clear strategies, the escalating reliance on edge processing, encryption, and governance frameworks underscores that in the data-driven landscape, mastering quality, privacy, and accessibility remains both a pressing challenge and a costly necessity.

Data Types and Formats

  • Nearly 80% of data used in AI models is unstructured
  • The most common data format for big data is JSON, followed by XML and CSV
  • 80% of global data is unstructured, such as video, social media posts, and emails
  • 78% of organizations use some form of machine-readable data formats for automation

Data Types and Formats Interpretation

With nearly 80% of data remaining unstructured—think videos, emails, and social media posts—it's no wonder that organizations are increasingly turning to machine-readable formats like JSON and XML to tame this chaotic digital wilderness.

Emerging Technologies and Adoption

  • The adoption rate of blockchain technology for data integrity is projected to reach 50% by 2024
  • 70% of organizations plan to implement real-time data analytics solutions within the next two years
  • The use of GraphQL for data querying is increasing by 50% annually among API developers

Emerging Technologies and Adoption Interpretation

With half of organizations embracing blockchain for data integrity by 2024, over two-thirds eye real-time analytics as the future’s pulse, while GraphQL’s rising 50% annual climb signals a rapidly evolving landscape where data querying is becoming smarter and faster—but perhaps more competitive.

Industry and Organizational Data Insights

  • 76% of companies using big data technologies reported competitive advantages
  • Approximately 92% of data users believe data literacy is essential for effective data use
  • 85% of organizations believe that data-driven decision-making improves business performance

Industry and Organizational Data Insights Interpretation

With over three-quarters of companies gaining a competitive edge through big data, and nearly all data users recognizing data literacy as essential, it’s clear that mastering data is no longer optional but the digital currency of business success in the modern landscape.