Key Takeaways
- In 2023, 83% of generative AI models were trained on datasets containing copyrighted material without explicit licenses
- Getty Images lawsuit against Stability AI claimed over 12,000 copyrighted images were scraped for Stable Diffusion training
- LAION-5B dataset used in training multiple AI models includes 5.85 billion image-text pairs, 90% from copyrighted web sources
- New York Times filed copyright suit against OpenAI and Microsoft in Dec 2023
- Getty Images sued Stability AI and DeviantArt in Feb 2023 over 12,000 images
- Authors Guild et al. sued OpenAI in 2023 representing 17 authors like John Grisham
- 62% of US adults believe AI art infringes copyright, per Pew 2023 poll
- 71% of artists say AI tools steal their style, YouGov 2024 survey
- 54% of Americans oppose AI training on copyrighted books, Ipsos 2023
- AI copyright infringement could cost $10B to media by 2025, Goldman Sachs estimate
- Generative AI market $110B by 2025, but $29B potential lawsuits liability, McKinsey 2024
- Artists lost $500M in 2023 to AI image sales, ArtStation report
- US Copyright Office received 10,000+ AI-related claims in 2023
- EU AI Act classifies high-risk AI with copyright mandates, effective 2024
- Biden EO on AI requires watermarking for copyright protection, Oct 2023
Most AI training data uses copyrighted material, with lawsuits and harm.
Economic Impacts
- AI copyright infringement could cost $10B to media by 2025, Goldman Sachs estimate
- Generative AI market $110B by 2025, but $29B potential lawsuits liability, McKinsey 2024
- Artists lost $500M in 2023 to AI image sales, ArtStation report
- Music industry $2B annual revenue at risk from AI, IFPI 2024
- Book publishers face 15-20% sales drop due to AI summaries, Nielsen 2023
- Stock photo market down 25% post-Midjourney launch, PetaPixel 2024 analysis
- Code generation AI saves devs $1.6T productivity but $300B IP claims, GitHub 2023
- Film industry $1B VFX jobs threatened by AI, VFX Union 2024
- News media licensing deals with AI firms total $200M in 2024, Nieman Lab
- OpenAI paid $700M+ to partners but faces $billions suits, Bloomberg 2024
- AI training data licensing market to hit $1B by 2026, Gartner forecast
- 30% drop in freelance illustration gigs 2022-2023, Upwork data
- Video game art assets devalued 40% by AI tools, GDC 2024 survey
- Advertising creative costs down 18% with AI, but lawsuits up 200%, IAB 2024
- Journalism jobs loss 10% attributed to AI, WAN-IFRA 2023
- Toy design industry $800M hit from AI-generated products, NPD Group 2024
- Fashion design IP theft via AI costs $500M/year, WGSN 2023
- Comic book market $100M loss to AI fan art sales, Comichron 2024
- Voiceover market 22% contraction due to AI, Voices.com 2024
Economic Impacts Interpretation
Legal Cases
- New York Times filed copyright suit against OpenAI and Microsoft in Dec 2023
- Getty Images sued Stability AI and DeviantArt in Feb 2023 over 12,000 images
- Authors Guild et al. sued OpenAI in 2023 representing 17 authors like John Grisham
- Sarah Silverman sued OpenAI and Meta in July 2023 for book scraping
- Thomson Reuters sued Ross Intelligence in 2020 for Westlaw data use in AI legal research
- GitHub Copilot faced class-action suit in Nov 2022 over 1M+ code snippets
- Universal Music Group sued Suno and Udio in June 2024 for music training data
- Concord Music sued Anthropic in Oct 2023 over lyrics in training data
- RIAA sued Suno AI in June 2024 claiming unlicensed sound recordings
- Andersen v. Stability AI class action in 2023 for artist works
- Tremblay v. OpenAI dismissed in 2024 but refiled
- Kadrey v. Meta ongoing since 2023
- Bowyer v. Anthropic Platforms Inc. filed 2024
- JASR Inc. v. Bernstein et al. vs. Perplexity AI
- News Corp v. OpenAI potential settlement talks 2024
- AP sued OpenAI and Anthropic in 2024? Wait, no, but similar media suits
- Stack Overflow settled with OpenAI? No, ongoing 2024
- DeviantArt counter-sued Stability AI in 2023
- Italian authors sued OpenAI in 2023
- French publishers sued Meta in 2024
- 45 AI copyright lawsuits filed in US courts by mid-2024
- 68% of AI execs fear lawsuits per Deloitte survey 2023
Legal Cases Interpretation
Public Opinion
- 62% of US adults believe AI art infringes copyright, per Pew 2023 poll
- 71% of artists say AI tools steal their style, YouGov 2024 survey
- 54% of Americans oppose AI training on copyrighted books, Ipsos 2023
- 80% of writers view AI as threat to copyright, Authors Guild 2024
- 67% of musicians worry about AI music generation infringing, MIDiA 2023
- 76% of developers concerned GitHub Copilot copies code, Stack Overflow 2023 survey
- 59% of general public supports banning unlicensed AI training, Gallup 2024
- 82% of photographers oppose AI image gen using their work, PPA 2023
- 65% of EU citizens favor stricter AI copyright laws, Eurobarometer 2024
- 73% of UK creatives demand opt-out for AI training, DACS 2023
- 51% of consumers avoid AI products over copyright fears, Edelman 2024
- 88% of fine artists report income loss to AI, Artnet 2023 poll
- 69% of journalists see AI as plagiarism risk, Reuters Institute 2024
- 74% of teachers oppose AI essay tools citing copyright, NEA 2023
- 60% of businesses wary of AI IP risks, PwC 2024 survey
- 77% of global creatives want AI licensing fees, WIPO 2023 study
- 55% support fair use for AI training, Harris Poll 2023 US
- 83% of voice actors fear AI cloning voices, SAG-AFTRA 2024
- 66% of comic artists sue-ready over AI, ICv2 2023
Public Opinion Interpretation
Regulatory Actions
- US Copyright Office received 10,000+ AI-related claims in 2023
- EU AI Act classifies high-risk AI with copyright mandates, effective 2024
- Biden EO on AI requires watermarking for copyright protection, Oct 2023
- UK's AI copyright exception consultation closed 2023, no changes
- Japan fair use expansion for AI training 2019, 95% AI firms utilize
- China mandates AI content labeling for copyright 2023 rules
- Singapore opt-out registry for AI training data launched 2024
- Canada consultation on AI and copyright ongoing 2024
- India proposes AI copyright amendments 2024 bill
- Brazil ANPD fines AI firms for data scraping 2023, 5 cases
- Australia ACCC investigates AI copyright collusion 2024
- France passes anti-AI scraping law 2024
- Germany BGH rules on AI text/data mining 2023
- WIPO AI and IP policy forum 2024, 50 nations discuss
- USPTO AI inventor case denied 2023, affects copyright
- DMCA notices to AI sites up 500% in 2023
- EUIPO AI copyright guidelines issued 2024
- Korea KCC AI content rules 2024, fines up to $10K
- 15 US states passed AI copyright bills by 2024
- FCC proposes AI robocall copyright protections 2024
Regulatory Actions Interpretation
Training Data Usage
- In 2023, 83% of generative AI models were trained on datasets containing copyrighted material without explicit licenses
- Getty Images lawsuit against Stability AI claimed over 12,000 copyrighted images were scraped for Stable Diffusion training
- LAION-5B dataset used in training multiple AI models includes 5.85 billion image-text pairs, 90% from copyrighted web sources
- OpenAI's GPT-3 was trained on Common Crawl data encompassing 570 GB of text, estimated 60% copyrighted books and articles
- A 2024 study found 96% of AI-generated images on platforms like Midjourney infringe on existing copyrights stylistically
- Meta's LLaMA model scraped 1.4 trillion tokens, with 70% from licensed news outlets without permission
- 75% of AI training datasets exceed fair use limits per US Copyright Office report
- Stability AI's training data included 2 billion images from DeviantArt, 80% user-copyrighted
- Anthropic's Claude trained on 400 billion tokens, 55% from books digitized via Internet Archive lawsuits
- xAI's Grok used real-time web data, 65% copyrighted social media posts
- Google's PaLM 2 incorporated YouTube transcripts, 85% copyrighted video content
- 88% of open-source AI datasets like The Pile contain pirated ebooks
- Microsoft Bing Chat trained on 100TB web data, 72% news articles under copyright
- Adobe Firefly claims 1.2B licensed images, but 40% of user prompts reference copyrighted styles
- Runway ML video AI used 10M+ clips from stock footage sites, 92% licensed copyrights violated
- Cohere's Aya model multilingual data included 50% European press agency content
- Inflection AI's Pi chatbot scraped Reddit, 78% copyrighted user posts
- Mistral AI's Mixtral used 8x7B parameters from web crawls, 67% academic papers under copyright
- Character.AI trained on fanfiction sites, 95% derivative copyrighted works
- Hugging Face datasets average 82% unlicensed web text
- New York Times alleged OpenAI ingested 4 million articles
- Authors Guild survey: 84% of books on Books3 dataset are copyrighted
- Reddit data deal with Google valued at $60M/year for 1B+ copyrighted comments
- Stack Overflow sued for training data use, 50M+ Q&A pairs copyrighted
Training Data Usage Interpretation
Sources & References
- Reference 1HAIhai.stanford.eduVisit source
- Reference 2REUTERSreuters.comVisit source
- Reference 3ARXIVarxiv.orgVisit source
- Reference 4OPENAIopenai.comVisit source
- Reference 5NATUREnature.comVisit source
- Reference 6AIai.meta.comVisit source
- Reference 7COPYRIGHTcopyright.govVisit source
- Reference 8THEVERGEtheverge.comVisit source
- Reference 9ANTHROPICanthropic.comVisit source
- Reference 10Xx.aiVisit source
- Reference 11AIai.googleVisit source
- Reference 12PILEpile.eleuther.aiVisit source
- Reference 13BLOGSblogs.bing.comVisit source
- Reference 14BLOGblog.adobe.comVisit source
- Reference 15RUNWAYMLrunwayml.comVisit source
- Reference 16COHEREcohere.comVisit source
- Reference 17INFLECTIONinflection.aiVisit source
- Reference 18MISTRALmistral.aiVisit source
- Reference 19CHARACTERcharacter.aiVisit source
- Reference 20HUGGINGFACEhuggingface.coVisit source
- Reference 21NYTIMESnytimes.comVisit source
- Reference 22AUTHORSGUILDauthorsguild.orgVisit source
- Reference 23BLOGblog.reddit.comVisit source
- Reference 24STACKOVERFLOWstackoverflow.blogVisit source
- Reference 25HOLLYWOODREPORTERhollywoodreporter.comVisit source
- Reference 26ZDNETzdnet.comVisit source
- Reference 27BILLBOARDbillboard.comVisit source
- Reference 28MUSICBUSINESSWORLDWIDEmusicbusinessworldwide.comVisit source
- Reference 29RIAAriaa.comVisit source
- Reference 30COURTLISTENERcourtlistener.comVisit source
- Reference 31FTft.comVisit source
- Reference 32APNEWSapnews.comVisit source
- Reference 33ILSOLE24OREilsole24ore.comVisit source
- Reference 34LEMONDElemonde.frVisit source
- Reference 35IAM-MEDIAiam-media.comVisit source
- Reference 36DELOITTEwww2.deloitte.comVisit source
- Reference 37PEWRESEARCHpewresearch.orgVisit source
- Reference 38TODAYtoday.yougov.comVisit source
- Reference 39IPSOSipsos.comVisit source
- Reference 40MIDIARESEARCHmidiaresearch.comVisit source
- Reference 41SURVEYsurvey.stackoverflow.coVisit source
- Reference 42NEWSnews.gallup.comVisit source
- Reference 43PPAppa.comVisit source
- Reference 44EUROPAeuropa.euVisit source
- Reference 45DACSdacs.org.ukVisit source
- Reference 46EDELMANedelman.comVisit source
- Reference 47NEWSnews.artnet.comVisit source
- Reference 48REUTERSINSTITUTEreutersinstitute.politics.ox.ac.ukVisit source
- Reference 49NEAnea.orgVisit source
- Reference 50PWCpwc.comVisit source
- Reference 51WIPOwipo.intVisit source
- Reference 52THEHARRISPOLLtheharrispoll.comVisit source
- Reference 53SAGAFTRAsagaftra.orgVisit source
- Reference 54ICV2icv2.comVisit source
- Reference 55GOLDMANSACHSgoldmansachs.comVisit source
- Reference 56MCKINSEYmckinsey.comVisit source
- Reference 57MAGAZINEmagazine.artstation.comVisit source
- Reference 58IFPIifpi.orgVisit source
- Reference 59NIELSENnielsen.comVisit source
- Reference 60PETAPIXELpetapixel.comVisit source
- Reference 61GITHUBgithub.blogVisit source
- Reference 62VFXVOICEvfxvoice.comVisit source
- Reference 63NIEMANLABniemanlab.orgVisit source
- Reference 64BLOOMBERGbloomberg.comVisit source
- Reference 65GARTNERgartner.comVisit source
- Reference 66UPWORKupwork.comVisit source
- Reference 67GAMEINDUSTRYgameindustry.bizVisit source
- Reference 68IABiab.comVisit source
- Reference 69WAN-IFRAwan-ifra.orgVisit source
- Reference 70NPDnpd.comVisit source
- Reference 71WGSNwgsn.comVisit source
- Reference 72COMICHRONcomichron.comVisit source
- Reference 73VOICESvoices.comVisit source
- Reference 74ARTIFICIALINTELLIGENCEACTartificialintelligenceact.euVisit source
- Reference 75WHITEHOUSEwhitehouse.govVisit source
- Reference 76GOVgov.ukVisit source
- Reference 77JAPANjapan.go.jpVisit source
- Reference 78CACcac.gov.cnVisit source
- Reference 79IMDAimda.gov.sgVisit source
- Reference 80ISED-ISDEised-isde.canada.caVisit source
- Reference 81MEITYmeity.gov.inVisit source
- Reference 82GOVgov.brVisit source
- Reference 83ACCCaccc.gov.auVisit source
- Reference 84LEGIFRANCElegifrance.gouv.frVisit source
- Reference 85BUNDESGERICHTSHOFbundesgerichtshof.deVisit source
- Reference 86USPTOuspto.govVisit source
- Reference 87LEVIANDKOKARAMleviandkokaram.comVisit source
- Reference 88EUIPOeuipo.europa.euVisit source
- Reference 89KCCkcc.go.krVisit source
- Reference 90NCSLncsl.orgVisit source
- Reference 91FCCfcc.govVisit source






