CNN Sues Perplexity Over 17,000 Scraped Stories
Key insights
- CNN is the first TV network to sue an AI company for copyright, joining NYT and Chicago Tribune suits already active against Perplexity.
- Perplexity allegedly scraped 17,000+ CNN stories, photos, and videos without permission to power its real-time answer engine.
- The case specifically targets live scraping for inference, a legal question distinct from AI training-data copyright disputes.
Why this matters
The lawsuit is the first to directly challenge the live-scraping-for-inference pipeline that powers AI search products, rather than the historical training datasets most prior suits have targeted. If courts accept CNN's theory, Perplexity's core product would require licensing deals or structural redesign, with similar exposure for Google AI Overviews and Bing Copilot. CNN's entry as the first broadcast network broadens the plaintiff coalition beyond print publishers, signaling that legal pressure on AI search companies will expand across every content category.
Summary
CNN filed a federal copyright suit against Perplexity AI on May 28 in the Southern District of New York, the first television network to take this kind of legal action against an AI search company.
Perplexity is accused of scraping 17,000+ CNN stories, photos, and videos to power its real-time answer engine without payment or permission. Perplexity's response: 'you can't copyright facts.'
Essentially: (CNN, Perplexity AI) are testing whether live scraping for AI inference crosses the same copyright line as training-data collection.
- CNN joins active suits from the New York Times and Chicago Tribune in the same court.
- The targeted behavior is real-time scraping for inference, a model Perplexity's entire product depends on.
- A ruling here could set liability standards for all AI search products that answer questions using scraped publisher content.
Potential risks and opportunities
Risks
- A preliminary injunction against Perplexity's scraping pipeline, if granted, could disable core product functionality within months while the full case proceeds.
- Google AI Overviews, You.com, and Bing Copilot face expanded legal exposure if CNN's live-scraping liability theory survives a motion to dismiss and other publishers follow.
- Perplexity investors face valuation pressure if a ruling requires licensing all scraped publisher content at scale, restructuring the unit economics of its answer engine entirely.
Opportunities
- Publishers without active suits against Perplexity, including AP, Reuters, and regional broadcasters, gain leverage to demand licensing terms before a court ruling forces the issue.
- Content-licensing platforms and wire services positioned to supply pre-cleared news data pipelines to AI search companies will see accelerated inbound interest from Perplexity competitors seeking legal cover.
- IP litigation firms with established AI copyright practices are likely to see case volume grow as CNN's complaint, if it survives early motions, encourages additional plaintiff coalitions across media categories.
What we don't know yet
- Whether Perplexity's 'facts aren't copyrightable' defense has been tested in any prior AI scraping case, or whether this will produce the first ruling on that specific argument.
- The damages theory CNN is pursuing, including whether it seeks a preliminary injunction that could disrupt Perplexity's live-inference model before trial concludes.
- Whether the 17,000+ scraped items include video content and, if so, whether those claims will be treated separately given DMCA and broadcast copyright distinctions.
Originally reported by bloomberglaw.com
Read the original article →Original headline: CNN Files First TV Network AI Copyright Suit Against Perplexity, Alleging 17,000-Plus Stories Scraped