Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
AI The Courts News

BBC Threatens Legal Action Against Perplexity AI Over Content Scraping 21

Ancient Slashdot reader Alain Williams shares a report from The Guardian: The BBC is threatening legal action against Perplexity AI, in the corporation's first move to protect its content from being scraped without permission to build artificial intelligence technology. The corporation has sent a letter to Aravind Srinivas, the chief executive of the San Francisco-based startup, saying it has gathered evidence that Perplexity's model was "trained using BBC content." The letter, first reported by the Financial Times, threatens an injunction against Perplexity unless it stops scraping all BBC content to train its AI models, and deletes any copies of the broadcaster's material it holds unless it provides "a proposal for financial compensation."

The legal threat comes weeks after Tim Davie, the director general of the BBC, and the boss of Sky both criticised proposals being considered by the government that could let tech companies use copyright-protected work without permission. "If we currently drift in the way we are doing now we will be in crisis," Davie said, speaking at the Enders conference. "We need to make quick decisions now around areas like ... protection of IP. We need to protect our national intellectual property, that is where the value is. What do I need? IP protection; come on, let's get on with it."
"Perplexity's tool [which allows users to choose between different AI models] directly competes with the BBC's own services, circumventing the need for users to access those services," the corporation said.

Perplexity told the FT that the BBC's claims were "manipulative and opportunistic" and that it had a "fundamental misunderstanding of technology, the internet and intellectual property law."

BBC Threatens Legal Action Against Perplexity AI Over Content Scraping

Comments Filter:
  • by Shades72 ( 6355170 ) on Friday June 20, 2025 @09:42PM (#65464877)

    Everyone's IP is up for grabs, until everyone starts to grab Perplexity's IP, in the eyes of Perplexity. Sure, they can hide behind "not understanding IP law", or "not understanding internet" or whatever. What Perplexity and all other LLM providers do not grasp is "content ain't free, so either pay up or shut up".

    You would think that people smart enough to build and/or deploy LLMs would have no trouble to understand that basic truth. Of course, that is not what they promised to their shareholders, who want to see returns quickly for the ungodly amounts of money the LLM builders/deployers usurp from their shareholders/investors and/or their own greed.

    Because all of these LLM builders have shown, when the rubber hits the road, that they are as greedy as any psychopath dares to dream of being. And more.

    And have no qualms of using the argument "But China will...". In a similar fashion as the police/politicians overuse "Against terrorism" and "Think of the children...". It is getting a bit long in the tooth. These companies are absolutely unwilling to play by anyone's book but their own, while copying all their homework from your homework, and then claim you copied their work instead.

    Articles and comment like the ones mentioned here being uttered by Perplexity personnel, they give me the impression that they are exactly like petulant children, who are flabbergasted about receiving demands for (fair) compensation by the people/organizations they steal their reason for existing on.

    All of the commercial LLM providers can (and should) be accused of the same behavior, as Perplexity sure isn't the only one.

    Also, they didn't take the hint they got from DeepSeek R1 either. Be more efficient, instead of demanding more power (energy) and liberty (copyright) for their data hunger. Efficiency, that never has been the forte, of U.S. companies, only "more, more, more". LLM providers from the U.S. can't help themselves, this is too ingrained into their DNA, I suppose.

    And yet, if they would be inclined to go the way of efficiency, then all the money they would save on hardware and the energy this requires could then be used to (fairly) compensate those making the content/"AI feed". That would be a win-win for everyone involved. Probably too simplistic of an idea for these LLM providers, especially since no AI/LLM was used to think of it, just a tiny lick of common sense.

    • Let BBC hold copyright over their expression while Perplexity serves an analysis from multiple sources, which is not identical to any one of them and provides an transformative output.
    • What Perplexity and all other LLM providers do not grasp is "content ain't free, so either pay up or shut up".

      It's a good reminder though, that copyright is an artificial construct created by governments.

      It's been around so long, and grown to such monstrous proportions, that we tend to think of it as a natural, real thing, But it's really more like tax policy than (say) the strictures against murder.

      There's nothing intrinsically wrong with the UK government contemplating adjusting it in the public interest. UK subjects do benefit from using tools like Perplexity, you know.

      • The two aren't actually so different. You do get to make economic arguments a lot more openly about copyright(while, when it comes to killing, we normally make them relatively quietly and circumspectly when the unpleasant matter of what risks to the public are just part of The March of Progress and which ones are negligent or reckless comes up. We prefer not to talk about it; and have some proxies like 'VSL/ICAF' to help; but we do it); but the classifications are ultimately a policy thing and open to amend
  • I guess. (Score:4, Funny)

    by NewtonsLaw ( 409638 ) on Friday June 20, 2025 @10:23PM (#65464911)

    They should have paid their TV license eh?

    Hahah!

  • by cstacy ( 534252 ) on Saturday June 21, 2025 @01:04AM (#65465067)

    I thought those folks at the BBC loved regeneration?

  • A.I leaching off other peoples work and not providing attribution or compensation
  • Free account for human, personal, non-commercial usage, negotiable subscription price for all businesses. Give Google search a subscription for $1 a year if you can properly license the usage for search only and not AI. Perplexity price will be much, much higher of course. This would be no different than a lot of products nowadays. It doesn't even have to be ultra secure enforcement, any scraper hacking it can be sued under DMCA for some outrageous compensation (say $1M per scraped page, maybe more). Sue th
  • Meta's AI memorized 42% of first Harry Potter book [perplexity.ai]

    “A new study reveals that Meta's latest artificial intelligence model can reproduce nearly half of the first Harry Potter book from memory, raising fresh concerns about copyright infringement in AI training as the company faces mounting legal pressure from authors and publishers.”
    • Meta's AI memorized 42% of first Harry Potter book [perplexity.ai]

      “A new study reveals that Meta's latest artificial intelligence model can reproduce nearly half of the first Harry Potter book from memory, raising fresh concerns about copyright infringement in AI training as the company faces mounting legal pressure from authors and publishers.”

      So ... the argument here is that somebody isn't going to buy the book - or check it out of the library - because Grok might be induced to cough up 42% of it? (Presumably in suspect chunks - how would the user know whether they are accurate anyway?)

      Seriously?

  • Lol misunderstanding of the internet, BBC.co.uk was an early domain, surviving many a ddos.

    Think the BBC has a very large domain portfolio, so many dramas mention site names and they have to register it before someone puts a goatse on it.

    BBC peer with a lot of ISPs, they know the internet pretty damn well.

    • At least half the fun of being a 'disruptive innovator' is being able to treat your abject ignorance of history as a strength rather than a deficiency.
  • Perplexity uses a few external models and a model that is based on R1. They do not train on BBC content, but they process fetched data to create summaries. That's covered by copyright law (and possibly free speech laws) as long as they do not provide originals verbatim.

  • So, you can teach a university course to train humans using BBC's content, but ... doing the same with an AI is illegal?

Is your job running? You'd better go catch it!

Working...