subscribe Support our award-winning journalism. The Premium package (digital only) is R30 for the first month and thereafter you pay R129 p/m now ad-free for all subscribers.
Subscribe now
The News Corporation building in New York, the US, September 21 2023. Picture: REUTERS/BING GUAN
The News Corporation building in New York, the US, September 21 2023. Picture: REUTERS/BING GUAN

Los Angeles/New York — Media baron Rupert Murdoch’s Dow Jones and New York Post filed a lawsuit against Perplexity AI on Monday, claiming the artificial intelligence start-up engages in a “massive amount of illegal copying” of their copyrighted work.

The lawsuit is the latest salvo in a bitter ongoing battle between publishers and tech companies over how the latter may use copyrighted content without authorisation to build and operate their AI systems.

“This suit is brought by news publishers who seek redress for Perplexity’s brazen scheme to compete for readers while simultaneously freeriding on the valuable content the publishers produce,” according to the lawsuit filed in New York by Wall Street Journal parent Dow Jones and the NY Post.

Perplexity did not immediately respond to emails from Reuters seeking comment.

The AI company is among the leading start-ups attempting to uproot the search engine market dominated by Alphabet’s Google. It assembles information from web pages it deems to be authoritative, then provides a summary directly within Perplexity’s own tool.

Perplexity uses a variety of large language models (LLMs) to generate its summaries, from OpenAI to Meta’s open-source model Llama. It provides citations in those results, though Perplexity’s own marketing promotes the notion that its interface enables users to “skip the links”.

Google likewise now shows AI-generated summaries similar to those offered by Perplexity, though most publishers grudgingly accept that arrangement because opting out would also mean having their content removed from Google’s search results, which would render them virtually invisible online.

The news publishers seek to differentiate Perplexity from search engines, which they argue allow for the discovery of their work, not a substitution for it, according to the lawsuit.

In the suit, the News Corp-owned publishers say their journalists investigate and write stories under tight deadlines and unpredictable circumstances. There is high demand for high-quality news presented in a timely, digestible format, and these publications rely on the sale of advertising and subscriptions to underwrite the cost of journalism, they argue.

The news organisations allege Perplexity’s AI-generated “answer machine” has ingested its copyrighted news stories, analysis and opinion in an internal database used to generate responses to users’ questions.

In its quest to provide answers, Perplexity copied “vast” quantities of the publishers’ work into a database, which uses an AI technique known as retrieval-augmented generation (RAG) to provide answers to users’ queries, the suit alleges.

Perplexity formulates its responses in a way that, at times, reproduces the content verbatim, the news organisations claim. The suit alleges these actions constitute an unlawful copyright infringement.

“Perplexity perpetrates an abuse of intellectual property that harms journalists, writers, publishers and News Corp,” News Corp CEO Robert Thomson said in a statement.

In July, Dow Jones and the New York Post sent a letter to Perplexity notifying it of the legal issues raised by its unauthorised use of copyrighted works, and offering to discuss a potential licensing deal. The company did not respond, according to the suit.

The news organisations are asking the court to stop Perplexity from using its news articles as the basis for providing answers to questions, and to order the destruction of any database using its copyrighted work.

With its lawsuit, News Corp is joining the ranks of multiple publishers that have sued AI companies for copyright infringement over their use of content without authorisation, both to train algorithms and to generate summaries of real-time information.

Earlier this month, New York Times sent Perplexity a “cease and desist” notice demanding it to stop using the newspaper’s content for generative AI purposes.

Perplexity has also faced accusations from media organisations such as Forbes and Wired for plagiarising their content, but has since launched a revenue-sharing program to address some concerns put forward by publishers.

Some publishers are signing licensing agreements with AI companies open to paying for content, though the sides often disagree over the value of the materials. Many AI developers argue they have broken no laws in accessing them for free.

In May, News Corp announced it had struck a multiyear partnership with OpenAI, with Thomson applauding the tech company for understanding “that integrity and creativity are essential” to realise the potential of AI.

While Perplexity has drawn the most scrutiny for its practices, it is not alone among AI companies in circumventing a common web standard used by publishers to block the scraping of their content, content licensing start-up TollBit told publishers over the summer.

Reuters 

subscribe Support our award-winning journalism. The Premium package (digital only) is R30 for the first month and thereafter you pay R129 p/m now ad-free for all subscribers.
Subscribe now

Would you like to comment on this article?
Sign up (it's quick and free) or sign in now.

Speech Bubbles

Please read our Comment Policy before commenting.