Perplexity accused of violating website scraping restrictions

Bitcoin reaches new all-time high of over $118,000 within 24 hours

Bitcoin reached a new all-time high of $118,900 on Friday, surpassing its previous record of $113,822 set on Thursday. As Read more

Conveyor Revolutionizes Vendor Security Reviews and RFPs with AI

Selling software to companies can be a daunting task, especially when it comes to meeting security requirements. Chas Ballew, founder Read more

Ready-made Stem Cell Therapies in Development for Pets

Earlier this week, San Diego startup Gallant announced $18 million in funding to bring the first FDA-approved ready-to-use stem cell Read more

Elon Musk’s Dodgy Election Claims Have Gone Viral with 2 Billion Views on X

The world’s richest man buys out one of the most popular social media platforms and uses it as a propaganda Read more

AI startup Perplexity has been found to be crawling and scraping content from websites that explicitly stated they do not want to be scraped, as reported by internet infrastructure provider Cloudflare.

Cloudflare’s Research Findings

According to Cloudflare’s research published on Monday, Perplexity was observed ignoring blocks and attempting to hide its crawling and scraping activities. The accusations claim that Perplexity obscured its identity while trying to scrape web pages in order to bypass the websites’ preferences.

Circumventing Blocks

Perplexity is accused of circumventing blocks by changing its bots’ “user agent” and autonomous system networks (ASN). Cloudflare observed this activity across tens of thousands of domains and millions of requests per day, using a combination of machine learning and network signals to identify the crawler.

Response and Actions Taken

See also  AI Coding Tools Might Not Make Every Developer Faster, Study Finds

Perplexity’s spokesperson dismissed Cloudflare’s claims as a “sales pitch” and denied that the bot mentioned in the blog post belonged to them. Despite this, Cloudflare took action by delisting Perplexity’s bots and implementing new techniques to block them.

Cloudflare’s Stance on AI Crawlers

Cloudflare has recently taken a public stance against AI crawlers, launching a marketplace to allow website owners to charge AI scrapers for visiting their sites. The company’s CEO, Matthew Prince, has expressed concerns about AI disrupting the internet’s business model, particularly for publishers. Additionally, Cloudflare has released a free tool to prevent bots from scraping websites to train AI.

Previous Allegations Against Perplexity

This is not the first time Perplexity has been accused of unauthorized scraping. Last year, news outlets, including Wired, accused Perplexity of plagiarizing their content. During an interview at the Disrupt 2024 conference, Perplexity’s CEO was unable to provide a clear definition of plagiarism when questioned by TechCrunch.

California AI Student Loses Visa – Exclusive Interview with the Candidate

EVgo secures $1.25B loan for expanding EV charging network