Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

How OpenAI's bot crushed this seven-person company's web site ‘like a DDoS attack’ | TechCrunch

Jan 10, 2025 - techcrunch.com
Triplegangers, a company specializing in 3D image files of human models, experienced a distributed denial-of-service attack when OpenAI's bot attempted to scrape its extensive ecommerce site, which hosts over 65,000 products. The bot made tens of thousands of server requests, overwhelming the site and causing it to crash. Despite having terms of service prohibiting unauthorized data scraping, Triplegangers had not configured its robot.txt file to block OpenAI's bots, which led to the incident. The company faced increased AWS costs due to the bot's activity and struggled to determine what data was taken, as OpenAI did not respond to requests for comment.

After configuring a proper robot.txt file and setting up a Cloudflare account to block unwanted bots, Triplegangers managed to stabilize its site. However, CEO Oleksandr Tomchuk expressed concerns about the lack of communication from OpenAI and the absence of an opt-out tool for data scraping. The incident highlights the challenges small businesses face in protecting their data from AI crawlers, which often exploit loopholes in the system. Tomchuk warns other businesses to actively monitor server logs to detect unauthorized scraping, as AI bots can significantly impact site performance and incur additional costs.

Key takeaways:

  • Triplegangers experienced a distributed denial-of-service attack due to OpenAI's bot scraping their site, causing significant server strain and potential financial costs.
  • The company lacked a properly configured robot.txt file, which allowed OpenAI's bot to scrape their data without restriction, highlighting the importance of correctly setting up these files.
  • Despite implementing measures to block unwanted bots, Triplegangers still struggles to determine what data was taken and how to contact OpenAI for resolution.
  • The incident underscores the broader issue of AI bots scraping data without explicit permission, placing the burden on businesses to protect their content.
View Full Article

Comments (0)

Be the first to comment!