Reddit blocked Microsoft from searching its site and using the data to train their AIs

Microsoft would have to pay Reddit from now on.

News

Flavius Floare

Tech Journalist

News

2 min. read

Published on August 1, 2024

Reddit’s CEO, Steve Huffman, highlighted Microsoft’s secret habit in a tech-world twist. The executive made an interesting statement about how Microsoft used Reddit’s extensive data collection to teach its AI services without Reddit’s knowledge.

In an interview with The Verge, Huffman mentioned that Microsoft, Anthropic, and Perplexity are among the AI businesses that acted like the Internet was their oyster by freely utilizing content for training their AI. This came after Reddit decided to take a strong stand in demanding search engines pay if they want to access its data—only Google has paid up until now.

Huffman used a clear analogy to explain how difficult it is to prevent these big tech companies from accessing Reddit’s data. He said blocking them feels like trying to keep squirrels away from a bird feeder. Even though Microsoft provided options for webmaster crawling controls, Reddit decided to use another search engine, which Huffman didn’t mention.

Despite this, Reddit has blocked Bing from crawling their site for search, favoring another search engine and impacting competition from Bing and Bing-powered engines.
— Jordi Ribas (@JordiRib1) July 29, 2024

This situation is not solely about Reddit and Microsoft. Instead, it offers a small look at the broader discussion regarding how tech firms utilize public data for their AI objectives. Microsoft AI CEO Mustafa Suleyman’s comment that internet data has been “freeware” stokes more discussion, implying a long-established understanding that data can be used for training artificial intelligence.

Huffman’s position suggests changing how content creators perceive their connection with search engines and AI firms. The old arrangement of exchanging data for traffic appears to be changing. As AI’s data hunger increases, the conditions for interaction are being redefined. Reddit’s decision could create a first instance, motivating additional content providers and preventing crawlers from agreeing.

This ongoing story of Reddit versus Microsoft is more significant than just a dramatic disagreement between two companies. It represents a crucial matter of our times, showing the intricate connection between data privacy, artificial intelligence development, and the fundamental principles on which the Internet functions.

More about the topics: microsoft, Reddit

Flavius Floare

Tech Journalist

Flavius is a writer and a media content producer with a particular interest in technology, gaming, media, film and storytelling. He's always curious and ready to take on everything new in the tech world, covering Microsoft's products on a daily basis. The passion for gaming and hardware feeds his journalistic approach, making him a great researcher and news writer that's always ready to bring you the bleeding edge!

Readers help support Windows Report. We may get a commission if you buy through our links.

Improve this guide

User forum

0 messages

Sort by:

Leave a Reply Cancel reply