Final yr Stack Overflow grew to become one of many first web sites to announce it could cost AI giants for entry to content material used to coach chatbots. Now the popular Q&A service for coders has signed up its first buyer—Google—in what CEO Prashanth Chandrasekar says is the beginning of a “significant” new stream of income.
The deal is important, as a result of it stays unclear how broadly Google and different AI builders can pay for content material wanted for AI initiatives. Thousands and thousands of books and websites have fueled the event of AI methods, however most publishers haven’t been compensated, and a few are suing over what they allege is misuse. Many publishers, together with Stack Overflow, seem threatened by ChatGPT and different generative AI merchandise, which might reply queries that might have beforehand despatched coders their approach.
The deal will see Google’s cloud division use questions and solutions from Stack Overflow about Google Cloud providers to offer coding help and technical help by a model of Google’s Gemini chatbot. Google’s cloud computing prospects will even be capable of ask questions by Google Cloud’s command-line interface. “Their AI could not have all of the solutions, and so we have now an enormous potential to assist full that loop,” Chandrasekar says. “We’re the most important place the place group data is curated and validated.”
Gemini will summarize solutions drawn from Stack Overflow in its personal phrases however embody the corporate’s brand, a hyperlink again to the unique materials, and the username of the location contributor who equipped it. The businesses plan to show the system at Google Cloud Subsequent, the search firm’s annual cloud convention in April, and launch it quickly after.
Chandrasekar says there are not any important restrictions on how Google Cloud can use Stack Overflow information, that means it may be used to coach giant language fashions and different AI methods. “The place we wish to stand agency on is—nonnegotiable things for us— belief, accuracy, high quality, and attribution again to the sources of those AI outputs,” he says.
He declined to say how a lot Stack Overflow is being paid by Google for the information. “This might be a significant business providing for us within the close to time period, medium time period, and long run,” Chandrasekar says.
Covert Scraping
Google and different AI builders have beforehand gathered information from Stack Overflow and different web sites with out a lot discover. As demand for generative AI applied sciences has surged—and the valuations of the businesses creating them has rocketed—the web sites supplying the foundational textual content have begun demanding what they view as their justifiable share. Thankfully for Stack Overflow, potential prospects have heeded the message, Chandrasekar says. “We’re not having to chase individuals,” he says.
Stack Overflow information is especially useful to AI systems that generate computer code, which have confirmed to be popular with software engineers and a big income for Microsoft and OpenAI.
The brand new Stack Overflow deal comes only a week after Google reached a licensing agreement to vacuum up information from Reddit, the dialogue boards operator, whose content material has helped chatbots’ potential to converse. Reddit had unveiled plans to start out charging for information entry simply earlier than Stack Overflow had final yr.
Discussion about this post