Massive language fashions (LLMs) depend on having plenty of good-quality knowledge on which to coach. Concerning builders, few organizations have as a lot knowledge as Stack Overflow, a number one on-line knowledge-sharing platform utilized by greater than 100 million builders each month.
In the present day Stack Overflow introduced a partnership with Google Cloud to convey superior synthetic intelligence (AI) capabilities to tens of millions of builders worldwide. A key a part of the partnership includes integrating Stack Overflow’s information base into Google Cloud’s AI instruments like Gemini and the Cloud Console. This can give builders entry to related solutions, code snippets and documentation surfaced by Stack Overflow’s neighborhood. The partnership is indicative of a rising pattern amongst LLM distributors, together with OpenAI to strike up offers with content material suppliers to assist inform generative AI coaching efforts.
The combination of the information base is enabled through the brand new OverflowAPI, which sooner or later may additionally be utilized by different LLM suppliers.
“Today Stack Overflow is launching a new program that will give AI companies access to its knowledge base through a new API,” Prashanth Chandrasekar, CEO of Stack Overflow, informed VentureBeat. “The launch partner for this is Google, which will use Stack Overflow’s data to enrich Gemini for Google Cloud and provide validated Stack Overflow answers in the Google Cloud console.”
VB Occasion
The AI Influence Tour – NYC
We’ll be in New York on February 29 in partnership with Microsoft to debate the way to steadiness dangers and rewards of AI functions. Request an invitation to the unique occasion beneath.
Request an invitation
What the Overflow API will convey to each Google and Stack Overflow
Google having access to the large quantities of knowledge out there on Stack Overflow is a worthwhile alternative, although it’s not fully clear simply how worthwhile. Chandrasekar declined to touch upon the monetary phrases of the Google Cloud partnership.
Chandrasekar defined that by means of the OverflowAPI, Google now has steady entry to the APIs that pull public knowledge from Stack Overflow. These APIs allow entry to the identical knowledge out there to the Stack Overflow neighborhood through its public APIs. This contains over 58 million questions and solutions, tens of millions of person feedback and put up metadata equivalent to votes and edits.
The partnership just isn’t a one-way avenue both. Stack Overflow can be adopting Google Cloud know-how extra broadly transferring ahead. Stack Overflow will now be utilizing Google Cloud as “the platform of choice” in response to the corporate as a bunch for its public-facing platform. Precisely what applied sciences and providers are being adopted remains to be being labored out.
It’s additionally vital to notice that the Google partnership and entry to the OverflowAPI don’t preclude Stack Overflow from working with different LLM suppliers.
“This is not exclusive to Google nor does Google have access to proprietary Stack Overflow data, customer data on any product at Stack, or any user personal information as part of this partnership,” Chandrasekar mentioned.
How the brand new OverflowAPI compliments OverflowAI
The brand new partnership with Google is hardly Stack Overflow’s first foray into the world of gen AI.
In July 2023, Stack Overflow introduced its OverflowAI effort. Chandrasekar mentioned that the brand new API enhances the OverflowAI know-how. He defined that OverflowAI is the overarching time period utilized by Stack Overflow to explain initiatives that introduce new AI/machine studying (ML) capabilities and options to Stack Overflow for Groups and the general public platform. Examples of OverflowAI initiatives which can be a part of the Stack Overflow for Groups providing embody Stack Overflow for Visible Studio Code, Enhanced Search and Auto-answer App for Slack.
In distinction, OverflowAPI is an API service that gives steady entry to Stack Overflow’s public dataset to coach and fine-tune massive language fashions.
“Our goal with the introduction of OverflowAI last summer was to ensure developers are not only contributing to the foundation of what GenAI is today, they are also an integral part of building its future,” Chandrasekar mentioned. “For today’s news, this is about the most developer friendly cloud joining forces with the most popular developer knowledge platform in the world.”
VentureBeat’s mission is to be a digital city sq. for technical decision-makers to achieve information about transformative enterprise know-how and transact. Uncover our Briefings.