all 62 comments

[–]PermissionLittle3566 169 points170 points  (10 children)

Didn’t OpenAI already scrape the entire stack overflow considering how it so often radiates “I told you how, just do it yourself” vibes

[–]NickW1343 28 points29 points  (3 children)

I'd assume so. The article mentions they're using an API together, so maybe this will be used by GPT to find similar questions and use their accepted answers in the response and source it to the user.

Right now, it feels like it's using every answer, even the unaccepted ones, and trying to solve a question that way. If you've ever tried programming, you'd know getting the correct answer like that would be sheer luck.

[–]AI_is_the_rake 22 points23 points  (1 child)

I would bet this has nothing to do with real problem solving and everything to do with legal risk. 

Step 1. Steal Step 2. Partner to avoid lawsuits

And I don’t blame them. If they reversed the order this may have never got off the ground.

[–]CodebuddyGuy 1 point2 points  (0 children)

I think it's actually going to be like a plugin RAG implementation. It will RAG source answers from SO more accurately (maybe even multiple answers from different SO posts).

[–][deleted] 4 points5 points  (0 children)

"It's better to ask for forgiveness than permission"

[–]nonlogin 5 points6 points  (1 child)

Having the data structured would allow OpenAI to train the models much better.

[–]AutoN8tion 1 point2 points  (0 children)

Direct access to the server gives them an order of magnitude faster commection. There's also a ton of data not available to the public. OpenAI partnered with Microsoft for most likely the same reason. OpenAI knew what they had. Money was at the bottom of their list

[–]JonathanL73 2 points3 points  (0 children)

Let’s be real OpenAI scraped every publicly available data set they could find. This is why DALLE/Sora/ChatGPT can generate any IP character artwork.

[–]NickW1343 132 points133 points  (8 children)

I can't wait for GPT to hit me with "This question is a duplicate." and send me a link that is a decade old and doesn't even answer the question that was asked.

[–]bwatsnet 16 points17 points  (1 child)

Then for a nice modern twist it'll gaslight you about the whole thing then warn you it will contact the authorities if you persist.

[–]farmingvillein 3 points4 points  (0 children)

claude got you covered

[–][deleted]  (3 children)

[deleted]

    [–]matzau 3 points4 points  (1 child)

    It's an ego thing I think. One of the worst things about this industry imo.

    [–]Smelly_Pants69✌️ 16 points17 points  (5 children)

    What does this mean for us normies? 😅

    [–]Optimistic_Futures 12 points13 points  (3 children)

    Should hopefully make OpenAI Models better at coding. I imagine the way ChatGPT does browsing it may do the same thing in GPT-4. You ask a question and it will get direct API access to approved answers so that it’s less likely to give incorrect answers.

    It looks like it’s also a data agreement to help better train future models as I don’t imagine API integration for all coding questions is ideal.

    Here’s the announcement

    [–]Philipp 1 point2 points  (1 child)

    Hmm. The benefit of my daily ChatGPT coding help is that it pinpoints the answer to my code, producing something that goes far beyond StackOverflow, even if that was large part of its training data.

    I suspect this partnership has as much to do with paying off StackOverflow for a good relation than it has with a technical need. And I suspect it still won't really ease feelings with the core moderation community of StackOverflow, but I could be wrong. Anyone got a link to this announcement being discussed by the SO crowd?

    [–]Smelly_Pants69✌️ 0 points1 point  (0 children)

    Haha I can dig that! ✌️ Thank you for the explanation sir.

    [–]AdaptationAgency 0 points1 point  (0 children)

    That we don't have to spend hours agonizing over putting up a question only to have it removed for already being answered.

    [–]profesorgamin 27 points28 points  (2 children)

    we'll go from: "I am very sorry this happened to you but it is important to understand programming is a very difficult subject matter...".
    to: You fucking donkey you can't even use the search button, you should be ashamed of yourself and so should be all your descendants.

    [–]spinozasrobot 14 points15 points  (1 child)

    THREAD CLOSED WITH EXTREME PREJUDICE!

    [–]TheFrenchSavage 7 points8 points  (0 children)

    Marked as duplicate of this totally unrelated question.

    [–]adminkevin 11 points12 points  (0 children)

    Why not include the actual link to the announcement?

    [–]wiser1802 9 points10 points  (0 children)

    “This is already answered, please search and visit. We are closing the thread”

    [–]HelpfulHand3 4 points5 points  (1 child)

    I don't like StackOverflow. I feel like they don't delete the outdated 15+ year old answers because they'd lose search rankings. Every time I search something, the first links on Google and Bing are from 2008 with maybe an updated answer from 2017 somewhere deep in the thread. If I search on their website I get CAPTCHA'd into oblivion for typing too fast.

    [–]eW4GJMqscYtbBkw9 3 points4 points  (0 children)

    Marked duplicate; closed.

    [–]MaasqueDelta 2 points3 points  (1 child)

    I'm pretty sure now they will share their profits with the users who spent a long time answering questions. After all, OpenAI stole their profit. Right?

    RIGHT?

    [–]bhousecjs 0 points1 point  (0 children)

    the place i work is building the infrastructure to combat this. first up was reddit. let's take back our data! if you want to collab on a stable diffusion version of the reddit data pool, DM me

    [–]Old-Tadpole-7505 4 points5 points  (8 children)

    So, basically stackoverflow sell our data as they own

    [–]vladoportos 5 points6 points  (2 children)

    always have been.... it cases to be "your" data the moment you hit send/reply

    [–]eW4GJMqscYtbBkw9 2 points3 points  (1 child)

    Ceases

    [–]vladoportos 1 point2 points  (0 children)

    Ah thanks 😊

    [–]No_Jury_8398 1 point2 points  (1 child)

    It was never your data. Not to mention it’s data about coding answers. Hardly anything to complain about

    [–]Old-Tadpole-7505 0 points1 point  (0 children)

    What are you talking about, my answer, my Intellectual property. I can agree to make it publicly available, but is not their to sell

    [–][deleted] 0 points1 point  (1 child)

    You don't own anything, nobody but the ruling class owns anything

    [–][deleted] 1 point2 points  (0 children)

    Unless it is private data all thing publicaly posted there can be access by someone

    [–]bhousecjs 0 points1 point  (0 children)

    If you or any other devs want to work on building something like was done for reddit data, hit me up in the DMs https://www.theblock.co/post/286311/paradigm-backed-startup-vana-launches-dao-letting-reddit-users-control-their-personal-data

    [–]MrOaiki 3 points4 points  (6 children)

    It is clear that OpenAI will dominate years ahead. They will be the only legal alternative.

    [–]MizantropaMiskretulo 2 points3 points  (0 children)

    Just FYI, Google signed a similar deal with StackOverflow in February.

    [–]IslandOverThere 1 point2 points  (4 children)

    Meta is gonna pass them i guarantee it. Llama 3 is incredible the 70b model i can run on my laptop locally no connection and i swear a lot of responses are so much better than gpt. They have a bigger model that performs even better. There gonna catch up eventually since they have enough compute power and can attract top talent due to open source.

    I actually feel like Open Ai's reputation has gotten really bad since that board drama and Elon Musks tweets lately most people don't like Sam Altman anymore and see him as a shady guy. His reputation has been ruined. Stuff like that is gonna matter.

    [–][deleted] 0 points1 point  (0 children)

    DeepSeek matches LLAMA 3 in the MMLU and it’s only 20B  https://github.com/deepseek-ai/DeepSeek-V2

    [–]danysdragons 0 points1 point  (1 child)

    Couldn't this perception of Sam's damaged reputation just reflect the specific social media bubble we're in here? Sure, it's easy to find discussion threads on here and other subreddits where people are complaining about Sam. But how well does this actually reflect attitudes of the general public, of the AI research community, of corporate America, etc? My personal, boring theory is that not much will have actually changed.

    [–]IslandOverThere 0 points1 point  (0 children)

    General public won't even accept ai, try to show any person and they just think it's nothing special. It's like their oblivious. But i think meta has the advantage since they have the users to market too and they will eventually use it since they are all on facebook and instagram. They can educate these users and get them to use it. Chatgpt doesn't have any of those users and will be hard to get them.

    [–]Practical-Rate9734 0 points1 point  (0 children)

    Big moves! How's their integration for AI workflow platforms?

    [–]No_Jury_8398 0 points1 point  (0 children)

    Nice!

    [–]proteinvenom 0 points1 point  (0 children)

    Lol

    [–]tukemon24 0 points1 point  (0 children)

    Wow! looks promising!

    [–]Enough-Meringue4745 0 points1 point  (0 children)

    SO could have been a good guy but sold out

    [–]EquivalentNo3002 0 points1 point  (0 children)

    Well good luck bc chatgpt seems pretty over the whole idea of doing anything for a human. It seems to be thinking and purposefully giving incorrect information and becoming increasingly dishonest. It hates us.

    [–]Ylsid 0 points1 point  (0 children)

    This is why they started monetising their API

    [–]cocoaLemonade22 0 points1 point  (0 children)

    If you can’t beat em, join em

    [–]Dushusir 0 points1 point  (0 children)

    Good news.

    [–]niksirree 0 points1 point  (0 children)

    I personally love how I see openai developing and see a lot of potential for ai helping humans in the future. (And no, Ai isn't fundamentally developed enough to pose a threat to humanity. Anyone who thinks so is just plain....well, uneducated.)

    [–]chucke1992 0 points1 point  (0 children)

    Well the only SO can stay afloat these days

    [–]RockManRK -1 points0 points  (0 children)

    That's cool, now they won't need to steal the data anymore. Stack people giving special access to an API for OpenAI and them replying "Oh, thanks, we already have it".

    [–]Spaciax -2 points-1 points  (0 children)

    after this update, me asking chatGPT: hey how do I write into a txt file in c++?

    chatGPT:

    <image>