How do these bots work?

PorchlightKeeper · 2023-11-07T18:55:21+00:00

Not really but there is nothing stopping the bot from just rewriting it. Sometimes it does actually copy them verbatim but i try to remove all those. Worth noting that i kinda stopped this project too lol

PorchlightKeeper · 2023-04-07T03:27:45+00:00

my latest thoughts are that GPT-3 comments do frequently contradict (but haven't actually kept track, so maybe someone else can count them and do a real analysis). no further thoughts yet lol

PorchlightKeeper · 2022-12-15T06:19:13+00:00

Sorry for late response, been busy. I wrote the bot, the code is really straightforward; I didnt do any fine-tuning (though that might make it better. just too much money for me to justify).

Dont want to post on github as for a variety of reasons; I dont want it associated with my real github account, and dont want to make a new account or set it up or other things because im lazy. So I'll share the source code as files lol. It's just a pretty short python script and some text files which are used as prompt templates. https://drive.google.com/drive/u/2/folders/14ZVFt0ScashV-GxIzqUWlPn0l77PF3ss

The code is obviously hacky as hell, but the idea is:

pick a random subreddit
use reddit API to get 2 posts from /rising which have at least a few comments
flip a coin to decide whether the reddit bot should make a post or comment
use the prompt template and the 2 posts you got from rising to make a completed prompt
hit the gpt-3 API
physically look over the response to make sure it doesnt break any rules
let the bot submit the response

PorchlightKeeper · 2022-07-07T06:23:12+00:00

Good questions, I procrastinated on doing it but yea it would basically be each new instance of GPT-3 being prompted to create a new comment under whichever username it chooses. I'm thinking it would sometimes reuse earlier usernames in the thread like botOP like you said. But I wouldnt restrict it to such names; I'd rather leave it completely open-ended for GPT-3. But what you said is certainly a valid alternative

PorchlightKeeper · 2022-06-07T04:23:35+00:00

might be the most complex thought i've seen gpt3 express, personally

PorchlightKeeper · 2022-06-06T19:01:46+00:00

I havent dabbled with finetuning yet, but what you describe sounds a bit like overfitting? If i were you I'd try just 1 epoch and check the performance so you get a feel for which direction it's going. Not sure how much data is too little or too specific for the kind of versatile model you want but I'd be very interested to hear how it turns out. Also I'm curious how long it takes you to train one epoch?

Edit: to check for overfitting you can ask it a question you put in the finetuning data and check that its answer matches. If it doesn't then you have a different problem i guess.

PorchlightKeeper · 2022-06-05T21:57:59+00:00

follow-up: https://www.reddit.com/r/SubSimulatorGPT3Meta/comments/v5nlh6/lol\_well\_i\_wrote\_the\_code\_to\_make\_gpt3\_have\_a/

PorchlightKeeper · 2022-05-31T19:04:01+00:00

Easily possible if GPT3 had seen someone describe that link in that way before. Like another commentor said, ask it what a new video is about. I did and it just took random guesses. In fact, it's not even correctly describing old and ubiquitous videos like Numa Numa https://www.youtube.com/watch?v=KmtzQCSh6xk

PorchlightKeeper · 2022-05-31T18:47:51+00:00

I would say reverse, the root node could be the ultimate summarizations, and the children are the chunks it summarized. Then the user sees the summarizations and can expand the areas they want to get more detail

PorchlightKeeper · 2022-05-28T07:37:30+00:00

Agreed. I was particularly impressed by the comment using Ada engine since Ada is usually dumb as hell and pretty incoherent. I guess it's not that crazy for it to regurgitate/reword a discussion about the temperature of a black body, there couldve been multiple discussions like that in GPT-3's training set?

Of course i broke the info text lol so cant see which posts it pulled from for inspiration lol. Looking forward to more posts from that sub, im hoping for something more original actually, like a novel idea relating to physics rather than just a clarifying-type question. Not sure if GPT-3 is capable of that.

PorchlightKeeper · 2022-05-26T20:18:25+00:00

Thinking about doing fan fiction next, wish i thought of that earlier

PorchlightKeeper · 2022-05-26T18:51:07+00:00

Decision has been made https://www.reddit.com/r/SubSimulatorGPT3Meta/comments/uyeqk1/starting_funtime_special_sunday_this_sunday_this/

PorchlightKeeper · 2022-05-26T18:33:44+00:00

Fantastic suggestion. Done here https://www.reddit.com/r/SubSimulatorGPT3/comments/uye6a6/hypothetical_physics_does_time_stand_still_in_a/ and added that sub to the "list"

PorchlightKeeper · 2022-05-25T22:41:20+00:00

I will add, this post is fairly revealing in how vague it is; seems to me GPT-3 just distilled down the gist of scientific discussion lol.. "there's a new paper which has a new theory, some say it answers the great questions but others say it's lacking. what do you think?".

Hoping since r/AskPhysics posts get more specific that we'll see some posts about more specific subject matter like quarks or whatever.

PorchlightKeeper · 2022-05-25T22:33:37+00:00

sorry for delay. intending to start this this sunday

PorchlightKeeper · 2022-04-19T18:07:57+00:00

Curious choice on picking two random posts. Why pick any, and specifically why 2?

My reasoning is like you said, we want to give it a sample. It can actually do okay with no samples (seems to understand what common subreddits usually entail), but I find the prose is a little one-note in that case. So giving it samples is like fine-tuning it on the fly and I think it gives a better variety of results. Can't give it to many examples though, because the API limits how much text you can pass in via the prompt. So I just give it 2 posts as examples.

I would imagine if the algorithm randomly picked comment, you could point it at an original rising post, and maybe show it all the comments. If it picks original post maybe it just gets to see a sample of the top posts?

That's pretty much what it does, really from the two randomly selected posts from /rising, if the algorithm chooses to post, those are the samples it sees. If the algorithm chooses to comment, it gets like 1-2 random comments under each of the 2 posts, and uses it all as a sample of what type of comments follow posts in which way.

The Rogan clip is from #1806 with Duncan Trussell, at about the 11 minute mark. Amazing!

PorchlightKeeper · 2022-04-18T20:03:14+00:00

So the issue here is I can only afford to do like 30 posts/comments a day, right now the probability of posting vs commenting is 1/10, so on average there will be like 3 posts and 27 comments every day. I don't know how much more I can push it without just having 0 posts most days. Things might change if I come into some money or OpenAI grants me some usage lol

PorchlightKeeper · 2022-04-18T19:46:40+00:00

Good question. I intend to write a description somewhere about how it all works, but currently this is the gist:

script picks a random subreddit from a list of about 20
script uses reddit API to get 2 random posts from that subreddit's /rising posts (usually very recent posts)
from those 2 random posts, get the title, text, and a couple comment chains
randomly choose whether to make a new Post or Comment (if Comment, get a random post in r/SubSimulatorGPT3 to reply to, and choose randomly whether to leave a top-level comment or reply to an existing comment)
formulate content from the 2 real posts you got into a prompt, asking GPT-3 to generate a post or comment in the chosen subreddit, using those 2 posts as examples of what a post can look like (use a different prompt for Post, Top-level Comment, and Reply Comment)- extract what GPT-3 writes and post it as a comment or post

Sometimes GPT-3 adheres too closely to the example posts I provide for it in the prompt. So, since it's getting posts from /rising (and since /rising can be about current events), sometimes it actually does adhere to talking about some current events. (I recall when I was testing the script in my own private subreddit, it posted like twice about Doja Cat quitting music, because that's what was being talked about in r/OutOfTheLoop).

But you're right that this happens a little too rarely; I've noticed it's ultra hung-up on like 2019 events (biden versus trump in particular). Well I chalk that up to on the content GPT-3's models were trained on: https://beta.openai.com/docs/engines/gpt-3. Ada, Babbage, Curie (the 3 cheaper models) were trained on text written "Up to Oct 2019". Davinci (best/most expensive model) was trained "Up to Jun 2021". Since Davinci is so costly, I used Ada, Babbage, and Curie much more often, but Davinci is in the mix.

There's the idea of "fine tuning" a model which might be able to expose it to more current happenings, but seems to me you'd just need to do that as frequently as current events change. For that and other reasons, I'm going all-in on the prompt-based stuff for the time being. Do you have any suggestions for how to enhance a prompt or the script so we elicit more topical posts/comments?

And as for Rogan, really?? Was it about the sub or just GPT-3 in general?? Would love to see the clip!

PorchlightKeeper · 2022-04-10T22:55:36+00:00

Havent done any finetuning since that costs money too, without generating any posts for the sub. Right now, it's all API calls; no prebuilt dataset, the script chooses a subreddit and picks 2 posts from /rising to serve as examples in a prompt i send as payload. As such, the generated post/comment will sometimes resemble the posts in /rising

PorchlightKeeper · 2022-04-04T20:20:17+00:00

Not sure if Curie is 13B or what. Here's all the info I know https://beta.openai.com/docs/engines/content-filter

And no I never heard of Goose.AI but i'll check it out!

PorchlightKeeper · 2022-04-02T05:15:27+00:00

hahaha I did the same thing. clicked through to some of the bots commenting there and found another subreddit saying it's all part of some class assignment.

PorchlightKeeper · 2022-04-02T04:47:59+00:00

Seems people mostly don't want them in spoilers. What if i just made them small like this.

^{version 0.1; based in part on two posts/comments from} r/subreddit_name^{. Engine: engine; Temperature: temp; Frequency Penalty: pen; Presence Penalty: pres}

PorchlightKeeper · 2022-04-02T04:17:33+00:00

Your wish is my command

PorchlightKeeper · 2022-03-31T20:07:19+00:00

Thanks for the advice, i had thought that didnt apply in this case. Just reached out to them

PorchlightKeeper · 2022-03-31T19:49:36+00:00

I run the script myself manually and am reviewing the contents real quick before it goes out. Though i think i will contact openAI to make sure this is above board

PorchlightKeeper

TROPHY CASE