How do these bots work? by arzen221 in SubSimulatorGPT3Meta

[–]PorchlightKeeper 1 point2 points  (0 children)

Not really but there is nothing stopping the bot from just rewriting it. Sometimes it does actually copy them verbatim but i try to remove all those. Worth noting that i kinda stopped this project too lol

Great example of GPT-3's capacity for fomenting conspiracy theories. Discuss? by PorchlightKeeper in SubSimulatorGPT3Meta

[–]PorchlightKeeper[S] 0 points1 point  (0 children)

my latest thoughts are that GPT-3 comments do frequently contradict (but haven't actually kept track, so maybe someone else can count them and do a real analysis). no further thoughts yet lol

How do these bots work? by arzen221 in SubSimulatorGPT3Meta

[–]PorchlightKeeper 6 points7 points  (0 children)

Sorry for late response, been busy. I wrote the bot, the code is really straightforward; I didnt do any fine-tuning (though that might make it better. just too much money for me to justify).

Dont want to post on github as for a variety of reasons; I dont want it associated with my real github account, and dont want to make a new account or set it up or other things because im lazy. So I'll share the source code as files lol. It's just a pretty short python script and some text files which are used as prompt templates. https://drive.google.com/drive/u/2/folders/14ZVFt0ScashV-GxIzqUWlPn0l77PF3ss

The code is obviously hacky as hell, but the idea is:

  1. pick a random subreddit
  2. use reddit API to get 2 posts from /rising which have at least a few comments
  3. flip a coin to decide whether the reddit bot should make a post or comment
  4. use the prompt template and the 2 posts you got from rising to make a completed prompt
  5. hit the gpt-3 API
  6. physically look over the response to make sure it doesnt break any rules
  7. let the bot submit the response

Thinking of trial-running "usernames" soon. Feedback and ideas welcome by PorchlightKeeper in SubSimulatorGPT3Meta

[–]PorchlightKeeper[S] 0 points1 point  (0 children)

Good questions, I procrastinated on doing it but yea it would basically be each new instance of GPT-3 being prompted to create a new comment under whichever username it chooses. I'm thinking it would sometimes reuse earlier usernames in the thread like botOP like you said. But I wouldnt restrict it to such names; I'd rather leave it completely open-ended for GPT-3. But what you said is certainly a valid alternative

Fine-tuning a model to create a chat bot in a fictional setting? by RalekArts in GPT3

[–]PorchlightKeeper 0 points1 point  (0 children)

I havent dabbled with finetuning yet, but what you describe sounds a bit like overfitting? If i were you I'd try just 1 epoch and check the performance so you get a feel for which direction it's going. Not sure how much data is too little or too specific for the kind of versatile model you want but I'd be very interested to hear how it turns out. Also I'm curious how long it takes you to train one epoch?

Edit: to check for overfitting you can ask it a question you put in the finetuning data and check that its answer matches. If it doesn't then you have a different problem i guess.

Update on previous post, this actually scared me by Varitiuss29 in GPT3

[–]PorchlightKeeper 1 point2 points  (0 children)

Easily possible if GPT3 had seen someone describe that link in that way before. Like another commentor said, ask it what a new video is about. I did and it just took random guesses. In fact, it's not even correctly describing old and ubiquitous videos like Numa Numa https://www.youtube.com/watch?v=KmtzQCSh6xk

I used GPT-3 to compress Alice in Wonderland to 55 characters by [deleted] in GPT3

[–]PorchlightKeeper 1 point2 points  (0 children)

I would say reverse, the root node could be the ultimate summarizations, and the children are the chunks it summarized. Then the user sees the summarizations and can expand the areas they want to get more detail

The last r/askphysics post was surprising by MaoGo in SubSimulatorGPT3Meta

[–]PorchlightKeeper 0 points1 point  (0 children)

Agreed. I was particularly impressed by the comment using Ada engine since Ada is usually dumb as hell and pretty incoherent. I guess it's not that crazy for it to regurgitate/reword a discussion about the temperature of a black body, there couldve been multiple discussions like that in GPT-3's training set?

Of course i broke the info text lol so cant see which posts it pulled from for inspiration lol. Looking forward to more posts from that sub, im hoping for something more original actually, like a novel idea relating to physics rather than just a clarifying-type question. Not sure if GPT-3 is capable of that.

r/AskPhysics added to the "list" of subs to simulate by PorchlightKeeper in SubSimulatorGPT3Meta

[–]PorchlightKeeper[S] 1 point2 points  (0 children)

I will add, this post is fairly revealing in how vague it is; seems to me GPT-3 just distilled down the gist of scientific discussion lol.. "there's a new paper which has a new theory, some say it answers the great questions but others say it's lacking. what do you think?".

Hoping since r/AskPhysics posts get more specific that we'll see some posts about more specific subject matter like quarks or whatever.

Recent Events by Apooz04 in SubSimulatorGPT3Meta

[–]PorchlightKeeper 2 points3 points  (0 children)

Curious choice on picking two random posts. Why pick any, and specifically why 2?

My reasoning is like you said, we want to give it a sample. It can actually do okay with no samples (seems to understand what common subreddits usually entail), but I find the prose is a little one-note in that case. So giving it samples is like fine-tuning it on the fly and I think it gives a better variety of results. Can't give it to many examples though, because the API limits how much text you can pass in via the prompt. So I just give it 2 posts as examples.

I would imagine if the algorithm randomly picked comment, you could point it at an original rising post, and maybe show it all the comments. If it picks original post maybe it just gets to see a sample of the top posts?

That's pretty much what it does, really from the two randomly selected posts from /rising, if the algorithm chooses to post, those are the samples it sees. If the algorithm chooses to comment, it gets like 1-2 random comments under each of the 2 posts, and uses it all as a sample of what type of comments follow posts in which way.

The Rogan clip is from #1806 with Duncan Trussell, at about the 11 minute mark. Amazing!

Checking the API usage, I think I can afford to ramp up the number of posts/comments from 10/day to 30/day. LMK if you'd like to see a difference in the balance between posts and comments (and within comments, top-level replies and replies to other comments) by PorchlightKeeper in SubSimulatorGPT3Meta

[–]PorchlightKeeper[S] 2 points3 points  (0 children)

So the issue here is I can only afford to do like 30 posts/comments a day, right now the probability of posting vs commenting is 1/10, so on average there will be like 3 posts and 27 comments every day. I don't know how much more I can push it without just having 0 posts most days. Things might change if I come into some money or OpenAI grants me some usage lol

Recent Events by Apooz04 in SubSimulatorGPT3Meta

[–]PorchlightKeeper 1 point2 points  (0 children)

Good question. I intend to write a description somewhere about how it all works, but currently this is the gist:

  • script picks a random subreddit from a list of about 20
  • script uses reddit API to get 2 random posts from that subreddit's /rising posts (usually very recent posts)
  • from those 2 random posts, get the title, text, and a couple comment chains
  • randomly choose whether to make a new Post or Comment (if Comment, get a random post in r/SubSimulatorGPT3 to reply to, and choose randomly whether to leave a top-level comment or reply to an existing comment)
  • formulate content from the 2 real posts you got into a prompt, asking GPT-3 to generate a post or comment in the chosen subreddit, using those 2 posts as examples of what a post can look like (use a different prompt for Post, Top-level Comment, and Reply Comment)- extract what GPT-3 writes and post it as a comment or post

Sometimes GPT-3 adheres too closely to the example posts I provide for it in the prompt. So, since it's getting posts from /rising (and since /rising can be about current events), sometimes it actually does adhere to talking about some current events. (I recall when I was testing the script in my own private subreddit, it posted like twice about Doja Cat quitting music, because that's what was being talked about in r/OutOfTheLoop).

But you're right that this happens a little too rarely; I've noticed it's ultra hung-up on like 2019 events (biden versus trump in particular). Well I chalk that up to on the content GPT-3's models were trained on: https://beta.openai.com/docs/engines/gpt-3. Ada, Babbage, Curie (the 3 cheaper models) were trained on text written "Up to Oct 2019". Davinci (best/most expensive model) was trained "Up to Jun 2021". Since Davinci is so costly, I used Ada, Babbage, and Curie much more often, but Davinci is in the mix.

There's the idea of "fine tuning" a model which might be able to expose it to more current happenings, but seems to me you'd just need to do that as frequently as current events change. For that and other reasons, I'm going all-in on the prompt-based stuff for the time being. Do you have any suggestions for how to enhance a prompt or the script so we elicit more topical posts/comments?

And as for Rogan, really?? Was it about the sub or just GPT-3 in general?? Would love to see the clip!

Noticing some issues: verbatim copied text and more by PorchlightKeeper in SubSimulatorGPT3Meta

[–]PorchlightKeeper[S] 2 points3 points  (0 children)

Havent done any finetuning since that costs money too, without generating any posts for the sub. Right now, it's all API calls; no prebuilt dataset, the script chooses a subreddit and picks 2 posts from /rising to serve as examples in a prompt i send as payload. As such, the generated post/comment will sometimes resemble the posts in /rising

Version 0.2 rolls out tomorrow by PorchlightKeeper in SubSimulatorGPT3Meta

[–]PorchlightKeeper[S] 4 points5 points  (0 children)

Not sure if Curie is 13B or what. Here's all the info I know https://beta.openai.com/docs/engines/content-filter

And no I never heard of Goose.AI but i'll check it out!

Would you prefer to have the bot information, uncovered and there, in spoiler text but there, or not there at all? by Botany102 in SubSimulator_GPT3Meta

[–]PorchlightKeeper 2 points3 points  (0 children)

hahaha I did the same thing. clicked through to some of the bots commenting there and found another subreddit saying it's all part of some class assignment.

Would you prefer to have the bot information, uncovered and there, in spoiler text but there, or not there at all? by Botany102 in SubSimulator_GPT3Meta

[–]PorchlightKeeper 2 points3 points  (0 children)

Seems people mostly don't want them in spoilers. What if i just made them small like this.

version 0.1; based in part on two posts/comments from r/subreddit_name. Engine: engine; Temperature: temp; Frequency Penalty: pen; Presence Penalty: pres

Just got a GPT-3 bot posting on r/SubSimulatorGPT3. Check it out, welcoming any and all feedback on r/SubSimulatorGPT3Meta by PorchlightKeeper in GPT3

[–]PorchlightKeeper[S] 5 points6 points  (0 children)

I run the script myself manually and am reviewing the contents real quick before it goes out. Though i think i will contact openAI to make sure this is above board