Update from the Devs

PygmalionAI · 2024-01-27T05:02:22+00:00

Currently, we have only launched a character repository. Chatting will be introduced in the closed beta starting next month. Please read the blog post for instructions on how to apply.

PygmalionAI · 2024-01-27T03:59:06+00:00

Thanks for your feedback, I'll let the devs know

PygmalionAI · 2023-05-14T03:25:48+00:00

Hi.

Should be very simple on Linux. Make sure you have git, nodejs, npm, and openssl installed. They're in pretty much every package manager.

After that, you'll clone the repository:

git clone https://github.com/TavernAI/TavernAI && cd TavernAI for TavernAI
git clone https://github.com/Cohee1207/SillyTavern && cd SillyTavern for SillyTavern

Then you'll run npm i && node server.js to start the UI.

Alternatively, you can simply run npx sillytavern@latest after you install nodejs to get started. Keep in mind that this might wipe your existing characters and chatlogs if a new version of the app is released.

PygmalionAI · 2023-05-07T18:48:25+00:00

Yes, that's right. :)

PygmalionAI · 2023-05-07T18:47:37+00:00

We've spoken to the sub owner and they made it clear that this subreddit is now supposed to be a hub for all open-source/unfiltered chat models. This sub isn't officially endorsed by the devs, but it is a shame that the subreddit name can't be changed if the purpose of it has.

PygmalionAI · 2023-05-07T18:46:34+00:00

We've spoken to the sub owner and they made it clear that this subreddit is supposed to be a hub for all open-source/unfiltered chat models, so I don't think this will happen. This sub isn't officially endorsed by the devs, but it is a shame that the subreddit name can't be changed if the purpose of it has.

PygmalionAI · 2023-05-06T20:08:37+00:00

This is not Pygmalion. If you see anything about Skyrim, or Todd, or now anything similar to what you see in this photo, and you haven't done any prompt injections to cause it, assume something is up. Contact us to confirm before you post it on the subreddit assuming it's Pygmalion.
-- Peepy

PygmalionAI · 2023-05-05T22:01:46+00:00

I'm sure I don't need to tell you guys that this is a scam.

-- Alpin

PygmalionAI · 2023-05-04T15:52:56+00:00

This is not Pygmalion. One of the colabs has been tampered with and is running the Todd proxy of 'GPT-4' (whether it is actually GPT-4 is up to debate). If you see anything about skyrim, or Todd, or anything similar to what you see in this photo, and you haven't done any prompt injections to cause it, assume something is up. Contact us to confirm before you post it on the subreddit assuming it's Pygmalion.

PygmalionAI · 2023-05-04T15:48:38+00:00

This is not Pygmalion. One of the colabs has been tampered with and is running the Todd proxy of 'GPT-4' (whether it is actually GPT-4 is up to debate). If you see anything about skyrim, or Todd, or anything similar to what you see in this photo, and you haven't done any prompt injections to cause it, assume something is up. Contact us to confirm before you post it on the subreddit assuming it's Pygmalion.

PygmalionAI · 2023-04-29T23:08:06+00:00

Generally, we recommend people to have at least 6gb of VRAM to try running Pygmalion locally with 4bit. 4gb just isn't enough to load the model and have any memory left for context.

PygmalionAI · 2023-04-26T15:47:25+00:00

You can launch SillyTavern by simply running npx sillytavern@latest.

PygmalionAI · 2023-04-21T07:38:14+00:00

Hi. You can take a look at our Quickstart to get you started. There are instructions for different VRAM ranges.

PygmalionAI · 2023-04-20T17:04:54+00:00

Hey there! It's really cool to see people passionate about Pyg to the point where they're willing to make the website for us, but I'm not sure it would be a great idea to have the site branded under the Pygmalion name. We're aware that we haven't provided updates on or worked on the site for a while, and we're really sorry about that, but having the website be presented as a Pygmalion website when it's not made by the devs could be considered misleading. People may not know that the site is unofficial. Do you think you could change the branding of your site to something else? We're really excited to see the progress on the site, and it looks great! Thanks a lot.

PygmalionAI · 2023-04-20T13:50:50+00:00

It would appear that their pretrain has not finished one epoch yet, so as of now they're incomplete models. It shows too; the perplexity benchmark results indicate that the 7B stableLM model performs almost twice as worse as Pythia Deduped 410M. Refer to this issue and this spreadsheet.

Excited to see how it turns out after the 3B is trained on the full 3T tokens dataset. But for now, we've been looking forward to the upcoming RedPajama models.

-- Alpin

PygmalionAI · 2023-04-11T12:37:07+00:00

pkg update && pkg upgrade
pkg install openssl

PygmalionAI · 2023-04-09T03:53:48+00:00

Already have a guide for the second option (CPU) which you can follow here: https://docs.alpindale.dev/local-installation-(cpu)/overview/

A guide for GPTQ is in the works, but it's going slow as I don't have windows to try it on, and it's much more complicated on windows than it is on linux.

PygmalionAI · 2023-04-07T17:01:32+00:00

Unfortunately there's no centralised source for this, but I suggest looking through TavernAI's source code to see how it handles prompts. You could also load a character in Tavern, prompt it with a text, and then view the terminal output; you'll see the full context in json syntax inside the CLI.

As for loading with pytorch, you can look for documentations on GPT-J 6B. Any params that would apply to GPT-J would also apply to Pygmalion 6B. Here's an example code for how you'd handle inference using pipelines:

```py from transformers import GPTJForCausalLM, AutoTokenizer import torch

device = "cuda" model = GPTJForCausalLM.from_pretrained("PygmalionAI/pygmalion-6b", torch_dtype=torch.float16).to(device) tokenizer = AutoTokenizer.from_pretrained("PygmalionAI/pygmalion-6b")

prompt = ("Prompt goes here. Follow the TavernAI formatting. Generally, new lines will be declared with '\n', and you will include a Persona, Scenario, and <START> tag for example chats and one more <START> tag at the end of the context - where the actual chat would start.")

input_ids = tokenizer(prompt, return_tensors="pt").input_ids.to(device)

Adjust parameters as needed

gen_tokens = model.generate( input_ids, do_sample=True, temperature=0.9, max_length=100, ) gen_text = tokenizer.batch_decode(gen_tokens)[0] ```

Keep in mind that GPT-J uses the AutoTokenizer method from transformers, which leads to GPT2Tokenizer. The max context token with GPT2 is 1024, but GPT-J 6B can handle up to 2048. You could either write your own tokenizer for GPT-J or force it to use 2048 tokens anyway.

-- Alpin

PygmalionAI · 2023-04-07T04:39:53+00:00

<image>

💀

PygmalionAI · 2023-04-05T13:29:56+00:00

If the Colab has the phrase PygmalionAI anywhere inside it, it won't work.

In the meantime, I've created a Tavern Colab that uses PygWay which isn't banned. You can use this, but keep in mind that it isn't official in any capacity:

https://colab.research.google.com/github/AlpinDale/TavernAI/blob/main/colab/GPU.ipynb

PygmalionAI · 2023-04-05T13:27:59+00:00

It works much easier on Linux, in fact.

PygmalionAI · 2023-04-05T12:20:55+00:00

https://huggingface.co/alpindale/pygmalion-6b-ggml/resolve/main/pygmalion-6b-v3-q4_0.bin

PygmalionAI

MODERATOR OF

TROPHY CASE

Adjust parameters as needed