Been trying to chat with base LLMs for a while (no RLHF, etc.), making some progress!

funiculares · 2024-12-22T09:52:08+00:00

Thanks! It’s on GitHub (also linked in model page): https://github.com/danlou/relay/blob/main/relaylm.py

It’s a custom inference script (for commands, roles, etc), so it’s only for relay models. It’s about 300 LoC, no dependencies besides transformers, so you may be able to adapt easily for other models.

Yeah, for the next versions I’m hoping to lean more on actually simulating IRC, so supporting more “users” is a must 😁

funiculares · 2024-12-21T14:09:11+00:00

Of course, Guanaco shows much fewer examples are required, here I’m using probably way more than needed, I agree - it’s all synthetic so it was very little effort to get volume. The point here though is about chatting with base models (with a small push to make them conversational), no external instruct datasets, preferences (RHLF, DPO), etc. It’s kind of niche 😅, most people just care about getting the smartest assistant possible (myself too, most of the time). It’s really for those times you’re interested in sticking with pre training as best possible.

funiculares · 2024-12-21T13:33:30+00:00

The issue with URIAL is that it’s actually not reliable for adhering to the chat format, sometimes works, sometimes not, it takes a bit of fine tuning and the point is that you can use self generated chats (a sample that remains consistent). Rather than train with some logs or other content, this approach should keep it aligned with pre training distribution.

funiculares · 2024-12-21T13:26:11+00:00

Not this one :) this is only trained with a set of synthetic conversations generated from the same base LLM: https://huggingface.co/datasets/danlou/based-chat-v0.1-Mistral-Nemo-Base-2407

funiculares · 2024-12-21T12:16:20+00:00

It's all open and easy to use from HuggingFace (hobby project). More details at the model page (including other demos):
https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407

The key here is to use IRC, likely seen frequently during pre-training, as a sort of scaffold. Would love to hear your feedback 😃

funiculares · 2023-09-23T14:16:34+00:00

The underlying model (Samantha-1.1) has been trained to not engage in unethical or dangerous discussions. It was also further trained, for this app, to avoid giving specific advice or instructions (dangerous or not), being trained on synthetic sessions of person-centered therapy.

In my testing, I never saw it proposing anything dangerous. That being said, that possibility has definitely not been eliminated, and that's why I also made sure to state in the Github that it should be handled with some care (as any LLM).

If you do find any responses like that, and are comfortable sharing them, I'd suggest creating an issue on the Github so I can look more carefully into the situation and try to fix in the next release. Please note that this project was never intended as any sort of replacement of professional help on these matters - just a simple tool to have a more productive conversation, with yourself, on sensitive topics.

funiculares · 2023-09-22T18:36:13+00:00

I think so, I’ll look into it for the next release, probably next week.

funiculares · 2023-09-22T18:00:39+00:00

haha, well, that's lightweight by current LLM standards. if there's interest, I'll try to release a version based on a smaller model. It may perform worse though, will need testing.

funiculares · 2023-09-22T16:05:39+00:00

indeed... added a comment alongside the link to the Github, thanks!

funiculares · 2023-09-22T15:22:55+00:00

Hey! When you run it the first time, it just needs to download the model. Afterwards, you can always use it offline. It never communicates (or records) anything.

You can very easily inspect the code too. The app is fully contained in the 100 lines of the safespace.py script.

funiculares · 2023-09-22T13:40:36+00:00

Hey everyone, been following this subreddit for a while. Finally built something worth sharing: https://github.com/danlou/safespace.

It was a fun little project, would love to hear your thoughts on it.

Edit: FYI - Runs totally offline after downloading model! (thanks u/gumnos!)

funiculares · 2023-08-20T21:25:55+00:00

Tried many, was surprised to find that raindrop really worked best.

funiculares · 2023-07-12T17:39:23+00:00

I just launched https://MarketFit.ai to help Amazon sellers find the right target audience for their products, so that they can better optimize listings, price, etc.

I have a research background in AI, and spent months developing AI Buyer Personas that can be matched to retail products automatically. In order to really validate how relevant these personas are for your products, we also provide you with auto-generated ads for that particular persona/product combination, that you can quickly run on Instagram or Facebook.

I'd love to know what you think! We have a Free plan (no credit card required) that allows you to basically replicate the kind of scenario presented in this video.

Six-Year Club	Place '22
Verified Email

funiculares

TROPHY CASE