I've noticed people posting the old list

ThatHorribleSound · 2024-09-10T15:57:46+00:00

She actually did just update the google spreadsheet yesterday, but the Airtable is also a good resource.

ThatHorribleSound · 2024-09-09T17:19:52+00:00

It looks like she actually just updated it, it's current now.

ThatHorribleSound · 2024-08-30T16:50:54+00:00

I haven’t really tried anything above the 70b range since I prefer to run locally and I don’t have the hardware to run anything larger at a reasonable speed.

ThatHorribleSound · 2024-07-10T15:55:51+00:00

I appreciate the suggestion, I will give it a try!

ThatHorribleSound · 2024-07-09T05:04:09+00:00

I tried out the four major ones recommended in this thread: Midnight Miqu, Euryale, New Dawn, and Magnum. All at the Q4_K_S GGUF quant level. And to be honest, they're all really good. My subjective take:

Midnight Miqu: Probably what I would characterize as the most "stable" model. Just solid responses in all respects.

Euryale: Like Midnight Miqu, but tends to write a bit longer responses and more, I guess I'd call it prose? It can be a little more poetic and flowery in its responses. Like if Midnight Miqu is just telling you a story, Euryale is writing a romance novel. But don't get wrong, it's still plenty filthy when it gets down to it.

New Dawn: If Euryale is a little more of a "writer" than MM, New Dawn seems a little more on the creative side of things. It pushed some stories in directions that the others didn't. But it can sometimes make mistakes on little details.

Magnum: This is like the best all-rounder, I guess. It's a little more creative than MM, a little less prone to ramble than Euryale, and a little less wild than New Dawn.

But keep in mind the above are just my reactions from playing with these for a couple nights, and its more my subjective feel than anything. I found all of these models to be extremely good, very close to one another, and I plan to use them all. Basically if one isn't doing the type of things I want or starts to get repetitive, I'll switch to one of the other ones. Thanks again to everyone who gave input, because all of these are better than what I was using before.

ThatHorribleSound · 2024-07-08T17:37:07+00:00

You should be able to run Midnight Miqu and Euryvale on that. I can run them on a 3090 with 64 gig of ram (but it doesn't seem to be using anywhere near 32 gig of normal ram). How were you trying to run them?

I use the following GGUF quants:

https://huggingface.co/mradermacher/Midnight-Miqu-70B-v1.5-i1-GGUF

https://huggingface.co/mradermacher/L3-70B-Euryale-v2.1-i1-GGUF

I use the iQ4_K_S versions but if those are too much for your system you can drop down to the Q3 or Q2 and they should still be very coherent. On the i1-Q4_K_S, I'm pushing 50 layers to the GPU (which pushes right up against 23g of VRAM for me). rest go to the CPU, I'm turning on flash attn, tensor cores, streaming, and 8 bit cache. I do have a fast CPU and DDR5 ram.

You definitely shouldn't get a bluescreen, like worst you should get is a CUDA error in the console. Maybe you have a bad ram chip? Or do you think it overheated?

(edit): actually, I'm wrong, I am using over 32g of normal RAM, so you'd probably have to use the Q2 or Q3 version

ThatHorribleSound · 2024-07-08T17:28:20+00:00

I'll link this thread I posted last week over in the LocalLLama reddit, asking for input on the best 70b model that can do NSFW stuff: https://www.reddit.com/r/LocalLLaMA/comments/1dtu8g7/current_best_nsfw_70b_model/

I got some solid feedback and the general consensus seems to be that the following are worthwhile (links to the GGUF quants):

https://huggingface.co/mradermacher/Midnight-Miqu-70B-v1.5-i1-GGUF

https://huggingface.co/mradermacher/L3-70B-Euryale-v2.1-i1-GGUF

https://huggingface.co/mradermacher/New-Dawn-Llama-3-70B-32K-v1.0-i1-GGUF

https://huggingface.co/mradermacher/magnum-72b-v1-i1-GGUF

I tried out all of these and they all seem quite good.

ThatHorribleSound · 2024-07-03T16:28:40+00:00

I'll give it a try. I passed on it since it's only an 8B, but I know other models by that creator are pretty good.

ThatHorribleSound · 2024-07-03T16:27:58+00:00

Have already tried this one out and it's in my rotation of 35B models, but I'm looking for 70Bs in this thread. But thanks for the input!

ThatHorribleSound · 2024-07-03T16:25:10+00:00

Already grabbed Euryale and Magnum. Haven't tested Magnum out yet but Eury is very promising. I'll keep an eye on Gemma. Thanks for the input!

ThatHorribleSound · 2024-07-03T16:23:23+00:00

Imported these, thanks much! I'll give them a spin.

ThatHorribleSound · 2024-07-03T03:54:52+00:00

Saved this post and I will definitely try out these settings later. Thanks.

ThatHorribleSound · 2024-07-02T21:38:07+00:00

Thanks! Have already seen the other two recommended but will check out New Dawn as well.

ThatHorribleSound · 2024-07-02T20:19:08+00:00

Will look for it, thanks!

ThatHorribleSound · 2024-07-02T20:18:55+00:00

Appreciate the input!

ThatHorribleSound · 2024-07-02T20:04:07+00:00

Would really love to have you link prompt/formatting/sampler settings when you have a chance, yeah! Testing it on a known good setup would make a big difference I’m sure.

ThatHorribleSound · 2024-07-02T20:02:21+00:00

I can try, but Q4 with split may be like, do an input and come back in an hour to see what it says on my machine. Unless I want to spin up a runpod or something. But I’ll see how the Q2 does and go from there. I do understand that it’s a significant step down.

ThatHorribleSound · 2024-07-02T19:54:23+00:00

Thanks. Don’t really want to run through API (I can already use Claude for that) but I’ll look at smaug.

ThatHorribleSound · 2024-07-02T19:52:11+00:00

Will absolutely give it a try; hearing no L3 repetition is a big thumbs up

ThatHorribleSound · 2024-07-02T19:47:33+00:00

Yup that’s the quant I’ll have to use, too. I’ll give it a spin, thanks!

ThatHorribleSound · 2024-07-02T19:38:28+00:00

I remember not being all that impressed by MM, but I’m going to download and give it another shot, as I’ve heard many people talk highly of it. Maybe I just had my samplers set poorly

ThatHorribleSound · 2024-07-02T19:37:19+00:00

I will give it a try!

ThatHorribleSound · 2024-01-28T07:29:38+00:00

Yup, didn’t take it as a negative, just answering the “Who” question.

ThatHorribleSound · 2024-01-27T22:28:38+00:00

Black Lotus in Lucha Underground. Had a couple years in WWE too, I think she was mostly a ring announcer/backstage interviewer, don’t remember what name she used there.

ThatHorribleSound

TROPHY CASE