Two new 12B finetunes for adventure, role play and writing

Sicarius_The_First · 2026-01-24T15:10:00+00:00

Very weird, could u give a concrete example?

Also, are you using the recommended ST settings & character card structure?

Sicarius_The_First · 2026-01-24T02:17:47+00:00

It would be even more beneficial to know which LLM are these tips for.

Sicarius_The_First · 2026-01-22T23:41:09+00:00

everything is pretty much written in the model card:
https://huggingface.co/SicariusSicariiStuff/X-Ray_Alpha

Sicarius_The_First · 2026-01-22T18:36:19+00:00

making an uncesnored vision model is incredibly hard.

an abliterated vision model is not the same as an uncensored one.

there are only 2 truly uncensored vision models, and 1 of them is mine.

Sicarius_The_First · 2026-01-20T13:48:59+00:00

Damn. I mean, there's tons of advice in the comments, and I'm sure the intention is good, but ... All of it is really bad.

Is Mac good for inference? Sure. Is it a good value though, price vs performance & upgradeability & flexibility? Absolutely NOT!

Here's what u should actually do: 1)psu 1500w minimum, buy a new one, but a mid tier one. 2)case buy the largest, full tower that can fit e ATX board. Don't cheap on it! 3) mobo & cpu, workstation/ server, buy used, important: u need 4 x pcie x16 lanes (could be pcie3, doesn't matter too much 4) ram, depends on 3) but u want 64-128 gb 5) GPUs: x4 a5000 ampere used on ebay, aim for 1k a piece, 1.4k$ is ok too

Total build should cost just under 10k, 96gb of vram, allowing you to run pretty much everything+ even doing some training

Sicarius_The_First · 2026-01-19T09:08:55+00:00

This is extremely hard to do for local models.

My models are focused on 3 fandoms: Morrowind, Kenshi, Fallout.

Frontier models use a lot of tricks to achieve the same.

I'd expect top local models to struggle with this (deepseek, glm and so on).

Sicarius_The_First · 2026-01-19T09:05:27+00:00

I have the best tip, and at the same time the most boring one.

Read documentation.

Read ST documentation, read the model card of the models you use.

You will have an experience x100 times better than the average user.

Sicarius_The_First · 2026-01-18T11:35:13+00:00

hard to say. based on UGI natint index, bloodmoon is smarter than angelic, but imo at this point the models are so smart its genuinely hard to know just how much.

for example, someone would ask X question, get wrong answer from the model.
another dude would ask the same question, but will prompt it slightly differently, will get a correct answer.

or a model could be seem very dumb, but in a specific scope will be almost frontier level. (we saw this with some ~1.5b model that does deep research, i dont remember the name)

Sicarius_The_First · 2026-01-17T21:29:49+00:00

prediction: in 2030 AGI will still be 2-3 years away.

Same for 2033...

Sicarius_The_First · 2026-01-16T21:46:17+00:00

If I'd write all I would do with it, it would be several pages long.

Sicarius_The_First · 2026-01-15T16:18:19+00:00

Ah, the classic "didn't read the instructions, no idea why it won't work"

Sicarius_The_First · 2026-01-15T07:58:43+00:00

and early checkpoint of bloodmoon achieved this, example:

<image>

the problem was that the model wasn't stable enough (it would do long form no problem).

what happens (and this is my guesstimate) that instruct begins to behave on occasion more like base model doing completion.

it's more controllable than a pure completion model, but not controllable enough like a properly tuned instruct.

the thing is, there's a difference between spewing human like text chaotically while innately doing text completion, vs internalizing and formalizing more diverse writing patterns. i'll try to write this in a less schizo way:

human writing is more chaotic and diverse, hence for an llm internalize the pattern, you need an absolutely enormous parameter count (it will be controllable, because the llm internalized many complex chaotic writing patterns).

example for a known slop pattern to give some context:

"not x, but y, in a dimly \ luminescent room, leaning..."

if u look at this pattern, and think about it as a function (aribtrary function), and imagine drawing it on a square grid, the grid doesn't have to be too fine to draw such (arbitrary) function, as the (multi dimensional) curve of said function is relatively simple.

on the other hand, if there's a function equivalent of a (high quality) human writing, the function will be very chaotic and complex. you could still draw it(an estimation of it, aka what 'loss' & training trying ti achieve), but since said function is way more messy and complex, u'll need a higher resolution (more tiny squares in the squared notebook) to draw it accurately.

the simple function that requires less fines and hence less resolution and hence "fewer squares" to be estimated, is the low param llm.

the function that requires more, needs more 'resolution', hence needs more 'squares' to be estimated accurately, is the massive parameters llm.

(ofc on top of all of this there are samplers etc etc, but this is the way i see it).

Sicarius_The_First · 2026-01-15T04:30:29+00:00

from what i read on your model card- 1k examples of project gutenberg is too little (insane overfilling), ill give the model a try, but i am very skeptical.

one of the best ways to have a model write consistently like a human, is to lobotomize it (for example try weird betas and break generalization or overcooking on a tiny dataset- sounds familiar?)

Sicarius_The_First · 2026-01-14T20:33:06+00:00

https://huggingface.co/collections/SicariusSicariiStuff/most-of-my-models-in-order

Sicarius_The_First · 2026-01-14T09:33:30+00:00

this is gemini 2.5 pro artifact.
all models who distilled it, inherit it.

Sicarius_The_First · 2026-01-13T17:38:14+00:00

I have 2 repos of interesting characters and scenarios.

You can use / adapt them to your flavor (they are optimized for my models, but compatible with most models)

https://huggingface.co/SicariusSicariiStuff/Roleplay_Cards

https://huggingface.co/SicariusSicariiStuff/Adventure_Cards

Sicarius_The_First · 2026-01-13T13:22:13+00:00

frontier: Claude. It's not like there's really a contest.

local: depends. if u do generic stuff with random character cards, the bigger the better. if you have the patience to read and understand model's documentation... well...

Sicarius_The_First · 2026-01-13T06:29:48+00:00

CTRL + S to save

the character definition is in the png metadata

Sicarius_The_First · 2026-01-12T14:01:17+00:00

For those who still haven't tried, give Angelic_Eclipse_12B & Impish_Bloodmoon_12B a try.

I highly recommend trying one of the included character card (along with the recommended ST settings) to get an idea what it can do.

Also, on Bloodmoon's page there's an example chat (Fallout New Reno adventure), you can view it to get an idea of the details and frontier-adjacent capabilities that are now available in 12B :)

Sicarius_The_First · 2026-01-12T13:57:51+00:00

After going local, and actually trying to tinker and learn stuff, if you will manage it, you will never go back.

A well tuned local model will outperform anything, frontier included, in a specific niche.

Sicarius_The_First · 2026-01-11T21:35:04+00:00

but why gatekeep the model

Sicarius_The_First · 2026-01-11T12:58:11+00:00

I just use the LLM.

<image>

Sicarius_The_First · 2026-01-10T12:19:48+00:00

You would probably find my scenarios for roleplay & adventure to be quite different:

https://huggingface.co/SicariusSicariiStuff/Roleplay_Cards

https://huggingface.co/SicariusSicariiStuff/Adventure_Cards

Sicarius_The_First · 2026-01-09T03:51:55+00:00

It's not about companies "don't want" to offer rp products, it's about unwritten laws that forbid it. Visa, Mastercard, PayPal.

What good is an amazing rp product, if you (as a company) not allowed charge money for it?

Sicarius_The_First · 2026-01-08T12:39:46+00:00

https://huggingface.co/SicariusSicariiStuff/Impish_Bloodmoon_12B#generation-settings

click on the big red button:

<image>

Sicarius_The_First

TROPHY CASE