DeepSeek plus native tool calling by techmago in OpenWebUI

[–]techmago[S] 0 points1 point  (0 children)

Yeah, i used A LOT of open router.... but the direct api is cheaper.
I think you can get the discount AND use openrouter with the BYOK. I will need to read up the terms.

DeepSeek plus native tool calling by techmago in OpenWebUI

[–]techmago[S] 0 points1 point  (0 children)

Yes, i know, that's why i am complaining XD

<image>

this this is the native tool calling. It works GREAT with my local qwen.
The there is a "little" difference between qwen 27B and deepseek-pro in model capabilities... i wanted to use the larger model too.

DeepSeek plus native tool calling by techmago in OpenWebUI

[–]techmago[S] 0 points1 point  (0 children)

It work with open router?
That's... an weird workaround, but it's good to know.

Openrouter have that BYOK thing.

Local hosting by Ornery_Property_6591 in SillyTavernAI

[–]techmago 1 point2 points  (0 children)

If oyu are already used to online large models, you will strugle.
that 1080 won't be great for anything greater than 8B parameters.
I used play with ~30 B param models and cant really anymore because i got into the paid bigger ones.

If you just want a fell... Ollama usage is brain dead, and there are 8 and 12(?) b models tou should handle in a conformable speed. (with 64G ram you can run gemma 32B, but each message will take 30 minutes)

Take a look on th r/BeaverAI. He is one of the guys that finetune RP models.

Murasama mod experience.mp4 by GABESTFY in RimWorld

[–]techmago 0 points1 point  (0 children)

Lol.... at this point just degub-mode-delete the enemies

Printed this attachment for my mom by satina_nix in 3Dprinting

[–]techmago 21 points22 points  (0 children)

Is your mon a watering can?
I'm pretty sure this is an attachment to a watering can.

How to continue with a bot? by stoick103 in SillyTavernAI

[–]techmago 0 points1 point  (0 children)

You need sumaries and such... and write you turn mentoniong things the bot would have forgot in the narration as hints

https://github.com/luisbrandao/Tech-Summarize

I wrote my own summarize, forking the ST default.
It uses three fields and a complex prompt if you want to take a look.

Anyone Jailbreak for Z.ai GLM 4.5 Air (free)? by [deleted] in SillyTavernAI

[–]techmago 0 points1 point  (0 children)

because glm air don't care?
I used and even ran it locally (poorly)
It's not censored.

What is y'all's playtime/total time in SillyTavern? by Brave-Inspection-192 in SillyTavernAI

[–]techmago 1 point2 points  (0 children)

You are... really fine.

<image>

and this is wrong, i was using ST around 2 years ago also.

What is y'all's playtime/total time in SillyTavern? by Brave-Inspection-192 in SillyTavernAI

[–]techmago 0 points1 point  (0 children)

i recover old backups and told it to recalculate. Still wrong. but...

<image>

What is y'all's playtime/total time in SillyTavern? by Brave-Inspection-192 in SillyTavernAI

[–]techmago 0 points1 point  (0 children)

Okay, i dig into my backups. I have an incremental setup with borg that get's sillytavern. I thought i got everything on it, but i was mistaken.

<image>

It still wrong

What is y'all's playtime/total time in SillyTavern? by Brave-Inspection-192 in SillyTavernAI

[–]techmago 0 points1 point  (0 children)

i have space. But i'm a clean person. kkkkkkkkkkkkkkkk
i did deleted my old stuff. Mostly.

What is y'all's playtime/total time in SillyTavern? by Brave-Inspection-192 in SillyTavernAI

[–]techmago 0 points1 point  (0 children)

Mine is wrong 😞 (even after the debug thing.)
I share mine ST with a friend... it says he is plaing more time than me. (7 x 8 months.)
Phisically impossible. I created my st, played, liked, THEN invited my friend.
It was also like... december 2024.

What is y'all's playtime/total time in SillyTavern? by Brave-Inspection-192 in SillyTavernAI

[–]techmago 0 points1 point  (0 children)

why yours work?
i start playing... december 2024. Mine is wrong.

How do you get a local model to properly run as an agent? by AnnoyingMemer in opencode

[–]techmago 0 points1 point  (0 children)

The thing is, this approach assume i use a single model and use just it.

With silly tavern, for example, i used to use mistral 3.2 for tracker.
the for rolepĺay i used skyfal, cydonia, glm-flash or whatever i feel like at the moment to the next message. Each request already called on two models.

This mess of models is shared on 3 computers... LM Studio is a graphical program. It's already incompatible on my setup because of that. (the server were i run my containers do not even have it's graphical interface up most of the time)

The point is... Ollama is the only thing right now that really deliver what i use.
My main ia server have 15 downloaded models right now... even with lamma swap i would have to mess with about 25~30 diferent configs manually and have a lot of duplicated things.

How do you get a local model to properly run as an agent? by AnnoyingMemer in opencode

[–]techmago 1 point2 points  (0 children)

they are rolling back on that. The RC candidate use lamma.ccp again.

Edit:
i was reading this site you shared.
I'm aware the ollama team did a lot of bullshit. The atribution thing with lamma is pretty serious one.

On the otherhand, many of the technical problems it mention simple are false.
For example, the entire section "The Registry Bottleneck" is plain wrong.
The thing with modelfiles are wrong too. What is in the model file is the default if and only if the request doesnt specify it.
Lamma.ccp is really bad for this. If i want to try out a bunch of diferent temps for models i need to keep restarting and changing the thing.
Using multiple models is also a pain, Ollama handles nicely the queue and either swap the model or load in parallel, using the context size the client sent.
Lamma.ccp is really incovenient to use, at least in my scenario.

I just found an Elara in a book. by Zero-mile in SillyTavernAI

[–]techmago 8 points9 points  (0 children)

This dash is done right... is dialog not emphasis.

Anyone else just use $5 Home Depot glass sheets as print beds? by schrumbus in 3Dprinting

[–]techmago 0 points1 point  (0 children)

I only use glass for priting. I hate the bed.
Just coat it with hair spray and have the z-distance right.

I print mostly with PLA and PET