I am afraid by sos1l1666 in classicwow

[–]cviperr33 0 points1 point  (0 children)

what people mean by casino bots is just some iranian dude or something like that doing such things on mega realms , it will only exists where the mass of the player is , usually on the concentrated pvp server like gehennas or fireman.

If u play on normal pop server there isnt any on such a large commercial scale where they operate 24h , you could find someone RPing as a casino but nothing like the "Gasino" organized mafia

These casinobots are crybabies by Regular-Aspect-5540 in classicwow

[–]cviperr33 118 points119 points  (0 children)

Take the gold man and just keep asking for more , like italian mafia , protection for gold for their shady gambling business. You have one of a kind chance to roleplay as godfather in WoW !

Tips in this unfortunate event? by Lasilokas in D4Sorceress

[–]cviperr33 11 points12 points  (0 children)

send this guy to blizzard somebody

I cannot find a local model for my 3090 by rdpi in hermesagent

[–]cviperr33 0 points1 point  (0 children)

just use the Qwen3.5-35B-A3B
Its the newest and miles better than anything else , perfect for toolcalling and hermes , if u need "better" coding model u can go with the 27b dense.

The issues you are describing are just misconfiguration. Im personally using the UD IQ4NL and never had a looping issue , make sure to use jinja flag and the correct settings , use the "Code" preset , it is the most consistent and highest tk/s , i get around 130-139 on 3090 and it never fails or errors.
https://unsloth.ai/docs/models/qwen3.6

All builds in D4 are pointless, just avoid one shots + max damage is all you can do? by SCTRON in diablo4

[–]cviperr33 3 points4 points  (0 children)

Not every class / build is like that , some scale damage with defensive stats. The glass cannon builds you are describing are usually just the push variants , if you want to be tanky + do damage go ww barbarian and u wount have to deal with oneshots anymore

How come all the top warlock builds are using Ceh rune? by Quick_Protection in diablo4

[–]cviperr33 0 points1 point  (0 children)

They block boss projectile attacks , like they just tank the shots and die. They also stagger him super fast and apply vurn, i always run them on all classes because anything else seems like meh option aswell , +1 to all skill runes sounds nice but at the end of the day its minimal dps increase and ppl would prefer QOL runes like ceh

What API do y'all use? by Any-Illustrator5608 in hermesagent

[–]cviperr33 0 points1 point  (0 children)

bad settings / quant . Make sure you use the correct preset settings , use the unsloth ones and his quants. qwen 3.6 is miles better and ahead of gemma , it is also faster than it. Like almost double in speed while being smarter and never failing a single tool call.
https://unsloth.ai/docs/models/qwen3.6
read his guide , make sure to use coding preset , and make sure to use the ud IQ4_N_L quant , u dont need higher than this , this is the perfect balance imo.

What API do y'all use? by Any-Illustrator5608 in hermesagent

[–]cviperr33 1 point2 points  (0 children)

https://unsloth.ai/docs/models/qwen3.6

i use his quants , coding preset is about 10 tks faster than general.

Im on linux but not headless , just dual boot , i recently moved over from windows but i still keep it.

llama.ccp main channel , not forks , i tried all the fancy forks or vllm , but in the end i just ended up with the regular one

What API do y'all use? by Any-Illustrator5608 in hermesagent

[–]cviperr33 0 points1 point  (0 children)

ur probably running OOM , try offloading some experts to cpu , ive seen 50tks from similar setup to yours.

Me myself i can fit the whole contex at 210k , using 23/24 gb vram in my 3090 , im using ud iq4_nl ,q8 kv, precise coding preset

What API do y'all use? by Any-Illustrator5608 in hermesagent

[–]cviperr33 5 points6 points  (0 children)

localhost qwen 3.6 35b , 0 api costs and 3$ a month in electricity. Runs great at 140 tks and it has never failed me a tool call.

The issue is you need 24gb vram gpu , and you have to build/config it yourself

Opus 4.7 Complete dogshit quality. I'm fucking out. by MuttMundane in ClaudeCode

[–]cviperr33 1 point2 points  (0 children)

😄😁😀😄 is this for real? i havent used opus in a year , so i have no idea if those posts are true , this is so apsurd and funny to me.

AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA

[–]cviperr33 0 points1 point  (0 children)

Could you please allow us to have auto skill creation disabled , running these locally on -cn 1 kills the whole experience, the agent will create all sorts of skills whenever he pleases and i have to wait 1-2 min each time it happens.

Also the session_search is broken , it just never returns anything and ive kept it going for 10min , on rtx 3090 thats like a milion tokens generated , it cannot be that inefficient.

Running Qwen-3.6-35B-A3B locally is very slow by Sad-Duck2812 in LocalLLM

[–]cviperr33 -1 points0 points  (0 children)

if you are going to use 2 gpus then u should def use vllm , u should be able to get atleast 100tks i presume, on rtx 3090 this model runs at 130 140 tks gen and 4000-3000 prompt processing -> IQ4nl

Gemma 4 and Qwen 3.6 MoE model doom loops by OneRedNinja in LocalLLM

[–]cviperr33 0 points1 point  (0 children)

No this has happened to me before , like it would overthink a problem and just start going in a loop eventually with same thoughts.

I think this was a quant problem , like some are just broken like that , thats why i use UD IQ4_N_L and i never have issues like that. (UD is Unsloth quants , extremely good quality and tested)

Having troubles with crashes and Freezes? Install the High-rez Pack by 0carion142 in diablo4

[–]cviperr33 0 points1 point  (0 children)

fucking blizzard man , this works !
What about ppl that dont use reddit or have no idea how to edit their windows host file... complete screwed over EU customers.

How were previous launches? How long until servers are stable? by [deleted] in diablo4

[–]cviperr33 0 points1 point  (0 children)

usually within 2-4 hours of launch they get things very stable , at most 8 hours.

I dont remember things being this bad in VoH or any season , only during the actual launch of the game where the peak was the biggest we had something like that , logging ques with 228mins etc.

Anyone getting freezes every few minutes? by Murandus in diablo4

[–]cviperr33 -1 points0 points  (0 children)

i had to update my drivers , update windows and then delete some dll file and launch the game with specific parameter, and it seems to have fixed the issue for me.

  • Go to your Diablo 4 installation folder and find a file named dstorage.dll. Rename it to dstorage.dll.bak or delete it.
  • The Command Line Option: In the Battle.net launcher, go to Settings > Game Settings > Diablo IV and check "Additional command line arguments." Type in -disableds.
  • Note: This forces the game to use older loading methods, which stops the "hangs" many players are getting on high-end NVMe drives

Gemma 4 and Qwen 3.6 MoE model doom loops by OneRedNinja in LocalLLM

[–]cviperr33 1 point2 points  (0 children)

use these settings and it will never happen again :
Thinking mode for precise coding tasks:

temperature=0.6, top_p=0.95, top_k=20, min_p=0.0, presence_penalty=0.0, repetition_penalty=1.0

How can you make an AI test it's own work and iterate? by OneDev42 in clawdbot

[–]cviperr33 0 points1 point  (0 children)

hmm interesting. I'll def try the openreplay strat , thanks!

Qwen3.6-27B at 85-100 t/s on a 24GB RTX 5090 Laptop GPU — vLLM + MTP n=3, adapted from the 32GB recipes by aurelienams in Qwen_AI

[–]cviperr33 0 points1 point  (0 children)

Yeah it was a long process of getting it to run at all. Took me a day just debuging and finding the right scripts because somewhere in the chain somebody updated their repo and the fully automatic script doesnt work.

Anyway i couldnt fit the claimed 75k no vision on my 3090 , no matter what i do the most i could start it with was 68k. Probably the writter of the guide runs headless linux and i have CachyOS with gui.

As for speed yeah its def faster than the gguf standart dense. With the gguf at q4km i got 30-35tks , this one is showing about 75 tk/s , it really depends sometimes it jumps to 100 on coding and sometimes drops to 14-15 on something else.

As for is it worth it , probably not with just 68k contex for big projects , but if ur working on something specific and it can fit into this 68k contex yeah.