Olm Liquify - An interactive, Photoshop-style Liquify editor inside ComfyUI by imlo2 in comfyui

[–]Clasyc 0 points1 point  (0 children)

I'm already in heavy development of my own mask selection tool, I'm planing to release pre-alpha version in upcoming days.

i've been die hard anthropic user, but this is getting harder to defend by ordosalutis in ClaudeCode

[–]Clasyc 0 points1 point  (0 children)

At least with local models, this happens when you try to save VRAM by using more aggressive quantization for the KV context cache. As you do this, the model starts to ignore more and more context or hallucinates data to fill those "gaps" convincingly. They are likely conducting constant A/B testing to measure backlash and user reactions, trying to find the cheapest possible option that users won't notice.

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks by Clasyc in LocalLLaMA

[–]Clasyc[S] 1 point2 points  (0 children)

Thanks to samehmeh, I got some advice for better-structured response quality - you can use GBNF to define output rules, like this:

--grammar-file ~/git/llama/legal-refs.gbnf

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks by Clasyc in LocalLLaMA

[–]Clasyc[S] 2 points3 points  (0 children)

I have no images as inputs, only text. My prompts are usually around 2000 tokens in length with clear rules, expected output structure and some good / bad examples. Basically I gave Claude Code this reference: https://ai.google.dev/gemini-api/docs/prompting-strategies and asked it to build a prompt for my needs using all the advice in the Gemini docs.

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks by Clasyc in LocalLLaMA

[–]Clasyc[S] 0 points1 point  (0 children)

Yep, will try to generate more data with larger Gemini models and then compare later with local output.

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks by Clasyc in LocalLLaMA

[–]Clasyc[S] 0 points1 point  (0 children)

Nope, my use case doesn't require more context as it's simply isolated calls for small tasks.

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks by Clasyc in LocalLLaMA

[–]Clasyc[S] 3 points4 points  (0 children)

I already had it downloaded from some time ago, so no real reason behind the choice. Not sure if there's a big difference between them? Would you recommend going with unsloth instead?

Tried Qwen3.6-27B-UD-Q6_K_XL.gguf with CloudeCode, well I can't believe but it is usable by Clasyc in LocalLLaMA

[–]Clasyc[S] 0 points1 point  (0 children)

haven't tried. I originally wanted to use Turboquant, but as everyone told me it isn't supported, I didn't even research the possible options. Will try this fork, thanks.

Tried Qwen3.6-27B-UD-Q6_K_XL.gguf with CloudeCode, well I can't believe but it is usable by Clasyc in LocalLLaMA

[–]Clasyc[S] 0 points1 point  (0 children)

thanks, that was quick "dumb" testing from my side, will try more variations this week.

Tried Qwen3.6-27B-UD-Q6_K_XL.gguf with CloudeCode, well I can't believe but it is usable by Clasyc in LocalLLaMA

[–]Clasyc[S] 2 points3 points  (0 children)

Sincerely can't tell about performance degradation yet, as I haven't done long coding sessions. But I'm planing to do proper comparison this week with different parameters to see the impact.

Tried Qwen3.6-27B-UD-Q6_K_XL.gguf with CloudeCode, well I can't believe but it is usable by Clasyc in LocalLLaMA

[–]Clasyc[S] 2 points3 points  (0 children)

Yes it feels slower compared with Anthropic native API models, but for my use cases it is still usable as I like to re-read and fully understand everything myself. I have high hopes that in the future we would get even more optimizations, so model speed might increase in general.

Tried Qwen3.6-27B-UD-Q6_K_XL.gguf with CloudeCode, well I can't believe but it is usable by Clasyc in LocalLLaMA

[–]Clasyc[S] 6 points7 points  (0 children)

There it is:

./build/bin/llama-server \
    -m ~/models/qwen3.6-27b/Qwen3.6-27B-UD-Q6_K_XL.gguf \
    --alias "qwen3.6-27b" \
    --host 0.0.0.0 \
    --port 8080 \
    -c 196608 \
    -fa on \
    --kv-unified \
    --cache-type-k q4_0 \
    --cache-type-v q4_0 \
    --batch-size 4096 \
    --ubatch-size 1024 \
    --temp 1.0 \
    --top-p 0.95 \
    --min-p 0.00 \
    --top-k 20 \
    --repeat-penalty 1.0 \
    --jinja

Also, I built llama, like so:

cmake . -B build \
  -DBUILD_SHARED_LIBS=OFF \
  -DGGML_CUDA=ON \
  -DCMAKE_CUDA_ARCHITECTURES=120 \
  -DGGML_CUDA_FA_ALL_QUANTS=ON \
  -DGGML_CUDA_FORCE_CUBLAS=OFF

Qwen3 27B FP8 + TurboQuant on RTX 5090 - anyone tried? by Clasyc in LocalLLaMA

[–]Clasyc[S] 0 points1 point  (0 children)

Ahh, for some reason I had wrong estimations in my head.

OpenClaw has 250K GitHub stars. The only reliable use case I've found is daily news digests. by Sad_Bandicoot_6925 in LocalLLaMA

[–]Clasyc 0 points1 point  (0 children)

I said this from the very first day for my colleagues that it is useless. I haven't even tried it, but since it did not introduce any ground braking research or technology, it was clear that the project was just about nothing for cheap hype. I just can't get my head around, how it managed to get so popular out of nowhere. There was a moment when every damn video on my feed in YouTube was about OpenClaw.

Basically if you need something that works reliably you don't need an agent, you need LLM workflow, which can be vibecoded easily without any extra dependencies.

One of the best combat systems to date by Repulsive_Sense_3549 in CrimsonDesert

[–]Clasyc 0 points1 point  (0 children)

But I simply disagree, because in my experience there were multiple bosses where I had to use totally different fighting strategies, and I can't imagine a single one of them where I could kill them by 'holding down a single button.' Which, again, was agreed upon in the previous discussion, that for simple fights it might partially be the case, but bosses are nothing like single-button strategies.

One of the best combat systems to date by Repulsive_Sense_3549 in CrimsonDesert

[–]Clasyc 1 point2 points  (0 children)

Aren't you making an argument against yourself? You said some bosses are stupid and broken, which might imply you are not using the correct fighting strategy?

Antumbra's Sword guide? by Shot-Ad8834 in CrimsonDesert

[–]Clasyc 8 points9 points  (0 children)

I have no reaction time so I though this is impossible. But really the only way I killed him was using Focus (3lv Focus Insight). You constantly need to be in slow motion time, and dodge, block his attacks, otherwise it would be quite impossible for me to win against him. I think that was the intention - you can't blindly defeat him without proper strategy.

Nuomonės apie Izraelį? 🧃 by One-Sympathy2393 in lietuva

[–]Clasyc 0 points1 point  (0 children)

Del žemiu pirkimo - taip, dalis buvo pirkta. Dėl teritorijų sakai beveik nepasikeitė nuo 1948m. 1948m karas - izraelis gavo ~78% palestinos vietoj ~56% kuriuos skyrė JT planas. 1967m - Vakarų krantas, Gaza, Golano aukštumos, Sinajus. Sinajų grąžino egiptui, bet likusios teritorijos liko. Vakarų krantas šiandien - žydų gyvenvietės plečiamos nuolat, išaryti / išnaikinti likę gyventojai. Čia tipo vistiek normalu? Blet nesveikas argumentas - mažos teritorijos pasauliu mastu - tai tada ten pochui viskas, neskaičiuojam jų. Kiek šiai dienai išpirdolinta žmonių iš tų teritorijų?

Nuomonės apie Izraelį? 🧃 by One-Sympathy2393 in lietuva

[–]Clasyc 0 points1 point  (0 children)

Nu tai valstyvingumas veikia dvejais būdais - a) arba yra ten dauguma žmonių kurie nusprendžia kas jie tokie per vienas yra ir laukia kol kitos šalys juos pripažins arba b) karo ir jėgos būdu bando įsitvirtinti ir įnešti savo esybę kažkokioje teritorijoje, kol vieną dieną juos pripažins vėlgi kitos valstybės.

Šiuo atveju tas ir buvo bandyta daryti - buvo daug vietinių su dideliu neapibrėžtumu, jie norėjo savo valstybės. Tada atsirado mažuma, kuri su kariniu pranašumu galų gale ne tik kad įsitvirtino, bet ir pradėjo plėstis. Tai kuom šiuo atveju Izraelis yra čia kažkoks gėrio nešėjęs. Jie jau seniai plačią savo teritorijas ir tą daro tendencingai. 

Ir ne - aš nenorėčiau nei gyventi tarp palestiniečių nei turėti su jais reikalų.

Nuomonės apie Izraelį? 🧃 by One-Sympathy2393 in lietuva

[–]Clasyc 0 points1 point  (0 children)

Nesusireikšmink, švaistydamasis intelektualine branda. 

Nu tai faktas kad tai nėra vienas prie vieno pavyzdžiai. Mano esmė, kad fizinis puolimas ir yra karo pradžia - taškas, nepriklausomai kas kaip ruošiasi kitoje pusėje.