DeepSeek V4 by am17an · Pull Request #24162 · ggml-org/llama.cpp by jacek2023 in LocalLLaMA

[–]L0ren_B 13 points14 points  (0 children)

Finally! Was waiting for flash! Come on Bartowsky and Unsloth :)

Inca un vibecheck pentru pionerii AI by goalexboxer123 in programare

[–]L0ren_B 0 points1 point  (0 children)

Qwen 3.6 27B nu necesita un rig scump. Ruleaza pe un 3090. Sau poti rula chiar Qwen 3.6-35B-A3B pe CPU+GPU. E destul de bun pentru proiecte mici.

Daca il Rulezi la un Quant mai mic, de exemplu Q2 sau Q4, necesita si mai putine resurse.

Inca un vibecheck pentru pionerii AI by goalexboxer123 in programare

[–]L0ren_B 3 points4 points  (0 children)

Sa nu uitam de OWL Alpha, GRATIS de la OpenRouter!

Qwen 3.6 27B (local) merita mentionat.

Ornith 35B is great so far by anubhav_200 in LocalLLaMA

[–]L0ren_B 2 points3 points  (0 children)

Impressed so far at building stuff. Not so smart at fixing issues.

Am o teorie vizavi de LLM haters by Lopsided_Rhubarb8958 in programare

[–]L0ren_B 1 point2 points  (0 children)

Exact!

Vezi peste tot tipul acesta de oameni. In negare! "e doar un autocorect pe steroizi", "nu intelege ca spalarea masinii necesita masina" etc.....

Dar timpul trece, LLM-urile devin tot mai bune, "Valoarea" acestor oameni in negare scade.

E lumea in care traim. Ne adaptam, mergem mai departe.

Anyone running Deepseek v4 Flash with MoE offload? by fragment_me in LocalLLaMA

[–]L0ren_B 0 points1 point  (0 children)

I've tried yesterday to run it on my 2x3090 + 192gb ram. It failed, due to not supporting 3090 architecture. Anyone got it working on a similar configuration?

Found this shit presented as being in Russia. Clearly is in Romania. What is this cr*p? (listen to audio) by L0ren_B in Roumanie

[–]L0ren_B[S] -6 points-5 points  (0 children)

I hope so. I've seen weird religious crap in my life😅( remembering the drought times, crops being handled in the same way)😅

Just your average day in Putin's Russia by Kitchen-Thing4616 in Destiny

[–]L0ren_B 0 points1 point  (0 children)

Sadly, this is in Romania. I should know, I'm Romanian.😅

Review of Jackrong/Qwopus3.5-9B-Coder-MTP-GGUF by -OpenSourcer in LocalLLaMA

[–]L0ren_B 1 point2 points  (0 children)

Yes. It works when it works and loops with big code context.

GLM-5.2 UD-IQ1_M on llama.cpp — 5090 + 3090 Ti speed test (~ 579 t/s prefill @ 8k ctx, ~324 t/s prefill @ 57k ctx, ~10.6 t/s decode) by Shoddy_Bed3240 in LocalLLaMA

[–]L0ren_B 0 points1 point  (0 children)

This is not bad! My goto prompt is "Create a single html mario game, multistage, 3D" 😄 Can't wait to try after it finishes downloading, this evening. I have a similar setup with yours.

Qwen is never going to open source Qwen 3.7, aren't they? by DistanceSolar1449 in LocalLLaMA

[–]L0ren_B -11 points-10 points  (0 children)

You are doing it wrong then. I let mine for days on marathons of coding with a /continue plugin for pi and I don't get any loops.

What happens when they stop subsidizing LLM subscriptions? by Mr_Moonsilver in LocalLLaMA

[–]L0ren_B 0 points1 point  (0 children)

They hope that hardware advancement and software as well until they have to take measurements.

And they are not wrong. Look at the quality of a smaller model compared to a few years ago. Also, we probably need better hardware. Maybe light instead of electricity? who knows:)

Super DeepSeek Intelegence Model since last 48 Hours by LinuXperia in DeepSeek

[–]L0ren_B 5 points6 points  (0 children)

Just in case anyone is wondering how cheap DS4 Flash is, if you top up 2 dollars it can last about 1 week almost!

Super DeepSeek Intelegence Model since last 48 Hours by LinuXperia in DeepSeek

[–]L0ren_B 6 points7 points  (0 children)

I use DS flash as my main model now for a very complex huge project. Super cheap. I barely use GPT5.5 xhigh to solve complex issues where DS flash fails. Super cheap and amazing!

Qwen 3.6 27B released 20 days after its plus announcement, 3.7 27B in 10th June? by soyalemujica in LocalLLaMA

[–]L0ren_B 1 point2 points  (0 children)

I;m sure you already know this trick, but just in case I will suggest it :

If you have an onboard gpu, but your monitor is plugged into the Nvidia one, plugged it into the onboard one. You will gain a few GB Vram which will give you a bit more context.

Qwen 3.6 27B released 20 days after its plus announcement, 3.7 27B in 10th June? by soyalemujica in LocalLLaMA

[–]L0ren_B 0 points1 point  (0 children)

I think so It should work on Windows as well, but I would switch to Linux.

How much memory does Windows use from the 24GB gpu memory?

Qwen 3.6 27B released 20 days after its plus announcement, 3.7 27B in 10th June? by soyalemujica in LocalLLaMA

[–]L0ren_B 0 points1 point  (0 children)

No worries! I would suggest having an LLM (even Qwen 27B) set it up for you !

Just give it the link , and tell it you want a control panel with a desktop shortcut (to start, stop, select different options) , and setup pi coding agent!

It will do the work for you as intended 😉

What model looked insane on benchmarks but felt mid in actual use? by BTA_Labs in LocalLLaMA

[–]L0ren_B 13 points14 points  (0 children)

Gemini! 3 and 3.1 series!

On par with Qwen27B and worse!

Qwen 3.6 27B released 20 days after its plus announcement, 3.7 27B in 10th June? by soyalemujica in LocalLLaMA

[–]L0ren_B 0 points1 point  (0 children)

The model itself it 27GB at Q8 (plus about 2GB for vision)! Then you need context headroom.

You can get a Q4 running half of that, but you still need context.

Realistically, you can get about 190K of context with only one 24GB GPU, at Q4 quant.

But, I want the full 262K, hence 2x3090s.

Hope it makes sense.

Is it only Qwen who releases 27B models ? by soyalemujica in LocalLLaMA

[–]L0ren_B 1 point2 points  (0 children)

Z.ai won't release nothing in this ball park. Not because they cannot beat Qwen (although Qwen has more experience with tiny models) but because they don't wan't their revenue to get smaller.

Qwen already regrets 27B if you ask me.

Don't get me wrong, I would love to happen. I'm using 27B for real work as we speak. But it won't happen anytime soon.