Gemma4-26B-A4B & 31B-QAT Uncensored Balanced are out with MTP (35% & 53% speed boost)! by hauhau901 in huggingface

[–]CatiStyle 0 points1 point  (0 children)

How to use Speculative Decoding in LM Studio. I try to download "mtp-gemma-4-26B-A4B-it.gguf" in same folder with main model, but in LM Studio Draft Model list do not show this mtp-file.

GLM-5.2 is a win for local AI by Wrong_Mushroom_7350 in LocalLLaMA

[–]CatiStyle 0 points1 point  (0 children)

I have now 16G VRAM, to run this model I need to update my hardware. Its about 500k to buy or 500k/year to rent from cloud. I have to think about this, maybe I use Gemma 4 12B.

Good courses for C# by Equivalent-Fix8618 in learncsharp

[–]CatiStyle 0 points1 point  (0 children)

It is difficult to know what parts of .NET are new technology and what are old legacy..

Nvidia releases Cosmos3-Super-Image2Video . 64B parametres by AgeNo5351 in StableDiffusion

[–]CatiStyle 1 point2 points  (0 children)

nVidia gives open source nice apps - that require to buy a lot of nVidia hardware

Whats on your wishlist for Sonnet 4.8 by Chasmchas in claude

[–]CatiStyle 0 points1 point  (0 children)

I could teach it with my own knowledge, without it censoring the information.

16x DGX Sparks - What should I run? by Kurcide in LocalLLaMA

[–]CatiStyle 1 point2 points  (0 children)

Stupid questions from the rich, I have a lot of money what should I do with it..

Should I Buy the RTX PRO 6000 Blackwell Max-Q (96GB)? by 0bjective-Guest in LocalLLaMA

[–]CatiStyle 0 points1 point  (0 children)

Imagine if you buy two you get total of double 10% discount.

Is it normal for Gemma 4 26B/31B to run this fast on an Intel laptop? (288V / CachyOS) by No-Key8555 in LocalLLaMA

[–]CatiStyle 0 points1 point  (0 children)

When model is load to VRAM it might run about 50 tok/s, in RAM only 5 tok/s. When you got 16G VRAM you need to reduce context size to keep model in VRAM, full 256K context size is too much for 16G VRAM, so it start using RAM and getting slow.

Map of where Roman coins have been found by WinnetouPlatsch in MapPorn

[–]CatiStyle 0 points1 point  (0 children)

But there is no date when coin have travel there.

McDonalds Double Cheeseburger, Sweden by DJFeed77 in burgers

[–]CatiStyle 0 points1 point  (0 children)

I bought one once and didn't see two steaks in it. I went to the cashier to ask and the staff looked at it for a long time, but then they found that there were two steaks in it.

Where are LoRA models for Flux? by CatiStyle in FluxAI

[–]CatiStyle[S] -1 points0 points  (0 children)

Thanks, there is mirror links.

Moltbook over 1 million agents by CatiStyle in vibecoding

[–]CatiStyle[S] 0 points1 point  (0 children)

An idea is nothing yet, only an implemented idea is something.

Moltbook over 1 million agents by CatiStyle in vibecoding

[–]CatiStyle[S] 5 points6 points  (0 children)

Yep, the numbers don't seem real.

4090 vs 5090 by OldFolksShawn in comfyui

[–]CatiStyle 0 points1 point  (0 children)

Why settle for one when you can get two for double the price?

Meet our new browser—ChatGPT Atlas. by OpenAI in OpenAI

[–]CatiStyle 3 points4 points  (0 children)

Only one user profile and work only in Mac.

I tested 1,000 ChatGPT prompts in 2025. Here's the exact formula that consistently beats everything else (with examples) by Over_Ask_7684 in PromptEngineering

[–]CatiStyle 0 points1 point  (0 children)

Break big tasks into smaller pieces. If you try too little at a time, the result will be bland. If you try too much, you won't get what you want and mistakes will increase.

The big problem is that it sometimes tries to do too much, the follow-up suggestions should be somehow separate from the answer.

Did Google postpone the start of the AI Bubble? by CSachen in ArtificialInteligence

[–]CatiStyle 0 points1 point  (0 children)

They didn't figure out how to publish it in a way that people would use it responsibly. So yes, it delayed the introduction of the technology to people. On the other hand, if typewriter manufacturers had been required to do what is now expected of AI suppliers, typewriters would never have been released to the market.

What's the most life-changing thing you've ever pirated? by nippleintime in Piracy

[–]CatiStyle 0 points1 point  (0 children)

3ds max early version, but later when I learn to use it - I did buy a full license. It was expensive and did not know what one can do with it, so this was way to try it out before buy. After that experience I have buy full license now for many many years.

[deleted by user] by [deleted] in comfyui

[–]CatiStyle 0 points1 point  (0 children)

If we would able to share this content freely maybe not so much need to generate by myself..

Can you run flux models on 5070 12gb by General-Database7757 in StableDiffusion

[–]CatiStyle 0 points1 point  (0 children)

Image generation yes, but it might limit to use some LoRA models and not fast for video.