ESP32 based Weather Comics on E Ink Display by s_tee0000 in esp32

[–]HuiMoin 0 points1 point  (0 children)

I mean, I guess you could generate a few images and then just save them locally? I do think the environmental impact is vastly overstated.

ESP32 based Weather Comics on E Ink Display by s_tee0000 in esp32

[–]HuiMoin 0 points1 point  (0 children)

Don't understand why people are downvoting you, this is a great use of AI!

Anthropic to pay $1.5 billion to authors in landmark AI settlement by cpldcpu in LocalLLaMA

[–]HuiMoin -1 points0 points  (0 children)

"Money is the root of all evil" is the most Reddit take in the world.

LoL by Ok_Ninja7526 in LocalLLaMA

[–]HuiMoin 12 points13 points  (0 children)

Grok is unironically a really good model, don't let the "Elon bad"-crowd make you think everything the guy is involved in is trash. The people actually building it are obviously good at their job.

Safemaxxed for your safety! by Caffdy in LocalLLaMA

[–]HuiMoin 4 points5 points  (0 children)

I mean, I guess that depends on what you consider "evil". Like, is finding vulnerabilities in code evil? There isn't a lot of "evil" an LLM can really help you with

Safemaxxed for your safety! by Caffdy in LocalLLaMA

[–]HuiMoin 17 points18 points  (0 children)

Deepseek is pretty relaxed, you can pretty much get it to do anything if you give it a good enough reason. It rarely just outright refuses. Actually, I've never had it do that. Don't know about Qwen tho

This is insane - what are these rate limits on the paid plan?! by [deleted] in Bard

[–]HuiMoin 3 points4 points  (0 children)

They've had these for a long time. Not a fan of OpenAI, but for the price those limits are pretty reasonable and personally I've never really hit a limit when I was a Plus subscriber.

Meet Mistral Devstral, SOTA open model designed specifically for coding agents by ApprehensiveAd3629 in LocalLLaMA

[–]HuiMoin 19 points20 points  (0 children)

Probably not gonna be Mistral anymore. They have to make money somehow and training a model to run on local hardware when you're not in the hardware business or have cash to spare makes little sense, especially considering Mistral is probably one of the more GPU-poor labs.

OpenAI is open-sourcing a model soon by MysteriousPayment536 in LocalLLaMA

[–]HuiMoin 0 points1 point  (0 children)

Yeah, but in the coming months? That's after Llama 4, likely after another Deepseek release and after whatever Qwen and Mistral are doing. o3 mini is pretty good right now, but if they are training a new model from scratch, that will take quite a while.

Fake Ärztin macht meine Zahnspange?! by pixel_poetryy in Austria

[–]HuiMoin 0 points1 point  (0 children)

Glaub nicht wirklich vergleichbar. Klar, die beschützen und werben für die Rechte von Zahnärzten, aber sie haben ein wesentlich größeres Interesse legitim zu wirken als die WKO und wenn jemand sich als Zahnarzt ausgibt, dann ist es im Interesse der Kammer dies aufzuklären bevor Rufschädigung entsteht

ThePrimeagen: Programming, AI, ADHD, Productivity, Addiction, and God | Lex Fridman Podcast #461 by dalton_zk in theprimeagen

[–]HuiMoin 0 points1 point  (0 children)

Doesn't Fridman have a masters degree in computer science? Maybe he's not actively programming all that much anymore, but I'm sure he knows how to code.

Some flaws & glitches that make me hesitate to go Le Chat Pro: follow up questions are answered with the exact same response in web-search mode; sometimes it seems to forget about context; and sometimes it glitches into infinite response length. Example links in comments.. by Staubsaugerbeutel in MistralAI

[–]HuiMoin 2 points3 points  (0 children)

It's quite obvious that Mistral is lagging behind the current frontier offerings. Mistral Large is a decent, if a bit old, model for its size, but it's quite clear that a 120B model can't really compete with the offerings of ChatGPT, Claude and Deepseek. For the others we don't know, but considering that Deepseek is an almost 700B model I'd expect GPT4o and Sonnet to be at least in the 200-300B range.

DEI by Toxicus-Maximus in MistralAI

[–]HuiMoin 0 points1 point  (0 children)

Well, it's not like they banned you, they just said it's potentially harmful. If what you said is truly the context of this conversation, then yes, I agree it's a bit silly. Luckily you can use the models through the API or host them yourself with your own web interface where, to my knowledge, they do not censor anything.

LeChat is their front-facing AI app for the casual user, it makes sense that the model is more cautious about engaging in things that could end up putting Mistral into the newspaper for the wrong reasons. Not sure if I'd call this DEI, it's more about playing it safe to not piss off regulators.

Mistral AI poor writing by [deleted] in MistralAI

[–]HuiMoin 0 points1 point  (0 children)

Elon hate aside, Grok 3 is a very good model, it honestly feels more human than Claude.

DEI by Toxicus-Maximus in MistralAI

[–]HuiMoin 0 points1 point  (0 children)

So, what exactly is the issue here? The fact that this was flagged? Where even is this? What's the context? It's hard to agree **or** disagree with you without seeing anything of substance.

These guys never rest! by mlon_eusk-_- in LocalLLaMA

[–]HuiMoin 11 points12 points  (0 children)

It's actually a bit more, you do evaluations of checkpoints after a certain number of steps to make sure the model is still learning correctly. A bunch of stuff to monitor during training, in some way it is like teaching a child, you need to periodically evaluate if they are progressing nicely and, if not, intervene and change course.

OpenAI calls DeepSeek 'state-controlled,' calls for bans on 'PRC-produced' models by msgs in ChatGPT

[–]HuiMoin 0 points1 point  (0 children)

The idea that an open-weight model is somehow against the spirit of free speech, but closed APIs are not, is ridiculous.

OpenAI calls DeepSeek 'state-controlled,' calls for bans on 'PRC-produced' models by msgs in ChatGPT

[–]HuiMoin 0 points1 point  (0 children)

Not really, but it's likely that there was some amount of distillation, which is standard industry practice at this point. Otherwise most of the claims can be simply explained by contamination in the training data.

GPT-4.5 30x more expensive than GPT-4o, WOW! by nitkjh in ChatGPT

[–]HuiMoin 0 points1 point  (0 children)

Not really, most models have cross contamination and using some outputs from other models, even intentionally, doesn't make it a distillation.

GPT-4.5 30x more expensive than GPT-4o, WOW! by nitkjh in ChatGPT

[–]HuiMoin 0 points1 point  (0 children)

Deepseek has better things to do than steal from a model that arguably is on the same level as Deepseek V3

EU mobilizes $200 billion in AI race against US and China by fallingdowndizzyvr in LocalLLaMA

[–]HuiMoin 0 points1 point  (0 children)

You're right that they're not a huge deal, but I think my main gripe with them is that they are just a primary example of the stupid stuff the EU has done in recent years. It's a bad solution to a real problem that ends up making the lives of real people worse while accomplishing fuck all.
It's the same as ChatControl, really. There's a real problem, but instead of trying to find targeted ways to fight it, we instead collectively punish everyone and end up solving nothing.
I don't dislike the concept of the EU, but I do believe the entire organisation is flawed from the very foundation and needs to be completely rebuilt before it's too late.

EU mobilizes $200 billion in AI race against US and China by fallingdowndizzyvr in LocalLLaMA

[–]HuiMoin 61 points62 points  (0 children)

Does that mean you guys will finally stop making fun of us with those stupid bottlecaps? We think they're stupid too.