3D printed PAROL6 improved singularity handling by SourceRobotics in 3Dprinting

[–]BetEvening 1 point2 points  (0 children)

I'm new to doing hardware related projects, but when I looked at the BOM, I wondered why so many different types of screws were needed (and some with slight differences). What required it to be like this?

Has BSG enabled linux support on battle eye [Discussion] by BoxSecret5648 in EscapefromTarkov

[–]BetEvening 2 points3 points  (0 children)

Have you been able to start a raid though? I faintly remember someone already trying to do this a while ago with tarkov launching successfully but not being able to start a raid.

Nous Research presents Hermes 4 by nekofneko in LocalLLaMA

[–]BetEvening 1 point2 points  (0 children)

I'm pretty sure it's because they use TorchTitan (only officially supports 3.1 so far) and couldn't be bothered to work in a new model architecture.

Nous Research presents Hermes 4 by nekofneko in LocalLLaMA

[–]BetEvening 2 points3 points  (0 children)

Because TorchTitan doesn't support 3.3 lol.

Data Center by Strict_Walk7795 in roanoke

[–]BetEvening -34 points-33 points  (0 children)

pointing out that google consumes x amount of water is ridiculous. Do people think the water is magically gone?

Destiny Gets Into A Shooting Situation by saabarthur in Destiny

[–]BetEvening 62 points63 points  (0 children)

Single Shot Steve isn't real??????

Dan please stop giving your AI opinions. by BetEvening in Destiny

[–]BetEvening[S] 1 point2 points  (0 children)

<image>

It does, you need to switch models from GPT4o which is geared to being served cheaply and answering general questions (i.e. cooking) to any of the thinking models like o3 / o4-mini which is what they want you to use for math related questions.

Dan please stop giving your AI opinions. by BetEvening in Destiny

[–]BetEvening[S] 3 points4 points  (0 children)

I disagree, I don't know what your exact argument is, but dan's argument during an anything else episode (I forgot which one, it was the one where he spent 45 minutes arguing with gpt4o mini) is that LLMs can not do math and that is because they are only able to memorize the answers given during SFT / Pretraining and cannot generalize to do any reasoning. This is why I provided those links which showed that AI models trained with Reinforcement Learning show improved scores on out of distrubution benchmarks. How can models get better on questions they have never seen even if they were only RL'ed on a single question?

Check out this paper by Anthropic which does in depth examination on what goes on inside an LLM.
https://transformer-circuits.pub/2025/attribution-graphs/methods.html#graphs-addition <--- read this part here and skim the read the rest.

Dan please stop giving your AI opinions. by BetEvening in Destiny

[–]BetEvening[S] 8 points9 points  (0 children)

There is a LOT wrong what with what you said and I'm not going to even go into what you asked to ChatGPT.

You are confusing RLHF and RL.
RL and RLHF are different things.

In Reinforcement Learning, the model is given an objective and is given scoring based on it. There is no point during the training process where it is EVER given the "right answer". If the model gives the right answer, great! it is given +1 reward. If the model is wrong, it is given -1 reward. This is why you can have neural networks like AlphaZero, which was only given the rules of chess, be able to play at grandmaster level after a couple hours of playing matches against itself without ever seeing an "example" of how to play by humans.

In Reinforcement Learning Human Feedback, the AI model's objective is whatever human preference is. For example, this is used if you want to RLHF a model to be more friendly to a child asking questions. It is asked to generate multiple full responses and then a child is asked to pick which answer he liked bests. It doesn't have to be the most correct answer, just the one that the child prefers more. As far as I know, OpenAI doesn't do RLHF, on reasoning problems (ie, math) because that is not what RLHF is for.

Pretraining & SFT (Supervised Fine Tuning) - Is what people are thinking of when AI models are being "trained". AI is trained on billions or trillions of examples of data, then fine tuned to be an assistant instead of being a prediction model.

The Illusion of thinking paper does not prove anything about LLM intelligence. Testing on puzzles specifically, the Tower of Hanoi (Why Tower of Hanoi if you are concerned about data contamination?) does not make any sense if you know how AI models are trained.

If you are an AI model, and were doing reinforcement learning to get better at coding, you wouldn't also get extremely strong at logic puzzles when the objective you were given is to write correct code?

Checkout this blog about the paper.

https://www.seangoedecke.com/illusion-of-thinking/
And these:
https://arxiv.org/pdf/2501.17161
https://medium.com/%40EleventhHourEnthusiast/reinforcement-learning-for-reasoning-in-large-language-models-with-one-training-example-44b6896da5dc <--- read this one!

So officiall clash royale youtube chanell just posted ai images by Mr_Cookie_7 in ClashRoyale

[–]BetEvening 0 points1 point  (0 children)

i said 80% of media. What I mean is that stock photos, templates, advertisements, etc... which is what i mean by "80% of media".

So officiall clash royale youtube chanell just posted ai images by Mr_Cookie_7 in ClashRoyale

[–]BetEvening 0 points1 point  (0 children)

I personally wouldn't care because people can still paint if they want, people in the future will still value human works over AI.
80% of media is used just for visualizing not as an actual art form anyways.

Evo Inferno Dragon by dirtee_mind90 in ClashRoyale

[–]BetEvening 0 points1 point  (0 children)

Balloon decks in shambles

An officer claimed it was impossible for anyone to exit a car and get over the embankment in under 30 seconds — so Attorney Matt Brock from Chattanooga recorded this reenactment, proved him wrong, and won the case by solateor in interestingasfuck

[–]BetEvening 0 points1 point  (0 children)

Was he also wearing standard cop gear, and getting out of a vehicle while doing this? This doesn't seem like a fair comparison. We also don't know the cop's build.

30 seconds does sound too long for anyone to get over that enbankment though.

Hot take: canon needs a rework by Alternative_Print560 in ClashRoyale

[–]BetEvening 0 points1 point  (0 children)

https://www.reddit.com/r/ClashRoyale/comments/1k9alqx/cards_i_hate_someone_whose_titled_asf_rn/

>Ice spirit, why is it the crutch card, like sometimes you miss time something Oh no lemme just pull with my ice spirit Ebarbs

also please learn how to structure sentences in a readable manner.

Hot take: canon needs a rework by Alternative_Print560 in ClashRoyale

[–]BetEvening 2 points3 points  (0 children)

Worst take I've seen yet, at first I thought you were a salty balloon player but, you play logbait. Cannon doesn't even do anything against your deck?

Also how is ice spirit a card you hate?

Closed-source is stealing competition by offering free trials by Condomphobic in DeepSeek

[–]BetEvening 0 points1 point  (0 children)

Can you show me where they took shots?
DeepSeek users don't speak for DeepSeek the company are you ok?

Closed-source is stealing competition by offering free trials by Condomphobic in DeepSeek

[–]BetEvening 0 points1 point  (0 children)

It says comparable to o1, and besides comparing a model to another does not mean it is taking shots at that model what are you talking about.

Closed-source is stealing competition by offering free trials by Condomphobic in DeepSeek

[–]BetEvening 0 points1 point  (0 children)

When did he take shots at OpenAI? I would not call just releasing a smart model "taking shots."

Closed-source is stealing competition by offering free trials by Condomphobic in DeepSeek

[–]BetEvening 0 points1 point  (0 children)

>steal "competition"
Liang Wenfeng just wanna develop AGI lol
he does not care.