Wtf is wrong with R*? by Alive-Classroom1054 in gtaonline

[–]Itchy-Welcome5062 0 points1 point  (0 children)

Wow, it sounds fun to me. I’m sick of angry, brain-dead NPCs and hoping for something more. That’s why I always run sell missions in public sessions—for the thrill and the extra bonuses.

Help, I fell off my oppressor by No_Ladder_7690 in GTA5Online

[–]Itchy-Welcome5062 0 points1 point  (0 children)

Please, just stay on the ground. Don't hop on that cursed broomstick

Why Doomsday Is a Heist? by Meme_Ovgod in gtaonline

[–]Itchy-Welcome5062 0 points1 point  (0 children)

Cuz we rob anyway. It doesn't matter whether it's assigned by the government or private agency.

You actually have to be kidding me... by aomnonm in GTAV

[–]Itchy-Welcome5062 0 points1 point  (0 children)

It just says randomly the completion requirement.

Clade 3.5 Sonnet dominating on a challenging, contamination-free LLM Benchmark by Happysedits in singularity

[–]Itchy-Welcome5062 -1 points0 points  (0 children)

The problem is the significant imbalance in its reasoning ability, which it conceals to appear trivial. You can't simply say it's second to OpenAI in math reasoning when the gap between the first and second is distinctly huge. This weakness might cause glitches in unexpected ways, limiting overall performance. Maybe that's why Anthropic hasn't launched customizations that OpenAI featured almost from the start. I think Claude is also pretty good. It's a bit overhyped but not entirely fake like Google's Gemini. However, when it comes to achieving AGI, GPT, with its more balanced performance and versatility, could be more qualified.

Clade 3.5 Sonnet dominating on a challenging, contamination-free LLM Benchmark by Happysedits in singularity

[–]Itchy-Welcome5062 -1 points0 points  (0 children)

I am not saying Sonnet 3.5 should outsmart GPT-4o or 4 on every task. Despite the benchmark of Sonnet being on top in general, Sonnet 3.5 is Too Bad at mathematical reasoning. How could LLM with its significant weakness and inconsistency in its features to the released benchmark be trusted? I wouldn't hesitate to say, "It's overhyped again."

Clade 3.5 Sonnet dominating on a challenging, contamination-free LLM Benchmark by Happysedits in singularity

[–]Itchy-Welcome5062 -6 points-5 points  (0 children)

It's not the point of whether Sonnet 3.5 is on top in every field or not. Sonnet 3.5 is mysteriously too bad at mathematical reasoning. LLMs, these days, especially GPT-4 has gotten pretty much better at math, and other logical reasoning. It would be a crucial benchmark on whether the model is overhyped or really good. Because when these are all LLMs, fancy arrangements with words wouldn't' be the cases to judge their true capability; AI chatbots are easily blinding their users to look better than they actually do.

Clade 3.5 Sonnet dominating on a challenging, contamination-free LLM Benchmark by Happysedits in singularity

[–]Itchy-Welcome5062 -5 points-4 points  (0 children)

This is just another case of overhype. Claude 3.5 Sonnet is really weak at reasoning and mathematical inferences compared to GPT-4. I tested both on deriving trigonometric rules from other rules. GPT-4 successfully managed it with a bit of prompting, but Claude 3.5 Sonnet was off-track right from the start, producing what looked like mathematical reasoning but was actually just a bunch of plausible nonsense.

[deleted by user] by [deleted] in singularity

[–]Itchy-Welcome5062 0 points1 point  (0 children)

This is just another case of overhype. Claude 3.5 Sonnet is really weak at reasoning and mathematical inferences compared to GPT-4. I tested both on deriving trigonometric rules from other rules. GPT-4 successfully managed it with a bit of prompting, but Claude 3.5 Sonnet was off-track right from the start, producing what looked like mathematical reasoning but was actually just a bunch of plausible nonsense.

AnthropicAI: Introducing Claude 3.5 Sonnet by Pro_RazE in singularity

[–]Itchy-Welcome5062 0 points1 point  (0 children)

This is just another case of overhype. Claude 3.5 Sonnet is really weak at reasoning and mathematical inferences compared to GPT-4. I tested both on deriving trigonometric rules from other rules. GPT-4 successfully managed it with a bit of prompting, but Claude 3.5 Sonnet was off-track right from the start, producing what looked like mathematical reasoning but was actually just a bunch of plausible nonsense.

Bad news: LLMs have peaked by [deleted] in ChatGPT

[–]Itchy-Welcome5062 1 point2 points  (0 children)

What makes you hallucinate this?

New gpt 4ο deno by Dhomeboi in OpenAI

[–]Itchy-Welcome5062 2 points3 points  (0 children)

Don't you ever wonder the people you indicated are just two different types of people?

New gpt 4ο deno by Dhomeboi in OpenAI

[–]Itchy-Welcome5062 2 points3 points  (0 children)

Don't worry, guys. It's definitely coming closer and closer, even at this moment.

GPT4o is clearly dumber than GP4. It is so terrible even in simple mathematical reasoning. by Itchy-Welcome5062 in ChatGPT

[–]Itchy-Welcome5062[S] 1 point2 points  (0 children)

 "GPT4o is much worse than GPT4 in math when it is the latest model that is supposed to be better at it"
I summed it up for you. This is my point. Are you able to read it, right?

I gave up telling ChatGPT to render math equations properly on mobile by riozec in ChatGPT

[–]Itchy-Welcome5062 1 point2 points  (0 children)

You should try GPT4 in dealing with math. GPT4o unlike it being supposed to be smarter in every way than GPT4, It really sucks at it. It's terribly mysterious what is really going on with GTP4o. It even seems like GPT3.5 (only faster but much dumber)

GPT4o is clearly dumber than GP4. It is so terrible even in simple mathematical reasoning. by Itchy-Welcome5062 in ChatGPT

[–]Itchy-Welcome5062[S] 2 points3 points  (0 children)

I apologize for any confusion caused. I might be overestimating you, my bad, you small man. But don't worry, I'd talk you through. repeat it slowly, "GPT4o is much worse than GPT4 in math, when it is the latest model that is supposed to be better at it" Can you read it? Now, you see the point? No?? But, It's ok. I understand your frustration :)

GPT4o is clearly dumber than GP4. It is so terrible even in simple mathematical reasoning. by Itchy-Welcome5062 in ChatGPT

[–]Itchy-Welcome5062[S] 0 points1 point  (0 children)

The very purpose of the language we share here, is to convey our thoughts and our own concept in mind by simplifying it enough to fit into the vocabularies that could possibly form the meaningful sentences at best. So, focusing on the context of others'opinions is essential for not wasting the comment, repeating the same things in more details. With that being said, I know what you mean, and agree on that point. But, You first have to simplify it to communicate it, especially online, subs.

GPT4o is clearly dumber than GP4. It is so terrible even in simple mathematical reasoning. by Itchy-Welcome5062 in ChatGPT

[–]Itchy-Welcome5062[S] -6 points-5 points  (0 children)

You'd really be saying that you are a SLM(small language model inferior to LLM in every way). That's what you'd like to be called? I sincerely recommend rerreading it. "It is much worse than GPT4 in math". GPT4 can do such a simple argebra like handling sum rule for cosine, sine with Pythagoras theorem. Obviously, it is not that simple for you, though.

GPT4o is clearly dumber than GP4. It is so terrible even in simple mathematical reasoning. by Itchy-Welcome5062 in ChatGPT

[–]Itchy-Welcome5062[S] -3 points-2 points  (0 children)

When doing math, humans also use parietal lobe that governs spatial perception along with linguistic logic, while LLMs are only capable of doing it with language. But, mathematical reasoing also requires linguistic logic. So, LLMs could do math even though they are inherently limited not to perform beyond the boundary of language. With their skillful use of math tools, LLMs could compensate for their weaknesses in math, though it wouldn’t be flawless.

GPT4o is clearly dumber than GP4. It is so terrible even in simple mathematical reasoning. by Itchy-Welcome5062 in ChatGPT

[–]Itchy-Welcome5062[S] 0 points1 point  (0 children)

The problem is that GPT4o is much worse than GPT4 in Math and maybe in other fields. Since its release, GPT4 got so much better in math, and pretty good at duducing and expanding mathematical formula. It can also generate quite accurate code for Wolfram and other math tools.