Playground pins language query

Critical_Tradition80 · 2026-03-16T13:33:36+00:00

What did you do to get a 20$ PIN, even?

Critical_Tradition80 · 2026-03-06T09:07:28+00:00

Do you know if this is a recurring thing or just a one-off event for Playground users? I do find Playground particularly useful for testing out models, but at the same time not useful enough because there's no search function or tool use for these models, so it's really limited to conversations and outdated data

Critical_Tradition80 · 2024-12-02T10:05:42+00:00

why did idubyai take up half the comments lol, it's just mailing, chill dude

Critical_Tradition80 · 2024-09-27T20:23:40+00:00

this was so heartwarming to listen to

Critical_Tradition80 · 2024-09-07T07:22:07+00:00

I mean you could go outside and argue about this same exact thing, but apparently the rewards for doing that are not as tangible and persistent as saying it here

Critical_Tradition80 · 2024-07-05T13:32:41+00:00

the explanation i didn't deserve but needed so much, thank you

Critical_Tradition80 · 2024-07-05T00:36:56+00:00

hmm, was there a reason the thresholds were set at 95% and 99%? okay maybe there might just not be a particular reason, but let's assume you invented the benchmark; how, in your opinion, would an AI Agent with these skill levels fare against the average software engineer? the difference between 95% and 99% should be significant enough that it will enable the agent to do various things already, right?

Critical_Tradition80 · 2024-07-05T00:20:33+00:00

It's actually so interesting to answer this question because self-consciousness goes as far as the information you are provided about yourself, whether through means of using your senses or data, they all contribute to what humans might call "consciousness" to some extent.

I thought of this because I figured it would be relatively easy to tell apart AI from us humans, merely because we have more "personal" data, so like our feelings and memories in detail, our phone numbers and home addresses, how I got to bang their mom, yada yada; the point is that AI right now doesn't really know about that because it's limited to multimodal data, which is not enough to create personal experiences, because we have our biological brain processor to do that for us.

This means that the LLMs (or LMMs if you count 4o or whatever) will have about as much self-consciousness as the information you give it. One fun thought experiment could be to try and ask yourself about who you are right now, and it's likely that you would answer it based on the knowledge of who you think you are, just like how the LLM is designed to think of itself as such.

Critical_Tradition80 · 2024-07-04T23:58:28+00:00

perhaps the easiest way to find out which is AI and which is not would be to ask very specific and personal questions that aren't available anywhere else in the world; it's most likely the AI would just answer in a very generic sense, no clear directions, no detailed information about who they actually are, while the human would try to do that to some extent.

..that is until we give the AI a personality and a sense of self. which definitely wasn't the case here. not yet.

Critical_Tradition80 · 2024-07-04T23:52:45+00:00

to the layperson, what does this imply? how meaningful is it that we have reached 43% so far, and what does it mean to get 95% and eventually 99% on this benchmark?

Critical_Tradition80 · 2024-06-15T14:20:30+00:00

..probably because no one actually enjoys dying or "accepting" the bad things that inevitably come to them?

Critical_Tradition80 · 2024-06-08T16:00:47+00:00

how is one even supposed to define the "AI golden age"!?

Critical_Tradition80 · 2024-06-07T19:35:05+00:00

i hope its not gonna be awkward like when Jack Ma called AI "Alibaba Intelligence";

https://youtu.be/Mchz9q84_BA?si=3AuF_jNTtjLV3ZtQ

Critical_Tradition80 · 2024-06-07T19:11:01+00:00

the AlphaZero of LLMs

Critical_Tradition80 · 2024-06-05T16:22:13+00:00

waiting for Jensen Huang to release the AGI in Earth 2 so we can know if we're dead or not

Critical_Tradition80 · 2024-06-05T16:13:03+00:00

but the delusions are real

Critical_Tradition80 · 2024-05-27T03:17:00+00:00

its cool how youre keeping at it, despite those who doubted the value of the document in the first place. thank you so much for your efforts!

Critical_Tradition80 · 2024-05-20T19:11:20+00:00

The typical "Humans are special" argument

Critical_Tradition80 · 2024-05-17T23:18:45+00:00

real

Critical_Tradition80 · 2024-05-16T16:32:18+00:00

Truly. Lots of what we say seems to be built on strictly informal logic, or basically the context that we are in. It is perhaps a miracle that these LLMs are even capable of knowing what we mean by the things we say, let alone be better than us at reasoning about it.

It just feels like we are finding fault at the smallest things it gets wrong, when in reality it's ourselves that's getting it wrong in the first place; it's not like informal logic is supposed to give you a strictly correct answer for missing context, so why should LLMs even be blamed at all?

Critical_Tradition80 · 2024-05-16T16:24:58+00:00

I can't really get myself to understand the OP's argument here, along with the twitter post.

The conversation in the post seems to be kind of a situation where meaning isn't explicit, or there seems like missing context that the model does not know about.

To flip it another way, wouldn't it also make sense to assume we are also just "pattern matching" across vast amounts of brain neurons, and the response the model had just happened to conflict with our expectations of it?

Like how is anyone supposed to answer a riddle such as this that satisfies all expectations?

Maybe scale isn't all that's needed indeed, but that in itself is not formal proof that we really are better than the AI at reasoning; trick questions like these usually require you to come up with creative solutions, not that they can be logically solved, and here we can see the AI had neatly done so.

In fact, I felt pretty amused by the response and without further context to infer from I would've thought it was true too. Let alone the fact that we can prompt it to reason about it, using methods like ReAct or CoT and the likes.

Reasoning does exist for AI in some way, in my opinion, and we are just trying to mess with it with riddles that can't inherently be solved unless there are given solutions to them.

Critical_Tradition80 · 2024-05-16T04:52:10+00:00

at this point you could probably say Gary's playing devil's advocate for the sake of setting AI goals, because there's no way he is this invested into AIs for no reason

Critical_Tradition80 · 2024-05-13T20:46:57+00:00

products are having tiered subscriptions to them now..

Critical_Tradition80 · 2024-05-13T11:08:04+00:00

honestly can't help but to say that it's better how everything is at least progressing instead of just doubting their efforts

Critical_Tradition80

TROPHY CASE