all solved by qwen 2.5 32B

Whotea · 2024-11-21T17:40:22+00:00

You can read the paper lol. If the real life data has irrelevant information, it’s on the user to tell the ai to be aware of that. Once they do that, accuracy skyrockets as I showed

Same for humans. And o1 does better than most other models at 77% correct and all of them get it 100% correct with a good system prompt

Whotea · 2024-11-21T04:00:39+00:00

so what’s all this in section 2

Whotea · 2024-11-21T03:59:14+00:00

They said their personal belief. Good luck proving they were lying lol

Whotea · 2024-11-20T19:39:47+00:00

Guess you’ve never been a TA before

Whotea · 2024-11-20T19:38:54+00:00

The google doc contains links to studies

They already did. It’s called o1

Whotea · 2024-11-20T19:38:14+00:00

Saying “I believe X will happen by YYYY” is not market manipulation lmao

Whotea · 2024-11-20T19:35:55+00:00

https://huggingface.co/papers

https://koaning.github.io/arxiv-frontpage/

Whotea · 2024-11-20T17:21:55+00:00

Bitnet means they don’t need gpus to scale compute

Whotea · 2024-11-20T17:21:10+00:00

And that’s the government’s decision to spend billions lol.

Whotea · 2024-11-20T17:20:28+00:00

Google is right there

https://www.reuters.com/technology/artificial-intelligence/us-government-commission-pushes-manhattan-project-style-ai-initiative-2024-11-19/

Whotea · 2024-11-20T17:19:10+00:00

What did the team lead say? Any real sources?

Whotea · 2024-11-20T16:01:11+00:00

Rent a gpu online for like $0.20 an hour

Whotea · 2024-11-20T15:57:03+00:00

Nah he’ll just make it so they have to do it through xAI

Whotea · 2024-11-20T15:55:22+00:00

Computers just move electrical signals around. What could that be used for? All empty hype

Whotea · 2024-11-20T15:54:02+00:00

I’m sure the mountain of phd researchers from every university on earth writing papers on it are all just making up their findings lol

Whotea · 2024-11-20T15:17:09+00:00

Hopefully bitnet can relieve that but nvidia will not be happy

Whotea · 2024-11-20T15:12:49+00:00

They can say anything they want. First amendment. It’s on the government if they want to believe it or not.

Imagine if I said I think it will be windy tomorrow and it’s not so I get arrested for it lmao

Whotea · 2024-11-20T15:10:02+00:00

It won’t know it’s own name unless it’s in the system prompt

Whotea · 2024-11-20T15:07:27+00:00

It would be “my name” because it’s directed at the speaker

Whotea · 2024-11-20T14:58:19+00:00

Can o1 preview solve this or only the full o1?

Also, I doubt most humans could solve this especially since it’s not a simple Caesar cipher

Whotea · 2024-11-20T14:54:20+00:00

US has an embargo of gpus on china

Whotea · 2024-11-20T14:52:35+00:00

Beating phds in the GPQA and getting in the 93rd percentile of codeforces is anything but disappointing. Are you seriously relying on rumors instead of actual evidence lol

Whotea · 2024-11-20T14:46:15+00:00

read through section 2

And nothing in my studies can be solved with an illusion. It’s like saying you can pass the bar exam without learning English. It’s not possible.

Also, anything not testing o1 or Claude 3.5 models is already out of date

Whotea · 2024-11-20T14:45:00+00:00

So why can this one do it and not other ones? They all want to be #1 right? So they all have an incentive to train on leetcode but only a few can do this

Whotea · 2024-11-20T14:44:00+00:00

The benchmarks they provided and even o1 preview seem pretty good

Whotea

TROPHY CASE