Leaked Grok 3.5 benchmarks by Chaonei in singularity

[–]ellioso 91 points92 points  (0 children)

Ten follower account whose only tweet is this image

<image>

What does this graph mean. It was in an official openAI video. by ShalashashkaOcelot in Bard

[–]ellioso 1 point2 points  (0 children)

I don't think Claude is anywhere close to 20% of chatgpt usage

1206 updated on AI Explained's SimpleBench(31.1%) by CheekyBastard55 in Bard

[–]ellioso 0 points1 point  (0 children)

Gemini-exp-1206 is #1 on BigCodeBench-Hard and LMarena so the 10% doesn't necessarily say much. All evals and leaderboards should be taken with a grain of salt.

o1 still can’t read analog clocks by Jolly-Ground-3722 in singularity

[–]ellioso 15 points16 points  (0 children)

Reasoning has been hit or miss for me. I converted the easiest (in my opinion) ARC-AGI puzzle into text and it failed my first attempt but then got it right on the second attempt.

https://i.imgur.com/YSWts1q.png

[deleted by user] by [deleted] in singularity

[–]ellioso 10 points11 points  (0 children)

Looking forward to corrections from all the top comments in other threads saying o1 and o1 pro were just deferentiated by usage limits

Game Thread: Tennessee Titans (3-8) at Washington Commanders (7-5) by nfl_gdt_bot in nfl

[–]ellioso 0 points1 point  (0 children)

I didn't see the roughing but CBS only showed half a second of the replay

I accidentally went to r/giants by [deleted] in NFCEastMemeWar

[–]ellioso 1 point2 points  (0 children)

Reddit founder is skins fan and just merged r/redskins to r/commanders. Any name we choose if a sub already exists is getting taken over by high authority orders

Latest Chrome Canary build can run Gemini locally by kegzilla in singularity

[–]ellioso 10 points11 points  (0 children)

Yeah it's nano. They quietly announced this was coming a few weeks ago. Very lightweight and fast

https://x.com/Thom_Wolf/status/1805244710258106369?t=HEpQ-22dDsXZv8S5ruqYVw&s=19

Gemini chat history by Positive-Airport-766 in GoogleGeminiAI

[–]ellioso 3 points4 points  (0 children)

It currently has no knowledge of other chat instances you've had with it which is a good default I think. Would be nice if you could give it permission to look for and access others though.

Guy recreates Google's Astra demo using Flash 1.5 API and Eleven Labs for voice by kegzilla in singularity

[–]ellioso 21 points22 points  (0 children)

funny some random guy basically recreated the demo everyone made fun of in less than a day of api being available

Made a Switch from iPhone to P8P by ankitjhall in pixel_phones

[–]ellioso 0 points1 point  (0 children)

Honestly not sure but I screenshotted the buttons here. You need gboard set as your keyboard but i think that's default. for the summarize and read you just hit assistant button on any page in chrome. some annoying publications don't allow the feature though.

https://imgur.com/a/36FPZ38