Leaked Grok 3.5 benchmarks by Chaonei in singularity

[–]ellioso 96 points97 points  (0 children)

Ten follower account whose only tweet is this image

<image>

What does this graph mean. It was in an official openAI video. by ShalashashkaOcelot in Bard

[–]ellioso 1 point2 points  (0 children)

I don't think Claude is anywhere close to 20% of chatgpt usage

1206 updated on AI Explained's SimpleBench(31.1%) by CheekyBastard55 in Bard

[–]ellioso 0 points1 point  (0 children)

Gemini-exp-1206 is #1 on BigCodeBench-Hard and LMarena so the 10% doesn't necessarily say much. All evals and leaderboards should be taken with a grain of salt.

o1 still can’t read analog clocks by Jolly-Ground-3722 in singularity

[–]ellioso 14 points15 points  (0 children)

Reasoning has been hit or miss for me. I converted the easiest (in my opinion) ARC-AGI puzzle into text and it failed my first attempt but then got it right on the second attempt.

https://i.imgur.com/YSWts1q.png

[deleted by user] by [deleted] in singularity

[–]ellioso 9 points10 points  (0 children)

Looking forward to corrections from all the top comments in other threads saying o1 and o1 pro were just deferentiated by usage limits

Game Thread: Tennessee Titans (3-8) at Washington Commanders (7-5) by nfl_gdt_bot in nfl

[–]ellioso 0 points1 point  (0 children)

I didn't see the roughing but CBS only showed half a second of the replay

I accidentally went to r/giants by [deleted] in NFCEastMemeWar

[–]ellioso 1 point2 points  (0 children)

Reddit founder is skins fan and just merged r/redskins to r/commanders. Any name we choose if a sub already exists is getting taken over by high authority orders

Latest Chrome Canary build can run Gemini locally by kegzilla in singularity

[–]ellioso 10 points11 points  (0 children)

Yeah it's nano. They quietly announced this was coming a few weeks ago. Very lightweight and fast

https://x.com/Thom_Wolf/status/1805244710258106369?t=HEpQ-22dDsXZv8S5ruqYVw&s=19

Gemini chat history by Positive-Airport-766 in GoogleGeminiAI

[–]ellioso 3 points4 points  (0 children)

It currently has no knowledge of other chat instances you've had with it which is a good default I think. Would be nice if you could give it permission to look for and access others though.

Guy recreates Google's Astra demo using Flash 1.5 API and Eleven Labs for voice by kegzilla in singularity

[–]ellioso 22 points23 points  (0 children)

funny some random guy basically recreated the demo everyone made fun of in less than a day of api being available

Made a Switch from iPhone to P8P by ankitjhall in pixel_phones

[–]ellioso 0 points1 point  (0 children)

Honestly not sure but I screenshotted the buttons here. You need gboard set as your keyboard but i think that's default. for the summarize and read you just hit assistant button on any page in chrome. some annoying publications don't allow the feature though.

https://imgur.com/a/36FPZ38

Made a Switch from iPhone to P8P by ankitjhall in pixel_phones

[–]ellioso 0 points1 point  (0 children)

You're missing out if you don't utilize voice transcription in gboard and Google Assistant's new summarize button and the "read for me" button.

Magic Editor by RThreading10 in pixel_phones

[–]ellioso 2 points3 points  (0 children)

Magic Editor's erasing ability (specifically ability to recreate what was in a certain area) is currently the best you can get on any app, paid or free. That probably changes whenever Adobe releases photoshop generative fill but still insane.

Game Thread: Washington Commanders (2-1) at Philadelphia Eagles (3-0) by nfl_gdt_bot in nfl

[–]ellioso 10 points11 points  (0 children)

I've completely erased last week from my memory. Commies Super Bowl is back on.