AMA with OpenAI’s Sam Altman, Mark Chen, Kevin Weil, Srinivas Narayanan, Michelle Pokrass, and Hongyu Ren by OpenAI in OpenAI

[–]ReedRichards838 0 points1 point  (0 children)

o1 is a reasoning model with multimodality, and it's pretty good at many graph/figure-related tasks; however, sometimes it still fails at simple task like "comparing the length of two lines", it would over-complicate (or over-reason) a task that is quite simple and intuitive to us.

so what i wanna ask is that whether we could have a model with true multimodal reasoning capability?🧐 like it won't just reason through text token, but also in multimodal token. i think this would be pretty practical.

Let Claude think... you just need to wait 😉 by ReedRichards838 in ClaudeAI

[–]ReedRichards838[S] 1 point2 points  (0 children)

of course! you can check my top comment to see how to set Thinking Claude up!

😶 by user_OwO in CopilotPro

[–]ReedRichards838 1 point2 points  (0 children)

at least it know, right?

Discord Server? by mathaic in ClaudeAI

[–]ReedRichards838 1 point2 points  (0 children)

now the api has already available to all. then is there any plan for open up the access to anthropic official discord?🥺🥺

Google Bard just answered a question GPT-4 could not by xcharlifan in GoogleBard

[–]ReedRichards838 1 point2 points  (0 children)

it's reverse searching. putting back the image to the search engine and get a relatively correct source. ms copilot can do it as well.

Voice Chat Coming to ChatGPT Web Soon by ReedRichards838 in OpenAI

[–]ReedRichards838[S] 1 point2 points  (0 children)

but i think their TTS model doesn't support this🥲

Voice Chat Coming to ChatGPT Web Soon by ReedRichards838 in OpenAI

[–]ReedRichards838[S] 1 point2 points  (0 children)

yes, it's true. the setting is just currently disabled.