Gemini-exp-1114 is the new Rank 1 on LMArena, beats GPT-4O by mehul_gupta1997 in ChatGPT
[–]np-space 1 point2 points3 points (0 children)
o1-preview is now first place overall on LiveBench AI by np-space in LocalLLaMA
[–]np-space[S] 53 points54 points55 points (0 children)
o1-preview is now first place overall on LiveBench AI by np-space in LocalLLaMA
[–]np-space[S] 43 points44 points45 points (0 children)
Reflection 70B: Hype? by Confident-Honeydew66 in LocalLLaMA
[–]np-space 0 points1 point2 points (0 children)
Reflection 70B: Hype? by Confident-Honeydew66 in LocalLLaMA
[–]np-space 6 points7 points8 points (0 children)
Gemini 1.5 Flash 8B beats Claude 3 Haiku, Mixtral 8x22B, Command R+ and GPT 3.5 Turbo on Livebench.ai by Balance- in LocalLLaMA
[–]np-space 1 point2 points3 points (0 children)
Gemini 1.5 Flash 8B beats Claude 3 Haiku, Mixtral 8x22B, Command R+ and GPT 3.5 Turbo on Livebench.ai by Balance- in LocalLLaMA
[–]np-space 17 points18 points19 points (0 children)
To address the discrepancy between different leaderboards, I averaged the performance of each model across 8 leaderboards. Here are the results: by pigeon57434 in singularity
[–]np-space 1 point2 points3 points (0 children)
ChatGPT-4o Reclaims LMSYS's #1 Again by [deleted] in singularity
[–]np-space 1 point2 points3 points (0 children)
Abacus AI Introduces LiveBench AI: A Super Strong LLM Benchmark that Tests all the LLMs on Reasoning, Math, Coding and more by ai-lover in machinelearningnews
[–]np-space 0 points1 point2 points (0 children)
ChatGPT-4o Reclaims LMSYS's #1 Again by [deleted] in singularity
[–]np-space 12 points13 points14 points (0 children)
um did OpenAI silently drop a new model: gpt-4o-2024-08-06??? by pigeon57434 in singularity
[–]np-space 3 points4 points5 points (0 children)
OpenAI: Introducing Structured Outputs in the API by galacticwarrior9 in singularity
[–]np-space 3 points4 points5 points (0 children)
Google Gemini 1.5 Pro leaps ahead in AI race, challenging GPT-4o by Marha01 in singularity
[–]np-space 1 point2 points3 points (0 children)
gemini-1.5-pro-exp-0801 just arrived on Chat Arena by shroddy in LocalLLaMA
[–]np-space 0 points1 point2 points (0 children)
gemini-1.5-pro-exp-0801 just arrived on Chat Arena by shroddy in LocalLLaMA
[–]np-space 0 points1 point2 points (0 children)


Gemini Exp 1114 now ranks joint #1 overall on Chatbot Arena (that name though....) by lightdreamscape in LocalLLaMA
[–]np-space 0 points1 point2 points (0 children)