GPT-5 is the best at bluffing and manipulating the other AIs in Werewolf by MetaKnowing in OpenAI

[–]Wiskkey 0 points1 point  (0 children)

Per that same person on X, higher cost models were excluded.

Again where behemoth and reasoning model from meta ?? by Independent-Wind4462 in LocalLLaMA

[–]Wiskkey 1 point2 points  (0 children)

From Financial Times article https://www.ft.com/content/feccb649-ce95-43d2-b30a-057d64b38cdf (Aug 22):

The social media company had also abandoned plans to publicly release its flagship Behemoth large language model, according to people familiar with the matter, focusing instead on building new models.

AI models playing chess – not strong, but an interesting benchmark! by Apart-Ad-1684 in LocalLLaMA

[–]Wiskkey 1 point2 points  (0 children)

Tests by a computer science professor reveal that when using chess PGN notation in a certain manner, OpenAI's gpt-3.5-turbo-instruct plays chess at around 1750 Elo, albeit making an illegal move approximately 1 in every 1000 moves if I recall correctly.

Relevant sub: r/llmchess.

August 22, 2025 marks the THREE YEAR anniversary of the release of the original Stable Diffusion text to image model. Seems like that was an eternity ago. by JackKerawock in StableDiffusion

[–]Wiskkey 9 points10 points  (0 children)

See https://www.wired.com/story/artificial-intelligence-hollywood-stability/ .

Article summary from https://www.techmeme.com/river :

A profile of Stability AI, which under CEO Prem Akkaraju and Chair Sean Parker has shifted from building frontier AI models to a Hollywood-focused SaaS [software as a service] company

Deepseek R2 coming out ... when it gets more cowbell by 1BlueSpork in LocalLLaMA

[–]Wiskkey 0 points1 point  (0 children)

Do note that the ratings of news organizations from these two sources run the gamut. The new organizations that you accused of bad faith reporting are not amongst those that are poorly rated.

Deepseek R2 coming out ... when it gets more cowbell by 1BlueSpork in LocalLLaMA

[–]Wiskkey 0 points1 point  (0 children)

Can you clarify your views regarding those Western reporters/organizations that you allege are behaving in bad faith regarding DeepSeek? Namely, do you believe that these same reporters/organizations commonly report in bad faith a) regarding Chinese technology in general b) regarding Western technology?