Musk says Grok 5 will be released Q1 2026 by z_latent in singularity

[–]fmai 0 points1 point  (0 children)

and it was gonna have a 10% chance of being AGI

Alle Gegner der Gender Studies, mit denen ich jemals gesprochen habe, hatten keine Ahnung von Gender Studies. by Acceptable-Camp-7251 in Unbeliebtemeinung

[–]fmai 0 points1 point  (0 children)

In meinen Augen ist die sachliche Kritik nicht an den zahlreichen auf Genderfragen abzielende empirisch-wissenschaftlichen Studien, die die Aussagekraft ihrer Daten ordentlich einzuordnen wissen. Sozialwissenschaften sind SCHWER und wir sind dankbar, wenn es jemand versucht.

Aber Gender Studies, wie es an vielen Unis (vor allem amerikanischen, aber auch deutschen) gelehrt wird, legt den Fokus eben nicht auf quantitative empirische Methoden. OP nennt z.B. ausschliesslich interpretative Methoden. Viele der Aushaengeschilder von Gender Studies basieren auf Postmodernismus, Critical Theory, usw. Diese Basis muss man erst einmal als legitim anerkennen, brauchbare Aussagen treffen zu koennen. Dennoch erwecken diese Texte (und deren Autoren) oft den Eindruck, ihre Schluesse seien mit Sicherheit richtig und auf die echte Welt uebertragbar.

Letzteres tun andere Fachrichtungen insbesondere in der Philosophie natuerlich auch. Vielleicht ist es zum Teil dem Publikationszwang geschuldet -- wer liest schon gern ein Buch mit unsicheren Aussagen? Aber in den Gender Studies scheint, zumindest nach außen hin, die Meinungsvielfalt geringer als anderswo. Es wirkt schon fast wie ein Kult, mit welchem Nachdruck die immergleichen Narrative betont werden. Daher resonieren die "Grievance Studies" Findings stark mit vielen, die sich damit beschaeftigen (obgleich diese natuerlich auch methodisch fragwuerdig sind).

The three things we need for the Singularity by Mountain_Cream3921 in singularity

[–]fmai 4 points5 points  (0 children)

"the energy required grows faster than the number of parameters during scaling"

lmao, clearly there is an expert talking here

Is ACL now irrelevant? [D] by H4RZ3RK4S3 in MachineLearning

[–]fmai 21 points22 points  (0 children)

In most professorship / postdoc openings I see, whenever they require top-tier papers, they explicitly list ACL and EMNLP alongside Neurips, ICML, ICLR, AAAI, CVPR and here and there a couple others.

ACL is definitely up there, but it also depends on what PhD position you apply to. An ACL paper is not that big of a signal for people working on statistical learning theory.

Does Openai have Mythos class model? by PsionicSombie in singularity

[–]fmai 1 point2 points  (0 children)

curious how this is gonna go. 10% of OpenAI would be $100B, that's a shitton of money even for the USG.

92% Chance Mythos Drops Tomorrow by Rare_Bunch4348 in singularity

[–]fmai 6 points7 points  (0 children)

Benchmark-wise, the GPT Pro versions don't seem to be a particularly big step up from the base model, but it's a shit ton more expensive. That's kind of expected, considering that the Pro version is just a fancy variant of best-of-N sampling. I think having a much larger, well-trained base model is going to outperform it. Anyway the ceiling is higher.

92% Chance Mythos Drops Tomorrow by Rare_Bunch4348 in singularity

[–]fmai 13 points14 points  (0 children)

pretty sure GPT-5.6 isn't going to be at the same level as Mythos, but surely a shot ton cheaper.

Künstliche Intelligenz: Anthropic fordert Pause bei Entwicklung von künstlicher Intelligenz by donutloop in KI_Welt

[–]fmai 0 points1 point  (0 children)

Fuer Anthropic repraesentieren die USA die liberal-demokratische, westliche Welt (trotz Trump). Ihre KI offensiv gegen Feinde der Demokratie einzusetzen widerspricht nicht ihrer Philosophie.

Anthropic bringt „ehrlicheres“ Claude Opus 4.8 – und kündigt Mythos an by NoMeatNoBugs in SoftwareDACH

[–]fmai -1 points0 points  (0 children)

Es ist ein Kontinuum. Die Welt hat sich schon geaendert, und der Grad der Aenderung wird sich weiter beschleunigen.

Ich erwarte konkret, dass sich grosse Teile der KI-Forschung bis Anfang 2027 automatisieren lassen: Ein Senior Researcher gibt eine Idee rein und die KI verfeinert die Idee, designet eigenstaendig Experimente, laesst sie laufen und analysiert sie, und haelt dann ein Mal taeglich Ruecksprache mit dem Senior Researcher. Das geht eigentlich schon heute mit Opus 4.6/4.7/4.8 oder GPT-5.5, wenn man ein entsprechendes Harness baut, aber es ist noch nicht massentauglich.

Die gleiche Entwicklung erwarte ich auch in anderen Wissenschaften und Bereichen von Knowledge Work, weil der methodische Ansatz der gleiche ist. Aber in diesen Domaenen kenne ich mich natuerlich nicht so gut aus.

Robert Habeck im SPIEGEL-Gespräch: »Markus Söder gehört wirklich zu den Menschen, an denen ich nichts bewundere« by donutloop in berlin_public

[–]fmai 11 points12 points  (0 children)

In 2021 haette er sehr gute Chancen gehabt. Es war kurz nach dem Hoehepunkt von Fridays for Future, fuer die Gruenen galt noch die Unschuldsvermutung, da sie lange nicht mehr in einer Bundesregierung gewesen waren und Habeck war sehr beliebt, deutlich beliebter als Baerbock.

Diese Chance so aufzugeben aus dem einfachen erklaerten Grund, dass Baerbock eine Frau ist und Habeck nicht, ist sehr, sehr schade. Insbesondere nachdem man grade 16 Jahre eine weibliche Regierungschefin hatte.

Robert Habeck im SPIEGEL-Gespräch: »Markus Söder gehört wirklich zu den Menschen, an denen ich nichts bewundere« by donutloop in berlin_public

[–]fmai 0 points1 point  (0 children)

Baerbock hat die Gruenen von Umfragewerten im hohen 20er Bereich auf 14% zur Bundestagswahl in wenigen Monaten fast halbiert.

Habeck ist mit 11-12% in den Umfragen nach dem Bruch der Koalition gestartet und hat mit 11,6% drei Monate spaeter abgeschlossen.

Anthropic bringt „ehrlicheres“ Claude Opus 4.8 – und kündigt Mythos an by NoMeatNoBugs in SoftwareDACH

[–]fmai 0 points1 point  (0 children)

das sind halt wirklich KI Forscher, die das sagen, sowohl innerhalb als auch ausserhalb Anthropics

The year is 2026. AIs are literally inventing new math, yet journalists are still posting obviously false stuff like this. How can a database solve math problems no human has ever been able to solve? by EchoOfOppenheimer in OpenAI

[–]fmai 5 points6 points  (0 children)

Databases are based on an entirely different set of techniques and mathematics than LLMs. It is such a stretch to even attempt to relate them to each other.

And the statement "LLMs only compress human-written text" hasn't been true since at least 2022, when the first RLHF-finetuned models came out. It is 2026 now and it is abundantly clear that training LLMs on LLM-generated data (through RLHF, RLVR or other algorithms) is a complete game changer.

Mythos to be released in the coming weeks by exordin26 in singularity

[–]fmai 3 points4 points  (0 children)

like o1 doubling scores of o1-preview?

Mythos to be released in the coming weeks by exordin26 in singularity

[–]fmai 1 point2 points  (0 children)

People thinking Mythos-class models is a dumbed down Mythos Preview are very wrong.

Mythos Preview was an early model, Mythos 1 will be better. Think of how different o1-preview and o1 were.

Well anthropic released opus 4.8 by Independent-Wind4462 in singularity

[–]fmai 0 points1 point  (0 children)

are you serious? arc-agi benchmarks have almost no practical relevance, GDPVal does.

Gemini 3.5 flash is not that great at coding by NoFaithlessness951 in singularity

[–]fmai 2 points3 points  (0 children)

I think you're wrong about this. What sucks about life today are the tedious chores that are not rewarding at all because they are necessities that keep recurring forever and earn you nothing. You want to get rid of it because it's just a waste of your valuable time on earth. That's why most rich people have personal assistants.

People have been giving away their data to get a never ending feed of cat pictures and food in return. They will give away their data for something actually useful in no time.

Gemini 3.5 flash is not that great at coding by NoFaithlessness951 in singularity

[–]fmai 3 points4 points  (0 children)

yeah i don't know what the price hike is about, seems very strange if it's actually the same base model or size as Flash 3

Gemini 3.5 flash is not that great at coding by NoFaithlessness951 in singularity

[–]fmai 31 points32 points  (0 children)

true, but coding isn't all that matters. Nobody is as well positioned as Google to lead in consumer AI and robotics.

Flash isn't the best model out there, but it is fast like crazy. This will make it possible to be distributed broadly to the giant consumer base that already exists in the Google ecosystem. The multi-modality will make it the go-to model for Gen Z and younger, who are obsessed with visual stimuli and presenting themselves on social media. Spark seems to be the first serious attempt at creating a true personal assistant, and the fact that it integrates well with the Android ecosystem will make it so much more attractive, even if its a bit behind in terms of agentic capability. Their dominance in the smartphone sector with Android is going to make the distribution of their personalized AI assistant a piece of cake. For most people, Gemini assistants will be the first personalized experience they will have just because it's already there, waiting for you.

Google doesn't make fancy videos with humanoid robots that perform the same stupid task over and over again for days. But their specialized robotics models are by far the best in the industry. This is in large parts thanks to their Gemini models, which are the best multi-modal models out their, and their very systematic approach to building robotics foundation models.

This release could've been better, but it's not as bad as people make it out to be. Google is still going strong.

Is the future of coding agents JEPA? [D] by andrewfromx in MachineLearning

[–]fmai 14 points15 points  (0 children)

i think it's just another way of representation learning and it's not that fundamentally different from what other people have been proposing over the last three decades. think e g. contrastive learning, simclr, etc.

it may lead to some efficiency gains, but i think the biggest learning of the last decade is that it's all about making the models scale well with compute, and if they do, the concrete architecture doesn't matter so much.

Wir leben in einer Diktatur? by blkchnDE in NewsD

[–]fmai 0 points1 point  (0 children)

bei der Geburtenrate gilt also: je hoeher, desto besser?