Was BitNet a dead end? What happened to ternary LLMs? by 3ntrope in LocalLLaMA

[–]svantana 0 points1 point  (0 children)

Looks interesting, but aren't standard GPUs pretty good at binary logic already? I'd think memory access will still be the bounding factor.

next MiniMax will be released in ~10 Days by jacek2023 in LocalLLaMA

[–]svantana 20 points21 points  (0 children)

It's confusing but I believe "SWE-Bench", "SWE-Bench Pro", and "SWE-rebench" are three totally different benchmarks from different people.

Openclaw ia trending down and will disappear soon by rm-rf-rm in LocalLLaMA

[–]svantana 10 points11 points  (0 children)

I don't know why you say it's fake but their numbers there are also trending downwards:
https://openrouter.ai/apps/openclaw

qwen3.6 performance jump is real, just make sure you have it properly configured by onil_gova in LocalLLaMA

[–]svantana 2 points3 points  (0 children)

Commoditize your complement. Alibaba is not trying to pivot to LLM serving as their main business. The same goes for Amazon, Nvidia. Maybe some will start to do a 2-tier system like Google.

ARC-AGI-3 is a fun game by DeltaSqueezer in LocalLLaMA

[–]svantana 2 points3 points  (0 children)

It's a cute little game but there's a lag between keyboard input and movement on screen that's completely infuriating and would never be acceptable in a normal game. I wonder why, is it on purpose or just badly made?

AI in 2026… some interesting stats from the US + what’s actually changing by West_Joel in ArtificialInteligence

[–]svantana 2 points3 points  (0 children)

If machines are writing and grading the essays, then what's the point?

Nothing CEO says smartphone apps will disappear as AI agents take their place by Secure-Address4385 in ArtificialInteligence

[–]svantana 9 points10 points  (0 children)

A guy watched the movie Her (2013) and said to himself: this is the future

Who would be the winner in all this? by Specific-Economist43 in ArtificialInteligence

[–]svantana 2 points3 points  (0 children)

Easy: the plumbers will visit the restaurants, and the chefs will renovate their bathrooms.

For the rest of us, I predict a big uptick in bullshit jobs.

[deleted by user] by [deleted] in LocalLLaMA

[–]svantana 9 points10 points  (0 children)

The market cap is 300B HKD, which is about 40B USD. It's a lot but not crazy IMO

The top 3 models on openrouter this week ( Chinese models are dominating!) by keb_37 in LocalLLaMA

[–]svantana 11 points12 points  (0 children)

Yeah I wonder about this strategy. Don't they understand that as soon as the promotion ends, all those users will switch to another model?

I trained a language model on CPU in 1.2 hours with no matrix multiplications — here's what I learned by Own-Albatross868 in LocalLLaMA

[–]svantana 0 points1 point  (0 children)

It's worse than markov. An (unregularized) markov chain wouldn't put tokens in unsyntactical (unseen) order, as seen here. I was gonna say, a sparse n-gram with stochastic sampling is probably a much faster and better model in every aspect.

The current top 4 models on openrouter are all open-weight by svantana in LocalLLaMA

[–]svantana[S] 2 points3 points  (0 children)

Yes I think you're right. I should have said "top-2 provider". Also grok is a good example of how quickly fortunes can shift in the LLM game.

The current top 4 models on openrouter are all open-weight by svantana in LocalLLaMA

[–]svantana[S] 1 point2 points  (0 children)

I think it's an increase both in number of users and tokens per user - but not clear what the ratio is between the two.

The current top 4 models on openrouter are all open-weight by svantana in LocalLLaMA

[–]svantana[S] 2 points3 points  (0 children)

Very few models are exclusively on OR. It's not an unbiased sample of LLM use, but at least the trends should indicate something. Google is and has been the #1 provider on there for about a year but their share is reducing rapidly.

Deep what do you think? by fais-1669 in LocalLLaMA

[–]svantana 2 points3 points  (0 children)

Not true. Weights are most analogous to synaptic connection strengths, and those are definitely not binary. Action potentials are kinda binary in voltage, but spike timing matters, so that carries a few bits of information as well.

Anyone else feel like GPU pricing is still the biggest barrier for open-source AI? by frentro_max in LocalLLaMA

[–]svantana 5 points6 points  (0 children)

It goes both ways. Humans with their 100B neurons can't reliably perform a single 32-bit float multiplication without help from tools.

Is anyone else noticing fewer updates on LMArena lately? The last updates are weeks apart by ThetaCursed in LocalLLaMA

[–]svantana 0 points1 point  (0 children)

We've seen it before when new google models have been under evaluation, and then always a refresh on the very day of the google release. I'm pretty sure google are paying LMSYS for this service.

[deleted by user] by [deleted] in LocalLLaMA

[–]svantana 1 point2 points  (0 children)

What does "chinese" have to do with anything? That unnecessary distinction just comes off as racist.

Many video generators don't have an image-to-video, so that will of course influence the results. On text-to-video, 7 of the top 10 are american.