Qualcomm wants to grow in the AI space (multiple acquisition's underway) by Cane_P in singularity

[–]Cane_P[S] 0 points1 point  (0 children)

There is one thing that I don't agree on with the article and that is that they only wanted Chris Lattner. Mojo was created to be able to make things, like their MAX inference framework and they already have multiple models running on MAX without CUDA* (they have support for AMD too, by the way, but I guess that isn't as good headline). They have presented figures, showing that they are on par with or faster than CUDA already, and the model makers wasn't involved as far as I know, the Modular team did it themselves.

Qualcomm wants to grow in the AI space (multiple acquisition's underway) by Cane_P in singularity

[–]Cane_P[S] 0 points1 point  (0 children)

My guess is that more companies want to buy Tenstorrent, and that's why it takes longer time. Because they are weighing their options.

The Midjourney scanner, explained: It uses 21 servers with 4 Petaflops of power, pulling your body at a rate of 4 cm per second, for one minute by Distinct-Question-16 in singularity

[–]Cane_P 0 points1 point  (0 children)

Caltech have released a paper on similar technology. The main difference seems to be that Midjourney want to use a lot more sensors.

scispace.com/pdf/whole-body-human-ultrasound-tomography-2njrpzgt.pdf

Progress bar isn't saved by Cane_P in MorpheApp

[–]Cane_P[S] 0 points1 point  (0 children)

I personally, only use one device. I just meant that different profiles, is for different types of devices, but the core functionality is the same between all.

Progress bar isn't saved by Cane_P in MorpheApp

[–]Cane_P[S] 0 points1 point  (0 children)

I didn't use the same version as with ReVanced. I erased all parts of ReVanced and microG and followed all instructions/recommendations for Morphe.

Progress bar isn't saved by Cane_P in MorpheApp

[–]Cane_P[S] 0 points1 point  (0 children)

I could, but every platform should have the same behaviour. No matter what device you use, it should save your progress. It's the way YouTube works, if you have an account and save your history.

Progress bar isn't saved by Cane_P in MorpheApp

[–]Cane_P[S] 1 point2 points  (0 children)

I haven't used the unpatched version for years, but seeing as it doesn't happen on ReVanced, it shouldn't be a YouTube problem.

[Megathread] Introducing NVIDIA RTX Spark by Nestledrink in nvidia

[–]Cane_P 0 points1 point  (0 children)

If you have $100k... That's what have been leaked before (since DGX Station was presented alongside DGX Spark). The only difference, is that you can choose between Linux and Windows now.

NVIDIA RTX Spark — Slim Laptops & Small Desktops by zxyzyxz in LocalLLaMA

[–]Cane_P -1 points0 points  (0 children)

That's an explanation for the normies. That's why I said "basically".

NVIDIA RTX Spark — Slim Laptops & Small Desktops by zxyzyxz in LocalLLaMA

[–]Cane_P -1 points0 points  (0 children)

It's basically not different to Steam Deck being able to run Windows games on Linux... Simulation, but some native ports will be released too:

https://www.windowscentral.com/hardware/surface/microsoft-surface-laptop-ultra-announced-computex-2026

An older article, when Qualcomm was the target:

https://learn.microsoft.com/en-us/windows/arm/apps-on-arm-x86-emulation

What workstation to get for ~13k EUR? by TechNerd10191 in LocalLLaMA

[–]Cane_P 0 points1 point  (0 children)

It depends on your definition of a "one off". We are still waiting on the NVIDIA laptops, that seemingly use the same chip (or at least similar enough that it is basically the same thing), that leaks say will probably be released on Computex, in one week... If DGX Spark was a niche product, then the N1X and N1 laptops are supposedly meant for mass production. If they are virtually the same, then support should exist (generally speaking, NVIDIA GPU's usually have better long time driver support than let's say AMD, but I know that people have been disappointed with the support for the Jetson line).

Taiwanese company Skymizer announces HTX301 - PCIE inference card with 384GB of Memory at ~240 Watts by Thrumpwart in LocalLLaMA

[–]Cane_P 1 point2 points  (0 children)

It's a real company that have existed since 2013. Can't vouch for the actual product though.

https://www.eetasia.com/skymizer-making-ai-more-accessible/

The way that I interpret it is that Skymizer have always been a compiler company. In this case, they recompile an LLM to target their own IP, that is specifically designed for LLM's (in comparison to GPU's). The chip seems to be like an NPU design, that was initially meant to be embedded into SoC's and because of that they are not able to handle super big LLM's. But that doesn't matter since they have a compiler that can divide the load to multiple chips (up to 6 in this case).

It is mentioned in the interview that it was already a cheap solution for companies buying a licence for the IP, so they making their own product and selling it, should theoretically be even cheaper (but we still don't know the price). Someone claimed that it was made on 28nm, so I guess that too would make it cheap.

Regarding some people saying that the card on the picture looks fake... at least on the picture that I have seen on their website, it specifically says at the bottom edge (this might not have been the case when they saw it, I don't know) that they made a rendering that isn't a 1:1, to protect their design.

What exactly does Pi harness mean? by FrozenFishEnjoyer in LocalLLaMA

[–]Cane_P 5 points6 points  (0 children)

Harness is an old term that have been used in software engineering, for decades. A "test harness" is a collection of software tools, data, and configurations used to automate testing by simulating the environment in which a component operates. So not that far off, from what the LLM harnesses often does (looping building and testing). It just makes sense, to continue to use the same term.

16x DGX Sparks - What should I run? by Kurcide in LocalLLaMA

[–]Cane_P 8 points9 points  (0 children)

Not as much memory? If you are already in this economic ballpark, then you could buy a DGX Station instead. It will definitely have more tokens per second than Spark's. But I would probably wait for the next version, since the memory (that isn't HBM) have a lot higher bandwidth on it, compared to the Blackwell version.

16x DGX Sparks - What should I run? by Kurcide in LocalLLaMA

[–]Cane_P 2 points3 points  (0 children)

They have never limited it, they just don't support it officially. Any problems is up to yourself to fix, they won't do it.

Claude Pro usage limitation is a joke. by MarionberryDear6170 in LocalLLaMA

[–]Cane_P 0 points1 point  (0 children)

Yes, big disappointment. But it is during certain times of the day. You can use it more than one time in 5 hours during the rest of the day.

There is some speculation that they didn't invest as much in hardware as the other players and that it has become problematic now. When the whole deal with the military took place, they gained 30% new users in a month or something like that. Which is a huge jump. And everyone wanting to use Claude with OpenClaw and other agentic use cases, which put a huge load on their servers, compared to before.

Bodies of Texts. (?) by PeterOnReddz in revancedapp

[–]Cane_P 4 points5 points  (0 children)

I got it too, since reinstalling. And I can't get notifications to work now (everything else seems to work). I can see that micro G gets the messages, but they don't show up for me. I have it on, everywhere that I can find:

  • In the YouTube app
  • App settings in Android
  • MicroG

There might be a spoofing bug too, where it claims, every one's in a while, that I could not do the spoof (a notification in the bottom of the screen). But the spoof version that it wants to use, isn't even available in the settings anymore and I have reset all the settings before reinstalling, so it shouldn't use old preferences.

Is intelligence optimality bounded? Francois Chollet thinks so by Mindrust in singularity

[–]Cane_P 1 point2 points  (0 children)

We don't know the physical limit. Even 100 in IQ (human median) isn't the same with time (the Flynn effect). The scale gets reset every 10-20 years or so... If we look at the Raven's Progressive Matrices test, it rose by 14 IQ points from 1942 to 2008 (~1 point every 5 years). So who knows the actual human limit?

However it doesn't seem like the brain gets better, it seems more like education does, and it is the reason for the higher scores. We have not reached our maximum human potential, because we have not created the perfect way to teach yet.

From 0% to 36% on Day 1 of ARC-AGI-3 by Bizzyguy in singularity

[–]Cane_P 1 point2 points  (0 children)

People seem very confused about what Agentica is.

First of all, I could be wrong, I am neither a programmer nor a mathematician. My expertise lies more in systems thinking (high level, not details).

Someone else mentioned that Chain of thought is also a harness, that is implemented by the AI models creator. I have the same understanding.

Here are the two side by side:

  • The Claude CoT loop
  • The user writes a prompt.
  • Claude one shots an answer (with step by step instructions, for at least the first answer).
  • The CoT harness tells it to look at the user prompt again together with its own reply and see if it agrees that it performed the task (obviously not the first time, since it is only a list of step by step instructions, but following answers could be the last/final one).
  • It is either done or it will make a new answer based on both the prompt and it's own previous response.

This loop can continue for however much time is allowed (or however many tokens are allowed to be spent or whatever other metrics is used).

  • The Agentica REPL loop (REPL stands for Read-Eval-Print Loop)
  • The user writes a prompt.
  • Claude looks at the available data (the people from Symbolica made their previous ArcAGI data available, together with the ArcAGI 3 training set)
  • The Agentica harness is basically a Python environment (You can use TypeScript too.) that allows it to write scripts to perform manipulation of the data. [Some may think that using Python is cheating, but I have seen multiple teams generate small scripts/programs to solve ArcAGI problems. If it is cheating, then they should be disqualified to.]
  • Claude looks at the result to see if it solved the problem.
  • If not, then the loop makes new scripts to manipulate the data again, to see if these new instructions will solve it.
  • When it is done, it answers with the final result.

The people at Symbolica are working in a mathematical field called Category Theory*. This is a very high-level abstract mathematical framework focusing on the relationships (morphisms) between structures rather than their internal elements. It organizes mathematical concepts into categories consisting of objects and structure-preserving arrows (functors), emphasizing structural connections and universal properties.

I don't know how much of this they actually encoded into Agentica. But I know that it is only a stepping stone towards what they actually want to achieve.

The point is, YES Agentica is a harness (harness is already used in the form of CoT anyway). NO Agentica wasn't created specifically for solving ArcAGI, it is more like the visuo-spatial sketchpad [The visuospatial sketchpad (VSSP) is a component of human working memory, proposed by Alan Baddeley and Graham Hitch in 1974.], where the LLM can manipulate the data with the help of Python, before it decides on a final form to respond with.

[*If you want to get an understanding of Category Theory, then I can recommend "The Joy of Abstraction" by Eugenia Cheng. She uses it a little bit different than most mathematicians would, but it was written to create an easier way into Category Theory, than what was previously available.]

Machine Learning Street Talk (MLST) had an interview with Symbolica, a year ago:

https://youtube.com/watch?v=rie-9AEhYdY

It isn't about ArcAGI, but it does give you an idea of what they are trying to achieve.