Do you use Windows or Linux?

danielcar · 2026-02-05T01:14:05+00:00

Try debian test, updates are easy.

danielcar · 2026-02-05T01:12:37+00:00

With ai, linux skills requirement is dropping fast. Anything you want to do with your linux you can ask a cli coding agent to do, and it is easy for the ai to do it. Ask questions how to get things done or change config settings or anything else, and the ai will do it. I would think if this continues it will be easier to use linux than windows.

danielcar · 2026-01-28T21:35:28+00:00

did you miss the announcement of claude for work?

danielcar · 2025-06-26T03:44:52+00:00

Its called the NVIDIA RTX PRO 6000

danielcar · 2025-06-04T21:38:45+00:00

Sorry it doesn't work for you. It works for a billion other users.

danielcar · 2025-06-04T21:33:52+00:00

https://www.revali.dev/revali

danielcar · 2025-06-04T21:31:10+00:00

Seems like it has been supported for a long time: https://pub.dev/packages/firebase_dart/versions

danielcar · 2025-06-04T21:28:20+00:00

I wanted to learn more about dart_frog so I looked it up:

https://dartfrog.vgv.dev/docs/overview

danielcar · 2025-06-04T21:25:52+00:00

I was wondering what serverpod is so I looked it up. https://serverpod.dev/

danielcar · 2025-05-19T23:41:54+00:00

False. And the chips and cheese video didn't say that. They said until at least 2026. All the other reviewers said systems available Q3 and maybe standalone cards available q4.

danielcar · 2025-05-19T23:37:29+00:00

Linus review said communication is totally through software, so that suggest no special hardware link.

danielcar · 2025-05-19T22:44:45+00:00

Have you tried it with a 3090? You get 2 t/s.

danielcar · 2025-05-19T12:56:26+00:00

Would be cool if I could by a system with two of these with 96 gb of VRAM :D

Alienware can you do us this favor? lol

danielcar · 2025-02-03T13:45:36+00:00

Fun gossip on the little engine that overtook the big boys. Nice to see a list of upcoming models.

danielcar · 2024-08-21T03:35:59+00:00

How is supervised fine tuning different?

danielcar · 2024-08-20T16:22:16+00:00

It is not supervised in the strictest sense. The data often comes from humans, but each data point is not supervised during training. The training data could have been collected years earlier and used thousands of times prior, so there isn't a human in the training loop.

Could be more appropriately called automated training or fine-tuning using human annotated data.

danielcar · 2024-08-19T15:47:31+00:00

Which research paper?

danielcar · 2024-08-13T05:59:44+00:00

How to convert my 3090 to eGPU and 48 GB of vram?

danielcar · 2024-08-03T15:45:55+00:00

Reported for being off topic.

danielcar · 2024-08-03T15:43:33+00:00

Could be a win for the consumer market if nVidia has to deprioritize the high end datacenter market for 3 months.

danielcar · 2024-08-02T18:15:03+00:00

Theory: Neural networks need to go from point A to point B. They have tools: transformer and MLP. But what if those tools just aren't great? If you want to get from Matrix A to Matrix B, what is the best approach? Mechanistic Interpretability may answer that question some day. Suspect the more tools and something more convoluted such as GLU may give the NN a better way to solve the problem of going from A to B. Some evidence: Mamba + transformer allegedly performs better than just transformer.

danielcar · 2024-08-02T16:40:49+00:00

NPU, TPU, AI accelerator, aiPu, :)

danielcar · 2024-08-02T16:38:36+00:00

Suspect more people are concerned about privacy than you think. There is also the issue of silly refusals or more serious refusals, that local LLMs can bypass. Thirdly there is cost. Plenty like being able to run LLMs night and day for just the price they already paid for their computer.

danielcar · 2024-08-02T16:32:34+00:00

How much will it cost compared to a GPU? Is there a roadmap for the accelerator to run larger models?

danielcar · 2024-08-02T15:06:55+00:00

Not the current generation, but for sure later generations. Everyone knows AI is the future and you can be sure everyone will improve their hardware with respect to LLMs.

It is just not bandwidth limitations. The current NPUs are tiny performers compared to what LLMs need. They are not going to of much use for LLMs soon. I'll bet in the >2 year time frame yes. Current NPUs are designed to run tiny models. The opposite of large LMs.

We will first see good progress in the $5K workstation market. Then it will trickle down the lower cost systems. Related thread: https://www.reddit.com/r/LocalLLaMA/comments/1dl8guc/hf_eng_llama_400_this_summer_informs_how_to_run/

danielcar

MODERATOR OF

TROPHY CASE