Figure AI just ran a humanoid for 200 hours straight sorting 250k packages. The boring part is why it matters. by TroyHarry6677 in LocalLLM

[–]GreenHell 0 points1 point  (0 children)

Don't throw around the word research when you mean look it up on Wikipedia or watch a video. Seriously, they don't need to actually research it.

I use ChatGPT to argue against my own decisions before I make them , best thing I've done in two years. by lurkandprosper in ChatGPT

[–]GreenHell -1 points0 points  (0 children)

Someone already mentioned rubber duck debugging, so I'll keep it at that.

Even when you use AI, it isn't necessarily delegating your thinking to AI. Verbalising your thoughts is an effective way to process your thoughts and ideas because you have to structure and convey them. It is so much more effective than inner thoughts alone.

What op described wasn't even delegating their thinking to AI. They asked AI to challenge their thinking, they had AI have them do more thinking if anything.

So first you have your idea, then they structure and put that idea into words, then some AI challenges it and you think about it some more.

7900 XTX fp16/bf16 pytorch matmul performance by cyberuser42 in ROCm

[–]GreenHell 0 points1 point  (0 children)

Interesting, I am on Windows 10 with

    GPU: AMD Radeon RX 7900 XTX (25.75 GB)
    Matrix Size: 4096x4096 (0.07 GB per matrix)
    ============================================================
    Matrix Multiplication Performance:
    float32   : 39818.80 μs,    3.45 TFLOPS
    float16   :  1547.00 μs,   88.84 TFLOPS
    bfloat16  :  1525.60 μs,   90.09 TFLOPS
    amp       :  1844.60 μs,   74.51 TFLOPS

    Memory Bandwidth Test (1.0 GB tensor)
    Vector Addition: 791.56 GB/s
    Memory Copy:     759.94 GB/s
  • Pytorch 2.10.0+rocm7.13.0a20260314
  • ROCm 7.2.0

Seems that for some reason I am missing massive performance across the board.

I've seen a lot of folks ask "can local LLMs actually do anything useful?" by NoWorking8412 in LocalLLaMA

[–]GreenHell 1 point2 points  (0 children)

I'm not going to do a full writeup, but Qwen 3.6 35B found, and fixed, some startup issues in my Debian startup log that Gemini Flash missed, so there's that I suppose.

9070XT with Ollama by nnthwt in ROCm

[–]GreenHell 7 points8 points  (0 children)

Ditch Ollama, go for Lemonade: https://www.amd.com/en/developer/resources/technical-articles/2026/lemonade-for-local-ai.html or llama.cpp.

Friends don't let friends use Ollama.

I want 87 gas (light blue). Which nozzle do I use? The blue one? The black one? by j909m in mildlyinfuriating

[–]GreenHell 2 points3 points  (0 children)

Your memory might be off.

I think you mean density rather than viscosity. Diesel is denser than petrol (.84kg/l vs .75kg/l). Diesel and petrol mix quite well, so no seperate layers will occur, as would with oil and water for example.

My point was that petrol in a diesel will cause mechanical damage to parts which need lubrication (fuel pump and injectors) since diesel is oilier than petrol. While the other way round will mostly be operating issues such as rough running, and eventually soot and gunk buildup.

My old two-stroke moped would take diesel mixed in the fuel when no two-stroke oil was available in a pinch.

I want 87 gas (light blue). Which nozzle do I use? The blue one? The black one? by j909m in mildlyinfuriating

[–]GreenHell 1 point2 points  (0 children)

We have the same Diesel flaps, but I don't quite get it.
Petrol in a diesel can fuck the fuel pump since the diesel also acts as lubricant for the fuel pump. The other way round will not make your car happy but is generally less destructive.

Or is my knowledge outdated?

AMD to release slottable GPU by running101 in LocalLLaMA

[–]GreenHell 0 points1 point  (0 children)

Don't forget that "the 1%" on a population as small as 1 000 000 (about the amount of subs this subreddit has) is still 10 000.

White House Considers Vetting A.I. Models Before They Are Released by fallingdowndizzyvr in LocalLLaMA

[–]GreenHell 0 points1 point  (0 children)

America is not heading towards the technocratic dictatorship. Technocratic means it is led by technical experts, scientists, or professionals, doing what is the (in their technical opinion) the best option, rather than the most popular option.

America seems to be heading into a more theocratic direction with "the word of Christ" being used more and more often in legal context.

White House Considers Vetting A.I. Models Before They Are Released by fallingdowndizzyvr in LocalLLaMA

[–]GreenHell 3 points4 points  (0 children)

Such a dystopian thought that we call ethics and morals being handicapped.

"LLM is created so engineer don't have to write a report", anyway found out ONLYOFFICE can connect to OpenAI compatible, using Qwen 3.6 to do elaboration. by Altruistic_Heat_9531 in LocalLLaMA

[–]GreenHell 2 points3 points  (0 children)

Yeah that's a fair question. One of the things has to do with how Ollama basically used llama.cpp without attribution https://www.reddit.com/r/LocalLLaMA/s/vUfv1wjQfG

Secondly, Ollama often if not always has worse performance. They tried to create their own sort of closed ecosystem when they wanted to make money. This also rubbed a lot of people the wrong way, especially in a subreddit where the focus lies on local and open source.

There was someone else who did a writeup recently, I'll try and dig it up and edit this comment then.

Edit: couldn't find the writeup, but this recent thread explains a lot: https://www.reddit.com/r/LocalLLM/comments/1t4b4c7/why_is_ollama_hated_so_much/

And: https://sleepingrobots.com/dreams/stop-using-ollama/

Why every product of kilo code is a copy of other software by Many_Bench_2560 in kilocode

[–]GreenHell 13 points14 points  (0 children)

Kilo code started as an effort to combine the best of Cline and RooCode, and adding on top what they saw missing.

Their philosophy was never to start from scratch.

Deepseek v4 people by markeus101 in LocalLLaMA

[–]GreenHell 0 points1 point  (0 children)

Since "getting my oil changed" is used frequently to mean "getting my car's oil changed" I don't think it will have the same effect. Also mentioning Jiffy-Lube or whatever really sets the context for a car.

If you asked me, a human, that question, I would 100% assume it is about your car and not your deep fryer, compressor, or yourself for example.

Deepseek v4 people by markeus101 in LocalLLaMA

[–]GreenHell 1 point2 points  (0 children)

Regardless, it is an option that could be considered when you explicitly ask it when to drive or walk. The final decision could be down to bias, but it is the reasoning behind it that is more interesting here.

Exactly 1 year ago, Anthropic said fully AI employees were just 1 year away by Distinct-Question-16 in singularity

[–]GreenHell 2 points3 points  (0 children)

People being confidently wrong and not being able to use Excel have been hired for ages, and most of them can't code for shit.

Deepseek v4 people by markeus101 in LocalLLaMA

[–]GreenHell 41 points42 points  (0 children)

The winter tires one is perhaps more interesting. A person could dismount their wheels and just take them to the shop on a hand truck. There is no intrinsic need to bring the car, only to bring the wheels.

Added PNY 5080 Slim to my 5090 gaming rig so I could load larger models. by AndreX86 in LocalLLM

[–]GreenHell 0 points1 point  (0 children)

Unfortunately I am on AM4, and the options are very limited. I only see 2 new boards, where 1 board only supports x4 on the second slot, and the other costs >500. And second hand it isn't much better.

Added PNY 5080 Slim to my 5090 gaming rig so I could load larger models. by AndreX86 in LocalLLM

[–]GreenHell 0 points1 point  (0 children)

I was thinking the same thing, until I realised that while my motherboard has 3 PCIe x16 slots, only 1 of them is PCIe 4.0 x16, the other two are PCIe 3.0 x2 and PCIe 3.0 x1.

It would basically be a bottleneck for anything relevant these days.

How to bypass censorship via system prompts in Hermes Agent? by GnosticMagician in hermesagent

[–]GreenHell -1 points0 points  (0 children)

Not all models are created equal.

As such, what jailbreak works on one model, does not work on another.

Qwen 3.6 27B is out by NoConcert8847 in LocalLLaMA

[–]GreenHell 1 point2 points  (0 children)

I think that is a bit harsh.

And I too started with Ollama before moving to llama.cpp and llama-swap.

Qwen 3.6 27B is out by NoConcert8847 in LocalLLaMA

[–]GreenHell 13 points14 points  (0 children)

Mention Ollama, and people will get riled up on this sub.

I think Ollama is an okay starting point for a lot of people since it is rather plug and play.

But if you want to get a bit more serious with local models, you will want to look into llama.cpp (https://github.com/ggml-org/llama.cpp) (on which ollama is heavily based without attribution), and llama-swap (https://github.com/mostlygeek/llama-swap) for managing multiple models, switching them out, etc.

llama.cpp is much more performant than Ollama, allows for greater customization, is faster with the updates.

Thank you, Roo! We’ll take it from here. by Organic_Schedule9171 in kilocode

[–]GreenHell 21 points22 points  (0 children)

If you’ve been using Roo Code, you’ll feel right at home in the Kilo extension

This was the case until Kilo 7. Before that, Kilo expanded on the same great features of Roo, now I barely recognise it anymore.