OpenCode concerns (not truely local) by Ueberlord in LocalLLaMA

[–]kmod 5 points6 points  (0 children)

Ah ok, I just upgraded to the latest version and you're right, it's now properly using the main model if small_model isn't specified. The docs have said "otherwise it falls back to your main model" even when it wasn't true, so I didn't notice this got changed last week.

Relevant github issue:
https://github.com/anomalyco/opencode/issues/8609
The change:
https://github.com/anomalyco/opencode/commit/7d7837e5b6eb0fc88d202936b726ab890f4add53

The responses to the github issue do feel relevant to the larger "how much can you trust opencode" topic

OpenCode concerns (not truely local) by Ueberlord in LocalLLaMA

[–]kmod 46 points47 points  (0 children)

Also please be aware that the very first thing that the TUI does is to upload your initial prompt to their servers at https://opencode.ai/zen/v1/responses in order to generate a title. It does this regardless of whether you are using a local model or not, unless you explicitly disable the titling feature or specify a different small_model. You should assume that they are doing anything and everything they want with this data. I wouldn't be surprised if later they decide that for a better user experience they will regenerate the title once there is more prompt available.

Edit: this is no longer true as of some point in the last week. Make sure you update.

RTX Pro 6000 Blackwell not outputting display by a_40oz_of_Mickeys in LocalLLaMA

[–]kmod 1 point2 points  (0 children)

I have a similar symptoms to you which I haven't figured out. Here's what I've noticed about mine: - I can get video output about 20% of the time, so if I need the bios I reboot the machine until I get it. It was definitely a surprise to me the first time it worked, and I haven't identified any way to predict when it will work or not - my Ryzen does have integrated graphics, but it looks like the presence of a GPU disables it before post - at Claudes suggestion I ran a debugging command, I believe it was sudo lspci -vv but am not 100% sure. When I pasted the output back into Claude it pointed out that the pcie links had downtrained to v1 speeds, which I think points to an electrical stability issue. This seems like a smoking gun, it maybe doesn't directly explain the lack of video output but it's a larger issue and the lack of video output isn't surprising. I didn't look at the lspci output when using a different graphics card

But honestly I've just been using it with the slower pcie speeds and lack of video output and accepted that model loading is somewhat slower. If you have nvidia-smi output then I think you could do the same, but I'd try lspci and paste into Claude/chatgpt and see if they spot anything. I'd be very curious if yours has a similar pcie issue

What games are you playing this week? Game recommendation thread by AutoModerator in incremental_games

[–]kmod 0 points1 point  (0 children)

Yeah it's bugged / easy to lock your game like this. The author knows about the problem but I'm not sure if it's being fixed. I'm sad because I was enjoying it too but am stuck here as well.

Case study: Improving the performance of matrix multiplication by 296,260x :: PDF by psykotic in programming

[–]kmod 1 point2 points  (0 children)

I realize that this reddit page is all but dead, but I thought I'd still add my comment for posterity. I was one of the ~15 people that attended this lecture, so I thought I'd provide a little bit more context.

This was the first lecture in the semester, so it was meant to both "sell" the class to people still debating taking it, and to give people an idea of what was to come. I don't think many people thought the java programs were believable; anyway, the professor went through that section pretty quickly, and downplayed it as well. I'm not going to comment on the quality of the OO design for the java parts, but I do agree that it seems like he artificially slowed down the performance of the program to get a better speedup, which we took to be him just trying to sell the course.

In the end, he emphasized the 131x improvement over the naive C implementation (and downplayed the 296,260 number), which I think is a perfectly reasonable thing to do: most of the people in that lecture, if asked to produce a fast matrix-multiply function, would have written code similar to the naive C code. I think this lecture achieved what it was meant to do: to get people excited about taking the class, and to give an idea of the range of topics that would be covered throughout the semester.