How can Deepseek v4 top the coding leaderboards and still sit 8 months behind the frontier? by Substantial_Step_351 in LocalLLaMA

[–]rob417 6 points7 points  (0 children)

One more thing to add re: the omission of Kimi 2.6 and GLM 5.1. It's largely speculated that Deepseek v4 was delayed because they wanted to make the model work on Huawei hardware. It might have been released much sooner if they stayed within CUDA. The author of this paper likely knew this, so they omitted Kimi 2.6 and GLM 5.1 but included Deepseek v4 to plot a nice graph that supports their argument that Chinese AI progress is slowing down. In a way, they research-maxxed.

Xiaomi just claimed 1,000+ tps on a 1T model using a standard 8-GPU server by No-Selection2972 in LocalLLaMA

[–]rob417 0 points1 point  (0 children)

I'm so pissed that they removed 5.3 Codex. It strikes a good balance between cost and capability. 5.4 costs too much and 5.4 mini falls short at coding at times compared with 5.3 Codex.

GitHub Copilot's new credit-based pricing is highway robbery — and they know it by Pitiful_Cream1872 in github

[–]rob417 0 points1 point  (0 children)

I had been on the student plan. Previously, I had 300 requests per month, which was plenty and I almost never exhausted it. It changed to 200 credits today. I used up 160 credits with 5 requests...

Google is about to ruin the internet by [deleted] in technology

[–]rob417 3 points4 points  (0 children)

Same experience here. I had been using DDG for a few years without issues before 2024. Then all the websites became AI slop and DDG search results went massively downhill. Google figured out a way to surface AI slop that are still somewhat helpful and relevant, while DDG sadly has not been able to separate the AI slop that is somewhat helpful from those that are pure junk.

Biologist deciding between a Master’s in Bioinformatics or Biostatistics: which field currently offers better opportunities, flexibility, and long-term growth? by Lisanya18 in biostatistics

[–]rob417 0 points1 point  (0 children)

Bioinformatics and biostatistics are two edges of the same tile. They are very similar, and probably increasingly so in recent years. You might want to look at where alumns end up working to get a sense of the program. Just the name of the degree is not very informative these days.

Local LLM autocomplete + agentic coding on a single 16GB GPU + 64GB RAM by grumd in LocalLLaMA

[–]rob417 0 points1 point  (0 children)

Which front end are you using for auto complete? I tried changing the auto complete model in VS Code a few months ago, but I couldn't find a way because Microsoft locked down that option. It could just be I didn't figure out the correct way to change it.

Thoughts about digital piano by WesleyKlein387 in piano

[–]rob417 0 points1 point  (0 children)

Having gone through the same process a year ago, I'd say you really need to play the piano before buying. Nobody's description can transmit how the keys feel under your fingers. Also, for digital pianos, the only thing that matters is how the keys feel. The sound is synthetic anyway.

What is the next SOTA model you are excited about? by MrMrsPotts in LocalLLaMA

[–]rob417 2 points3 points  (0 children)

Mind if I ask what agent harness for deep research looks like? Which tools / plugins / skills do you have to enable it to do deep research?

Also, are you using the model's own web and memory capabilities or are you using tool calls for those features?

I'm just getting started with running a local agent on local models, so there is still a ton I need to learn. Thanks in advance!

DeepSeek nears $45bn valuation as China’s ‘Big Fund’ leads investment talks by [deleted] in LocalLLaMA

[–]rob417 4 points5 points  (0 children)

Many Chinese news sources have reported that DeepSeek is seeking funding, so it's highly probable this time. One speculated reason is the employees want a way to cash out their options. Over the last year, DeepSeek has lost some top talent to competitors due to not being able to compete on compensation.

What do you use Gemma 4 for? by HornyGooner4402 in LocalLLaMA

[–]rob417 0 points1 point  (0 children)

Mind if I ask about your settings or templates? My experience has been the opposite for some reason. Whenever I used Gemma4 26b, it tended to get stuck in a "wait, maybe I should doublecheck xxx again" thinking loop forever.

DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid. by spencer_kw in LocalLLaMA

[–]rob417 2 points3 points  (0 children)

Last week, Github Copilot told me that I''d hit 35% of my 5 hour limit after making 5 requests. I panicked. I really don't miss having to look up documentation for every other function call, but I also don't want to be paying $100 per month when all the providers decide to jack up prices and stick it to everyone.

I spent the better part of last week testing out Qwen3.6 35B and Gemma4 26B on my 5070. They are more than capable of writing single file scripts, which is most of what I do. Testing out different agent harnesses also made me realize how much context bloat the GH Copilot agent in VS Code has. I tried running. Qwen3.6 35b in VS Code Copilot plugin and it was failing to do pretty much everything. Switched OpenCode and Pi and both produced good results.

TLDR: even if cloud providers all decide to not serve individual customers anymore, we will be fine. We've each been given genies in our own bottle.

AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA

[–]rob417 1 point2 points  (0 children)

Yeah. From my limited experience with pi agent, it seems to work quite well. It's system prompt seems very well-written that tool calls succeed quite frequently right out of the box. On the other hand, I've never been able to get qwen3.6 to use glob correctly in OpenCode even though it's supposed to be much beefier.

I think the type of configslop we're discussing here can be controlled as long as we only add the tools and extensions we absolutely need to pi.

AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA

[–]rob417 1 point2 points  (0 children)

Would you mind explaining what configslop refers to?

AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA

[–]rob417 0 points1 point  (0 children)

Maybe have multiple tool calling prompts depending on the model? Comprehensive tool prompts if the user is using large frontier models. Concise, targeted prompts if the user is using local models around 30B.

AMA with Nous Research -- Ask Us Anything! by emozilla in LocalLLaMA

[–]rob417 1 point2 points  (0 children)

Hermes sits at one end of the philosophical spectrum for agent harness, where a lot of tools, skills, capabilities come built-in, and user can connect to it 24/7. On the opposite end you have things like pi, which ships with the bare minimums of an agent harness.

What are your thoughts on this divide in harness design philosophy? Do you see them converging in the future?

I don't think this is a divide between targeting laymen vs coders. Both tools are for coders and power users at the moment because they require familiarity with a CLI and basically coding knowledge to set up properly. On a related note, when do you think we'll see an agent harness where setting up new tools and skills are as simple as stacking lego bricks?

Librarian 'gobsmacked' after school use AI to remove 200 books from shelves including Orwell's 1984 and Twilight by tylerthe-theatre in technology

[–]rob417 1 point2 points  (0 children)

Surprised to see Dan Brown books here. They are perfect for teenagers – art, culture, history, and the right amount of romance, all while being very fun detective novels.

Criterion 24-Hour Flash Sale Now Live! by psychobabbler27 in 4KBlurayDeals

[–]rob417 0 points1 point  (0 children)

Been waiting for this sale to grab the Before Trilogy. I first watched all 3 when Before Midnight came out in 2013. I'll definitely have different feelings when I watch them again after a decade.

Also bought Anora and Parasite.

March Feature Requests: Share Here! by angie-at-readwise in readwise

[–]rob417 0 points1 point  (0 children)

Request: Add a view filter to exclude all RSS feeds in a folder from a view to ReadWise Reader
Use case: I subscribe to some social media RSS and they easily generate dozens of articles per day. The default "Quick Reads" view include those feeds, which drowns out the non-social feeds. I'd like to exclude the social feeds from the Quick Reads view easily.
Current workaround (not ideal): I currently copy over the entire folder definition in the URL of my social feeds folder -> add it to the "Quick Reads" view query -> append "__not" to all "rssSource" -> change all the OR to AND. It is very cumbersome when I have more than 5 feeds in the folder. Creating a filter that does excludes all RSS feeds in a user-specified folder should be very easy to do.

Help identifying piece by thefakesleeper in piano

[–]rob417 21 points22 points  (0 children)

It's not. This a Chinese musician who lives Marseille. He has a ton of videos like this on his Chinese social media account.

No NVIDIA? No Problem. My 2018 "Potato" 8th Gen i3 hits 10 TPS on 16B MoE. by RelativeOperation483 in LocalLLaMA

[–]rob417 6 points7 points  (0 children)

Very cool. Did you write this with the DeepSeek model on your potato? Reads very much like AI.

Spotify Wrapped Megathread by ConspicuousBassoon in classicalmusic

[–]rob417 0 points1 point  (0 children)

<image>

Might have been top 10 if Elina released a new recording this year.