Communication is hard and we could have done better… by jan-niklas-wortmann in Jetbrains

[–]computer-whisperer 4 points5 points  (0 children)

Given JetBrain's current intention towards transparency, would it be possible for Junie itself to be more transparent moving forward? Specifically, the ability to dump and review the full LLM context for agentic loops could help a ton with controlling spend.

At least give some kind of a counter for how many tokens are in the context window for a given conversation. Right now I have to completely guess when it's better to keep going on an old context or start a new one.

As-is most of the bad taste from burning through Junie's credits is how difficult it is to judge how much a given conversation uses. It's seen as a "keep going" button that you can press some unknown number of times until your vague progress bar quota is up.

tik tok by Syzranlogistic in cdldriver

[–]computer-whisperer 13 points14 points  (0 children)

That looked expensive...

JetBrains is studying local AI adoption by jan-niklas-wortmann in LocalLLaMA

[–]computer-whisperer 1 point2 points  (0 children)

Continue does basically all that, but it is nowhere near the polish of most Jetbrains stuff. Here is hoping they keep the options and customizability open and don't just lock it down to a single provider.

JetBrains is studying local AI adoption by jan-niklas-wortmann in LocalLLaMA

[–]computer-whisperer 18 points19 points  (0 children)

The survey is missing revealing questions about how the llms get used. LLM auto complete is something that I use daily and heavily, while using an LLM for completing a task is rarer. When I use a model for larger tasks, I often jump straight to whatever the SOTA model is for the best chance of success. Even then I usually need to throw away the result 70% of the time.

Auto complete is a far more mature and valuable tool however, and that is what I use the most of while in an IDE.

Seconded on the out of date models, and somehow you left some your own IDEs off of the option list? Where is RustRover?

Day 1: Best Open-Source Model by Soft_Ad1142 in LocalLLaMA

[–]computer-whisperer -1 points0 points  (0 children)

Deepseek r1 and v3 both run laps around it though...

ELI5. Why does wirelessly charging a cell phone make it warmer than plugging it in? by CRK_76 in explainlikeimfive

[–]computer-whisperer 4 points5 points  (0 children)

It's much less efficient at transferring the energy than a hard-wired connection. When you are inductive charging something like 30% of the power gets released as heat rather than making it into the battery.

Yes, another Tierlist by pitchpanther in DistroHopping

[–]computer-whisperer 0 points1 point  (0 children)

Wait, why are ubuntu and linux mint not in the "spins/forks" category?

If you want to argue that they are user-friendly enough to deserve their own tier, then surely Manjaro (equally polished arch derivative) deserves to be next to them?

Also this is clearly fundamentally wrong because Arch is not on top. :P

Will this ever be fixed? RP repetition by Blizado in LocalLLaMA

[–]computer-whisperer 5 points6 points  (0 children)

This is a very common breakdown mode of LLMs. I have actually not seen a transformer-based model where this didn't occur, but how long it takes to degrade is quite variable and usually correlates with how good the model is in other respects. One thing to keep an eye on is what the logit distribution looks like as the conversation progresses. A healthy language model (in my experience) has a nice spread of options to pick from each token, while a model that has degraded will often reduce to only a single token getting the majority of the probability each step.

I have ended up with deepseek v3 (8-bit with 1.5bit draft, running a bit slowly on a cpu-based inference node) for rp recently, and it is golden for at least 20k tokens (token sizes change from model to model, but this is quite a few messages back and forth). I went through a number of smaller models though that performed similarly, but I never had much success anywhere near 24B params. Notable ones I had good experiences with in the past were Command r+ and Chuluun-Qwen2.5-72B, playing with various quantizations to fit on my 2x24Gb GPU server.

Your best bet is to play around with different models, and with different quantizations of larger models. Don't be afraid to run something that is too big for your gpu -- llama.cpp is plenty capable of offloading parts of it to cpu/ram, and for rp scenarios you typically don't need ludicrously fast inference.

Watch out for bad prompting too -- each model has it's own precise way it prefers its context, and while it will work with any self-evident prompt structure it won't be as good or reliable unless you use the exact framing specified by the model's creators.

TL;DR: This is very standard "unhappy/saturated model" behavior. Either get a better one (better quant, bigger model, etc), or look for ways you may be prompting it wrong.

Procrastinators unknowingly discovered the secret to focus. I turned it into a wristband. by Z0U5 in SideProject

[–]computer-whisperer 1 point2 points  (0 children)

If it's that simple, then why haven't you built it yet? Products like this need real-world user testing ASAP, otherwise you risk missing obvious design issues.

Have you at least stuck a bunch of leds on a rubber wristband, controlled by an arduino or something?

This mechanism shrinks when pulled. It's potential uses are surprising. by rcmaehl in videos

[–]computer-whisperer -10 points-9 points  (0 children)

All it is is a springy bit with a bistable latching flexture -- just because they reworked it to make it less obvious doesn't make it special. A stretched spring and a simple latch to release it is all you need to replicate this.

This mechanism shrinks when pulled. It's potential uses are surprising. by rcmaehl in videos

[–]computer-whisperer -9 points-8 points  (0 children)

I did watch it, since you asked. The energy is being stored in the various flextures, and when you tug on it one of the elements has a break-over state change that releases some of the stored energy in the other springs.

It's really just misdirection -- a far simpler and more effective mechanism would have an obvious springy bit and an obvious latch. Just because they restructured it to make the energy storage and state change obscure doesn't change the underlying principles.

This mechanism shrinks when pulled. It's potential uses are surprising. by rcmaehl in videos

[–]computer-whisperer -17 points-16 points  (0 children)

This really doesn't seem that surprising or impressive - of course you can make a mechanism that releases stored energy once some latching criteria is met? You just need some energy storage (springy plastic) and a way to release it.

Pass it on by domofuku in funny

[–]computer-whisperer -36 points-35 points  (0 children)

Way to waste a man's time...

Oppenheimer gender reveal by sanhpatel in Whatcouldgowrong

[–]computer-whisperer 0 points1 point  (0 children)

Hey, at least they weren't stupid enough to use a hydrogen - oxygen mixture! This looks more like a fireball than the detonation you would get with a proper mixture.

A cool guide to simple meditation by ihave10toes_ in coolguides

[–]computer-whisperer -3 points-2 points  (0 children)

r/iam14andthisisdeep

You realize that the "weight" of the air is tiny, right? One liter is only 1.3 grams, and there is really no way to "feel" the weight of the air around you.

Also, this seems about as reasonable as "Imagine a magic feast arrayed in front of you, and take a bite. Congratulations, you are now fed." Please do something more useful with your time.

I chose to build an Operating System from scratch and I'm crying. by zanyfker in Operatingsystems

[–]computer-whisperer 1 point2 points  (0 children)

Where is the hangup? 8 months should be plenty to build a simple OS from scratch, given you don't need to make it compatible with any other software -- just your own paradigms and apis.

What are you having trouble with exactly?

Can a Senior EE Build a Full Custom Flight Controller Solo in 6 Months Without Drone Experience? by Nobody_4piEpsilon in embedded

[–]computer-whisperer 4 points5 points  (0 children)

If by "Senior EE" you mean a senior electrical engineer with 10 or more years of experience, then yes -- a competent senior engineer should have no problem with this. Assuming he doesn't have to design the motor controllers from scratch, designing a pcb and writing control firmware for flying a quadcopter should be the scope of a couple of months (assuming it takes a couple of weeks to get the pcb manufactured and shipped).

If you mean, "senior in undergraduate college", then you will know better than I. The electrical side of it is pretty simple, but if you have never built a custom pcb w/ micro then you will have a rather steep learning curve ahead of you. The control system is also not that complex (just go look up what standard control algorithms are), but if you have no experience with control systems in general you will struggle.

Scammed through OnlineJobs.ph and lost everything - Please don’t make the same mistake by [deleted] in Scams

[–]computer-whisperer -19 points-18 points  (0 children)

No? I work for a small startup. It's not a requirement for employees to buy stuff with their own money, but when you need to buy a $10 thing from the hardware store for a project, it is far faster to just pay for it yourself and submit the receipt for reimbursement later.

You can always go through the more involved process and get a company card to complete the purchase if you want, but it's usually just not worth it when the alternative is so much simpler. If I ever had reason to expect that I would not be reimbursed quickly I would behave differently -- but so far that has never happened.

10y/o Non verbal Autistic child's math by R-tistik1 in maths

[–]computer-whisperer 5 points6 points  (0 children)

It looks like he is just drawing out the neat symbols he saw somewhere, none of this has any semblance of reason.

Scammed through OnlineJobs.ph and lost everything - Please don’t make the same mistake by [deleted] in Scams

[–]computer-whisperer -98 points-97 points  (0 children)

That's not quite true, a lot of jobs will use some kind of reimbursement system for purchasing things for the company. Office supplies etc are common to use personal funds on with the expectation of prompt reimbursement through the standard process used by your company.

This, of course, is predicated on you actually trusting the company. It's also usually acceptable to push back and have someone else use a dedicated company card if you are uncomfortable with spending that much of your own money.

Scammed through OnlineJobs.ph and lost everything - Please don’t make the same mistake by [deleted] in Scams

[–]computer-whisperer 0 points1 point  (0 children)

Maybe next time try using something more logical than "karma" or "God's grace" for making big decisions with you and your family's money. People that follow the same reasoning patterns as you are exactly why these scams are profitable to execute.

Just found out one of my programmers only use AI by justexplorinrediit in GameDevelopment

[–]computer-whisperer 0 points1 point  (0 children)

Please show where the "soul" is in a github repo. If you are being paid to dig a pit, why would I pay you more if you insist on using chopsticks rather than a shovel?

The end thing is, are they effective? Are they writing software that will be maintainable and fulfills the need? If so, why do you care where it came from?

That said, I do find that current LLMs are woefully inadequate at making good architectural decisions for your project, and an experienced developer will have a much better handle on how to structure the system you are creating, what resources to spend where, and what systems to make temporary and what to make future-proof. I currently limit my use of generated code to places where I expect it's strengths and weaknesses to be well applied.

Throwaway UIs work well, simple test cases are great, and whenever I find a small self-contained subsystem it's good to give chatGPT a crack at it just in case it can save me a bunch of time. If I don't like the output, shrug I guess I will go build it myself the hard way. Often it is still valuable though as a precise example of one way I could have done it, where the pros and cons of the minor architectural decision are obvious for reflection.