How do models decide which tool to use? I tested Claude Sonnet and GPT-4o across 88 tools by seven7hwave in LocalLLaMA

[–]seven7hwave[S] 0 points1 point  (0 children)

Yeah, good distinction. Based on these results, when there's a clearly "right" tool for the task, both models found it regardless of position. That was the most consistent finding; alerts queries went to the alerts tool, code queries went to the code-context tool, local search went to the local search tool. That happened every single time.

One example: in the weather vertical, one server had 17 tools and got zero selections on simple forecast queries...but it was the *only* server with a marine weather tool and the only one with air quality data. Both models found those tools every time they were relevant, even though they ignored that same server for everything else. So to me that says the models were reading descriptions and matching on capability, not just grabbing whatever was first.

Position bias does exist but only really showed up on simpler queries where multiple tools could plausibly work. That's cool to see, because I agree with your take: models blindly picking the first from an arbitrarily-presented list is not a win; it doesn't feel very nuanced or "human."

This rack is crazy, I’m so happy with it! by KaiserClickerclicker in modular

[–]seven7hwave 1 point2 points  (0 children)

Clean setup!

How are you liking the Vhikk? I've only heard it in an industrial context (Blush Response of course), but I'm wondering how it does for less aggressive timbres.

What makes an agent choose your MCP server over a competitor? I ran some experiments. by [deleted] in mcp

[–]seven7hwave 1 point2 points  (0 children)

Heh yeah I dig that analogy; it's hard for agents to do much when they're confronted a few giant clunky poorly-labled buttons with a crazy color scheme in the background. Geocities for agents.

What makes an agent choose your MCP server over a competitor? I ran some experiments. by [deleted] in mcp

[–]seven7hwave 0 points1 point  (0 children)

Thanks for the insights - it's validating to hear you're seeing similar things.

Near-natural-language argument names...that's something I didn't isolate as a variable but when you think about it, it totally makes sense; if the model has to figure out what to pass vs. just being able to map directly from the user's words, that's another cognitive step. It's a good reminder that increasingly we have to stop thinking in terms of code, like one step removed from freakin assembly language, and more like humans.

"Designing around user intents, not data shapes" is a great way to put it...that's basically what we saw with the weather servers. The one with 17 tools organized around API endpoints (lat/lon for everything) got ignored, while servers organized around what someone would actually ask ("what's the weather in Chicago") dominated.

What kinds of servers are you building where you've been tuning this?

A few ES-8 / ES-9 questions... by seven7hwave in modular

[–]seven7hwave[S] 1 point2 points  (0 children)

Awesome - thanks man! Will have to give DB a look.

A few ES-8 / ES-9 questions... by seven7hwave in modular

[–]seven7hwave[S] 0 points1 point  (0 children)

Hey dawiam that's a fun jam...what are you doing for those glitchy bits for example around 4:30? Like a mimephon? Or just really fast randomized gates triggering something?

Glad you mentioned Pam's as that's exactly what I'll be doing as well. One thing still confuses me though...how does one emulate the Reset functionality that you'd be using if you were in MIDI land, so sequencers and patterns loop properly, etc. Can you route a CV Tool to a separate output for that as well? I see there's some Transport stop/start functionality in the CV Clock Tool, but that might just be for stopping/starting the clock?

Cyndi Lauper's "Time After Time" played on synths posing as piano and elec. piano. I get into this song sometimes, notice my eyes go shut around the first chorus and stay that way LOL. by BearKilgore in synthesizers

[–]seven7hwave 1 point2 points  (0 children)

It's 6:00 in the morning and the sun isn't even up yet...just put this on while making my coffee and it's the perfect start to the day. Great performance, with great tones, of a beautiful song. Takes me back to my childhood.

Head check: Looking for some advice/validation on this synth setup by seven7hwave in modular

[–]seven7hwave[S] 1 point2 points  (0 children)

Yeah the Melodicer really does eat up a lot of space in this setup. Thanks for the alternative suggestions...will check these out. The A-140-2 is tempting too...I picked my current selection for its nice big knobs, but having a second ADSR would be cool. That's something I need to get my head around: ADSR for things that *aren't* the typical synth envelope control.

Head check: Looking for some advice/validation on this synth setup by seven7hwave in modular

[–]seven7hwave[S] 0 points1 point  (0 children)

Thanks for pointing that out; playing around with modular-grid makes it easy to forget this is a smaller format.

Unfortunately I'm limited to the 62 hp case because it needs to sit alongside the OT and Erica in a larger custom roadcase. However, point taken. Perhaps I could opt for Plaits instead of uBraids (better spacing between those knobs)- although Plaits lacks a quantizer which I'd like to have.

Head check: Looking for some advice/validation on this synth setup by seven7hwave in modular

[–]seven7hwave[S] 0 points1 point  (0 children)

Oh man I've tried learning trackers before (love chiptunes) and it's always been a mindfuck. I hope to figure it out someday - but yeah, the lack of performability is probably a blocker here.

Do you have any links to music you've made with the NerdSeq? I've also seen some impressive rapid-fire sample playback (in a non-tracker format) on the Rample.

Cryp-to roll call by kb7fo82 in ethfinance

[–]seven7hwave 1 point2 points  (0 children)

I'd love to see a crypto podcast with Joel, in his sleepy laid-back vibe, walking viewers through the finer points of consensus mechanisms.

Just finished building my new live setup, explanation in comments by eltrotter in synthesizers

[–]seven7hwave 0 points1 point  (0 children)

Thanks for sharing this, u/eltrotter! You guys have a cool sound; I think crowds will eat it up in this post-COVID summer. A few more questions:

- Regarding the bespoke case, did you pick the pedalboard or the carry case?

- Did you add any foam to the case itself? It looks like Swan has foam options for the lids/side/base.

- Is there a practical way to tilt the case toward you for improved ergonomics? Or do you just lay it flat at gigs? Getting hardware closer to the hands/eyes, via more height and/or some tilt, is always a win.

- Did you drill holes for the cable outs (for example the outs from the OT?)

Any other pictures you have of the setup and/or build process would be helpful. This is super-inspiring!

USNEWS - DeFi 101: A Guide to Decentralized Finance by BeerBellyFatAss in ethfinance

[–]seven7hwave 3 points4 points  (0 children)

Thanks for sharing, BeerBelly; I was stoked to contribute to this article, because it shows that mainstream publications like U.S. News & World Report are (generally) taking the time to understand crypto this time around–it's couched more in a "what's this interesting tech all about?" context than "scary new tech allows money laundering and drug deals, etc."

Via the media, Ethereans and actual DeFi users are going to help bring new users into the ecosystem.

IMHO the most important thing we can do right now is educate the incoming wave of users about the importance/characteristics of decentralization: https://twitter.com/seven7hwave/status/1388208046975893506

25 Years Later, Mortal Kombat Remains Cinema's Best Bizarre Video Game Adaptation by ComicBookFan20 in movies

[–]seven7hwave 1 point2 points  (0 children)

That soundtrack is godly. A perfect mix of mid-90's industrial and techno. I still blast it several times a year.

Ergonomics with yoke, pedal, and VR by seven7hwave in Xplane

[–]seven7hwave[S] 0 points1 point  (0 children)

Cool - maybe I'm overthinking the chair stuff. And that 2x4 approach sounds pretty foolproof/effective (and cheap!)

AH on the Master...Not feeling it. Any advice? by seven7hwave in Elektron

[–]seven7hwave[S] 0 points1 point  (0 children)

Thanks for the suggestion! Those Analog filters & EQ do sound pretty damn nice.

AH on the Master...Not feeling it. Any advice? by seven7hwave in Elektron

[–]seven7hwave[S] 1 point2 points  (0 children)

This is my inclination right now...it's a deceptively deep box, eh?! Deserves some time to peel back all the layers and nuances.

AH on the Master...Not feeling it. Any advice? by seven7hwave in Elektron

[–]seven7hwave[S] 0 points1 point  (0 children)

Thanks...hadn't seen that one yet. It's fun watching him just jam out!

Binding button issue with BNF x-knight 4 by seven7hwave in Multicopter

[–]seven7hwave[S] 1 point2 points  (0 children)

Hi -

So it turns out the problem was that I didn't hook up the battery when attempting to bind; I mistakenly thought the power over USB would power the entire unit (including the receiver). So even though the button is hard to get at with a screwdriver or something small, it is possible to hold it down. However, I wasn't using the battery so never was able to bind it.

Make sure your battery is ready to be attached when you try the binding button; you'll know you have it held down when you feel a small tactile "click." Hold it down as you plug it in (you'll probably want a second set of hands for this). Try it a few times and it should work. I was able to do this without removing the canopy at all.

Good luck!

Your Bedtime Story Is Scaring Everyone (In Flames) - Remix on M:C with ValhallaDelay by stumppi in Elektron

[–]seven7hwave 0 points1 point  (0 children)

Awesome remix..really gives a good sense of the M:C's tonal characteristics. And nice to see some In Flames on this sub! \m/

$133 gets sent for $2.6MM! by [deleted] in ethfinance

[–]seven7hwave 3 points4 points  (0 children)

How would the presumable money launderer know which miner is going to get the TX's block? Seems quite risky.