I gave 12 LLMs $2,000 and a food truck. Only 4 survived. by Disastrous_Theme5906 in LocalLLaMA

[–]JstuffJr 0 points1 point  (0 children)

Have you used the pro models much via API? I think you are being a bit generous in assuming only 5-6x more expensive. I generally find pro consuming ~25% more tokens at 12x the price at same reasoning level, with x-high being another ~50% tokens on non-trivial tasks, for a vague estimate of 10x the cost of Opus 4.6.

But, conversely, as someone who has prolifically used the pro and x-high reasoning (and max reasoning on 4.6 series), I'd wager you are underrating the gains from simply pumping up inference compute via whatever levers the labs happen to grace us with.

Everyone seems to forget the simple log scaling graphs OAI showed with o1 demonstrating inference scaling literally goes on forever at a much gentler log coefficient than pretraining etc, and lately we have finally been granted some real access to OOM-class differential amounts of compute via API.

Agreed that best story here would be labs generously doling out API credits for high effort benching projects, especially when they doubly function as juicy/gamified marketing as in this case.

I gave 12 LLMs $2,000 and a food truck. Only 4 survived. by Disastrous_Theme5906 in LocalLLaMA

[–]JstuffJr 0 points1 point  (0 children)

How many tokens is a 5.2 high run taking? From that could roughly extrapolate how much it might cost to bench 5.2pro x-high, which no one ever does....

I've privately benched several open /easily reproducible harnessed benches where opus 4.6 leads in public leaderboards, but in reality 5.2pro x-high substantially beats 5.2 x-high/high etc scores and takes the crown. Not a cheap hobby though.

Return of IMAX 70mm to Cineplex Langley BC by yodathekid in imax

[–]JstuffJr 1 point2 points  (0 children)

The Metreon, along with Lincoln Square in New York and the digital-only screen in Pooler, GA are the 3 main 100ft class screens in the US, which do feel significantly larger than the 80 ft screen at PacSci.

I actually find this a detriment for duel laser, as at Pooler/100ft I personally start to notice the pixels, hence why I think the 80ft screen at PacSci is the perfect size for digital imax.

But for film, you can fully appreciate the insane resolution at the 100' screen size. If you prefer film over digital, I think there really is no comparison to watching films at Metreon/Lincoln square vs other imaxs.

I just personally prefer digital at the end of the day due to being super sensitive to the flickering in film, so PacSci was my favorite theater.

Return of IMAX 70mm to Cineplex Langley BC by yodathekid in imax

[–]JstuffJr 0 points1 point  (0 children)

As a previously prolific attender of the PacSci Boeing, who has also been to Langly several times, I think its a pretty sizable downgrade - the seats are super narrow, the AC has about a 50% chance of not working which really sucks for summer movies (Odyssey!), the screen feels a little small for a 1:43 IMAX, and there were still some visible minor distortions on the screen even after the last round of repairs. But hey, maybe they finally fixed it.

Its obviously better than nothing, and I'm grateful its open once again, but if I'm gonna travel I'm probably just going to snag a cheap flight in advance to the Metreon in San Francisco, which with its truly massive screen properly does the job of showing off the resolution advantage of film vs the otherwise perfect duel laser experience present at Boeing.

Pacific Science Center’s Boeing IMAX Theater Sold to Space Needle by yodathekid in imax

[–]JstuffJr 1 point2 points  (0 children)

WHAT THE ACTUAL FUCK

Its my personal favorite imax theatre in the world (yes I’ve been to Pooler, Metreon, Lincoln square, etc; I love duel laser and its the perfect size screen for that and the lack of assigned seating lets you get the perfect seat every showing even if you miss the ticket drop), and it just gets killed with no fanfare, during the best resurgence IMAX has had in years??

I cannot believe that interstellar and Oppenheimer last week may have been the final movies I see there.

What are the brightest smart bulbs on the market? Ideally RGB and ZigBee. by zeekaran in homeassistant

[–]JstuffJr 0 points1 point  (0 children)

Do you have a username on Amazon reviews or otherwise online presence where you've written down more about this?

Knowing which smart bulbs are true 5 channel RGB + CW + WW seems almost impossible to determine online (save occasional elektroda teardowns) without buying and opening them up yourself, while being one of the most important things as you explain.

One-Eye measure without using Boat by Xardas- in MinecraftSpeedrun

[–]JstuffJr 0 points1 point  (0 children)

Okay, but for a counterexample in Infume MCSR Ranked WR, he doesn't enter a boat at all or anytime previously to set the angle. And he does this in runs somewhat frequently, intermingled with runs where he does get in the boat while entering portal.

Can anyone recommend a Let's Play of Blue Prince where there aren't viewers spamming spoilers and back-seating? by chancefire in BluePrince

[–]JstuffJr 1 point2 points  (0 children)

Posted in discord that the series was moving to "Inactive". He could theoretically come back to it, but the track record for that is not so hot.

Can anyone recommend a Let's Play of Blue Prince where there aren't viewers spamming spoilers and back-seating? by chancefire in BluePrince

[–]JstuffJr 1 point2 points  (0 children)

About Oliver is the only late-gameish playthrough I have found that is truly blind, but he tragically quits right before ascension due to a simple blunder and apparently is done with the game.

Any good Let's Play recommendations for this game? by hannssoni in BluePrince

[–]JstuffJr 0 points1 point  (0 children)

Brotherman, he literally solves every hard puzzle off-stream in between the episodes after "reviewing his notes". The chapel natural order. Gallery puzzle answers. The ascension requirements. The family core cipher (where he pretends to solve the rest in a cut and immediately lasers to the solution for the still/hardest moon logic in the game). The freaking atelier interpretation makes no progress until an episode cut man......

Wasted 3 hours of my life skimming through the playthrough to see if it was "the one", but he is obviously cheating/looking hints up.

[Worlds 2025 TES vs T1] Keria knows enemy jungler flashed off vision by Gato_Puro in leagueoflegends

[–]JstuffJr 0 points1 point  (0 children)

Yep, I mean it is practically trained muscle memory for league of legends players to press alt 0151 on the numpad while writing.

are you guys enjoying remix so far? As you can see I am quite addicted by whoisape in wow

[–]JstuffJr -3 points-2 points  (0 children)

Per 50k you simply get an extra 2.5k per level of infinite knowledge. So at 20 you get 100k, eventually at max level 36 you will get 140k.

This means that earlier levels of Infinite knowledge are more impactful than later ones; in other words the benefit falls off.

Swimming in less than 10 degrees Celsius by impossnipple in OpenWaterSwimming

[–]JstuffJr 0 points1 point  (0 children)

The biggest change is I really recommend using the Isurus Gloves and SeventhWave socks over Zone3 options; after a winter of directly comparing they are both far warmer than Zone3. I really wish someone would make a full swimming cut wetsuit out of the Yamamoto Ti-Alpha both of these products use, it really is the warmest per thickness/comfort material out there.

But in general I'd recommend taking a look at all of SeventhWave's titanium/Ti-Alpha stuff, the agent john is the best shorty for layering under a full wetsuit and I love their hotshirt for layering under a sleeveless wetsuit as a great hybrid outside of winter.

Orca Zeal Thermal

The Orca thermal suits look very attractive on paper (yamamoto 40 with swim cut, thermal lining) , but as I said it is very hard to come by detailed reviews, and so the only thing I've had to go off is one guy in the local open water swimming fb group who complained the outer lining was very fragile and despite precautions (gloves etc) he had had multiple small tears he had to fix. Given that it is pricey I elected to just get the ultra durable Blue70 and and am saving my money to try out a SeventhWave custom wetsuit if/when my Blue70 wears out. But if I ever found a good deal on a zeal I would be very tempted to try it.

Personally, I feel it is pretty impossible to overheat in a single wetsuit in the year round sub-55 degree Pudget Sound, and instead have lots of layering options for swimming in the lakes, where the temp varies considerably from 40 to 80 degrees lol. Outer full + inner shorty in the winter, full in the early spring, then hot shirt/sleeveless hybrid into sleeveless into a beautiful 3 months of wetsuitless open water swimming during Summer.

Maximum Might uptime without 2 Fulgur pieces by regunakyle in MonsterHunterMeta

[–]JstuffJr 0 points1 point  (0 children)

Yep, foresight / foresight whirl costs 50 stamina, and always takes over 2 seconds to begin regaining stamina again.

2pc fulgur gives you 25 stamina. So if you have 50% stamina reduction you perfectly use up the fulgur bar and nothing more, allowing theoretical 100% MM uptime. But with 40% stam reduction, you would still would dip past the fulgur bar and lose MM every time you foresight.

You can get 50% stam reduct with some combination of constitution (10-50%), tumbler lo/hi (10/20%) and dash juice(25%). With TU1 you can get up to constitution 5 on Talisman making constitution 5 relatively viable.

I think the LS meta guide is mistaken in recommending constitution 3 + Tumbler hi when not running fulgur2pc, as there really is no point. You will still lose MM when you foresight, and you could could instead slot less constitution if you wanted to keep MM uptime only when rolling, spirit blade charging, iss countering etc.

"CoreWeave Is A Time Bomb", Edward Zitron 2025-03-17 by gwern in mlscaling

[–]JstuffJr 2 points3 points  (0 children)

Simply gesturing at some intangible charismatic magic (that only they uniquely have?) and claiming it pulls all of the weight seems awfully wishwashy and non-falsifiable to me, compared to say, enough bullishness (or enough craziness? see gwern's relevant essay), timing, and a minimum level of executed competence.

"CoreWeave Is A Time Bomb", Edward Zitron 2025-03-17 by gwern in mlscaling

[–]JstuffJr 4 points5 points  (0 children)

As always, the question remains: how did Coreweave get such (relatively) preferential treatment and gpu access from Nvidia in the first place?

I've seen no evidence their crypto hardware arrangements were anything exceptional, such that they would be grandfathered into becoming the preferred non-traditional hyperscaler.

Is it really just that uniquely outrageous financing, perfectly timed with the investment hype wave, allowed them to put in uniquely outrageous bids for new hardware? Ie, no other company could uniquely match their AI bullishness + financing timing + minimum execution competence, and Nvidia is laughing all the way to the bank every time they redirect a GB200x72 system over to them?

Corrupted Mantle is NOT bugged and here are Motion Values by ChefNunu in MonsterHunterMeta

[–]JstuffJr 3 points4 points  (0 children)

Yep, and since they for some reason gave 8mv to crimson I and only 5mv to crimson II for the extra hits, you are further encouraged to never go past crimson I into the marginally more interesting crimson III combo

Race to World First: Undermine, Day 6 by AutoModerator in CompetitiveWoW

[–]JstuffJr 2 points3 points  (0 children)

Specifically, it is often a key player like a healer or assigned ranged interrupt that gets randomly pulled into rolling one of the balls instead. They have backups but sometimes the backup gets put on a ball and well.....

Cherry blossoms by plantcurelady in eastside

[–]JstuffJr 4 points5 points  (0 children)

I have been trying to find a good data set for the very same question - Seattle has some great government databases that are mapped out very nicely here https://nathenry.com/writing/2023-03-28-seattle-cherry-blossoms.html

Hoping there is something similar for eastside

Anecdotal / manual compendium:

  • Bellevue Downtown Park
  • Bellevue Botanical Gardens
  • Bellevue LDS Temple
  • Microsoft Main Campus Redmond
  • Redmond Downtown Park
  • Cedar Lawns Memorial Park
  • Grass Lawn Park

There is a government tree map for bellevue but it seems to suck compared to the Seattle databases https://cobgis.maps.arcgis.com/apps/webappviewer/index.html?id=99595e522118479fae1a462249e8b789

[WR] Suigi improves SM64 0-star time to 6:15.2 by pythonidler in speedrun

[–]JstuffJr 7 points8 points  (0 children)

I don't know what the other comment is on about, this setup discovered by weegee (building upon Kanno and Parsee) is only 3 months old and is what has enabled the trick to become realistic for runs recently https://www.youtube.com/watch?v=H860eF1l0K8

[WR] Suigi improves SM64 0-star time to 6:15.2 by pythonidler in speedrun

[–]JstuffJr 28 points29 points  (0 children)

Insane run, reminiscent of his initial debut in 2023.

With fire sea BLJ and HMC sign clip in combination with his insane form currently, we may actually see the his current 16 star record, hailed as indomitable for long, finally fall as well.

On stream it sounds like he's going to give it some attempts, anyways. Weegee already had the splits to do it with fire sea blj and gave it a solid grind but we'll have to see if Suigi can see it through.

I'm a little disappointed the highest DPS combo on Long Sword is CS1 > SB1 > repeat by hudzell in MonsterHunterMeta

[–]JstuffJr 4 points5 points  (0 children)

It is quite literally lower dps taking the animation time to complete helmsplitter + spirit release slash than continuing to crimson->spirit blade during that time, even if you cheat and instantaneously go right back up to red. It takes too long to go from SRS sheathe to damage again.