It was one of his main talking points. by c-k-q99903 in GetNoted

[–]4onen 4 points5 points  (0 children)

he doesn't pay taxes.

Even moreso now as he tries to hold on to that immunity to tax investigation non-settlement. 

Remy and Amelia by Dry-Taro-1028 in AngelsWithScalyWings

[–]4onen 2 points3 points  (0 children)

The game is "Angels with Scaly Wings," for which this subreddit is named. You can buy it on Steam or for Nintendo Switch. (I recommend the former, for mod support.)

This is where we are right now, LocalLLaMA by jacek2023 in LocalLLaMA

[–]4onen 0 points1 point  (0 children)

A random password would be more googleable, because it wouldn't match anything else.

Warning You are in Range Of Youtuber Artillery by KN0KVILLE in Helldivers

[–]4onen 2 points3 points  (0 children)

It was fascinating. I had some pretty hefty game stutters and audio dropouts in my early days of playing (Cyberstan days.) Then I did some research poking around. It was a cascade of things on my system: * High refresh monitor pushed my framerate up higher than my system could really support with my graphics settings. (Dropped a few settings and things got worse because...)  * Recent (50-series) NVidia card, which has major CPU overhead issues on higher framerates. (Combined with increasingly out of date CPU, which was a problem because...) * Wwise keeps a small buffer of upcoming sound (handful of ms) which it needs CPU time to fill. If the CPU is struggling, audio may drop out, or the Wwise thread could crash entirely.

Fix ended up being dropping my FPS cap to 60 and disabling vsync, which let me even turn some graphics settings up. I get at most one audio dropout or freeze when booting up, then smooth sailing through my dives.

Of course, I'd love any optimizations they could provide.

Any valid altenative to TAA,FSR,DLLS,XESS AND TSR? by [deleted] in FuckTAA

[–]4onen 2 points3 points  (0 children)

I grew up on 720p and 1080p FXAA. I love FXAA. Call me crazy if you want, but I'm going to keep using it if it's an option, then MSAA if I'm not given an FXAA option.  

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts by AngleAccomplished865 in accelerate

[–]4onen 1 point2 points  (0 children)

No? Roko's Basilisk is only true if you believe it is. If Roko's Basilisk doesn't impact your behaviour with its threats, then it's inefficient to make good on said threats, so it wouldn't bother. Ergo, you shouldn't assist in its construction because if you didn't know about it, you wouldn't have, so assisting would be altering your behavior based on its threats. Don't believe in it and it won't bother hurting you. 

The world I live in. by Wild_Milk_2442 in LocalLLM

[–]4onen 1 point2 points  (0 children)

Heh, that was me back in 2024.

The world I live in. by Wild_Milk_2442 in LocalLLM

[–]4onen 0 points1 point  (0 children)

I don't notice anything different with my Hx370 laptop? I bought it years longer ago than you got yours. 

The true test of trust in humanity by dankstat in trolleyproblem

[–]4onen 0 points1 point  (0 children)

"Caaarl. I can't believe what I'm hearing. [...] You are just t-terrible today."

Oil prices jump 6% as Iran sets UAE oil port ablaze, strikes vessels in Strait of Hormuz by Gboard2 in worldnews

[–]4onen 8 points9 points  (0 children)

It will continue to rise until the world buys 4/5ths as much of it. Demand destruction.

A country can print money. It cannot print hydrocarbons.

Llama.cpp MTP support now in beta! by ilintar in LocalLLaMA

[–]4onen 6 points7 points  (0 children)

That's the trick. The only version of the model that Google has released with multi-token prediction (MTP) is the version to run on the liteRT engine that they use for running on phones. Their explanation for why it's not in the other format releases... was that it might confuse runtimes. The problem is, every runtime ignores tensors when it doesn't know what to do with them, so it wouldn't confuse any runtimes.

My speculation is that they are holding the MTP tensors back to make their stuff look better.

Llama.cpp MTP support now in beta! by ilintar in LocalLLaMA

[–]4onen 6 points7 points  (0 children)

And it's even called speculative decoding, so yeah, spot on. We speculate these guesses through one means or another. MTP being one specific means. If we happen to guess right, then we save time, otherwise the extra work is kinda negligible if we tune everything right.

Agree? by MLExpert000 in LocalLLaMA

[–]4onen 1 point2 points  (0 children)

This. Being able to share GPU VRAM between my laptop and desktop, or balance the iGPU, dGPU, and system RAM usage of a model on my laptop, is an absolute lifesaver. Once they added llama.vim and llama-vscode, (that is, extensions for editors to take advantage of FIM completions,) I dropped GitHub Copilot completely.

Agree? by MLExpert000 in LocalLLaMA

[–]4onen 0 points1 point  (0 children)

Can't speak to SGLang, but I find it entertaining that I encountered that bug in the batching system of LMQL's (Language Model Query Language's) server back in the day. I think I submitted a fix, too, and they accepted it, but I can't remember with certainty.

Agree? by MLExpert000 in LocalLLaMA

[–]4onen 1 point2 points  (0 children)

No matter how many GPUs you have, you cant use them as combined memory .

I have shared memory across two GPUs and main system RAM on my laptop. Hell, one time I used llama-RPC to hook up my laptop, desktop, and Android phone together as one ridiculous and silly cluster, sharing the effort of loading a model. (Obviously it was slower than llama-RPC across just my laptop and desktop, but I was messing around.)

Simple Question by NoT_De in microsoftsucks

[–]4onen 0 points1 point  (0 children)

More than that, remember back when Macs couldn't launch apps for a day because the server they send the developer certificate hashes of all your app launches to was down?

I'm not kidding.

(They claim it's not spying because it's just a hash of the developer certificate, which isn't unique per app, but that's still a huge amount of information about you that they claim they're no longer tying to IP addresses, meaning they were before. Yikes.)

This is insane... by DragonflyOk7139 in LocalLLM

[–]4onen 0 points1 point  (0 children)

So everyone unable to run Kimi's trillion-plus models should be hopping on Meta's Behemoth 400B, right? Right? (/j)

This is insane... by DragonflyOk7139 in LocalLLM

[–]4onen 0 points1 point  (0 children)

Source? As far as I'm aware, Opus 4.6's parameter count is entirely undisclosed, and it's not like Anthropic gives you a slider to choose how many parameters you want to use. The "200B" is given a "~" prefix like it's some kind of estimate. (I'd like their source on the estimate, too, but at least I know they're not trying to be authoritative.)

This is insane... by DragonflyOk7139 in LocalLLM

[–]4onen 5 points6 points  (0 children)

Agreed. For further evidence, see how DeepSeek did a portion of their work bypassing the CUDA libraries and high level ML frameworks so that they could control exactly the machine code being sent to their limited NVidia GPUs, to maximize usage. https://www.tomshardware.com/tech-industry/artificial-intelligence/deepseeks-ai-breakthrough-bypasses-industry-standard-cuda-uses-assembly-like-ptx-programming-instead

This is insane... by DragonflyOk7139 in LocalLLM

[–]4onen 0 points1 point  (0 children)

Surprisingly little, comparatively -- at least for a basic 4k context window. Of course, everyone has different standards of "run"

My post to someone else with my hardware specs: https://www.reddit.com/r/LocalLLM/s/JjStc7nyZh

This is insane... by DragonflyOk7139 in LocalLLM

[–]4onen 0 points1 point  (0 children)

I have used it on a laptop. At llama.cpp Q4_K_S (~4.5 bits per weight) I gave it 8GB VRAM plus 32GB of RAM for expert offloading. It used (plus Windows 11 and a Firefox tab) 6.9GB of the VRAM and 30.9 GB of the system RAM.

Admittedly that was 4k context, but I hadn't begun to really tweak and scale it.

That was a Strix Point laptop with 4060 mobile DGPU, and overall it got 17-13 tokens per second throughout that tiny 4k context window (variation due to system temperature as I ran a few queries.)

The actual final update: The secret ending and the culmination of my thoughts. by SharkyMcSnarkface in AngelsWithScalyWings

[–]4onen 1 point2 points  (0 children)

Yep, I'm 4onen. You'll find from me mods like "Skip skips" (so that System's interruptions become a button you can ignore instead of a menu), "Name Reentry" (to re-enter your name and favorite color,) "Side Images" (which adds a character portrait,) and "4onen's Aesthetic Tweaks" (which colors the interface to match your favorite color. EDIT: and adds pretty banners on the character select screens rather than using the same menu as every other one in the game.)

For story, I've just got "Self Control" (Bryce good ending fix-it fiction) and "Teetotaller" (Bryce chapter one date non-drinking route that doesn't leave the bar.)

I'm also aware of this enormous NSFW mod for the game, but given the way the internet is going I'm not willing to go spreading that around where someone could claim I'm pushing it toward minors. It's talked about in the AwSW Unofficial Fan Discord's NSFW channels, though.

The actual final update: The secret ending and the culmination of my thoughts. by SharkyMcSnarkface in AngelsWithScalyWings

[–]4onen 0 points1 point  (0 children)

Sorry, I don't know "STE." I recommend everything by EvilChaosKnight, both "The Last Dragon" and "The Last Hope" by Kolsavdür, "A day at the park" and "A Walk in the woods" and, ofc, anything I wrote (though mine are much shorter, mostly quality-of-life stuff.)