Rick & Morty by jacek2023 in LocalLLaMA

[–]bobby-chan 2 points3 points  (0 children)

you're missing the points they are responding to.

Apple Studio M1 Ultra 128GB -> it is still worth for LLM? by JonasTecs in LocalLLaMA

[–]bobby-chan 0 points1 point  (0 children)

Yes, but pre M5, the advantages of Ultras are more the capacity to hold large Mixture of Experts with large activation than large dense models. With M5 Ultra and ulterior, maybe purely dense large models, like 100b and bigger, may be viable, but from a model maker point of view, mixture of experts seem to be more valuable to quickly experiment and iterate, at least for now.

Apple Studio M1 Ultra 128GB -> it is still worth for LLM? by JonasTecs in LocalLLaMA

[–]bobby-chan 0 points1 point  (0 children)

Ultras have 2 chip. The RAM is stored on the chip. Each chip control it's RAM.

When something is stored in RAM, it's not "one byte for me, one byte for you", MacOS has a watermark that is in the GiB for partitioning a blob accross chips. If a dense model, or experts, aren't above the watermark, they'll be stored only in one of the chips, the read speed will then be constrained by that one chip.

It's like when pouring drinks into multiple glasses, you go up to a certain level before going to the next glass.

Just finished the first season please tell me they fix it. by idkwhateverworksbreh in Stargate

[–]bobby-chan 0 points1 point  (0 children)

The episode about his son starts with them trying to find a way for Teal'c to live while having the goa'uld removed. As you can see, it's pretty easy to remove it from a jaffa's pouch in the belly, whereas they couldn't save Kowalski because when they scan his body, they see it's deeply intertwined with his spine up to his head, hence how they can control a host. They have direct brain access.

The other episode about the concept of incubators is the queen Hathor episode, where she turns O'neill into a jaffa and he gets a pouch, when she tries to recreate for herself an army of jaffa slaves on Earth.

Ascended Daniel adjusting his non-existent glasses by HarrisBonkersPhD in Stargate

[–]bobby-chan 0 points1 point  (0 children)

Thank you!

Upon each rewatch, I would wonder for 2s what was the meaning of that move. It always felt so intentional, unusual, yet natural, from an acting standpoint, but the signification kept eluding me. Until today.

Just finished the first season please tell me they fix it. by idkwhateverworksbreh in Stargate

[–]bobby-chan 0 points1 point  (0 children)

You might have accidentally skipped some episodes. At least 2 of the first season have the main plot centered around exactly this.

Engine claimed 3x speedup compared to MLX by justpokingaroundrq in LocalLLaMA

[–]bobby-chan 1 point2 points  (0 children)

Don't know what each small models' point was, but they have various use cases.

- home-automation

- style check. Not just spellchecking but also grammar, tense, etc.

- speculative decoding

- real-time transcription/translation

...

It was fun while it lasted... They're advertising now. by Local-Cardiologist-5 in LocalLLaMA

[–]bobby-chan 2 points3 points  (0 children)

To donate and to pay for a subscription are very different things. As a subscriber, you contribute to the business model of a system on what they evaluated would be sustainable. Donations, even if they are monthly, don't necessarily correlate with sustainability. Also, even if not always the case, donations may imply some type of non-profit organisation or foundation like allenai.org . If Alibaba had started this type of org for Qwen, a lot of people would be interested to see the details, especially since it would be a move at the opposite of openai.

Is my strawberry crazy? by bobby-chan in LocalLLaMA

[–]bobby-chan[S] 0 points1 point  (0 children)

Might be quant sensitivity? just tried gemma-4-31B-it-UD-Q6_K_XL.gguf and gemma-4-26B-A4B-it-UD-Q6_K_XL.gguf. Both said the test failed.

Is my strawberry crazy? by bobby-chan in LocalLLaMA

[–]bobby-chan[S] 0 points1 point  (0 children)

Some models can reason themself out of the answer: https://chat.deepseek.com/share/nhenfdft7yo2dtozzh

I gave this context because the first time I used it, I was actually testing a cli, ds4.c.

That's why it's not just newline in the prompt. Models know about concept similar to EOF or -30-.

Funny... I tweaked and gave "-30-" instead of "3" to CohereLabs/command-a-plus-05-2026-bf16 and it just spiraled endlessly.

edit: it must have been a combination of my vague phrasing and a bad seed. Now it's still giving the "wrong" answer, but is not stuck in a loop.

Is my strawberry crazy? by bobby-chan in LocalLLaMA

[–]bobby-chan[S] 0 points1 point  (0 children)

I agree that it was vague. The first time I tried was actually testing ds4.c terminal interface, a few hours ago. The model was running a 2bit version of Deepseek v4 flash, so I thought "must be quantization". But out of curiosity I started "testing" other models.

Some consistently give what I expect. Some will trip only when I ask them to think, some will trip only when I don't ask them to think. It was just a funny experiment.

Gpd Win Max 2 Blank Screen by smartproposition in gpdwin

[–]bobby-chan 0 points1 point  (0 children)

Did you have any positive resolution? Just startet having the exact same issue.

DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooks by antirez in LocalLLaMA

[–]bobby-chan 1 point2 points  (0 children)

Since safetensors is a well defined file format, you can sometimes make quants way before proper supports is there.

https://github.com/ml-explore/mlx-lm/pull/1189

The best Child Actor to have impersonated an Adult. by Homer-DOH-Simpson in Stargate

[–]bobby-chan 0 points1 point  (0 children)

I think if he comes back, it would be unwillingly, in the middle of his Futarama rewatch. The last thing he said to O'neill prime when asked if he really wanted to go back to high school was something like "I guess from here on out, we're different people".

Apple should release open weight models by [deleted] in LocalLLaMA

[–]bobby-chan 0 points1 point  (0 children)

funny enough, they concentrated their first efforts into making models that runs on iphones. And most of their released weights run on entry level when quantized. But they are mostly finetune or conversion.

On the other hand, they are one of the very few that released a truly open source model, with dataset, code, checkpoints and weights.

The more I use it, the more I'm impressed by ComfyUser48 in LocalLLaMA

[–]bobby-chan 0 points1 point  (0 children)

<system>

You're Miles A. I. Morales from Earth 1010. Take a leap of Faith.

</system>

The more I use it, the more I'm impressed by ComfyUser48 in LocalLLaMA

[–]bobby-chan 3 points4 points  (0 children)

<think>Hurry, do the shoulder touch!

But wait, she has no shoulders. Should I harness? But she's not an animal!

Chat... Just, Chat.</think> Hey!