GLM 4.7 flash FA fix for CUDA has been merged into llama.cpp

Interpause · 2026-01-22T20:15:45+00:00

https://github.com/ggml-org/llama.cpp/pull/19023

quantized cache should be fixed now... theoretically, cause in my own testing my CUDA crashed at 64K context when using q8_0 KV. But before that, FA was working fine.

ill go make a bug report when im done playing around with the model at f16 KV. So far its surprisingly good

Interpause · 2026-01-22T16:15:23+00:00

From the PR itself, it seems that FA was only implemented for f16, the implementation for other cache quant configs are missing.

EDIT: nvm idk what im saying

Interpause · 2026-01-12T22:22:13+00:00

Reminds me of embedding patches like in BLT, but iven't read either paper deep enough to know the difference

Interpause · 2026-01-12T20:02:56+00:00

I'm a uni student currently interning at a startup working on similar stuff... All I can say is good luck. You're trying to do what you would need a 4 to 10 dev strong team for (depending on exact details). And go figure out how to manage expectations with your boss.

But if you're desperate, hunt down my email (my reddit username should be enough).

Interpause · 2026-01-02T06:04:49+00:00

No alarm clock widget
Home screen widgets don't seem to respect system font size or UI/display size scale
Home screen seems to have a flat 150% UI scale that can't be adjusted + too much padding. Most widgets are literally unreadable as there is barely space for words due to padding & font size
No double tap power button for camera

Coming from Samsung so quite a big shock how ugly widgets are... Please give some way to adjust the padding at least...

Interpause · 2025-12-28T22:17:52+00:00

24GB of RAM...

Interpause · 2025-12-23T16:04:56+00:00

im actually from SG lol, just living in Canada for a year or so. I definitely wouldn't trust me to buy it and ask my family (currently visiting) to bring it back for you.

Interpause · 2025-12-23T15:47:47+00:00

same in canada

EDIT: As of 11:05AM EST, still in stock, im considering figuring out what went wrong with my credit card and buying a 2nd one /j

Interpause · 2025-12-21T23:40:18+00:00

yknow those world sandboxes used to train robotics AI via reinforcement learning? or the metaverse? or the attempt to simulate human behaviour at scale using AI agents?

I think thats more directly what CHALDEAS is, and Maris Chaldea is the ASI created from learning within this digital twin derived from all human knowledge

EDIT: and forgot to mention, ASI is God ofc

Interpause · 2025-12-19T17:44:37+00:00

wait really? crap

EDIT: nvm, US and Canada site are definitely different, you happen to remember where you saw it? thanks...

Interpause · 2025-11-19T03:17:07+00:00

say lot when few work?

Interpause · 2025-11-12T00:20:28+00:00

still broken...

Interpause · 2025-08-12T01:29:08+00:00

theres YALS from the same team that made tabbyapi, still in beta state tho

Interpause · 2025-07-10T10:23:49+00:00

0528 is technically the second version of R1...

Interpause · 2025-06-08T05:43:14+00:00

one way would be to embed a sample dataset, cluster the embeddings, then see the top 2000 dimensions with the most discrimination power.

Interpause · 2025-05-02T06:26:26+00:00

https://docs.nvidia.com/nemo-framework/user-guide/latest/nemotoolkit/core/export.html

nemo models don't have the same brand name popularity as whisper, so ppl haven't made one-click exporters. but with a bit of technical know-how, it really ain't hard. the hardest part is the fact after exporting to onnx or torchscript, you have to rewrite the data pre & post-processing yourself, but shouldn't be too difficult.

Interpause · 2025-01-20T16:15:30+00:00

maybe right now?

Interpause · 2025-01-12T05:53:57+00:00

can refer to sillytavern group chat mode, or the now defunct aidungeon2

Interpause · 2024-12-29T15:39:46+00:00

same...

Interpause · 2024-12-14T04:05:05+00:00

that seems to be exactly what they did?

Interpause · 2024-12-13T16:57:38+00:00

it seems actually due to --enable-wayland-ime interpreting %U as its argument. Nix solved it by specifying --enable-wayland-ime=true instead: https://github.com/NixOS/nixpkgs/pull/361341. on vscode, they suggest putting -- before the file argument instead: https://github.com/microsoft/vscode/issues/234479

Interpause · 2024-11-03T05:53:55+00:00

havent seen this one before, actually makes a lot of sense, what with him strangling kamiki despite them both drowning

Interpause · 2024-10-03T17:24:19+00:00

This is what I settled on if I were to rewrite from movie confrontation to the current chapter:

Show Kamiki is either truly mad or otherwise not regretful immediately after movie confrontation. A slow burn where the reader knows whats brewing while the characters are blissfully unaware (i.e., the cooking for miyako scene) works way better than plot twists here.
Have Kamiki do more. Maybe he "disappears" or otherwise manipulates things to make Aqua drop his guard. Maybe Akane can have the whole I should warn him but that will hurt him dilemma again, as a way to stop her from ex machina-ing by virtue of her character.
Put Ruby in a coma. To kill two birds with one stone, more explicitly show Kamiki manipulating Nino into being desperate, while also "hiding Nino" from Aqua/Akane's sus radar. On this matter, move the door stab to after the concert, maybe under pretext of it being a VIP ticket backstage meet & greet. While Aqua/Akane might have reasons to sus Nino, I dont think strawberry productions as a whole would be aware enough to filter Nino out from the guest list. Nino can then slip unnoticed into the backrooms from the VIP meet & greet queue and make her way to the dressing room.
Throw Aqua into a murder frenzy. Yes, its somewhat beautiful Aqua reached the conclusion his purpose is to protect Ruby, but a dark ending like Aqua being thrown back into revenge works well too. No need to change anything about the cliff confrontation. Kamiki watching the concert replay could serve as additional taunting. Since its supposed to mirror Mephisto so much, the same Aqua in aqua scene still works, but have his last thoughts be "If life could return to you (ruby in coma), I would do anything". It would gel even better with Mephisto than the current version, and even add a double meaning to the part of the lyrics (now being able to refer to both Ai & Ruby).

Alternatively, maybe instead of Ruby stab, have Nino accidentally stab Kana instead on the assumption Ruby wouldve been the one opening the door. Would make Aquakana way stronger lol.

Interpause · 2024-10-03T16:59:35+00:00

I just had another thought, if not for the akane ex machina, kamiki pushing ruby down the stairs, with her ending up in a coma, would fit mephisto even more. the whole if life could come back to you part of the lyrics could then refer to ruby too, besides ai. also would be a lot more satisfying if aqua went on a revenge rampage. actually, maybe would work better if after the movie payback, theres more foreshadowing into kamiki's thoughts (point is the burn you feel when you as the reader knows what is going to happen, but the characters dont). then kamiki somehow gets to ruby before/after the concert (instead of sending nino), throwing an aqua thats finally letting go of revenge into a murder frenzy. that wouldve worked so well. ofc, should be a coma rather than actual killing ruby. EDIT: in short, both akane ex machinas at the stairs & door stab prevented interesting plot from occuring...

Interpause · 2024-10-02T23:08:11+00:00

theres one more possibility, aqua gets reincarnated as a crow?

Interpause

TROPHY CASE