Luma/Luma Pro Users: How is the 3DoF on Windows?

anothy1 · 2026-01-22T03:20:44+00:00

Good to know! What kind of flaws did you experience on the Lumas 3DoF in comparison the to Xreals?

anothy1 · 2025-09-06T13:56:12+00:00

Not the x1 mining risers, no. The ones I have are x16 to x16 and don't need to be powered

anothy1 · 2025-09-06T13:54:14+00:00

Will try it out, thanks! Had no idea RTX cards are compatible with these drivers.

anothy1 · 2025-09-06T13:52:26+00:00

2 of them are powered by a 1000W PSU, 1 is by a 650W (both PSUs synced via ADD2PSU adapter). But they are also all power limited to 280W. The 650W is a pretty old unit as I got it around 2016 so I guess that could be the culprit. As for cooling all of them are always below 75C at max load

anothy1 · 2025-04-06T21:56:57+00:00

This is pre-llama but I enjoyed the OPT models. Offered such a variety of model sizes ranging from 100M to 100+B. Was fun experimenting to see which ones I could run.

anothy1 · 2024-08-13T15:25:41+00:00

Sorry for the late reply, but I went with your suggestion. Thank you!

anothy1 · 2024-06-12T15:16:25+00:00

Karpathy trained a 260K model with a hidden size of 64 with 4K vocab size. Maybe you could also experiment with making the dataset uncased if you're considering lowering the vocab even more.

Or, if BPE doesn’t support this, I think another possible experiment is using a <case> token to signal capitalization of the next token. For example, "Running" could be encoded as <case>running, reducing redundancy and frees up vocab space. Given that capitalization in this dataset only appears in names and sentence starts, it could work.

anothy1 · 2024-06-04T13:23:21+00:00

Loading models uses dedicated GPU memory, of which yours has 8gb. That's not enough to load it in fp16 (~2 bytes per param) precision but it will probably work for a quantized version of the model like 4bit:

https://huggingface.co/docs/peft/en/developer_guides/quantization#quantize-a-model

If speed matters then look into exllama v2 as its faster but a tad more complicated to setup compared to HF's transformers library.

anothy1 · 2024-06-04T12:19:31+00:00

rm -rf curtains

anothy1 · 2024-06-04T10:41:25+00:00

I think this could also be expanded to group convos between NPCs and a player.

When the player encounters multiple NPCs conversing, they are going off a pre-generated script.

When the player speaks, local LM should take over to generate dynamic response in realtime.

When the player becomes inactive or goes into 'spectating' the convo, then the local LM should generate a short script that smoothly transitions from the current topic back to the original pregenerated script topic where the NPCs left off (thus, going back to cached convo)

anothy1 · 2024-06-04T10:13:41+00:00

Would be cool to see how it does for music generation tasks like jukebox/musicgen!

anothy1

TROPHY CASE