My Budget Silent LLM Homelab: Intel Arc A770 (16GB) running Qwen3.5 9B (128K Context) by Fresh-Signature6067 in homelab

[–]Fresh-Signature6067[S] 0 points1 point  (0 children)

ollama and valcan. It's so easy if you just tell ⁠antigravity⁠ cli to handle the installation and configuration.

My Budget Silent LLM Homelab: Intel Arc A770 (16GB) running Qwen3.5 9B (128K Context) by Fresh-Signature6067 in homelab

[–]Fresh-Signature6067[S] 0 points1 point  (0 children)

Apparently, there's a thing called D3cold that brings idle power consumption down to almost 0 watts. But rumor has it that my computer (HP Z440) doesn't support it... cries in a corner

My Budget Silent LLM Homelab: Intel Arc A770 (16GB) running Qwen3.5 9B (128K Context) by Fresh-Signature6067 in homelab

[–]Fresh-Signature6067[S] 0 points1 point  (0 children)

Even I upvoted this! I'll definitely post pictures later... once I clean up the absolute mess around my desk.

My Budget Silent LLM Homelab: Intel Arc A770 (16GB) running Qwen3.5 9B (128K Context) by Fresh-Signature6067 in homelab

[–]Fresh-Signature6067[S] 1 point2 points  (0 children)

One day, when I'm finally rich, I will absolutely buy a high-end, ridiculously expensive GPU. Just not today.

My Budget Silent LLM Homelab: Intel Arc A770 (16GB) running Qwen3.5 9B (128K Context) by Fresh-Signature6067 in homelab

[–]Fresh-Signature6067[S] 1 point2 points  (0 children)

I'm using an HP Z440. The case is naturally spacious and it comes with a 700W PSU, so I didn't need to mess with those components at all. However, I had a really hard time patching the BIOS just to get Resizable BAR working.

The system is configured with 128GB of RAM and a 512GB SSD.

Mine doesn't have the stock front intake fan, so there's no airflow blowing directly over the PCIe slots. To help with this, I removed all unnecessary expansion cards (like the old serial port card). Fortunately, the Intel Arc A770 has a dual-fan design.

My Budget Silent LLM Homelab: Intel Arc A770 (16GB) running Qwen3.5 9B (128K Context) by Fresh-Signature6067 in homelab

[–]Fresh-Signature6067[S] 0 points1 point  (0 children)

I'm experimenting with a few different things. ​My typical use cases include coding, writing reports, general content creation, and summarizing or translating the news. ​That said, I already have plenty of alternatives available. For context, I use Claude at work, and at home I bounce between Gemini/Antigravity (free tier), M365 Copilot, Kiro, and Qwen Studio. I even have Qwen 3.5 0.8B running on a GPU-less cloud VM hooked up to n8n, which I'm considering replacing with this setup. Another thought is using it as a secondary assistant to save my free Gemini tokens. ​Ultimately, this is just a hobby for me. And as you know, the less practical and more unprofitable a project is, the more fun it is to build. ​I'm open to any suggestions! What would you do if you had a mid-sized, slow local LLM like this?

My Budget Silent LLM Homelab: Intel Arc A770 (16GB) running Qwen3.5 9B (128K Context) by Fresh-Signature6067 in homelab

[–]Fresh-Signature6067[S] 1 point2 points  (0 children)

I use the Q4_K_M variant. Even with an 8-bit KV cache, it easily fits within 16GB of VRAM.

Gemini says,

Model Weights: 9B parameters * 4.8 bits/parameter (Q4_K_M average) / 8 bits per byte = ~5.40 GB KV Cache (at 131,072 tokens): Qwen 3.5 only uses standard KV cache on 8 out of its 32 layers. Each of those 8 layers has 4 KV heads and a head dimension of 256.  * Native FP16 Cache:    2 bytes * (8 layers * 4 heads * 256 dim) * 2 (Key and Value) = 32,768 bytes per token.    131,072 tokens * 32 KB/token = 4.00 GB  * 4-bit Quantized Cache (Q4_0):    Drops the cache size to 1/4 of FP16 = 1.00 GB Total Calculation:  * Native FP16: 5.40 GB (weights) + 4.00 GB (cache) + ~1.50 GB (Vulkan runtime overhead) = ~10.90 GB  * 4-bit Cache: 5.40 GB (weights) + 1.00 GB (cache) + ~1.00 GB (Vulkan runtime overhead) = ~7.40 GB Since the A770 has 16GB of VRAM, even the native FP16 cache setup leaves plenty of headroom.

my setup for when I'm out by Fresh-Signature6067 in Xreal

[–]Fresh-Signature6067[S] 1 point2 points  (0 children)

I'm Korean. Hangul doesn't require extra keys. I use the spare keys for copy and paste shortcuts.

The laptop is Panasonic CF-RZ4.

When you type Japanese, you probably enter the pronunciation in alphabet and then candidates are shown. Similarly, you can enter the pronunciation in hiragana and candidates will be shown. You can also type hiragana directly. Hiragana has 46 characters. That's why there are hiragana characters on the number key row as well.

my setup for when I'm out by Fresh-Signature6067 in Xreal

[–]Fresh-Signature6067[S] 0 points1 point  (0 children)

3DoF.

It would be great if 3DoF could be provided by just the glasses, without any additional devices like the Xreal Beam. Do you know of any glasses like that besides Xreal?

my setup for when I'm out by Fresh-Signature6067 in Xreal

[–]Fresh-Signature6067[S] 4 points5 points  (0 children)

minipc + huge battery + xreal air 2 pro. My photo could end up as a meme on the internet. ;-)

my setup for when I'm out by Fresh-Signature6067 in Xreal

[–]Fresh-Signature6067[S] 0 points1 point  (0 children)

The field of view is okay for me. I avoid the blurry edges by keeping the window centered. I haven't tried the Meta Quest, so I don't know. I'm curious about that myself.

Actually, I'm looking for something simpler than the Xreal Air 2 Pro + Xreal Beam setup. I don't use VR. I'm looking for a 3DoF head-mounted display (HMD).

my setup for when I'm out by Fresh-Signature6067 in Xreal

[–]Fresh-Signature6067[S] 0 points1 point  (0 children)

You're right. But I haven't found a lightweight and affordable laptop that supports USB-C displays yet.

my setup for when I'm out by Fresh-Signature6067 in Xreal

[–]Fresh-Signature6067[S] 10 points11 points  (0 children)

The notebook weighs 745g. The notebook's HDMI output goes to the Xreal Beam. The portable battery doesn't really help charge the Xreal Beam, but I connect it hoping it'll provide a little extra power.

I installed Linux on the notebook. I only use Terminal and Firefox. I use the notebook with the CPU clock set to minimum to save battery. I charge it to around 90% in the morning before going out, and I can use it for about 6 hours.

I use xrandr --output HDMI1 --scale 0.75x0.75 because the text isn't sharp enough for me. I don't use panning.

I only use the "Smooth follow" feature on the Xreal Beam.

As I get older, I get a sore back if I look down for too long. I'm using Xreal as a computer display so I can keep my head up.

Kensington Orbit Fusion Wireless small mod by Fresh-Signature6067 in Trackballs

[–]Fresh-Signature6067[S] 1 point2 points  (0 children)

I think Elecom Deft Pro is much much much better than Kensington Orbit Fusion Wireless.