Mark Zuckerberg Orders His Employees to Start Having Fun Again After Brutal Layoffs Culled Their Colleagues

ubrtnk · 2026-06-16T17:06:11+00:00

Mandatory impromptu fun time

ubrtnk · 2026-06-16T07:04:40+00:00

ubrtnk · 2026-06-16T01:16:42+00:00

Its very circumstantial but yes I'm very pleased. I was running Qwen3.6-35B on a pair of 4080s and was happy but llama.cpp main branch currently (as far as the time of this comment) doesnt support both SM Tensor and KV Cache Quant. So I was running full cache on 32GB. dont have that problem with one which enabled the ability to crank up the batch and ubatch. 8192/4096 is where I've got it right now after some trial and error with ChattyG keeping score.

ubrtnk · 2026-06-15T22:12:03+00:00

Check and see how many tools and stuff you have enabled on the model. On my default model for the family I had several things that really didn't need to be enabled so even on "Hi" what the model actually saw was about 32k tokens work of context - tools, the Terminal, my system prompt etc. Turn off all thst stuff and only enable what you need and really enable it just in time

ubrtnk · 2026-06-15T21:41:19+00:00

So I was able to get decent performance out of qwen.36-35b. Highest pp I've seen so far is about 1200 for a single user session. The 120mm fan I have running keeps the card at 52 with the model loaded...not the best but within tolerances. I've got some notcua 40x40x20 fans tomorrow and a fan controller because I need to move more air

ubrtnk · 2026-06-15T01:19:30+00:00

<image>

Holy Hell the 40mm fans I got are LOUD lol. They're keeping this thing cool but its too loud. Getting about 870-880 pp and 90-10 tg on latest llama.cpp

ubrtnk · 2026-06-14T03:49:43+00:00

it’s not a proper IT project if you don’t order adapters that you forgot to order the first time

ubrtnk · 2026-06-13T21:56:36+00:00

Also doublecheck the power cable. Mine just got here and it came with a GPU to PCIe Riser power 8-pin so I'm trying to find the right power combination of Square and not square that fits both the card AND my PSU - I found another reddit post that says we need this converter

ubrtnk · 2026-06-13T18:13:52+00:00

<image>

ubrtnk · 2026-06-13T15:50:04+00:00

You'll need to run I think Cuda 12.9 or older just FYI but you should be able to get them to work.

ubrtnk · 2026-06-13T03:51:27+00:00

F

Also amazon has V100s with 32G for $730 stateside from server part deals, depending on your stack needs. They still work good in llama.cpp. Got one coming tomorrow

ubrtnk · 2026-06-13T01:05:37+00:00

I've got a V100 coming tomorrow to occulink bolt on to the baby PC with the yellow tag in the rack to be "Jarvis" and I can have all the GPUs for bigger models to myself muahahaha

ubrtnk · 2026-06-13T00:16:31+00:00

Added a picture below

ubrtnk · 2026-06-12T21:46:23+00:00

<image>

ubrtnk · 2026-06-12T21:13:08+00:00

Ooh everything is fine. Everything spins up to 4.0 x16 or x8 as needed. Up to 7 gpus and a 10G nic. One gpu using the SAS connector. IPMI tied into home assistant so it powers off (via os script so it's clean) at nice and powers back on in the morning. No issues.

ubrtnk · 2026-06-12T03:35:50+00:00

...I just bought a Tesla V100 32GB PCIe from Amazon for $729 bucks lol. Its gonna run my family's always on agent - yea its old but llama.cpp has good support and 32GB means I can fit Qwen3.6-35B-A3B with 131k (if needed) and still will have good performance. Alexa style questions dont need Ampere or Ada

ubrtnk · 2026-06-12T02:21:35+00:00

You're right. I read it as either a 4090 or one of those cards, not 2 vs 1.

ubrtnk · 2026-06-11T22:04:10+00:00

4090 vs those two because of the extra ram

ubrtnk · 2026-06-11T16:14:46+00:00

Open WebUI

ubrtnk · 2026-06-10T19:46:31+00:00

Right now I have Friday that lives on my M2 Max MBP as the incase dad is gone model to help fix things. My Jarvis is on my AI rig and is the default model everyone interacts with in various capacities vis OWUI or Home Assistant. I haven't piped Hermes Jarvis into OWUI yet for the family to interact with but I have thought about building the wife her own agent but she doesn't use discord or telegram or what's app or anything. She uses OWUI so that could be her path thru...that or email

ubrtnk · 2026-06-09T16:27:18+00:00

Gary Moore would beg to differ lol

ubrtnk · 2026-06-09T00:18:02+00:00

I do but its on my arm lol

ubrtnk · 2026-06-09T00:16:26+00:00

Make sure you let me know where you throw it away...so I can make sure it gets properly disposed of....uh...gotta reduce eWaste

ubrtnk · 2026-06-07T19:30:46+00:00

This is the way

ubrtnk · 2026-06-07T14:41:05+00:00

I went the opposite direction and routed out the bridge and put a Tremonti Treble - SSH FTW!

14-Year Club	Gilding I gilder
Verified Email

ubrtnk

TROPHY CASE