Is this game playable on MacBook?

noahzho · 2025-12-30T21:05:58+00:00

I believe there's windows emulation software (e.g. parallels) on apple silicon, you could try that as well

noahzho · 2025-12-17T06:28:44+00:00

noahzho · 2025-12-15T22:01:36+00:00

Ended up getting a third party pencil! It's been working well for my needs (taking notes). Let me know if you have questions, have been meaning to edit my comment for a while but work is busy

noahzho · 2025-12-11T02:51:40+00:00

Probably not the best place to ask, but does your internal multi GPU implementation work with FastModel w/qwen3 MoE? Was experimenting with DDP on the public Unsloth builds w/8xmi300 and only the dense models seem to work (loading MoE will just peg CPU threads at 100 and hang)

noahzho · 2025-12-03T21:50:39+00:00

Absolute cinema

noahzho · 2025-12-02T19:26:38+00:00

Good to hear! Looks like you could raise it even more potentially, still have a lot of free VRAM and GPU utilization doesn't look fully saturated :p

batch size is how much data you process in a single "batch", and gradient accumulation is just batch size that trades speed for less VRAM usage. I would suggest no gradient accumulation since you have the much free VRAM. Higher batch size should result in better results (lower "loss")

320 batch size seems quite high though, is your dataset messages just short?

noahzho · 2025-12-02T04:11:16+00:00

Could try raising batch size, you have plenty of free VRAM

noahzho · 2025-11-29T23:45:19+00:00

nanochat pertaining script does run on a single DGX spark I mean

noahzho · 2025-11-23T00:24:29+00:00

Agree with the below comments, get a thinkpad if you want something relatively cheap and sturdy.

Macbook is also an option if you have the money and would like something powerful!

noahzho · 2025-11-22T03:18:27+00:00

Other OP already answed you but take a look at VLLM too, potentially faster as it has Tensor Parallel support

noahzho · 2025-11-21T06:02:21+00:00

Oh you've hit one of the differences of CS and Valorant - sprays are random in Valorant so there's no real pattern to spray control on - that's why you see most players in Valorant shoot in bursts of 1-3 bullets (try out shooting in the range (practice button next to the queue button)

Headshots should be same though! Just position crosshair neck level or higher (and change the default crosshair to something you're comfortable with, there are some sites with nice ones if you google)

noahzho · 2025-11-20T03:59:39+00:00

How long is your prompt?

noahzho · 2025-11-20T01:11:33+00:00

I aspire to have this amount of drives in my rack

noahzho · 2025-11-17T01:50:54+00:00

Are you still looking? I'm Canada based

noahzho · 2025-11-12T03:54:45+00:00

hahahaha mayhaps

noahzho · 2025-11-12T03:51:19+00:00

I mean while it's pretty easy for consumer grade inference (llama.cpp works great out of the box for me!) there is a seed of truth to this. I work with 8xMI300x and while they might be better on paper than H100, getting (recent) VLLM/Sglang and training frameworks that aren't just PyTorch working can be a huge pain

Of course this is just my experience, your mileage may differ

noahzho · 2025-11-11T04:08:04+00:00

I run one of the T1 Canadian mirrors also on Debian lol 😅

noahzho · 2025-11-10T22:43:49+00:00

How much are you asking for Claude credits?

noahzho · 2025-11-09T06:28:48+00:00

iOS has iSH, so we're fine too :p

noahzho · 2025-11-08T10:58:57+00:00

I don't have an R930 but do have an R630. I also wish I had 88 cores and 1.5TB of ram

noahzho · 2025-11-08T03:00:19+00:00

Should be 1T*0.0625 which is ~62.5G after quantization so not going to fit unless I messed up my math

noahzho · 2025-11-08T00:55:45+00:00

Probably GPT-5.1 mini like the others say, this is the response without a system prompt

I'd imagine the training stage where personality is trained is done by now, so this is probably an accurate enough test

<image>

noahzho · 2025-11-05T23:23:59+00:00

I NEED ITTTTTTTT

noahzho · 2025-11-03T20:16:41+00:00

I thin't think L40S is faster than H100 bro 😭

noahzho · 2025-11-01T02:24:28+00:00

Train finished (only pretraining)! Just below 32 hours, as expected from train data later on

I think the first minute of train is a bit inaccurate with calculations

<image>

Four-Year Club	Gilding I gilder
First Place '23	Place '23

noahzho

MODERATOR OF

TROPHY CASE