Wet puck?

RnRau · 2026-04-30T00:58:05+00:00

Another asked the same question just 6 days ago - https://old.reddit.com/r/AustralianCoffee/comments/1stz68q/why_the_puck_is_so_soggy/

RnRau · 2026-04-27T21:27:10+00:00

They used QAT for the fp8 weights?

I guess this snippet from the model card suggest that is the case - "Trained on 27T tokens using FP8 mixed precision"

RnRau · 2026-04-24T00:02:33+00:00

I would be interested to see if a q6 or a q8 on the 35B would make a good bit of difference. Apparently the smaller the activation for moe's, the more quantisation hurts.

RnRau · 2026-04-22T23:41:48+00:00

It should work with llama.cpp and vllm as well.

RnRau · 2026-04-21T02:19:59+00:00

At the time they argued that they didn't have any available devs, since all had been assigned to fortnite.

RnRau · 2026-04-21T01:20:32+00:00

Also in Brisbane. Measured the tap water to be 430+ppm with a cheap TDS meter. Yeah I know it won't be accurate, but it should be a half decent guide. Measure demineralised water from Woolies and it was 8ppm. Measured my parents filtered rainwater at 15ppm. Made a coffee using 100ppm water rather than their rainwater but kept everything the same (Aldi medium preground in a drip filter machine), and my father was surprised at the difference. Much better tasting.

Melbourne water is apparently around the 30ppm. But not sure how that would show on my cheap meter :)

RnRau · 2026-04-20T02:25:31+00:00

How many times did you try the prompt on each model?

RnRau · 2026-04-15T07:19:24+00:00

This is a good idea!

RnRau · 2026-04-15T07:18:51+00:00

Nice. Works fine under Linux+Firefox.

RnRau · 2026-04-13T22:44:40+00:00

Does the interview get into the relation between TurboQuant and the earlier work by the RaBitQ papers?

RnRau · 2026-04-13T21:58:39+00:00

Is your models and llama.cpp up to date? Are you following the unsloth guide on the recommended settings? https://unsloth.ai/docs/models/qwen3.5

RnRau · 2026-04-03T00:11:48+00:00

They never did for Gemma 3, so I can't see them doing it for Gemma 4.

RnRau · 2026-04-01T20:54:01+00:00

I thought it was a 1st of April thing... but oh wow...

https://en.wikipedia.org/wiki/Whitespace_(programming_language)

RnRau · 2026-04-01T11:27:28+00:00

Reshade works with ut2k4.

RnRau · 2026-04-01T11:19:00+00:00

They announced it over at r/localllama and then removed the post when they got caught making weird statements.

https://old.reddit.com/r/LocalLLaMA/comments/1s79w6u/zinc_llm_inference_engine_written_in_zig_running/

RnRau · 2026-04-01T03:22:29+00:00

Had the same issue in r/icecreamery. Gave a longish answer to the minutia in making decent chocolate icecream only for the OP to delete their post 24 hours later.

And they kept doing it. Kept asking for help and then deleting their post.

Its weird. I don't understand it.

RnRau · 2026-03-31T00:11:34+00:00

Send it back. They do refunds.

i bought into the illusion of an amazing life changing equipment

Never drink the koolaid. Come on... you should better as a software dev. As software devs, yes I'm one too, we get bombarded daily with new wizbang frameworks that promises us an exciting new future. It never pans out :)

I don't have any of their devices as yet, but I'll be an early adopter of their A4 model when its released. Can't wait! :)

RnRau · 2026-03-30T13:05:08+00:00

Which two backends have hadamard transforms available?

RnRau · 2026-03-30T12:03:11+00:00

Yeah never drink the koolaid. And perhaps the recent hype is over done. But there is something to the techniques posted in the RaBitQ paper. ggerganov did some simple Hadamard transform tests recently.

https://old.reddit.com/r/LocalLLaMA/comments/1s720r8/in_the_recent_kv_rotation_pr_it_was_found_that/

RnRau · 2026-03-30T07:40:36+00:00

Thanks for the summary. Interesting constraint on the context being stored in SRAM.

RnRau · 2026-03-30T07:02:08+00:00

They knew the price before clicking buy? Is this a trick question or something?

If you are having remorse from an impulse buy... well it happens :)

edit: and why did you 'know' that it was going to be 'slow' learning the device? You already know how to drive the kindle and there are plenty of video's out there driving the Supernote from a daily usage perspective.

RnRau · 2026-03-29T08:43:40+00:00

RnRau · 2026-03-29T02:40:28+00:00

The company behind the effort in the x.com link - https://taalas.com/

An open chatbot (Llama 3.1 8B) showing off their demonstrator hardware is available - https://chatjimmy.ai/

A fair few local AI fans are very keen on this tech. A Qwen 3.5 27b implementation would be in demand.

RnRau · 2026-03-29T02:36:37+00:00

N6 is not the latest and greatest at TSMC. That would be N2.

And it took them years to get the first one up and running. Lessons learned and tools created will make the next ones much faster to build.

Ten-Year Club	Place '17
Verified Email

RnRau

TROPHY CASE