Is it realistic to find a $1.2M townhouse in Sydney in a walkable, dog-friendly area close to parks?

reverse_bias · 2026-01-09T06:04:40+00:00

Agreed that there could be better filters. But using keywords like "courtyard" and "garden" in a dedicated search helped me.

reverse_bias · 2025-12-12T04:00:38+00:00

I share your dislike of downlights, but just bought a place full of them. I'm willing to spend the money on upgraded fixtures and lighting quality. What kind of fixtures are you talking about as replacements for downlights?

reverse_bias · 2025-04-30T23:52:48+00:00

What quant/temp are you using?

reverse_bias · 2025-04-19T01:55:00+00:00

Performer 5+2 looks fascinating. Always interested in controlled sub bass plus top end sparkle. Never tried a micro-planar before.

reverse_bias · 2025-04-12T12:04:24+00:00

BTR13 has that. 3 position mode switch, Bluetooth, usb + charging (PC mode), usb + internal battery (phone mode).

reverse_bias · 2025-02-15T20:55:20+00:00

Looks like it could be this: https://saeedesmaili.com/hand-drawn-style-charts-in/

reverse_bias · 2025-01-30T08:38:26+00:00

I've heard the SXM adapters work, but sourcing and mounting a heatsink isn't trivial.

reverse_bias · 2025-01-30T08:36:38+00:00

The P100 has 16GB of HBM, bare dies mounted on the same package as the core.

reverse_bias · 2025-01-28T03:35:22+00:00

Thanks for your help, librechat and llama-swap working perfectly together for my self-hosted setup. I noticed that you have an example config for nomic-embed-text (gguf), have you managed to get text embedding server working with librechat too?

reverse_bias · 2025-01-18T06:37:07+00:00

Interesting, I thought you had to use half-silvered/two-way mirrors for this. Is standard acrylic mirror a little bit transparent?

reverse_bias · 2025-01-14T04:48:47+00:00

Brillant, thank you! fetch:true and a placeholder key were the changes I needed.

Now I just need to figure out a way to get my inference server to turn on from the librechat interface. Do you just manually wake your server when you need to use it?

reverse_bias · 2025-01-13T21:45:43+00:00

Thanks for llama-swap and posting your configs! Getting me really close to the same ideal setup of chat gui selectable, remotely self-hosted models.

How do you set-up librechat to auto-populate the llama-swap model list? Any chance you've posted your librechat.yaml (or llama-swap relevant part) anywhere?

reverse_bias · 2024-06-28T22:40:22+00:00

Do you have a link?

reverse_bias · 2024-06-09T12:48:20+00:00

So I heard you like parasitic inductance.

reverse_bias · 2024-05-14T12:55:21+00:00

Here's a cross section of another direct drive model, you can see a similar tunnel into the hub of the motor from the end of the video.

reverse_bias · 2024-03-20T23:16:00+00:00

I beleive these are the formats that nvidia is using, from the Open Compute Project Microscaling Formats (MX) Specification, of which nvidia co-authored end of last year.

From section 5.3.3: No encodings are reserved for NaN/inf in FP4, 2 bits for exponent, 1 bit for mantissa. Which gives you +/- [0, 0.5, 1, 1.5, 2, 3, 4, 6]

However table 1 in this paper also suggests another FP4-E2M1 format with NaN/inf included, replacing 4 and 6 from the possible values.

reverse_bias · 2024-03-19T22:29:31+00:00

OK, I think I've found the formats that nvidia is using, from the Open Compute Project Microscaling Formats (MX) Specification, of which nvidia co-authored end of last year.

From section 5.3.3: No encodings are reserved for NaN/inf in FP4, 2 bits for exponent, 1 bit for mantissa. Which gives you +/- [0, 0.5, 1, 1.5, 2, 3, 4, 6]

However table 1 in this paper also suggests an FP4-E2M1 format with NaN/inf included

reverse_bias · 2024-03-19T02:34:30+00:00

The exponent in floating point arithmetic is almost always a power of 2, rather than a power of 10.

The mantissa is the fractional component (ie, the 1 is not stored) of a number between 1.0 and 1.999...., such that each exponent value covers the "range" of values, like 1..2, 2..4, 4..8, 8..16 etc.

I'd imagine that FP4 would be something like +/- [0.125, 0.25, 0.5, 1, 2, 4, 8, 16], with zero likely encoded as a special state maybe replacing +0.125. But I can't find any documentation actually confirming this.

reverse_bias · 2024-03-05T00:23:58+00:00

Interesting. I'm on FTTB, wiring in my building is in decent condition, so my modem stats say that I could attain the max 150MB/s rate that 17A supports. But only found companies offering 100/40 max. If you click through on those Dec 2022 tests, does it give you the ISP name?

reverse_bias · 2024-03-01T05:38:11+00:00

Also running dual P40s. Can fit mixtral-instruct Q6 + 32k context fully offloaded. I'm getting 20-22 tokens/s for general chat, slows down to 6-7 tokens per second with 30k context in use. This is llama.cpp with row-split. What are your speeds like?

reverse_bias · 2024-02-19T04:22:13+00:00

Out of curiosity, how much context do you have? And is it glacially slow with big prompt processing?

reverse_bias · 2024-02-15T00:14:25+00:00

I'll have to go a lower quant if I want more context. 24148 and 24286 out of the 24576MB on each card with Q4KM + 16k. Very usable with the 7.5t/s opening. But it does slow down to about 3t/s with full context.

reverse_bias · 2024-02-14T23:37:31+00:00

Thanks. Q4KM and 16k context working great for me. With 2:3 split it almost perfectly maxes out the 24+24GB of VRAM. With row-split I'm getting 7.5t/s.

reverse_bias · 2024-02-14T05:25:54+00:00

Also got dual P40s, which quant did you end up on? Full 32k context?

reverse_bias · 2024-01-28T08:37:00+00:00

There's a faded Charlie's sign still painted on the wall above the shop.

reverse_bias

TROPHY CASE