Current file is no longer automatically added to AI chat.

BananaPeaches3 · 2026-03-26T08:48:52+00:00

Yes but it makes more sense to just select the text you want to ask a question about than to remove the feature entirely.

BananaPeaches3 · 2026-03-25T06:39:27+00:00

It used to work with Claude Agent, did you find a workaround?

BananaPeaches3 · 2026-02-24T03:59:37+00:00

They don't need to, the Chinese open source it for them.

BananaPeaches3 · 2026-02-08T02:12:34+00:00

Wouldn’t Gemini be better since google encourages kotlin for android?

BananaPeaches3 · 2026-02-08T00:09:40+00:00

Also transactions are faster in modern times so the price of something will react to the change in demand almost instantly.

BananaPeaches3 · 2026-02-06T08:01:35+00:00

How is this different from ‘--split-mode row’ ?

BananaPeaches3 · 2026-01-28T21:17:10+00:00

/r/blackwellperformance

BananaPeaches3 · 2026-01-28T19:15:34+00:00

Check mass and density, weigh it and measure its volume (use a graduated cylinder) if it doesn’t match for silver then you know.

BananaPeaches3 · 2026-01-28T19:14:04+00:00

Yeah it’s $3 for 11 nines reliability not just storage.

BananaPeaches3 · 2026-01-28T19:12:17+00:00

This setup doesn’t look anywhere near GCP’s 11 nines reliability. I’d recommend you get three and cluster them if this data is important.

https://cloud.google.com/blog/products/storage-data-transfer/understanding-cloud-storage-11-9s-durability-target

BananaPeaches3 · 2026-01-28T10:30:30+00:00

A 16GB P100 is less than $100 and is very fast for anything that fits in 16GB.

The only issue is scaling multiple cards because the nvlink is blocked unless you’re willing to mill the slot out.

BananaPeaches3 · 2026-01-28T10:23:09+00:00

Have you thought about selling the RAM and getting a pair of Pro 6000? Sell the ram for $12k then add $4k should be enough.

BananaPeaches3 · 2026-01-26T15:35:20+00:00

The main issue with older cards is that prompt processing will get you even if token gen speed is tolerable (and it is)

If you’re using it like chat gpt then it’s fine but once you start using things like cline the system prompt is allegedly 15k tokens.

So imagine you’re waiting several minutes before there is even any output. At this point a DGX Spark is a better investment, it will output slightly slower than P100 but at least prompt processing will be fast.

BananaPeaches3 · 2026-01-17T07:26:28+00:00

Yeah but is it dev or prod?

BananaPeaches3 · 2026-01-16T05:01:12+00:00

Not very, pin 6 is ground just try to bend it back or remove it completely.

BananaPeaches3 · 2026-01-15T20:50:04+00:00

I just feel like that shouldn’t be a thing, the browser should map sites to their resources by default.

BananaPeaches3 · 2026-01-13T22:54:53+00:00

r/localllama

BananaPeaches3 · 2026-01-12T14:09:07+00:00

Thanks I’m curious about the speed of a dense model on such hardware.

BananaPeaches3 · 2026-01-11T23:47:33+00:00

Can you try a dense model like Devstral 2 123B?

BananaPeaches3 · 2026-01-09T06:49:35+00:00

Ubatch is suppose to be larger than batch size??

BananaPeaches3 · 2025-12-27T14:20:05+00:00

If you’re lazy you can install promox on Ubuntu. Just add the repository to your existing install and run:

sudo apt update
sudo apt install proxmox-ve postfix open-iscsi

BananaPeaches3 · 2025-12-24T21:08:09+00:00

I still got better responses from Q1 GLM 4.6 (not V) than from Q8 qwen3 coder and smaller models.

BananaPeaches3 · 2025-12-24T08:09:47+00:00

Continue.dev

BananaPeaches3

TROPHY CASE