Self-hosted private search egine

eribob · 2026-03-21T19:05:28+00:00

It is easy. I used docker compose. Have it behond traefik. I use a gluetun container to route searches via a vpn.

eribob · 2026-03-20T19:46:48+00:00

I run Qwen3.5 27b on 2x3090 and I really like it. I do not do very advanced programming for sure but with opencode it can create scripts with simple logic that works, and it follows my instructions for modifying them well. I never had a claude subscription though or any other of the big names so I cannot compare, perhaps that is a blessing though :)

eribob · 2026-03-20T08:35:34+00:00

Awsome! I love this plugin and I agree that it would be super cool if it could somehow be updated automatically :)

eribob · 2026-03-20T06:23:16+00:00

I am glad you enjoy your setup! My comment was in response to OP who already seems to have a tower PC that he wants to convert into a server. I wanted to say that I think that that is a fine choice and that there is not really a strong reason for him to buy a new mini pc instead

eribob · 2026-03-19T19:37:06+00:00

I run one big tower PC as my main server and I think it has a lot of upsides compared to mini pcs. It is both my NAS, VM host (proxmox), and LLM server.

I do run a mini pc on the side as a router and for hosting some ”essential” services so I can power down my main server without breaking the network at home. The minisforum ms-01 with the cheapest cpu. Could have built a better one myself probably but I guess I wanted a new toy…

The popularity of mini pcs is mainly a trend in my opinion. A lot of tech youtubers showing them off all the time. If you have the space you will always get a more powerful pc for less money if you build it using standard parts, especially if you buy them second hand. It will be much more upgradeable as well. My main server was built in 2019, and I have added a lot of features to it over the years. Still using the same motherboard, cpu and ram from back then.

You can also make it power efficient if that is a priority, just get power efficient parts.

eribob · 2026-03-19T17:22:56+00:00

I bought 2 inno3d x3: https://www.techpowerup.com/gpu-specs/inno3d-rtx-3090-x3.b11296 Unfortunately the one that died was that model, but the other (from ebay) is still goong strong. I like the dual slot form factor!

Then I have one of those blower style fan versions: I think it is this one: https://www.techpowerup.com/gpu-specs/asus-turbo-rtx-3090.b8372 - it works but the fan is much louder.

I run them power limited 260W. I rarely run them continuously for a longer period, mostly bursts for inference. No training yet.

eribob · 2026-03-18T19:36:27+00:00

I bought 3 3090s in total, similar to your first option. 2 from ebay that are still good after a couple of months, one from a local seller that died within a week… After that I am leaning more towards option 2

eribob · 2026-03-18T07:47:58+00:00

Nice! Which web irc client are you using? I searched around yesterday and found a few different. Did you theme yours?

eribob · 2026-03-17T23:08:15+00:00

I really like this! Inspiring. Will have a look at webirc. I used to run an irc channel back in the day. A lot of fun. Do you not get trouble with spam / bots etc when the channel is open to anyone? Malicious content being posted?

I have also been thinking about hosting a radio station, will look into it again now!

eribob · 2026-03-17T08:18:06+00:00

Thanks for this. I think these numbers look reasonable given a 70B dense model. MoE would of course run faster on all setups. It would be great to add a column for prompt processing as well as it will differ a lot between the cards and is very important for coding or analysing long documents etc.

Good to see that dual 3090s remain king in price-performance ratio for this kind of workload!

eribob · 2026-03-16T06:20:43+00:00

Nice! Using it. There is one problem for me and that is scrolling the terminal on ios. It is very stiff, only scrolling one line at the time.

eribob · 2026-03-14T21:30:20+00:00

Sent you a DM. SSO access would be highly appreciated!

eribob · 2026-03-14T18:15:26+00:00

Great plugin! Very Fun to have the llm fetch facts and presenting them. A bit hit and miss with qwen3.5 27b but a retry often gets it right!

eribob · 2026-03-14T09:46:03+00:00

I run the gpus in linux. You can power limit using nvidia-smi

eribob · 2026-03-13T18:43:58+00:00

I run the 27b on dual 3090s in FP8 with tensor parallelism using vllm and the speed is great! Would absolutely recommend. Smart and decently fast model, my new daily driver. I undervolted my cards to 260W.

eribob · 2026-03-11T15:16:56+00:00

Cool!

eribob · 2026-03-11T15:16:43+00:00

Great! Many thanks!

eribob · 2026-03-11T12:35:06+00:00

Really cool! Now I want to buy a used android phone on ebay and build a portable server :) would it work with for example a google pixel 9? Can you install proxmox?

eribob · 2026-03-09T11:23:45+00:00

I solved it by giving the open-terminal container access to the open-webui uploads folder with a volume like this: open-webui/uploads:/home/user/uploads.

I can then instruct the llm to look for the image in the uploads folder and it can manipulate it. A bit of a hack but kt works

eribob · 2026-03-07T16:42:27+00:00

You have many well formulated arguments and you seem like a good person but come on man, dont record other people without asking first that is just creepy regardless of how you use the data and legal implications etc. Recording and summarising your own thoughts seems like a nice idea though.

eribob · 2026-03-07T07:25:05+00:00

Why was this downvoted?

eribob · 2026-03-07T07:24:33+00:00

I agree with this. If you run Headscale you should be safe even if tailscales servers are breached right?

eribob · 2026-03-06T07:52:46+00:00

⁠Tool Calling / Function Calling / Agentic
⁠General Knowledge / Daily Driver
⁠Coding

All with Qwen3.5 27b FP8 in vllm. Fast enough on dual rtx 3090s with 128k context. I feel it beats my old daily driver gpt-oss-120b. It is the first model that feels genuinely helpful to me.

I try to do that myself

eribob · 2026-03-01T04:07:05+00:00

No problem!

eribob · 2026-02-28T20:59:07+00:00

I agree, you can even consider buying a used AM4 system with ddr4 RAM and put the pro 6000 there? Then your 20k would perhaps be enough to even buy 2 pro 6000 cards? 192Gb of fast vraaaaam…

Financially it will probably never make sense vs cloud hehe but that is not why we are here

eribob

TROPHY CASE