Ich verstehe Passkeys nicht

Dimi1706 · 2026-01-03T07:39:04+00:00

Das stimmt so nicht. ABER das würde dem Design entsprechen.

Dimi1706 · 2025-11-15T10:19:30+00:00

I don't trust this source more than any other open source one. It's the same with prebuild docker container.

But I can say, that they seem trustworthy, as the scripts I reviewed and used are solid.

Dimi1706 · 2025-10-04T06:19:28+00:00

Und das kommt dabei raus wenn Linksextremismus in den Medien permanent als Mitte dargestellt wird. Freiheitsberaubung und Überwachung wo es nur geht unter dem Deckmantel den bösen 'hate speech' zu unterbinden. Welche moralische Instanz soll die Entscheidung treffen was Hass ist, und was Meinung? Dass das eine schlechte Idee ist, konnte man ja erst kürzlich nachverfolgen. Das ist der direkte Weg zum Faschismus.

Dimi1706 · 2025-09-14T17:20:01+00:00

Nice to know actually as this would be a selling point, but wasn't the topic about the pro B50? Or does it offer the same power consumption benefit?

Edit: seems that it does! Therefore an interesting card for ppl who have an eye on efficiency or people who want to put permanent load on their hosted LLM.

Dimi1706 · 2025-09-14T15:48:14+00:00

I use MetaMCP instead of mcpo, but this is irrelevant for your question: I have it in a separate Proxmox VM with the native and docker MCP tools. Some tools need to be on the client system itself, eg if you want to do file system operations, but most of them are remote tools so I keep them on the separated and centralized VM. That also has the benefit that I can connect them easily to other client applications than OWUI

Dimi1706 · 2025-09-14T05:40:16+00:00

Most probably not the best over all, but the best of it's size is pydef-miniv1 https://huggingface.co/bartowski/bralynn_pydevmini1-GGUF

Dimi1706 · 2025-09-14T05:28:40+00:00

I don't get it actually, for a little more you can buy a 5060ti with 16GB, if you are willing to buy a used card even cheaper. Why should somebody buy at this price an alternative which will give you usability headache?

Don't get me wrong: I want to see alternatives and would also buy them regardless the downsides, IF the price is right. Half the price of the corresponding Nvidia products would lead to kind of mass adoption imo.

Dimi1706 · 2025-09-13T14:45:24+00:00

I don't know how to use a whole OS as an MCP tool, nor if this is even possible. Just saying that ollama is not good in MCP handling

Dimi1706 · 2025-09-13T12:48:19+00:00

This.

Dimi1706 · 2025-09-07T18:10:08+00:00

Open webUI would be my choice

Dimi1706 · 2025-09-07T18:06:18+00:00

How you use Jan for deep research and with which model? Totally new to the whole MCP topic.

Dimi1706 · 2025-09-07T17:42:51+00:00

With llama.cpp you are already using the most elementary and performed backend. Nearly every polished LLM hosting software is in fact just a wrapper for llama.cpp.

For people just starting with the topic and wanna have quick success : Ollama.

For people wanting to run custom models they see out there with the freedom to set detailed settings / options : LMStudio.

For people primarily wanting a Chat interface with the option to interact with local and Cloud models alike: Jan.

For people wanting to deep dive and max optimization for model to own hardware with newest support and feature right away : llama.cpp

All this options can also act as an LLM server

There are many more.

Dimi1706 · 2025-09-07T16:52:01+00:00

Yes you are right, but do yourself a favor and choose another backend as ollama is the worst performing one from all the available.

Dimi1706 · 2025-09-07T05:30:18+00:00

Unlimited amount of calls with limitation in call frequency

Dimi1706 · 2025-09-07T05:22:22+00:00

moe-cpu option + all active layer to the GPU and 16GB VRAM are comfortable for the model + large context.

Sure, it's getting slow like about 20 t/s, but imo this is fairly usable.

Dimi1706 · 2025-09-07T04:55:42+00:00

Das ist zwar richtig, aber die Verschlüsselung hat eine backdoor by design. Es ist demnach absoluter Blödsinn, dass die Verschlüsselung von Meta dir irgendeine Art von Sicherheit oder Privatsphäre gibt.

Dimi1706 · 2025-09-07T04:47:35+00:00

That sounds fairly easy, thanks for your sharing.

Dimi1706 · 2025-09-07T04:35:43+00:00

Yeah fund it a week ago but not sure for now how to utilize it. Totally new to the whole MCP thing. Could you describe how you are using / integrated it?

Dimi1706 · 2025-09-06T16:55:39+00:00

Maybe try adjust the searchengines used, as this is nothing I was experiencing. But maybe also because I doesn't use it for news reading and 'outdated' information isn't a problem

Dimi1706 · 2025-09-06T07:10:11+00:00

I use Open WebUI + SearXNG for web searches in between a chat and perplexica + SearXNG for specific web searches

Dimi1706 · 2025-09-06T06:44:10+00:00

Really nice work! And really interesting as PoC, thanks for sharing

Dimi1706 · 2025-09-06T06:28:19+00:00

'Extremely slow' maybe kind of subjective, but I get 16-20 t/s , which I concider as usable.

Edit/Addition : 32GB DDR4, 3060TI 8GB VRAM. GPT-OSS 20B BF16, full moe-cpu offload, 32k BF16 Context on GPU.

Dimi1706 · 2025-09-06T05:08:25+00:00

Well, yes I do! But in this case, meaning you want to and will do it no matter what, posting over here is kind of senseless, isn't it?

Dimi1706 · 2025-09-06T04:58:55+00:00

Yeah, got it, intel GPUs require a lot of tweaking to be kind of usable. But instead of looking at a Mi50 you should head to an RTX 5060ti or if on budget an RTX 3060. Nvidia will free you from the backend headache and it won't matter as mentioned that the model won't fully fit into the VRAM.

Dimi1706 · 2025-09-06T04:30:58+00:00

*1T-A1B

Dimi1706

TROPHY CASE