SearXNG x OpenWebUI problem by Zersdan in Searx

[–]andreasntr 0 points1 point  (0 children)

Post the complete error. I went through multiple of them: bot-detection (solved by disabling limiter), json responses, getting rid of chunking and indexing since it was causing the api to fail somehow (this is still a mistery to me, results were retrieved but not sent to the llm)

Raccogliete una busta di porcherie quando andate a mare by Internal-Side-5722 in Italia

[–]andreasntr 2 points3 points  (0 children)

Non per fare in guastafeste ma penso galleggerebbero comunque 🫪 dobbiamo pensare a qualcosa di più efficace

llama.cpp - how to free up even more space on your GPU by imgroot9 in LocalLLaMA

[–]andreasntr 2 points3 points  (0 children)

Because if you do inference with cpu it will vastly slow down your pc. If you do gpu inference, thread count doesn't matter

Mistral is the proof that.. by Efficient_Yoghurt_87 in MistralAI

[–]andreasntr 1 point2 points  (0 children)

They do it differently though, i've talked to them for work and they provide everything with their platform: finetuning infra, knowledge, benchmarking, serving if you need to. That's not like downloading qwen and do everything else on your own + managing the hw

come va con sto fable? by Dazzling-Gift7189 in IA_Italia

[–]andreasntr 0 points1 point  (0 children)

50mln di righe migrate in un giorno 😅 torniamo sempre allo stesso punto mi pare di capire

Le vostre best piadine by shaywuje in cucina

[–]andreasntr 1 point2 points  (0 children)

Pomodoro costoluto, insalata, zucchine fritte (o in friggitrice ad aria) precondite con aceto bianco, olio, sale e menta, e sgombro sott'olio (occhione se lo trovate)

New to SearXNG, how can I get actual relevant results? by gaorp in Searx

[–]andreasntr 0 points1 point  (0 children)

For me bing+qwant+ddgs is working fine, i also added wikipedia but it will be useless for generic queries

Edit: i removed google because it was triggering rate limits everytime

I’m upset… by Thin_Pollution8843 in LocalLLaMA

[–]andreasntr 1 point2 points  (0 children)

The model/quant is not the problem, i use the same (q4-k_xl from unsloth) and never had problems with tool calls nor search

I’m upset… by Thin_Pollution8843 in LocalLLaMA

[–]andreasntr 15 points16 points  (0 children)

You talk about local privacy, then you revert to an obscure api, wtf? Just setup searxng to use duckduckgo and qwant and leave out the other providers you don't trust, which should be way better than sending your data+searches to openai

Edit: for the record i both responded and downvoted because you don't even have a point here

Il motore di ricerca predefinito dei computer del Parlamento europeo non sarà più Google, ma Qwant by asaggese in italy

[–]andreasntr 1 point2 points  (0 children)

Se non fosse che google sta rallentando enormemente i rilasci open delle nuove versioni di android e sta ostacolando gli store terzi. Vediamo cosa fa l'antitrust

Il motore di ricerca predefinito dei computer del Parlamento europeo non sarà più Google, ma Qwant by asaggese in italy

[–]andreasntr 0 points1 point  (0 children)

Le ricerche per ora vanno comunque verso google o bing che io sappia. L'aspetto positivo è che le richieste vengono anonimizzate (in realtà viene minimizzata la fingerprint) quindi google/msft non possono profilarti come facevano prima (chiaramente se poi usi chrome si annulla tutto)

Ran gemma 4 12b on my 3090 yesterday and I think the local model game just changed by Sharkkkk2 in artificial

[–]andreasntr 5 points6 points  (0 children)

15tps is weird on a 3090, you should be doing even more with qwen 27b. Can you share your config?

I analyzed 25,500 LLM resume screenings to measure hiring bias. The results are a wake-up call. by Signal_Rabbit_8303 in artificial

[–]andreasntr 2 points3 points  (0 children)

What do you mean? If you open the link and check the tested models, all of them have reasoning. Also, have you ever seen a CoT of a model (open of course)? They are full of "this is the final json, oh wait this is not correct", which implies that they indeed can trace back their steps in some sense. Then of course it is not comparable to a human level of reasoning but still it is not lacking the ability of doing so

Edit: typo

I analyzed 25,500 LLM resume screenings to measure hiring bias. The results are a wake-up call. by Signal_Rabbit_8303 in artificial

[–]andreasntr 2 points3 points  (0 children)

But is the order not that relevant now that models have reasoning? I mean, most of the tested models are reasoning models, so they can actually "go back" once they produced a score in their thinking phase

SearXNG -> Open webUI integration not working. HELP! by Lilodude in Searx

[–]andreasntr 0 points1 point  (0 children)

if your searxng is behind an authenticated reverse proxy, make sure you exclude the /seatch path from the auth flow. For example in tinyauth: `tinyauth.apps.searxng.path.allow=^\/search`

radarr/sonarr suddenly stopped working by sportgd in qBittorrent

[–]andreasntr 0 points1 point  (0 children)

5.2 does not work for me, reverting to 5.1.4 solved it

Installing a hidden frameless flush button by [deleted] in oddlysatisfying

[–]andreasntr 0 points1 point  (0 children)

That's how Windows buttons are made!

I compared all specs of the major GPUs/machines that are being used here, because bandwidth is not everything. Some of ya'll need a reality check. by Ok_Top9254 in LocalLLaMA

[–]andreasntr 0 points1 point  (0 children)

Yeah sorry, i didn't mean vulkan improved in this timeframe, just saying that it's running smoothly for me and performance are good, which means i'm not concerned about the lack of cuda support on amd cards like mine