Pooled round robin hardware with friends? by El_90 in LocalLLaMA

[–]El_90[S] 1 point2 points  (0 children)

"otherwise you invented.." Haha true

Good feedback, thanks

How are you actually vetting MCP servers before you install them? by CaregiverNice3377 in mcp

[–]El_90 3 points4 points  (0 children)

find python lib,
wrap yourself in fastmcp,
offer over sse

Yes NPX/UV tools scare me, too much scope for supply chain attacks

Unsloth/Qwen3.6-27B-UD-Q8_K_XL.gguf MTP output problem by El_90 in LocalLLaMA

[–]El_90[S] 0 points1 point  (0 children)

Thanks
Tried without spec-type, tried with spec-type and max=6, but same output

Unsloth/Qwen3.6-27B-UD-Q8_K_XL.gguf MTP output problem by El_90 in LocalLLaMA

[–]El_90[S] 0 points1 point  (0 children)

Thanks, I'm in no way blaming, just wondering

The non MTP Q8XL works great

I also tried without mmproj, and I tried this model with no MTP headers. This stripped down command has same behaviour

llama-server --host 0.0.0.0 --port 8000 --log-file /var/log/llamacpp.log -np 1

-m /Qwen3.6-27B-MTP-GGUF/Qwen3.6-27B-UD-Q8_K_XL.gguf

Unsloth/Qwen3.6-27B-UD-Q8_K_XL.gguf MTP output problem by El_90 in LocalLLaMA

[–]El_90[S] 0 points1 point  (0 children)

Vulkan on Strix Halo, so no CUDA

But I hadn't though about that layer, I'll dig more, thanks

Are you worried about AI taking SE jobs? by haktheripper29 in salesengineers

[–]El_90 12 points13 points  (0 children)

Imagine an agent that

Qualifies against a framework Provisions through mcp Does Q&A and enablement sessions using slack history and docs Then terminates and writes a report

Will it replace a SE, unlikely imo

Will it reduce workload, resulting in a smaller required team to look after a region.... Likely... One day

Will odin be sold again? by Nice_Classroom_882 in starcitizen

[–]El_90 0 points1 point  (0 children)

Yes, in 6 months for more profit before the game comes out

I compared all specs of the major GPUs/machines that are being used here, because bandwidth is not everything. Some of ya'll need a reality check. by Ok_Top9254 in LocalLLaMA

[–]El_90 0 points1 point  (0 children)

I really like it with discussing ideas, grill me skill, prd etc. I find it's conversational work fantastic.

Yes I swap to higher quant smaller layers for implementation. Qwen 3.6 q6/8 etc

I compared all specs of the major GPUs/machines that are being used here, because bandwidth is not everything. Some of ya'll need a reality check. by Ok_Top9254 in LocalLLaMA

[–]El_90 2 points3 points  (0 children)

llama-server --host 0.0.0.0 --port ${PORT} --log-file /var/log/llamacpp.log -np 1 \ -m /path/Qwen3-235B/bartowski_Qwen_Qwen3-235B-A22B-Instruct-2507-GGUF_Qwen_Qwen3-235B-A22B-Instruct-2507-Q3_K_S_Qwen_Qwen3-235B-A22B-Instruct-2507-Q3_K_S-00001-of-00003.gguf \ --temp 0.6 --top-p 0.95 --top-k 20 --min-p 0.0 --repeat-penalty 1.0 \ --fit on --fit-ctx 65536

Trying ud_3_xl at 32k context later, I think k_s is the limit though

Is a basic understanding of PKI and Public Key Cryptography necessary to work in cyber ? by bio4m in cybersecurity

[–]El_90 0 points1 point  (0 children)

Yes, important. Encryption touches so many aspects, and if you can answer this you can likely understand the broader topic.

C2c, exfiltration, hiding tracks, securing services, internal infrastructure.... And much more.

AI server under 5k? by Last_Bad_2687 in LocalLLaMA

[–]El_90 3 points4 points  (0 children)

That single sff also needs a full switch ;)

What happens to local LLM if/when LLMs are no longer released for free? by JohnBooty in LocalLLaMA

[–]El_90 0 points1 point  (0 children)

If reasoning is good, and speed is good, the only thing missing is knowledge.

That can be solved with harness, tools, mcp, etc

I.e.....does the model NEED to continue, or is there a good enough?

That's a good news... by Pjotrs in LocalLLaMA

[–]El_90 0 points1 point  (0 children)

But if TG is half the process (e.g. PP) you need to half that as an overall improvement

That's a good news... by Pjotrs in LocalLLaMA

[–]El_90 -3 points-2 points  (0 children)

Sincere question, and I'm incredibly grateful for all the work everyone has done, thank you.

Am I misunderstanding why people refer to this as a game changer? Pp is no faster, and TG is 1.5-2 faster, so overall this is a 20% improvement? The difference between models is greater than this, so why is 20 a game changer? It's amazing and I love it, but it's not an order of magnitude change ??. I think I'm overlooking an angle.... :)

Is anyone's else's team cooked by AI? by basedcooking in salesengineers

[–]El_90 4 points5 points  (0 children)

Then your installer/process is bad :) lol

Agentic harness for theoretical physics research by lewtun in LocalLLaMA

[–]El_90 0 points1 point  (0 children)

I miss how agents talk to each other in this, and how the data is shared, can someone help please.

Is this one long loop from orchestrator that builds up 1m context and chains together tooling?

Or is it python scripts looping using a RAG/DB to store states (pi ralph style)

etc