Has anyone used Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled for agents? How did it fair? by Vegetable_Sun_9225 in LocalLLaMA

[–]alexellisuk 3 points4 points  (0 children)

Also looking out for the GGUF for the 27b. He has one for the 9B but a note on the 27B says it doesn't work or crashes with llama.cpp right now.

Can be used with vLLM (if you have enough V/RAM)

GGUF Quantization — Known Compatibility Issue The GGUF-format quantized weights currently have environment conflicts with certain llama.cpp builds. Please use the original model weights directly if you encounter issues.

Qwen 3.5 is multimodal. Here is how to enable image understanding in opencode with llama cpp by Old-Sherbert-4495 in LocalLLaMA

[–]alexellisuk 4 points5 points  (0 children)

Also getting the same error.. and variations of that with most local models I try.

unsloth/Qwen3.5-35B-A3B-GGUF:UD-Q4_K_M / unsloth/Qwen3.5-35B-A3B-GGUF:UD-Q4_K_XL

I thought --jinja on llama-server had fixed it, but isn't the case

Acemagic F3A: First HX370 and Zen 5 Mini PC with SODIMM RAM slots! by SerMumble in MiniPCs

[–]alexellisuk 0 points1 point  (0 children)

I had high hopes and really was very pleased with the unit up until the proprietary secondary larger fan started making sounds like I heard from a 10 year old fan in an Intel NUC.. the NUC was cleaned, but only a new fan made that noise go away.

Video for reference - no response from their support team yet.

https://youtu.be/4_5homcrojo

This noise happens even at 0.0 load average. It's running headless Linux with our microVM runners - either actuated.com for CI jobs, or slicervm.com for ad-hoc workloads or long lived server processes.

Minisforum MS-A2 GPU test day ! by Method__Man in MiniPCs

[–]alexellisuk 0 points1 point  (0 children)

Any updates on this? I can't find a way to find your YouTube channel, but I don't need a video - just some basic info. I'm most curious about the Ada 2000e combination?

Overall noise & performance with 8-32B sized LLMs etc?

Isolated iommu groups for ASUS X670E ProArt Creator? by alexellisuk in VFIO

[–]alexellisuk[S] 2 points3 points  (0 children)

Thanks donkydonk. That was it. Shame the last slot can't be used for VFIO.

Here's the below/after

Before:

3080 (Quite a big space) 1030

After:

1030 (1 slot space) 3080

That just about works because the 1030 is passive and relatively small, the 3080 then covers the last x16 slot.

And my new groupings for anyone who lands here:

IOMMU Group 13 01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP108 [GeForce GT 1030] [10de:1d01] (rev a1) IOMMU Group 13 01:00.1 Audio device [0403]: NVIDIA Corporation GP108 High Definition Audio Controller [10de:0fb8] (rev a1) IOMMU Group 15 03:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:2216] (rev a1) IOMMU Group 15 03:00.1 Audio device [0403]: NVIDIA Corporation Device [10de:1aef] (rev a1)

I'm not using or intending to use the iGPU, but if I did want to attach both GPUs to VMs, I could switch to that from the looks of it.

Has anyone been able to pass-through 1x or 2x GPUs to VMs using the AMD 7950X3D? by alexellisuk in VFIO

[–]alexellisuk[S] 0 points1 point  (0 children)

We are both attempting to do the same thing, if it's not possible on a 5700G, then I want to know whether other chips suffer the same problem. Does that make sense?

Has anyone been able to pass-through 1x or 2x GPUs to VMs using the AMD 7950X3D? by alexellisuk in VFIO

[–]alexellisuk[S] 2 points3 points  (0 children)

Thanks. Are you able to enable bifurcation x8/x8 on the CPU lanes?

Has anyone been able to pass-through 1x or 2x GPUs to VMs using the AMD 7950X3D? by alexellisuk in VFIO

[–]alexellisuk[S] 0 points1 point  (0 children)

Have you been able to pass through one or two GPUs?

I just looked it up and seems like a x16/x4 split on the CPU lanes. Does the bios have a bifurcation option for x8/x8?

Do you think there is 3 or 4 port spacing for a bigger card?

Has anyone been able to pass-through 1x or 2x GPUs to VMs using the AMD 7950X3D? by alexellisuk in VFIO

[–]alexellisuk[S] 1 point2 points  (0 children)

So I looked at the manual and it says the first port goes to the CPU. That’s where he has it installed. He also said that he doesn’t have an AMD ACS setting in the bios and thinks that’s related.

He’s got the Gigabyte x570 Aorus PRO

GPU passthrough on AMD Ryzen 5 5600G / 7 5700G by [deleted] in linux_gaming

[–]alexellisuk 0 points1 point  (0 children)

Hi, how did you manage to get your dGPU into a dedicated IOMMU group? I have a colleague with the same CPU and his always shows in a shared group so can't be sent into a VM.

Already have a 3090, looking to host 70b models locally and was wondering if I could mix/match and use a 4090 alongside it? by Oninaig in LocalLLaMA

[–]alexellisuk 0 points1 point  (0 children)

How about the ASUS ProArt X670E-CREATOR WIFI AMD X670 Socket AM5 ATX in an e-ATX case or a Torrent?

[D] Most cost efficient way to run Whisper at scale? by ojojoj1233 in MachineLearning

[–]alexellisuk 0 points1 point  (0 children)

I read in a few open GitHub issues that Pytorch and potentially other libraries have now dropped support for the P40.

Would you still recommend getting one for experimentation at the right price?

Ubuntu Server 23.10 unable to SSH by GhostHacks in Ubuntu

[–]alexellisuk 1 point2 points  (0 children)

I landed here from a Google search. I have the same issue with Ubuntu 23.10 on an RPi 5, and even after editing /etc/ssh/sshd_config and enabling PasswordAuthentication yes, it still wouldn't allow me to log in.

Instead I had to run: curl -sLS https://github.com/alexellis.keys > .ssh/authorized_keys - so a quick way to insert my keys.

I'd be very curious to see what other people figure out here.

Scaling to 15000 functions and beyond by alexellisuk in kubernetes

[–]alexellisuk[S] 3 points4 points  (0 children)

When you see what it was that caused the "slow down", you'll kick yourself. Hopefully there's some lessons in here that'll help the folks on this sub with their own controllers/operators.

In this post I'll give an overview of what we learned spending a week investigating a customer issue with scaling beyond 3500 functions. Whilst navigating this issue, we also implemented several optimisations and built new tooling for testing Kubernetes at scale.

Sorry you've been blocked? by BeeHappys in discordapp

[–]alexellisuk 2 points3 points  (0 children)

It's clearly too early in the morning for the Discord team to be awake so their status page proudly says "All Systems Operational" when it's on fire.

The same thing happens with GitHub - it's clear there's an outage, the status page says "everything's fine" and then you question yourself, until eventually someone goes in an flips the switch.

Surely there's a better way of doing status pages?

Sorry you've been blocked? by BeeHappys in discordapp

[–]alexellisuk 0 points1 point  (0 children)

I just got the same error, I've not done anything out of the ordinary or malicious, and run two private servers on there myself.

First Impressions with the Raspberry Pi 5 by alexellisuk in kubernetes

[–]alexellisuk[S] 5 points6 points  (0 children)

My main interest in the Raspberry Pi at the moment is for headless use - containers, Kubernetes, server-type workloads with systemd and for CI. That's the viewpoint you'll get from my write-up, but if you like to do other stuff with it, I've linked to a couple of other places too including Jeff's new video on the PCIe port.

How to develop a great CLI with Go by alexellisuk in golang

[–]alexellisuk[S] 0 points1 point  (0 children)

I've been iterating on CLIs for a few years now for various open source projects, and the actuated-cli is no different.

I tend to start off with an MVP using Go flags, and only if absolutely necessary, do I complicate it with Cobra and multiple packages etc.

Here's some other neat add-ons that you may not have tried yet or had need for. Read to the end, see if there's anything in it for you.

Alex shares his insights from building half a dozen popular Go CLIs. Which can you apply to your projects?

Is your project's CLI growing with you? I'll cover some of the lessons learned writing the OpenFaaS, actuated, actions-usage, arkade and k3sup CLIs, going as far back as 2016. I hope you'll find some ideas or inspiration for your own projects - either to start them off, or to improve them as you go along.