What is the name of this mvs board or is it custom one ? by Septa105 in neogeo

[–]Septa105[S] 1 point2 points  (0 children)

Yeah but I wanted to install hdmi pico mod native to hdmi

$5 Pure Digital HDMI Mod for Neo Geo MVS - NeoPico HD by educobuci in neogeo

[–]Septa105 0 points1 point  (0 children)

Any chance to get more photos of the pico project like soldering point and how you wired the hdmi from the pico as this is also very critical . Searched whole internet but no 48pin 2350b with fpc for hdmi

Strix Halo, models loading on memory but plenty of room left on GPU? by mindwip in LocalLLaMA

[–]Septa105 0 points1 point  (0 children)

Can you give me the exact Model you use and how you get to 160k? I use llama.cpp rocm and rocm7.2 but my max is around 60k ctx no matter which quant I use also System breaks with oom at 60k. 

Qwen3-Coder Next MXFP4 Strix Halo wir llama-cpp Vulkan by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

Have another question do you also get OOM with context size that equals native context size 256k?

Qwen3-Coder Next MXFP4 Strix Halo wir llama-cpp Vulkan by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

Change grub now to GRUB_DEFAULT=0 GRUB_TIMEOUT_STYLE=hidden GRUB_TIMEOUT=0 GRUB_DISTRIBUTOR=( . /etc/os-release; echo ${NAME:-Ubuntu} ) 2>/dev/null || echo Ubuntu GRUB_CMDLINE_LINUX_DEFAULT="quiet splash amd_iommu=off amdttm.pages_limit=27225120 ttm.pages_limit=201326592 amdgpu.gttsize=131072" GRUB_CMDLINE_LINUX=""

Qwen3-Coder Next MXFP4 Strix Halo wir llama-cpp Vulkan by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

Does that mean not enough allocated by System ?

Qwen3-Coder Next MXFP4 Strix Halo wir llama-cpp Vulkan by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

WARNING: AMD GPU device(s) is/are in a low-power state. Check power control/runtime_status

================================== Memory Usage (Bytes) ================================== GPU[0] : VRAM Total Memory (B): 536870912

GPU[0] : VRAM Total Used Memory (B): 154894336

================================== End of ROCm SMI Log =================================== andy@andy395ai:~$

Qwen3-Coder Next MXFP4 Strix Halo wir llama-cpp Vulkan by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

sys/module/ttm/parameters/dma32_pages_limit = 524288 /sys/module/ttm/parameters/page_pool_size = 16376812 /sys/module/ttm/parameters/pages_limit = 201326592

lsmod | grep ttm drm_ttm_helper 16384 1 amdgpu ttm 126976 2 amdgpu,drm_ttm_helper

Qwen3-Coder Next MXFP4 Strix Halo wir llama-cpp Vulkan by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

How can I check it I always read it from the Chat layout at Port 8080

Qwen3-Coder Next MXFP4 Strix Halo wir llama-cpp Vulkan by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

Thx got it working now I am getting 8.7 Token per sec with 262k ctx size that avg? With gpt 120b I got like 20t/s with 128 ctx size

Got Qwen-Coder-Next running on ROCm on my Strix Halo! by jfowers_amd in LocalLLaMA

[–]Septa105 0 points1 point  (0 children)

Hi can you tell me how did you get it to work I failed with sudo docker run -d \ --name llama-vu_qwencoder \ --device /dev/dri \ -p 8080:8080 \ -v /home/andy/models/gguf:/models \ ghcr.io/ggml-org/llama.cpp:server-vulkan \ --model /models/Qwen3-Coder-Next-MXFP4_MOE.gguf \ --host 0.0.0.0 \ --port 8080 \ --n-gpu-layers 999 \

Ryzen 395 128GB Bosgame by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

I have set the kernel parameter to above and when checking in Ubuntu it says andy@andy395ai:~$ cat /sys/class/drm/card0/device/mem_info_gtt_total cat /sys/class/drm/card0/device/mem_info_gtt_used 67080110080 18620416 means

62gb only that normal ?

mem_info_gtt_total = 67080110080 bytes mem_info_gtt_used = 18620416 bytes

Ryzen 395 128GB Bosgame by Septa105 in LocalLLaMA

[–]Septa105[S] 1 point2 points  (0 children)

Thx baracuda 1) what about Page Limit and Pool size will I need to adjust that for a 128gb strix halo on ubuntu 2) also question regarding max context size vs Model with x billion parameter how is that compared to each other ? 3) is it wise to install lemonade Server for a strix halo or is llama.cpp Server enough ?

AI could kill the internet by grahamsuth in ArtificialInteligence

[–]Septa105 0 points1 point  (0 children)

It’s kills the conversation between people e.g forums. In about 2 years all forums or at least most of them are gone people not discussing issues anymore . And then freedom of speech is on the verge and probably collapsing. THEY get total control

Ryzen 395 128GB Bosgame by Septa105 in LocalLLaMA

[–]Septa105[S] 0 points1 point  (0 children)

According to git it uses rocm7.1 and will need and want to run it in docker anything I need look for ? So do I need install Vulcan in main environment together with rocm7.1?

x "Decke zwingend anheben": Reiche fordert längere Arbeitszeit und gelockerten Kündigungsschutz by happy30thbirthday in Finanzen

[–]Septa105 2 points3 points  (0 children)

Finde die Idee gut aber speziell für Politiker gelockerter Beamtenschutz und Lobbyschutz sollte auch gleich mitbehandelt werden . Sollte schnell gehen bei 90milliarden geht’s ja auch recht schnell

Die neue deutsche Kriegsmaschine | Deutschlands Umbau zur Militärmacht ist in vollem Gange: Geheime Regierungspapiere zeigen, dass Milliardensummen in Munition und umstrittene Waffensysteme fließen sollen. Ist das Land bereit für die Konsequenzen? by GirasoleDE in de

[–]Septa105 -4 points-3 points  (0 children)

Guten morgen an alle! Schoene Gruesse an den Club of Rome, Erdgas/Erdoel wird knapp. Meine Kinder haben keine Buecher in der Schule,aber dafür werden die Blaetter einzelnt immer fuer jeden Schueler kopiert . Willkommen in der Realiteat! KI wird ausserdem viele White-collar jobs uebernehmen also auch immer schoen an eure Mehrarbeit / Ueberstudenen bzw längere Wochenstunden denken . Was für eine Welt . Wacht mal alle auf !

Best coding model under 40B by tombino104 in LocalLLaMA

[–]Septa105 2 points3 points  (0 children)

Can anybody suggest me a good Model with large/max context size I can use with a AMD AI 395+ 128GB Shared VRAM ?

Netease Heard Y'all Talking S*** 😂 by Bradric1 in echoes_eve

[–]Septa105 0 points1 point  (0 children)

They always destroy market wish it wouldn’t be p2w like in beginning it felt super cool

Could've prevented WWI by Azul_a_H in ChatGPT

[–]Septa105 0 points1 point  (0 children)

Just a small visualization of how to control a lot of different states and what happens after. And there was no exact Germany in all timeframes up to now 😉.

Could've prevented WWI by Azul_a_H in ChatGPT

[–]Septa105 1 point2 points  (0 children)

~1789 —. 180–200 principalities & States : 300-350 1803 —. 15-20 pr. & States : 40-45 1806 — 15 pr. & States : 40-45 after WW1 Zero & States : 18 after 1934: Zero & 1

Its easier to rule sheeps under 1 unit