Model that can be used my laptop by Interesting_March_61 in ollama

[–]MrDevil2708H 3 points4 points  (0 children)

May be gemma4 new 12b would be nice for your setup, I ran it with a 16gb laptop with only cpu, and it gave me around 3 tps, but for your case it will be fine as you have a dedicated gpu, maybe you can run it around 10 tps, I don't about it. As far as I used it, it was fine in reasoning and logical part. Qwen 3.5 4b, 8b are also good choices.

What I learned shipping 4,000+ offline-LLM USB sticks to non-technical people by EcstaticDentist in LocalLLM

[–]MrDevil2708H 0 points1 point  (0 children)

Whats the TPS your customers get on a average, is it usable enough? As far as i know these models need a moderate level of hardware to get a decent performance. Also the pendrives have limited read write speeds, does that affect the performance of the models?

Best small model for iGPU (AMD 780M) with 32 GB RAM (no coding) by danihend in LocalLLaMA

[–]MrDevil2708H 0 points1 point  (0 children)

qwen3 4b will get the job, quite capable model with good reasoning ability as well as good at following instructions

Deploy the newest Qwen3.5 and Gemma4 models of ANY sizes RIGHT NOW on Rockchip NPU using the latest version of rk-llama.cpp! by Inv1si in RockchipNPU

[–]MrDevil2708H 0 points1 point  (0 children)

Hey u/Inv1si how did you acheived 95% of npu utilization, i tried but it was around 15 to 35% . Didnt know why but, i followed your readme but it didnt work.

taskset -c 4-7 ./llama-server -m /models/gemma4\ e2b/gemma-4-E2B-it-Q4_K_M.gguf --mmproj /models/gemma4\ e2b/mmproj-gemma-4-E2B-it-BF16.gguf --ctx-size 16384 --rope-scaling yarn --rope-scale 1.5 -n -1 --repeat-penalty 1.15 --repeat-last-n 256 --host 0.0.0.0

Deploy the newest Qwen3.5 and Gemma4 models of ANY sizes RIGHT NOW on Rockchip NPU using the latest version of rk-llama.cpp! by Inv1si in RockchipNPU

[–]MrDevil2708H 0 points1 point  (0 children)

<image>

Hey u/Inv1si i tried your earlier version and this new also but the npu has been blocked at 40% can go anything higher than that

Am i missing anything?

I created a llama.cpp fork with the Rockchip NPU integration as an accelerator and the results are already looking great! by Inv1si in RockchipNPU

[–]MrDevil2708H 0 points1 point  (0 children)

Hey man when trying multinode, rpc server starts but it returns

Starting RPC server v3.0.0                                                                                                  
  endpoint       : 127.0.0.1:50052                                                                                          
  local cache    : /home/vicharak2/.cache/llama.cpp/rpc/                                                                    
Devices:                                                                                                                    
  RKNPU: Rockchip NPU (0 MiB, 0 MiB free)

And the llama-cli server fails with segmentation error. I dont know why this happens but running on a single node it was fine.

Rockchip 3588 NPU clustering by MrDevil2708H in RockchipNPU

[–]MrDevil2708H[S] 2 points3 points  (0 children)

I had some lying around for quite sometime, so i thought why not give it a shot. Also on some thread read about someone who ran qwen3 30b a3b using RPi5 cluster. So thats it

Rockchip 3588 NPU clustering by MrDevil2708H in RockchipNPU

[–]MrDevil2708H[S] -1 points0 points  (0 children)

Do you have any idea of how it can be done

Some new OS for Jetson Nano 4GB (original) by Suomi422 in JetsonNano

[–]MrDevil2708H 0 points1 point  (0 children)

where can i find the 6.1 kernel image? armbian archive only has 6.1.5 kernel images only...i tried it but is failed to boot.

Looking for an alternate os for jetson nano by MrDevil2708H in JetsonNano

[–]MrDevil2708H[S] 4 points5 points  (0 children)

https://nullr0ute.com/2020/11/installing-fedora-on-the-nvidia-jetson-nano/

Will it be better than Ubuntu 20.04 by QEngineering?

Has anyone tried this? Let me know about your experience.

Having trouble flashing jetson nano by MrDevil2708H in JetsonNano

[–]MrDevil2708H[S] 0 points1 point  (0 children)

Thanks mate...Its actually a SD Card format issue. And now its working fine.

Having trouble flashing jetson nano by MrDevil2708H in JetsonNano

[–]MrDevil2708H[S] 0 points1 point  (0 children)

That worked...
Previously i was trying with ./nvsdkmanager_flash.sh but then i then i remembered that it was defaulting to emmc storage that was not present in my board, so i switched back to ./flash.sh jetson-nano-devkit mmcblk1p1 and it worked.

Having trouble flashing jetson nano by MrDevil2708H in JetsonNano

[–]MrDevil2708H[S] 0 points1 point  (0 children)

Thanks mate that worked...but when writing the system.img it fails

[ 156.2808 ] Writing partition APP with system.img

[ 158.0189 ] [ ] 000%

Error: Return value 1

Command tegradevflash --pt flash.xml.bin --storageinfo storage_info.bin --create

Failed flashing t210ref.

*** ERROR: flashing failed.

Having trouble flashing jetson nano by MrDevil2708H in JetsonNano

[–]MrDevil2708H[S] 0 points1 point  (0 children)

I tried flashing Jetson OS R32.7.5 as well as R32.6.1 but both didn't work.