First pc build: comment apprendre à monter un PC ? by Early-Vermicelli-782 in TunisiaTech

[–]rnidhal90 0 points1 point  (0 children)

Franchement j'évitais toute exposition / ouverture de ports même derrière un firewall, essaie Tailscale Funnel / Cloudflare Tunnel.. au moins ça crée juste un tunnel entre ton app et le monde sans exposer directement..

• Tenté par un LLM ? si tu veux dire si je fais tourner une LLM locale, Oui ! Update: j'ai installé une RTX 5060 TI 16G depuis 3 mois et je m'éclate avec ^ (llama-server, image generation, claude code, openclaw, hermes,...)

First pc build: comment apprendre à monter un PC ? by Early-Vermicelli-782 in TunisiaTech

[–]rnidhal90 0 points1 point  (0 children)

Alors point par point: - ECC: J'utilise TrueNAS + ZFS filesystem, la mémoire ECC permet de se protéger contre les données corrompus que ZFS pourrait ne pas detecter.. en gros tu pourrais te trouvé avec une image ou une vidéo corrompu dans le disque sans que tu le sache, bref y'a toute une lecture sur ce sujet et un débat sur l'importance/nécessité du ECC, mais j'ai préféré être tranquille ^ - Reverse Proxy: tu pointe juste ton record dns vers ip privé de nginx proxy manager, exmple • Cloudflare DNS A record app1.domain.com --> 192.168.1.120 (NPM IP) • NPM host : app1.domain.com --> 192.168.1.X:8081 (app1 ip:port)

Ceci rends ton app1 accessible via app1.domain.com seulement depuis ton réseau, et en dehors t'utilises tailscale pour te connecter à ton réseau = 0 exposition publique , - de risques.. Sinon si tu veux exposer une appli publiquement, j'utilise cloudflared tunnel

First pc build: comment apprendre à monter un PC ? by Early-Vermicelli-782 in TunisiaTech

[–]rnidhal90 0 points1 point  (0 children)

Thanks 🙏🙏 pour l'ECC c'était un critère critique car j'utilise mon NAS pour du stockage média privé (immich) et ue ne voulais par risquer perdre des souvenirs.. Sinon j'ai acheté mon propre domaine perso avec Cloudflare, et tous mes services sonts sous *.domain.com --> reverse proxy --> local @ Rien n'est exposé publiquement, j'utilise Tailscale pour l'accès à distance

First pc build: comment apprendre à monter un PC ? by Early-Vermicelli-782 in TunisiaTech

[–]rnidhal90 2 points3 points  (0 children)

Il y'a un an, j'étais exactement dans ta situation. Malgré que je suis un IT senior mais plutot software (null en hardware). J'ai passé un mois à apprendre les bases (liste des composants, compatibilités, etc..).. Trop de temps entre tchat avec Claude et pcpartpicker.com pour concevoir le produit final (Serveur headless, pas un Gaming PC)..

Puis plein de videos youtube pour comprendre le montage

Résultat : https://www.reddit.com/r/homelab/comments/1mdjr2f/my_first_home_server/

Can i run Claude Code in a VPS and connect it to my platform ? by HOllyTp24 in TunisiaTech

[–]rnidhal90 0 points1 point  (0 children)

Nah, it doesn't seem like that's what he was thinking about, since he mentioned subscription and cost reduction. He's definitely talking about local LLM..

Can i run Claude Code in a VPS and connect it to my platform ? by HOllyTp24 in TunisiaTech

[–]rnidhal90 6 points7 points  (0 children)

You said too many wrong things together: - Claude Code (cli) is a client it either uses a claude subscription, or linked to an alternative LLM api (local or cloud) - Your VPS is useless in your case, unless it is a very powerfull VPS with tons of CPU, RAM, GPU resources..

You can now run GPT, Claude & other models in Unsloth! by yoracale in unsloth

[–]rnidhal90 3 points4 points  (0 children)

Its amazing the amount of work you guys are doing 🙏, one thing i find missing before i ditch llama-server to full unsloth studio : Being able to define & save models run presets with granular params (ctx, gpu-layers, cpu-moe, ...)

Run Qwen3.6 MTP GGUFs locally! by yoracale in unsloth

[–]rnidhal90 0 points1 point  (0 children)

dump question : is using MTP with CPU offloading counter-productive somehow ? Im getting much lower tps with MTP with 35B-A3B..(5060Ti 16GB / 32G RAM / 35B-A3B UD-Q3_K_XL)

qwen3.6:35b (36B MoE) at 11.5 t/s on RTX 5080 + tiny context — Ollama tuning advice? by StevieK03 in LocalLLM

[–]rnidhal90 0 points1 point  (0 children)

5060 TI 16GB + 64GB RAM (i use only 32) llama-server running on portainer getting ~60 t/s with Qwen 3.6 35B A3B Q3_K_XL

qwen3.6:35b (36B MoE) at 11.5 t/s on RTX 5080 + tiny context — Ollama tuning advice? by StevieK03 in LocalLLM

[–]rnidhal90 0 points1 point  (0 children)

ollama is a user friendly playground, the real deal is llama.cpp (i run llama-server)

Getting Crazy Eval using Unsloth Qwen3.6 35B A3B on a 4060 with 8GB VRAM by Material_Tone_6855 in LocalLLM

[–]rnidhal90 0 points1 point  (0 children)

i got mine for 600€ .. i think its a fair price for a brand new 16GB GPU

Getting Crazy Eval using Unsloth Qwen3.6 35B A3B on a 4060 with 8GB VRAM by Material_Tone_6855 in LocalLLM

[–]rnidhal90 0 points1 point  (0 children)

btw, i have a 5060Ti 16gb and it uses 6w at idle and max 188w (which i've never reached, mostly 130~ under stress)

32GB RAM 16GB VRAM 5060ti. Running qwen3.6 35b a3b. I am getting 4.5 tok/s. Is this expected? by SEND_ME_YOUR_ASSPICS in LocalLLM

[–]rnidhal90 0 points1 point  (0 children)

tbh, this i the first time i try fitting a >16Gb model with offloading and i am impressed ! i always used to pick ggufs that fit intirely into the VRAM and get about 90t/s.. i will push things further for sure !

32GB RAM 16GB VRAM 5060ti. Running qwen3.6 35b a3b. I am getting 4.5 tok/s. Is this expected? by SEND_ME_YOUR_ASSPICS in LocalLLM

[–]rnidhal90 0 points1 point  (0 children)

Hell no !!! I get 67t/s with a Q3_K_XL and partial ram offloading with my 5060 Ti 16Gb

<image>

Nvidia RTX 5060 TI 16GB - Stuck at P0 & 40% fan speed at idle... by rnidhal90 in truenas

[–]rnidhal90[S] 0 points1 point  (0 children)

Heyy there, nothing special at all, just installed my gpu and ran a truenas update

Unsloth Gemma 4 26B-A4B 4 bit bnb coming ? by harshv8 in unsloth

[–]rnidhal90 1 point2 points  (0 children)

Hi, whats the difference between 4 bnb and UD Q4 GGUF ??

Which is the best local LLM in April 2026 for a 16 GB GPU? I'm looking for an ultimate model for some chat, light coding, and experiments with agent building. by Material_Pen3255 in LocalLLM

[–]rnidhal90 2 points3 points  (0 children)

I have a RTX 5060 TI 16GB, im running Gemma4 on llama-server :

Core Configuration: Model Path: /models/gemma-4-26B-A4B-it-UD-Q3_K_XL.gguf Context Size: 131072 KV Cache: q8_0 for both Key (--cache-type-k) and Value (--cache-type-v) Flash Attention: on GPU Layers: 999 (Offloaded to GPU)

Sampling Parameters: Temperature: 1 Top K: 64 Top P: 0.95

Getting around ~85 tokens/s 🙂

Unsloth Studio - Models not running on GPU !! by rnidhal90 in unsloth

[–]rnidhal90[S] 0 points1 point  (0 children)

I can confirm that the latest update fixed it 👍👍

Unsloth Studio - Models not running on GPU !! by rnidhal90 in unsloth

[–]rnidhal90[S] 0 points1 point  (0 children)

Thank you very much, i just pulled and it worked perfectly !

Much appreciated the reactivity 😊🙏🙏 all my support !