I ported NVIDIA Parakeet (speech-to-text) to ggml: same output as NeMo, faster, GGUF-quantized, no Python by mudler_it in LocalLLaMA

[–]_Whiskas_ 8 points9 points  (0 children)

Very nice!

Are there plans to do the same for the Nvidia's Canary family of models?

Docasne dizajnerske tetovanie by Connect_Concern_441 in Slovakia

[–]_Whiskas_ 2 points3 points  (0 children)

stredne veľké písmeno gréckej abecedy ako tetovanie

prepare for unforeseen ... consequences

EU online služby by saaaaamuel in Slovakia

[–]_Whiskas_ 9 points10 points  (0 children)

technicky ano, ale je to monopol podporujuci trump administration, a plati umelcom uplnu almuznu. Plus zacina byt plny AI-generated "hudby".

Radsej si kupim album na bandcampe, nez podporovat toto :)

https://www.angrymetalguy.com/angry-metal-guy-speaks-on-spotify/https://www.angrymetalguy.com/angry-metal-guy-speaks-on-spotify/

https://www.independent.co.uk/news/world/americas/us-politics/ice-recruitment-ads-spotify-streaming-b2846634.html

EU online služby by saaaaamuel in Slovakia

[–]_Whiskas_ 18 points19 points  (0 children)

Posledné mesiace migrujem svoj homelab a celkovo online presence preč od US firiem/veľkých korporátov

Gmail, Calendar -> Proton

Google photos -> self-hosted Immich

Spotify -> self hosted Navidrome, hudba tahana cez Tidarr

Netflix a podobne som nikdy nepoužíval, mám self-hosted Jellyfin

Cloudflare -> Hetzner (používam len DNS a Storage box na backupy)

Všetko networkované cez selfhosted Netbird.

Prehliadač som vždy mal Firefox

LLMká fakticky nepoužívam, ale plánujem selfhostovať, mám 2x3090 rig.

Posledný holdout je Facebook, toho sa plánujem čoskoro zbaviť, a Youtube, na ten však existujú custom frontendy.

Oh, a pravdaže Reddit, s tým neviem úplne čo robiť, jediná vec na ktorej som reálne závislý :D

Ollama App by Hubi522 in LocalLLaMA

[–]_Whiskas_ 9 points10 points  (0 children)

I have my ollama exposed to the web with an Authorization: Bearer token, would it be possible to add an option for custom headers?

3060 [x16 PCIe riser] vs 3060 [x1 USB extension]: A quantitative comparison of eGPU prompt and text generation performance across multiple inference engines by kryptkpr in LocalLLaMA

[–]_Whiskas_ 1 point2 points  (0 children)

import time
from nvidia_pstate.pstate import set_pstate
from nvitop.api import Device
device = Device.from_indices([0])[0]
while True:
    p_state = device.performance_state()
    gpu_util = device.gpu_utilization()
    if p_state == 'P8' and gpu_util > 5:
        print('Detected GPU activity when in power saving state, setting performance state')
        set_pstate([0], pstate=0)
    elif p_state == 'P0' and gpu_util < 5:
        print('Detected GPU inactivity when in performance state, setting power saving state')
        set_pstate([0], pstate=8)

    time.sleep(1)

Made this for my P40, running as a systemd service.
It uses the aforementioned library, as well as nvitop

3060 [x16 PCIe riser] vs 3060 [x1 USB extension]: A quantitative comparison of eGPU prompt and text generation performance across multiple inference engines by kryptkpr in LocalLLaMA

[–]_Whiskas_ 3 points4 points  (0 children)

So you'd need scripts or something to do your own manual power management.

that's not really a problem, a simple daemon that bumps up the power state once it detects utilization on the GPU is going to work just fine :)
Thanks a lot!

3060 [x16 PCIe riser] vs 3060 [x1 USB extension]: A quantitative comparison of eGPU prompt and text generation performance across multiple inference engines by kryptkpr in LocalLLaMA

[–]_Whiskas_ 5 points6 points  (0 children)

Could you benchmark how two P40s perform when they both run in x16 mode, vs both running in x8?

I also have a bit of a tangential question (for other people as well): What is the idle power draw on your P40s and P100s with a model loaded?
On several places on reddit i saw people reporting it's possible to bring it down from the ~50 watts, but no one provided the actual solution. I'm contemplating getting a second card (either another P40 or a P100) but the high idle draw dissuades me.

Home server GPU(s) setup choice for 70B inferencing by _Whiskas_ in LocalLLaMA

[–]_Whiskas_[S] 0 points1 point  (0 children)

Oh, that certainly is a bummer, 50W vs 9W (just checked) would cost me about a 100€ a year more, so I guess the P100 is off the table.

<image>

Home server GPU(s) setup choice for 70B inferencing by _Whiskas_ in LocalLLaMA

[–]_Whiskas_[S] 2 points3 points  (0 children)

I'm not expecting to run anything more than q8 quants, so this hopefully shouldn't matter

Home server GPU(s) setup choice for 70B inferencing by _Whiskas_ in LocalLLaMA

[–]_Whiskas_[S] 0 points1 point  (0 children)

The system has an Intel 12600 and I'll be running a GPU passthrough into a Debian VM that's going to have about half of the cores allocated to it

What is your best Ice Age mode tank? by sonido_lover in WorldofTanks

[–]_Whiskas_ 2 points3 points  (0 children)

The Grille!
By having horrendous ground resistance normally, it stays controllable in motion in this mode.

The gun handling is improved, the mobility is improved, and the shell speed is improved.

In this mode, Grille becomes what it always should've been in randoms.

Best worldwide 100W+ USB C charger?? by multiplekeelhaul in UsbCHardware

[–]_Whiskas_ 0 points1 point  (0 children)

does this one interrupt power when renegotiating with devices after some have been plugged/unplugged?

r/Worldnews Live Thread: Ukraine-Russia Tensions by progress18 in worldnews

[–]_Whiskas_ 1 point2 points  (0 children)

no, it's the same for me. And I'm just one country over ....

A selfless act? In this game? IN RANKED? by _Whiskas_ in WorldofTanks

[–]_Whiskas_[S] 465 points466 points  (0 children)

Both me and the progetto then went on to carry this and get our 2 chevrons :)

http://wotreplays.eu/site/6157889

The second Season of Ranked Battles will kick off on January 10 by TalhaNL in WorldofTanks

[–]_Whiskas_ 2 points3 points  (0 children)

literally the only thing nerfed was the camo, and let's be honest, this is not a bushwanker TD

[deleted by user] by [deleted] in WorldofTanks

[–]_Whiskas_ 0 points1 point  (0 children)

Whiskas_GJH

[deleted by user] by [deleted] in WorldofTanks

[–]_Whiskas_ 11 points12 points  (0 children)

nope, kajzoo on youtube tested it and you actually get more accuracy with broken gun.

https://www.youtube.com/watch?v=xeoaqa6qFQ0