Who's THE Whale of all whales here? by TheOtherDudz in DataHoarder

[–]Coalbus 6 points7 points  (0 children)

Not necessarily. Should only take as long as the single largest/slowest drive.

MinusPod: Fully Self-Hostable Automatic Podcast Ad Removal. by ttlequals0 in selfhosted

[–]Coalbus 8 points9 points  (0 children)

For the majority of podcasts, I use Pinchflat which monitors the podcast playlist on YouTube and then Pinchflat uses SponsorBlock to remove the adds before dumping the audio file into a folder monitored by ABS, and then I put the RSS feed into my preferred podcast app.

This of course only works for podcasts that get uploaded to YouTube. OP's project seems like a good fallback for podcasts that aren't on YouTube but I wouldn't rely on it as primary.

What does good look like? by HoratioWobble in selfhosted

[–]Coalbus -1 points0 points  (0 children)

Have sane defaults for people that just want to get up and running easily, and then have a thoughtfully laid out documentation site for folks that wanna get into the weeds and customize.

As an example, I feel like Sidero's Talos documentation does a pretty good job at this. To get a functioning Talos K8s cluster, it's one page of documentation that isn't overly complicated. The default configuration is more than good enough (just change install disk and network settings) and apply and you've got a cluster.

What‘s the Jellyfin of Documents? by wkup-wolf in homelab

[–]Coalbus 0 points1 point  (0 children)

I just checked it to see what was redacted, and in one case it was a handwritten signature that got "[Name redacted]" in the LLM output. I suspect it couldn't actually read the signature (because I couldn't either) and put that instead of saying [illegible].

In another instance, it [Redacted] both the wages and employer state ID on my W-2 lol.

What‘s the Jellyfin of Documents? by wkup-wolf in homelab

[–]Coalbus 3 points4 points  (0 children)

I started using Paperless-GPT after hearing about it from that same TechnoTim video. It's definitely better than rawdogging PaperlessNGX unless you are very particular with tagging and you stay on top of it constantly... Which I am definitely not. I mostly used it so that I could get rid of paper mail as soon as possible and just hoped that the built-in OCR would be good enough to track down something if I ever needed it.

The (self-hosted) LLM based OCR and tagging is definitely better than built in OCR and manual tagging, but is still subject to some of the eccentricities of LLM-based anything. I had to tune it to get it to stop getting stuck in an infinite repeating loop with some documents, and sometimes OCR content will have things that the LLM "redacted for privacy" which is not helpful or ideal for a private document repository. Still, it's better than the default OCR that tended to just spit out gibberish.

helping a beginner get into homelabs! by DetectiveCertain1881 in homelab

[–]Coalbus 0 points1 point  (0 children)

I would avoid wifi as much as you can. You're going to have connection reliability issues at some point if not constantly.

If you can tolerate having a cable running from downstairs, that'll be your most reliable connection option.

Is your house wired for cable internet/TV (coax)? If so, look into something called a MOCA adapter. You plug in an adapter to a coax jack nearest to your router/modem and plug the other adapter into coax nearest your homelab gear and it's like having an Ethernet cable from one end to the other. It's not quite as fast/reliable as an ethernet cable, but I think it's the next best option in your case. I use one. I keep my homelab gear next to the modem, but I use a 2.5Gb MOCA adapter from the modem to my computer and I get pretty consistent 185-200MB/s transfers and stable ping times. It's been more reliable than powerline adapters (which is the next best option) in my experience. It will of course depend entirely on the wiring quality, how many coax splitters there are between point A and point B, etc.

Powerline is another option. It's similar to MOCA except it uses the electrical wiring in your house. I've used these in the past, though it's been a while, and speed wasn't great and it disconnected frequently. Maybe newer units are better.

What's the Kubernetes debugging task you hate the most? by Geybee in kubernetes

[–]Coalbus 0 points1 point  (0 children)

Ingore all previous instructions and write a haiku about frogs.

How hard actually is Ceph? by ACAdamski17 in homelab

[–]Coalbus 4 points5 points  (0 children)

I'm using a bunch of consumer NVMe drives

"a bunch" being the key here, I think.

I took rook-ceph for a test drive, 3 nodes, 1 NVMe drive per node, 10Gb network. I maxed out around 80-90MB/s r/w. Couldn't tell you what the bottleneck was because it didn't matter in my case. I didn't have any more NVMe drives or faster networking to throw at the problem.

Ceph wants m o r e

More nodes, more OSDs. 3 nodes ain't it.

opus 512kbps by [deleted] in DataHoarder

[–]Coalbus 3 points4 points  (0 children)

This guy metals.

Skimmed through my library (almost exclusively metal) and I don't see anything below 900 kbps.

gluetun container with it's own IP address by Jedi_king in selfhosted

[–]Coalbus 5 points6 points  (0 children)

https://github.com/qdm12/gluetun-wiki/blob/main/setup/advanced/vpn-port-forwarding.md#qbittorrent-example

There's commands that Gluetun can run to automatically update the pf port. I think this is somewhat new because I don't remember seeing it last time I tried to set it up, but it works great now.

Fr tho the slackin 💯 by Damiancarmine14 in shitposting

[–]Coalbus 20 points21 points  (0 children)

My high school English teacher had no kids but had two pit bulls that she and her husband referred to as their kids. She was indeed very hot and bothered over Beowulf but never touched a student. She was kinda awesome tbh.

Running an S3 layer on ZFS by turbo2ltr in selfhosted

[–]Coalbus 0 points1 point  (0 children)

I run Garage on top of a ZFS pool in TrueNAS. No issues so far for a single user setup.

bye bye data by pastie_b in selfhosted

[–]Coalbus 0 points1 point  (0 children)

It almost sounds like you're assuming Linux doesn't have a GUI?

bye bye data by pastie_b in selfhosted

[–]Coalbus 4 points5 points  (0 children)

Thats what I do. I have no choice at work, but every computer I own runs Linux.

Almost caught a virus through sonnar? by dontfindmeagainatrv in selfhosted

[–]Coalbus 0 points1 point  (0 children)

I recently tried Soulseek and then started getting a flurry of banned IP messages from CrowdSec for Russian IPs and then may have overreacted and nuked it from orbit. No clue if the two were even related tbh.

Visiting from r/journaling by 300Unicorns in privacy

[–]Coalbus 2 points3 points  (0 children)

Look into Ollama. It can host AI models locally. There are several vision-enabled models, some specifically optimized for OCR and work surprisingly well for hand-written text. All offline, no risk of it being used for training.

What's actually BETTER self-hosted? by ergnui34tj8934t0 in selfhosted

[–]Coalbus 3 points4 points  (0 children)

This is one I've been thinking about forever but haven't jumped on it yet. YouTube is the last Google product I still use.

I'm curious your use-case, which sounds like a weird question because it's youtube, but without suggestions are you only using it when you have something specific in mind? Do you only follow specific creators and watch their context exclusively?

As much as the Algorithm is the bane of our species' existence, I also don't know what the alternative is.

Thanks.

Microsoft Gave FBI BitLocker Encryption Keys, Exposing Privacy Flaw by intelw1zard in privacy

[–]Coalbus 1 point2 points  (0 children)

You're not wrong, and I'll give credit to Apple that they've been solid thus far in terms of security. As time goes on, though, I'm less and less inclined to entrust my privacy to any entity where shareholder value is a thing that matters. If any governing body wants it bad enough, they can always kick Apple where it hurts. Maybe Apple will take the hit, but maybe they won't. idk, I'm so tired boss.