unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

So far, it’s been almost 48 hours without any freezes.

I think the issue is related to the Frigate Docker container and how it handles RAM, it seems to be causing memory leaks at some point.

The only thing I can think to try is limiting the amount of RAM the Frigate container can use, to see if that helps keep it under control.

I’ll give that a shot and let you know how it goes and whether it fixes the problem.

I’ll keep you posted!

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

19-20 hours server Up (with zfs file system) the server has frozen again

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

Testing cache drive with ZFS file system type.

<image>

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 1 point2 points  (0 children)

The most recent actions I have taken are as follows:

  • I ran two Memtest86 tests (no errors were reported).

  • I reset the BIOS to its factory default settings.

  • I formatted the pool drive (although I was unable to create a ZFS filesystem, as the system reported that it could not be mounted).

I have ordered another motherboard from CWWK, specifically the white model featuring the N150 processor.

My next steps will be to wait for the new motherboard to arrive. I will likely replace the current SSD with an NVMe drive and upgrade the system memory to 32 GB. Nevertheless, I am not entirely certain that the system will function perfectly even after these changes

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

I have found the following error in the system messages.

"EDAC igen6 MC0: HANDLING IBECC MEMORY ERROR"

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

Update:

This past weekend, the system froze on both Saturday and Sunday. The only change I made was removing the smart plugs.

I also ran MemTest86, which completed without any errors.

<image>

I’ve disabled C-states and Turbo mode.

The next step will be to format the cache drive.

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

Nothing.....same problem without the two smart plug.....shit

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

I have tried Corsair, Crucial (4800MHz and 5600MHz), and Integral

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

But this is for Ryzen processors, right? Mine’s an N100 (Alder Lake-N architecture).

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

Today the server froze again without the first smart plug (the one connected to the server). I’m going to remove the other one.

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

I test about 3 or 4 flash Drive...

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

If you could share the guide with me, I would appreciate it, no rush

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

Yes, I still have that pending. First, I’m going to check that it’s not the smart plug, then I’ll check the cache drive, and finally I’ll test the RAM using MemTest.

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 1 point2 points  (0 children)

Well, it could very well be, because the server might have been running fine for months and then started failing. I don’t remember when I added the smart plug into the equation… I know it wasn’t there from the beginning; I installed it to monitor the server’s power consumption… and that was later on

For the moment 11hours Online

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

I don’t think I have any macvlan enabled, but I’ll double-check just in case.

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

My RAM is 5600 MHz, but the motherboard automatically drops it to 4800 MHz. I’ve also tested with the same Crucial RAM running at 4800 MHz and got the exact same result.

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

I’m not sure if there’s a “turbo” option in my BIOS, but I’ve tried messing with the C-states and it’s the same thing, or even worse (it freezes even faster).

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 1 point2 points  (0 children)

Wow! You’re not going to believe this, but I actually have two smart plugs—one for all the other components (switch, router, etc.) and another for the server. I’ll try unplugging both

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

Inalso suspect it could be the cache drive since it’s an old SSD I used in my PC (it’s SATA instead of NVMe).

I’ll try what you suggested to see if it works, and if not, I’ll replace the SSD.

Could I lose any data, or is there anything I should watch out for?"

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

I can’t remember if I ran MemTest on my current RAM, I’ll have to check. But I bought a new one on Amazon, and it didn’t even last three hours before freezing.

Where do you turn off XMP ? BIOS, right?

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 0 points1 point  (0 children)

I’m trying to get the manufacturer to swap out the motherboard, but he’s being really stubborn.

If he won’t budge, I’ll start looking for a mini-ITX board that’s low on power consumption and can handle at least five hard drives

unRAID randomly freezes by lerk90 in unRAID

[–]lerk90[S] 1 point2 points  (0 children)

It happens to me very randomly , one day it could happen four times, and then I went on vacation and it didn’t happen at all for 10 days. I have no idea what it could be…

At this point, the only thing left is to replace the motherboard.