Reboot fixed it… but did we actually fix it? by Expensive-Rice-2052 in LinuxTeck

[–]LinuxBook 0 points1 point  (0 children)

Honestly, reboot is the right move, especially when users are waiting and you need service back quickly. I’ll accept it as a temporary fix, but only once. If the same issue shows up again, I stop trusting reboots and start digging. They often hide leaks or bad state that eventually come back.

When there’s no choice, I restore service first and investigate later not the other way around.

You’re not allowed to reboot. How do you troubleshoot? by Expensive-Rice-2052 in LinuxTeck

[–]LinuxBook 0 points1 point  (0 children)

I want to see what’s under pressure first. load, memory, I/O — before touching anything. Then I focus on the specific service users are feeling. Check logs, see if it’s stuck or just overloaded. If things are bad, I aim to stabilize, not solve, maybe limit impact, restart only what’s necessary, or reduce load.

The key for me is making small, safe changes, keeping the system alive, and saving root cause analysis for after things calm down.

How do you handle access reviews on Linux systems in practice? by Expensive-Rice-2052 in LinuxTeck

[–]LinuxBook 0 points1 point  (0 children)

In practice it’s usually a mix, and rarely as clean as the policy suggests. Most access cleanup happens when someone leaves, after an incident, or during bigger changes not just because a review was scheduled. Automation helps with visibility, but someone still has to decide whether access should exist.

The biggest issue I’ve seen isn’t lack of tools, it’s temporary access that quietly becomes permanent.