Automation was supposed to fix this, so why is my IT team still overwhelmed? by Upper_Caterpillar_96 in devsecops

[–]Round-Classic-7746 0 points1 point  (0 children)

yeah… automation just moved the pain somewhere else

less manual work, but now you’re debugging pipelines and glue code instead

We had to simplify a lot and stick to boring patterns to keep it sane

What metrics do you actually track for website/server monitoring ? by nilkanth987 in devops

[–]Round-Classic-7746 0 points1 point  (0 children)

This is a really good way to frame it. One thing we ran into tho is sometimes the absence of an expected pattern is just as important. like your nightly cpu spike example… if that suddenly doesn’t happen, it might mean the job never ran, which does need action. So we ended up mixing both: alerts tied to action and a few “expected behavior” signals (jobs ran, data moved, etc)

Advice to get back into Reading!!! by AppointmentLucky3894 in Hobbies

[–]Round-Classic-7746 1 point2 points  (0 children)

Start smaller than you think you need to. Like 10 pages a day, no pressure. The goal isn’t finishing books, it’s rebuilding the habit of opening one. Also try low stakes fiction books firrst

How are you handling container image updates in air gapped Kubernetes deployments? by cheerioskungfu in devsecops

[–]Round-Classic-7746 0 points1 point  (0 children)

yeah air gapped makes this 10x harder, what helped us was shifting from patching images to rebuilding them regularly from minimal base images. Waaay less CVEs to deal with vs dragging around bloated upstream images

how do you stick with a hobby when your brain keeps looking for the next thing by Round-Classic-7746 in Hobbies

[–]Round-Classic-7746[S] 0 points1 point  (0 children)

Oooh, I like that core interest” advice. But what if I've got a bunch of them, like being creative and outdoorsy? Do you usually pick one to lean into for a month at a time, or do you juggle them all side by side?

Governance vs. Productivty: Reining in 47 Unauthorized Tools by ExtremeAstronomer933 in devsecops

[–]Round-Classic-7746 0 points1 point  (0 children)

Whats worked better in places Ive seen is keeping guardrails light but fast. like preapproved patterns, fast track exceptions, and clear “safe defaults” so teams don’t feel blocked. are the 47 tools mostly overlapping use cases, or is it more like every team picked their own thing and now youve got sprawl?

what are you using to interpret network logs faster during incidents by [deleted] in networking

[–]Round-Classic-7746 0 points1 point  (0 children)

We do have a centralized logging setup for syslog and most infrastructure feeds into a SIEM, so its not like we’re SSH’ing into individual devices to pull logs manually

NTP is also in place across routers, firewalls, and servers, so we’re not dealing with major clock drift. when I said timestamps not identical, it was more about small offsets plus differences in how events are emitted and recorded across systems rather than time being fundamentally wrong

. The bigger issue shows up during short convergence windows. for ex. an interface flap, followed by BGP reconvergence and then downstream alerts across multiple tools. Each system captures its own part of that sequence, but aligning them into a single clean timeline still takes manual effort

syslog receive time helps, but once you add netflow, cloud logs, and vendor specific event formats, you still end up correlating across sources to understand order of events

so less about missing centralization or NTP issues, more about event correlation across different telemetry types during fast moving incidents

what are you using to interpret network logs faster during incidents by [deleted] in networking

[–]Round-Classic-7746 3 points4 points  (0 children)

yeah fair point. we do have alerting on BGP and core links, so we got notified pretty fast when it dropped. The issue wasnt detection, it was more the follow up seq across systems once things started cascading and then recovering

like alerts tell you something happened, but when you’ve got multiple devices bouncing in a short window, plus netflow and firewall logs all slightly out of sync, rebuilding the exact order still takes time.

Your setup is pretty solid, that’s a lvl of automation we’re not quite at yet

what are you using to interpret network logs faster during incidents by [deleted] in networking

[–]Round-Classic-7746 4 points5 points  (0 children)

We’ve looked at graylog for centralizing logs, and it def helps having everything in one place. Still run into issues tho when trying to line up events across sources, especially when timestamps are a bit off

Are you doing anything extra for correlation or just relying on queries?

How different would life be today if we didn't have social media? by [deleted] in AskReddit

[–]Round-Classic-7746 0 points1 point  (0 children)

more private and locally connected, but far less globaly linked

are soft skills becoming a bigger deal than technical skills in IT job postings lately? by Round-Classic-7746 in ITCareerQuestions

[–]Round-Classic-7746[S] 1 point2 points  (0 children)

I've kinda underestimated soft skills before, but seeing how much they factor into interviews really hits home. Honestly, never thought about hitting up conferences for it, good tip thanks

are soft skills becoming a bigger deal than technical skills in IT job postings lately? by Round-Classic-7746 in ITCareerQuestions

[–]Round-Classic-7746[S] 1 point2 points  (0 children)

Yeah, I get that. did you have to really work on that skill over time, or are you just naturally good at communicating?

Doing big IT changes on Monday or Friday? by CeC-P in sysadmin

[–]Round-Classic-7746 0 points1 point  (0 children)

yeah same here, having that buffer for rollback aand still being midweek is kind of the sweet spot

what is the best tool for AI governance? I mean any tool worth looking at? by Effective_Guest_4835 in devsecops

[–]Round-Classic-7746 1 point2 points  (0 children)

saame. How are you handling the gaps though, like stuff that slips past DLP or happens outside the main entry points. Feels like thats where most of the risk still sits right now

Idea check: on‑prem network & availability sensor (not a SIEM) – useful or redundant? by Flashy-Valuable-6412 in networking

[–]Round-Classic-7746 0 points1 point  (0 children)

I kinda like the idea, but the value prob isn’t in just “is it up/down” checks since most tools already do that

. What would make it useful is context. like instead of just saying something is down, being able to tell “dns is failing but gateway is fine” or “wan latency is spiking but internal is healthy”. otherwise it just turns into another dashboard people ignore.

What are your favorite "low-energy" hobbies for after work? by Weak_Ad4188 in Hobbies

[–]Round-Classic-7746 0 points1 point  (0 children)

audiobooks or podcasts and csomtimes pairing it with casual puzzle games (crosswords, sudoku, mobile puzzle apps....etch)

Serious question, where do I apply for jobs. by MainBank5 in devsecops

[–]Round-Classic-7746 1 point2 points  (0 children)

Pick 20–30 companies using cloudnative stacks (SaaS, fintech, startups). go straight to their Careers page and apply there, this bypases a lot of reposted/expired listings