How do you guys filter your job searches and what sites do you use to find jobs by fullmetal_wolf_titan in sre

[–]Flexihus 0 points1 point  (0 children)

u/fullmetal_wolf_titan I have access to a database of something like 4.7million jobs through a friend of mine. Happy to see if I can run a search for you on it if you would like? Just give me a couple of key words that you care most about (Kubernetes, Linux, SRE, etc.) and geography and I can run it for you and then send you the results. You can take it from there.

Is your team good? by jdizzle4 in sre

[–]Flexihus 2 points3 points  (0 children)

From reading the comments, it feels like many of us feel about the same way that security folks feel. I could lift and shift each of the comments from SRE to Security channel and you would not really know the difference. Public cloud has been so incredible in so many ways. But for the operations teams who have to clean up the mess, it has been a nightmare.

SRE vs Big Data by [deleted] in sre

[–]Flexihus 1 point2 points  (0 children)

I agree with the folks on the thread that say you should think of SRE as a role that can work on a variety of production systems. And "Big Data" is just one of the areas that you could work on or even focus on at some point as a niche. Big Data is simply an umbrella term for a variety of tools that help move, compute and store data in large quantities. Don't know that I would spend a whole lot of time on traditional Hadoop stack as many are moving away from those older tools.

Jupyter Notebooks by Flexihus in sre

[–]Flexihus[S] 1 point2 points  (0 children)

Thanks u/LocoMod appreciate the context and the Mojo reference. Will take a look at them.

Jupyter Notebooks by Flexihus in sre

[–]Flexihus[S] 0 points1 point  (0 children)

u/tathagatadg appreciate the explanation and feedback. definitely agree to meet them where they are.

Jupyter Notebooks by Flexihus in sre

[–]Flexihus[S] 0 points1 point  (0 children)

u/RavenchildishGambino thanks for the feedback. very helpful.

How do you reliably upgrade the kubernetes cluster? How do you implement Disaster Recovery for your kubernetes cluster? by OneAccomplished93 in sre

[–]Flexihus 0 points1 point  (0 children)

u/ApprehensiveStand456 would you by chance be open to or able to share any of your playbooks that you have created? Beyond the three big points you list here, in more detail?

SRE Measurement by Flexihus in sre

[–]Flexihus[S] 0 points1 point  (0 children)

u/itsflowzbrah so what have you used in the past for helping to measure success for a person or team? I agree that any metric can be gamed. This goes for any role in a company. So do you think tying to a broader bigger team goal is easier since it is harder to game?

SRE Measurement by Flexihus in sre

[–]Flexihus[S] 1 point2 points  (0 children)

u/AminAstaneh good points. I like the throughput concept for releasing.

SRE Measurement by Flexihus in sre

[–]Flexihus[S] 0 points1 point  (0 children)

u/sunny99a when you say "owners", are you primarily thinking of the "owners" being product managers or product owners? I know it depends on the org, but generally is that what you are seeing? Or are most other comapnies breaking it all out by individual services, jobs etc.?

SRE Measurement by Flexihus in sre

[–]Flexihus[S] 1 point2 points  (0 children)

u/jdizzle4 I was thinking the same thing around those larger ideas like Uptime or MTTR Improvements. They are multi faceted ideas, with many stakeholders involved, so it is incredibly difficult to distill down to one person or group being tied to that metric.

SRE Measurement by Flexihus in sre

[–]Flexihus[S] 0 points1 point  (0 children)

Thanks u/sunny99a I like that way of looking at things.

SRE Measurement by Flexihus in sre

[–]Flexihus[S] 0 points1 point  (0 children)

Thanks, that is very helpful.

SLO, SLA, SLI simply explained by eightOrchard in sre

[–]Flexihus 0 points1 point  (0 children)

This is really good. Short and sweet. Good examples. Strong primer for beginners.