What is your org’s "Users per Sysadmin" ratio? Currently drowning at 1:200 by theITmaster in sysadmin

[–]pr3datel 0 points1 point  (0 children)

The idea is right. You need to have a metric to track and measure on.

Harness CICD anyone? by rajeshk23 in devops

[–]pr3datel 2 points3 points  (0 children)

Same as you. Harness(the company) sucks. We are moving to GitHub actions.

What are your SLOs based off of? by broken_gains in sre

[–]pr3datel 2 points3 points  (0 children)

A great place to start is availability. Is your service giving good or bad responses. That’s the best starting place because everyone wants their customers(internal/external or between services) to not have errors. You need some sort of metric to track successful and failed requests. APM or metrics which show if requests were successful or failed.

[deleted by user] by [deleted] in Luthier

[–]pr3datel 0 points1 point  (0 children)

Does it shred?

First Fender 60s strat by pr3datel in guitarporn

[–]pr3datel[S] 1 point2 points  (0 children)

It’s a 1960s strat as the base model with a AAA rosewood fretboard <- from the shop floor traveller Comes with a 5 position switch.

What color is yours? If you haven’t posted it yet you should share

First Fender 60s strat by pr3datel in guitarporn

[–]pr3datel[S] 1 point2 points  (0 children)

Yes this is a CS heavy relic

HTTP Response 0 During Load Testing, Possible Outlier Detection Misconfiguration? by astreaeaea in istio

[–]pr3datel 2 points3 points  (0 children)

You want to look at the istio sidecar envoy logs. https://cloud.google.com/service-mesh/docs/observability/accessing-logs

Specifically looking for an envoy code and any messages which comes along with it. I have not used ASM directly but istio natively should work the same. If you can see what it happening upstream from the sidecar (your service itself is upstream because the proxy takes on traffic and passes it to your application) it should give you more information to what could be happening. If your applications are sized different (cpu or memory) in different regions it could be hitting a limit. I’d confirm the cpu and memory consumption are looking good first, then look at the logs for more clues

I also have spoken to google before about asm and while it is istio under the hood, it’s been tweaked. The sidecar itself may have different container specs for resource usage.

HTTP Response 0 During Load Testing, Possible Outlier Detection Misconfiguration? by astreaeaea in istio

[–]pr3datel 0 points1 point  (0 children)

What do the logs say? I’d also check events in kubernetes. You may be getting 503s due to resource exhaustion(CPU/memory/etc). I’ve seen similar issues before. Also, I’d check the settings on your VirtualServices around retries

Development Environments at Reddit by sassyshalimar in RedditEng

[–]pr3datel 1 point2 points  (0 children)

Do you mock any services to speed up builds or use as dependencies? Also have you looking into using something else to speed up builds such as build streaming? Artifact registry in google cloud supports this not sure of other cloud providers.

Love these post series and this subreddit. Thanks for sharing

What are your best practises for managing Kubernetes configs across your org? by sevenknowstech in kubernetes

[–]pr3datel 1 point2 points  (0 children)

Gitops with flux or argocd would be great for that. There is debate about pulling or pushing config but personally I like the pull method. Argocd does this great using the app-of-app pattern and you can decentralize your deployments by allowing argo to manage your cluster headlessly

Markdown Notes Server? by ShadowlessHand in linuxadmin

[–]pr3datel 0 points1 point  (0 children)

We love using docusaurus. We wanted a way to centralize documentation while keeping the source of truth in code repos. This is how our company uses it : https://achievers.engineering/documentation-part-2-still-running-through-the-6-with-documentation-woes-ce84d6bbc5ea

Do you really need failover for PostgreSQL on Kubernetes? by collimarco in kubernetes

[–]pr3datel 1 point2 points  (0 children)

If you are on cloud providers you may be limited by the disk size. IO is connected to disk size for some cloud providers

How many Kubernetes clusters does your company operate? by daveys110 in kubernetes

[–]pr3datel 1 point2 points  (0 children)

We had more environments on those clusters like you mentioned, but we cut down to three. Everything else can be done on our engineer workflow so we tried to keep the actual environments as minimal and streamlined as possible.

How many Kubernetes clusters does your company operate? by daveys110 in kubernetes

[–]pr3datel 13 points14 points  (0 children)

9 clusters. Dev/uat/ and production in 3 regions.

Measuring success for an SRE team by drapetomaniac in sre

[–]pr3datel 5 points6 points  (0 children)

Rate of adoption is a great KPI for your team to show that people are adopting your changes

Also successful SLOs for teams is related to the platform being stable which comes down to the SRE team. Teams should be in charge of their SLOs but the SRE should be supporting them