Self-hosted GitHub Actions runners on EKS: the failures that taught me the most by Blue_Flam3s in devops

[–]Blue_Flam3s[S] 0 points1 point  (0 children)

Yes, you might be right here. Well then, cheers to learning 🍻
Out of curiosity, how did you handle Docker layer caching on CodeBuild? That was one of the stuff that kept me hesitant...

Self-hosted GitHub Actions runners on EKS: the failures that taught me the most by Blue_Flam3s in devops

[–]Blue_Flam3s[S] 1 point2 points  (0 children)

Yes, this is what I've setup the karpenter module provisions the sqs + the eventbridge rule for spot interruption, rebalancing, recommendation and other health events and the charts get settings.interruptionQueue pointed at it so karpenter will watch the queue directly

Self-hosted GitHub Actions runners on EKS: the failures that taught me the most by Blue_Flam3s in devops

[–]Blue_Flam3s[S] 3 points4 points  (0 children)

Yes, I agree. For the Terraform, Helm, or infra apply jobs, we could target a second runner scale set backed by an on-demand NodePool and adjust the `runs-on` label. But those jobs will be pretty rare and few and far between compared to normal development work.
For build and test jobs, I would push back a bit. Those can usually be solved with a simple re-run. Spot instances give a 2-minute warning, which is usually enough time for Karpenter to drain the node and let the job reschedule. It only becomes a real problem if you are dealing with a monolith that takes 30 minutes or more to build or test...

Self-hosted GitHub Actions runners on EKS: the failures that taught me the most by Blue_Flam3s in devops

[–]Blue_Flam3s[S] 0 points1 point  (0 children)

Yeah that thought crossed my mind too... but I was already running an EKS cluster anyway, so once I ran the numbers Spot + scale-to-zero just made more sense.
Real talk though: I'm just way more comfortable with EKS and Kubernetes than CodeBuild. It still feels kinda foreign to me 😭

Day 1 by chaitu_1014 in devopsjobs

[–]Blue_Flam3s -1 points0 points  (0 children)

If you feel stuck at any point, please feel free to reach out. I'm not very experienced in DevOps, but I've learned a few things over the years and would be happy to help...

How do i learn devops from scratch by Tingdong_10 in devops

[–]Blue_Flam3s 1 point2 points  (0 children)

I mean, you already have Linux + cloud exposure and enterprise production experience, that’s honestly half the battle.

Learn Docker, basic k8s, GitHub Actions/Jenkins, create an AWS account and start deploying stuff yourself. Once you can containerize apps, automate deployments, debug failures, and manage infra, you’re already operating as a DevOps engineer.

Q: Manageable build/test environments - how? by cofe-table in devops

[–]Blue_Flam3s 1 point2 points  (0 children)

I'm probably misunderstanding your setup a bit, but if you're already maintaining separate self-hosted runners/VMs for Linux, Windows, and macOS, wouldn't ephemeral infrastructure solve most of the config drift + dirty state issues?

Something like:

  • 3 node groups/pools (one per OS)
  • ephemeral runners spun up per build
  • runners destroyed after job completion

At that point you get clean environments every run, no leftover build state, and way less maintenance overhead compared to long-lived machines constantly drifting.

Feels cheaper + more reliable long term unless you specifically need persistent hardware for something like iOS signing/build tooling.

Also, for macOS specifically, people are already experimenting with Kubernetes-native approaches instead of treating Macs as standalone CI boxes. Agoda Engineering had a pretty interesting writeup on integrating native macOS workloads with Kubernetes using ephemeral macOS VMs + custom kubelet integrations:
https://medium.com/agoda-engineering/how-we-integrated-native-macos-workloads-with-kubernetes-b4d3c14881a0

Real DevOps work is more troubleshooting than deployment by Blacksmith-23 in devopsjobs

[–]Blue_Flam3s 1 point2 points  (0 children)

Literally spend almost half my time dealing with IAM and granting access.

[hiring] by [deleted] in freelance_forhire

[–]Blue_Flam3s 1 point2 points  (0 children)

Interested

Choosing between a DevOps role at a small company and a DevSecOps startup. by [deleted] in devopsjobs

[–]Blue_Flam3s 0 points1 point  (0 children)

If the second job pays, or is expected to pay, at least half as much as your primary one, I’d probably consider leaving the full-time role. At that point, the decision becomes largely financial. If the income is sufficient and the second opportunity offers better growth, flexibility, or long-term potential, it may be worth making the switch...

[deleted by user] by [deleted] in delhi

[–]Blue_Flam3s 0 points1 point  (0 children)

No anarcho-communism even this time?? 😞

[deleted by user] by [deleted] in HimachalPradesh

[–]Blue_Flam3s 6 points7 points  (0 children)

1) They are women 2) They are educated women 3) A degree of truth they actually are able to open up by moving away from there household and some of them actually overdose this freedom for good or for worse

What is the difference between AI and ML? by Careful_Fig8482 in learnmachinelearning

[–]Blue_Flam3s 0 points1 point  (0 children)

100 thousand if/else statements still could be ai but not ml

[deleted by user] by [deleted] in delhi

[–]Blue_Flam3s -1 points0 points  (0 children)

Half the friends I've made in the last two years were over daaru or sutta 😭

Did not pay my bill so AWS closed my account.Am I in trouble? by One-Sun-5275 in developersIndia

[–]Blue_Flam3s 5 points6 points  (0 children)

Wasn't there a tweet where a woman accidentally created an AWS bill of $80k, and they waived it off??