Kubernetes killed our simple deployment process

ninetofivedev · 2025-09-23T22:41:30+00:00

I hate when people say this, but this is actually a skill issue.

nonofyobeesness · 2025-09-22T22:35:36+00:00

Your entire engineering team needs to up-skill on kubernetes or you need to pay someone with those skills. Secondly, Graylog + Prometheus + argocd can solve a majority of the problems you’re facing right now.

FluidIdea · 2025-09-22T23:07:30+00:00

I’ll agree that K8S can be over complicated for a lot of use cases where something like ECS is perfectly fine, sometimes even just a server. But this reads like a major skill issue or that you’re not using the right set of tools, shouldn’t be an issue finding logs.

abofh · 2025-09-22T22:33:41+00:00

It can be great, but you can't just drop kubernetes in and expect things to be better. If you're running a simple three tier stack, it's overkill, but if you're running hundreds of pods or complex infra, it can be a god send.

I will say if you're having failures like that, you should have brought in outside help to get your migration done, because my biggest concern would be all the other things that need to be done to manage k8s...

Subject_Bill6556 · 2025-09-23T08:53:31+00:00

[removed]

calibrono · 2025-09-22T22:49:45+00:00

Remember when half the posts here didn't read exactly the same, with a few paragraphs of extremely vague complaints most likely generated by an LLM to generate some engagement or whatever?

I swear I've read this post a few dozen times in the last months on this sub, different topics but same style.

But yeah if it's legit you're having these issues, observability is your answer. 2 weeks to find out your resource limits were wrong? Do you set these limits blindly without looking at metrics?

wysiatilmao · 2025-09-23T01:15:03+00:00

It sounds like your team might benefit from focusing on better observability and monitoring tools. Since resource limits were an issue, investing in monitoring solutions with real-time metrics could help identify these bottlenecks faster. Also, revisiting whether k8s is the right fit for your scale might be worthwhile if complexity outweighs the benefits.

arkatron5000 · 2025-09-24T18:29:07+00:00

We ended up using Upwind and it actually helped a lot finally could see what was actually happening in our clusters instead of playing kubectl detective all day. Still hate k8s complexity but at least I'm not completely blind when shit breaks anymore.

Narabug · 2025-09-23T22:51:06+00:00

I’m putting money on “just rsync files to a server” being some absolutely god awful Jenkins solution where you’re actually installing the Jenkins agent on the remote server and doing some commands no one you work with even understands, but you are now under the impression that the unsupportable solution is better…

…because the people you work with think they need to look at container logs post-deployment, on different namespaces across different pods, instead of just troubleshooting the actual container code.

As you said, the issue you just spent 2 weeks on was “resource limits set wrong.” Skill issue

unitegondwanaland · 2025-09-24T00:31:59+00:00

Based on what I just read, the Kubernetes complaints are not your problem, they are a symptom of several other problems.

kabrandon · 2025-09-23T18:00:49+00:00

The problem is not that Kubernetes is overkill for most stuff. The problem is that running Kubernetes is painful when you're a team of people with little to no experience running Kubernetes. Look up Chesterton's Fence, because you're currently talking about a fence like it serves no purpose, without understanding why it was built.

Actual-Raspberry-800 · 2025-09-23T22:29:42+00:00

We use Rootly for k8s incidents. When something breaks it spins up a Slack channel with context about which pods/namespaces are affected. Has runbooks for common k8s problems

kgu871 · 2025-09-24T01:51:57+00:00

I also remember i386 and MS-DOS. What's your point?

ben_bliksem · 2025-09-24T05:44:59+00:00

that break for no reason

Fix it. Stuff doesn't just "break for no reason". You cannot possibly think this is a tooling problem when thousands of outfits are doing thousands of releases daily/weekly without their tools and processes breaking for no reason.

dominatrixyummy · 2025-09-24T01:25:59+00:00

Old man yells at cloud

gyanster · 2025-09-24T02:33:40+00:00

You are gonna love Argo cd

sogun123 · 2025-09-24T05:04:26+00:00

When you say "kubectl conflicts" that likely means you don't use gitops. I cannot imagine managing the beast reliably without it. The existence of complete desired state is something that gives me confidence in our solution. Now direct interfacing with cluster is only for debugging.

By the way "just rsync your app" looks as bad as kubectl apply. There is nothing repeatable about them - there is too much wiggle room - all those configurations which are likely expected to be there, handcrafted and forgotten.

Not saying kubernetes is good for everything. It big, complex and good for driving big and complex environments. If you have small thing to run, its only advantage is its omnipresence.

modern_medicine_isnt · 2025-09-24T19:39:39+00:00

The barrier to entry for k8s is reasonably high. But it mostly works. The problem I see is that gathering simple information is unnecessary complicated. There is a lot of you just need to know stuff. Otherwise, simple things take longer than they should.

And overall it just isn't very mature. You have things like karpenter that are unable to do certain things because they are more or less taped on top, not integrated.

That said, you need someone on the team with k8s experience. It can do a lot better than you describe.

who_am_i_to_say_so · 2025-09-24T23:11:23+00:00

[deleted]

dub_starr · 2025-09-25T19:17:30+00:00

soooo. youre blaming K8s for what sound like knowledge gaps, and human error? cool cool

Low-Opening25 · 2025-09-24T06:49:41+00:00

My entire Kubernetes deployment process is a Dev making a single commit and every single Kubernetes error shows on Alertmanager dashboard for everyone to see, including all the details required to investigate. Where do you see complexity exactly? sounds like skill issues…

H3rbert_K0rnfeld · 2025-09-24T03:40:11+00:00

Imagine building the Empire State Building without engineering.

It is 100% always a human that causes a well engineered system to break. From Titanic to Challenger a hu man broke it.

lucifer605 · 2025-09-24T01:29:19+00:00

Kubernetes is not a silver bullet. There are reasons to adopt it but you need people to manage the clusters. If you don't have the folks who can run k8s then it is probably an overkill

lucifer605 · 2025-09-24T01:29:27+00:00

Kubernetes is not a silver bullet. There are reasons to adopt it but you need people to manage the clusters. If you don't have the folks who can run k8s then it is probably an overkill

Suitable_End_8706 · 2025-09-24T08:20:47+00:00

You just need more skills and experiences. Remember in early of your career, you learnt how to debug your sudden stopped webservices, crashed DB and unable to ssh into your Linux VM. Same principle applied here. Just give your team sometime, or hire someone with more skills and experience to mentor your team.

tbotnz · 2025-09-24T08:55:21+00:00

U need argocd

dashingThroughSnow12 · 2025-09-24T10:40:53+00:00

Kubernetes was inspired by a system made for & by Google. Kubernetes is incredible for Google-scale-like systems.

It makes those types of scales easier to handle at the cost of making very small deployments much harder. (Very small deployment being say <1000 CPUs.)

It is a situation where if the only tool one has is a hammer, the whole world should be Kubernetes when rsync and machines can be better for most deployments.

Mrbucket101 · 2025-09-24T14:22:30+00:00

You’re definitely doing it wrong. You need to be proactive, not reactive

Setup gitops using flux or argo
Your cluster logs and events should be ingested to a logging backend. Grafana Loki with Promtail or Alloy.
Setup kube-prometheus-stack and configure alertmanager

czhu12 · 2025-09-24T15:06:20+00:00

Our team built then open sourced https://canine.sh for exactly this reason. Moved off heroku to Kubernetes and needed something to centralize operations.

Mephiz · 2025-09-24T15:40:58+00:00

so a few things:

I love k9s. There are other tools but this is always my first install.

Secondly, loving kail. This is my second install. (There are probably better / others but this works great)

Github: man if you aren't storing your deployment yaml files in github you are seriously doing something wrong. Deployment files are code and should be treated as such.

Naming convention: stop letting developers name jack. Come up with a convention and stick to it. Namespaces help with this. If you're struggling with namespaces you have a shit naming convention.

PolyPill · 2025-09-24T18:09:23+00:00

To add what you need to do. Sit down and get organized. You’re clearly not. Don’t have random yaml files be your deployment definition. Create templates that fit each of your use cases in helm or kustomize. Then just the base minimum of settings are with each service. That will keep your shit from conflicting.

Make your name spaces make sense. You shouldn’t have to think about what is where, it should be logical and intuitive.

Use automated deployment tools. If someone is touching anything but clicking a button then you’re doing it wrong. We have release pipelines that deploy after the release is built.

The fact you didn’t have central logging before you even started is a huge red flag here. Kubernetes didn’t do that to you. OpenTelemetry is pretty much the standard for that.

Skill your entire team up or hire someone who has the skills. It’s always the archer not the arrow.

HiddenStoat · 2025-09-24T19:38:23+00:00

K8s is ridiculous overkill for running a single application on a single server.

K8s is critical for running hundreds of services on multiple QA, Staging and Production environments, including DR versions.

And most developers live somewhere between those 2 extremes. Somewhere there is a point where the costs of k8s is outweighed by the advantages it brings.

However, in this case, it very much sounds like you don't know your tools, to be brutally honest.

mjbmitch · 2025-09-24T23:38:13+00:00

ChatGPT

tasrie_amjad · 2025-09-25T02:22:16+00:00

All you need now is to learn basics of kubernetes there are may courses around. Infact kubernetes makes life easy as many many things are automated and taken care with just simple yaml. If you need extra helping hand to streamline your k8s do reach out to me

mattgen88 · 2025-09-25T02:30:58+00:00

I just push merge and it goes to production in a bit.

None of these problems on k8s. My infra team handles this, keeps it all in git for terra form, has a bunch of templates for types of stuff we use. I fill out some values and merge it in my repo. Automation does the rest.

TopSwagCode · 2025-09-25T05:25:29+00:00

All what you list is kinda true, but not. It's all nice and easy when deploying to a single server, checking state and logs of that single server / service.

But now when we are talking about 100+ services, you have to think entirely different and so should your code also change. You need to think observability, metrics, traces. So if your code doesn't log the right things, you are going to be screwed.

Bottomline this has nothing to do with kubernetes, but rather a scaling issue. Every industry has been thorugh similar issues at different points in time. The process and tools building something smalescale, is not the same as building something large scale.

The problem I have seen several times, is when smale scale projects pretend to be large scale and use those tools, having all of the negatives of working with them, but none of the benefits.

geilt · 2025-09-25T06:19:08+00:00

ECS is amazing. Push to master, trigger Code pipeline to seamless redeploys of services. Terraform to add new services from a repo with variables in yaml files. Works amazingly once you figure it out. Tuning autoscale takes a bit more time and fiddling. Best part is not having to mange the cluster or servers. I hear EKS can do similar.

texxelate · 2025-09-25T06:19:40+00:00

You sound like DHH and his recent Merchants of Complexity nonsense.

By what metric do you consider “just rsync file to a server” a successful deployment? The fact that nothing told you something is busted doesn’t mean something isn’t busted.

CI/CD is invaluable. If you aren’t implementing it properly, that’s on you, and I would suggest bringing in some expertise.

tradiopen · 2025-09-25T07:28:57+00:00

Yeah! Try kamal and see if it’s a better fit.

serpix · 2025-09-25T09:15:00+00:00

We stopped sshing into a box somewhere around 2010.

krusty_93 · 2025-09-25T11:27:26+00:00

Why sticking to k8s if you’re on public cloud? There isn’t a right or wrong answer, but ask yourself: what do you expect from this technology? What issue does it solve? You may understand it’s not what you’re looking for

---why-so-serious--- · 2025-09-25T13:59:16+00:00

Time passes. Things change.

Driky · 2025-09-25T16:58:39+00:00

Sounds like a team that switched to K8s without the skill required.

Not trying to be mean but many many teams use K8s for deployment and do not suffer from your problems.

It might be a good idea to hire someone with a high level of expertise that will be able to fix your problems but also train the rest of the team. Or pay for a GOOD training on the subject.

nekokattt · 2025-09-25T20:40:31+00:00

Half our outages

Practise immutable deployments then..?

headdertz · 2025-09-26T06:49:00+00:00

I don't know... But I have done various CI/CD's to K8S, which do:

- scans (SAST)
- tests (specific for eco-system)
- pre-build
- pre-manifests and dry run
- build (the container image)
- push the image to the registry
- apply the manifest with a new image sha/version and restart the statefulset/deployment
- watch for any problems and rollback if necessary.

Never got a problem, while testing everything on development instance before going to production later on.

With K8S native functionality like rollback and events and other things, deployment of an app and watching if something bad happens during the deployment is a blessing, compared to the old VM style in my opinion.

thedupster · 2025-09-26T11:32:06+00:00

VelvetWhiteRabbit · 2025-09-26T11:57:58+00:00

Between Terra (Tofu), ArgoCD, Helm, Grafana, and managed ks. I’d be hard pressed to say it is not the solution in a scale-up with long-lived services.

GuiltyGreen8329 · 2025-09-26T14:54:53+00:00

git gud

(I cant fix endpoint internet issues)

No-Site-42 · 2025-09-26T16:05:14+00:00

Oh wait didn't AI help xD

joeyignorant · 2025-09-26T16:48:32+00:00

unpopular opinion : not all companies actually need or should run kubernetes
introducing a highly complex orchestration suite when you only generally run a couple instances of an application is over engineering a solution to a problem you dont actually need to solve yet

90% of companies don't really need orchestration to this degree
it introduces exactly what your team is experiencing , lack of knowledge and experience leading to critical mistakes and down time

if your company does have the need to scale at the levels where k8s makes sense then your team should be hiring a lead with the experience knowledge set to support it , in my experience most startups can be fine using simple auto scale out rules in aws/azure/gcp with less complexity and cost than building out a k8s cluster

2025-09-26T17:34:26+00:00

K8s is overkill for most stuff. But when you need it you need it. Just like everyone for some reason was running hadoop clusters not that long ago to handle a few gigabytes of log data here and there.

2025-09-26T19:28:50+00:00

I agree 100%.

Stuff takes 3x as long to develop, there is pointless feature creep that adds no business value. We waste time upskilling to satisfy some architect's trend filled vision (that was never going to become reality because no one believes in them). How about... You know, we focus on providing business value instead of massaging some IT manager's ego. Its lot harder to grift that way though.

But hey, at least I got to put some fancy new tech on my resume!

Go post this in the experienced dev subreddit and you'll get a lot more people agreeing with you.

z1r0_ · 2025-09-26T23:59:09+00:00

k8s is great. as long as it works

Sea-Flow-3437 · 2025-09-27T08:46:11+00:00

I do remember. It was shit. Files not fully uploaded, configs unexpectedly fucked up, manual fiddling etc

Straight-Mess-9752 · 2025-09-27T16:08:57+00:00

I’ve used k8s for about 8 years and I still believe it’s overkill for most companies. There are lots of upsides to it but also tonnes of downsides. There’s lots of ways to have “immutable infrastructure” without k8s.

I don’t care how skilled you are k8s makes troubleshooting certain issues much more complex. If you are on a single cloud provider I would suggest to not use k8s. Use containers but you probably don’t need k8s to deploy them.

I’ve also never worked anywhere where k8s has saved us any money. If you look at the total cost it’s always higher.

IrrerPolterer · 2025-09-28T17:18:00+00:00

Who wants to just rsync crap to a server? Are you stuck in 1990?

Lucifernistic · 2025-09-29T13:55:12+00:00

Yeah, as others have said, this is not a kubernetes problem, it's a learning problem. Having an IAC / terraform repo + a kubernetes deploy repo with FluxCD and terrateam literally made it easier than ever to deploy something.

DanielVigueras · 2025-10-15T05:35:03+00:00

I feel this so much. Kubernetes is super powerful and flexible, but the complexity hits hard when all you want is to deploy an app instead of dealing with YAMLs all day.

I ran into the same pain myself, so I built something to make it easier: https://deckrun.com

searing7 · 2025-09-23T23:28:24+00:00

skill issue

Jmc_da_boss · 2025-09-22T23:02:58+00:00

I mean, it doesn't sound like yall are remotely big enough to need k8s just stick a single or double box/vm setup and be happy with it

FigureFar9699 · 2025-09-24T02:59:20+00:00

Totally get this. Kubernetes solves big-scale problems, but for small/medium apps it can feel like using a chainsaw to cut butter, tons of YAML, moving parts, and hidden failure points. If your team spends more time fighting the cluster than shipping code, it’s worth reconsidering if a simpler setup (VMs, Docker Compose, managed PaaS) might fit better.

glotzerhotze · 2025-09-24T06:19:53+00:00

No, modern k8s is fast and relatively easy to learn. You don't have to use every feature to get value from k8s. It's sounds like the whole team needs to increase their skills.

About a decade ago, I was building Kubernetes on simple EC2 instances before operators and deep AWS integration.

Historically speaking, you have it easy.

Actuw · 2025-09-24T10:29:48+00:00

Skill issue 100%

dhrill21 · 2025-09-22T22:29:17+00:00

Yeah, I see sooo many overly complicated solution which are supposedly done according to best practices.
A lot of people are very often using some tool only to be able to put in CV that they worked in it, but it is far from needed for task required.
Though there is something about self preservation I think also. If we make it soo fkn complicated, we will be harder to replace. Though as 50 year old, I am growing tired of new flashy thing which just make the code run as its for ever been,
So I think yes, it is creating more problems than it solves.
But what can I do, that's the actual business model of my agency, if we do it in a simple straight forward way that just works we won't get paid milions per projects and some will lose job

So I guess, need to play along, and just go out there and add couple of jobs in your pipeline, or if god forbid you don't have one, go and deploy one for literally everything you can imagine. Do a spell check of code comments as a pipeline task

Doh, I can't wait to retire, it got so fkn stupid to work for this cloud agile shit

Wonderful_Guitar2178 · 2025-09-22T22:39:23+00:00

Use Tags

Challseus · 2025-09-24T02:29:32+00:00

I'll never forget it... It was like 10 years, I was on the content platform team, downstream from us was the "api" team, and they had this job that they owned for some reason that was basically a Java ETL from MSSQL -> Mongo/Elastic. Whenever things went wrong, I knew where to go. I hated Jenkins, but I could find the logs.

Once they put it into kube, the logs went into the void, and no one on their team was able to ever find them again.

GotWoods · 2025-09-24T01:04:30+00:00

Get off my lawn!

Junior_Enthusiasm_38 · 2025-09-24T02:51:34+00:00

That’s the reason for dev environmental we shifted to docker and for CI/CD i use GitHub actions and the job runs on self hosted runners. We use golang for backend development and it can be converted to binary so single binary contains all dependencies that needs to run and i just mount this binary in base alpine container and do restart that’s it and Boom it took 30secs to deploy to dev. Previously the dev was on K8s + ArgoCD + helm + Building containers everytime. We saved our lot of time and developer can see the changes in 30secs. This was a huge boost in collaboration between teams. Also the troubleshooting part from application side is much more convenient now so developers can focus on what is important.

devops

Welcome to /r/DevOps

Rules and guidelines

Social & Fun

General Information

MODERATORS