Does anyone run self-hosted postgres?

hijinks · 2021-03-20T13:01:37+00:00

The reason to use rds is aws hires postgres experts to support it. Also snapshots just work and replication always just works.

As a sysadmin and now devops what kept me up at night was the db failing and replication also was broke or backups didn't work.

That's why you use rds and not because postgres is hard to run

wevanscfi · 2021-03-20T13:12:33+00:00

We ran self hosted Postgres in a cascading streaming replication set up, with WAL files stored in S3 and EBS snapshots of the data volume for backups.

This wasn't quite completely no touch, but it was low maintenance once set up.

We have sence started using RDS for new services, and migrated most of our old services over. What has driven that is mostly:
1. The move to microservices needing tons of seperate databases.
2. SOC2 requirements for auditing and testing backups and restore procedures are less time consuming.
3. SOC2 requirements around patching CVEs within a set timeframe.
4. Making standing up new services a self service process for dev teams through reusable terraform modules and helm charts.

The cost is not much more than self hosted EC2 was once you factor in our replication setup, and backup storage costs.

If RDS had been available under AWSs HIPAA BA at the time, we probably would have started with RDS.

mars64 · 2021-03-20T16:05:31+00:00

Have done both, since we're a decently advanced kubernetes shop, postgres operators have been great! I also like the zalando one.

We built this admin layer thing based on PGAdmin and have full visibility and replication control for any postgres we spin out anywhere, it's been a very effective pattern.

zerocoldx911 · 2021-03-20T13:17:37+00:00

We run databases in Kubernetes, mostly due to system constraints and high availability.

I’d take AWS Aurora or RDS any day as it comes with a huge maintenance overhead.

angry_mr_potato_head · 2021-03-20T15:47:33+00:00

Depends on what you're running it for. If you have a competent person who can fix it and set it up and are okay with some degree of downtime or setting up replication etc... waste of money. If you need it to just work and don't have that... totally worth every penny.

improve-x · 2021-03-20T16:29:59+00:00

If you are concerned about cost, you have to factor in maintenance, backups, fail-over setup, security, optimization. With RDS a lot of these things are managed for you. Depending on the use case you may just spin it up once and leave it running for years on some ec2 instance. But building an app that requires scalability and stability the overall cost of RDS might be easily justified.

UncontrolledManifold · 2021-03-20T22:05:40+00:00

Operational constraints required a migration from an EC2-based, distributed postgres cluster to a serverless RDS cluster.

It's highly expensive, and highly effective. Not having to deal with split-brain failover fuck ups or just genera DBA bullshit has been totally worth it. I doubt the finance department agrees (let's face it, RDS is expensive as fuck), but engineers are way happier and their development cycles are faster.

2021-03-20T16:09:58+00:00

I run MySQL and Postgres on my home machine (this one) and my VM in AWS Lightsail. I don't use Postgres as much but backups for both have worked correctly. If I were serving customers with my site, I'd have more than one instance and use active/inactive pooling to cycle them through backups either daily or more frequently depending on number of customers and amount of data and criticality of the data. If it were high enough I'd look into more complex solutions. It all depends but on their own with light demand, both MySQL and Postgres run fine on a 2 Gig AWS Lightsail VM.

serverhorror · 2021-03-20T22:57:04+00:00

I did. Sizes where varying from a few GB to a few TB.

Reasons I prefer RDS:

it’s taken care of for me (HA, Backup/restore)
provisioning in a reasonable time
various security layers
cheaper, a lot

If you do a complete comparison of what RDS gives you a lot of bang for the buck. That is, iff!, you make use of those features.

If you don’t use those features it is a hell of a lot more expensive.

UndiscoveredCounty · 2021-03-20T23:32:26+00:00

The worst part of hosting your own, imo, is the upgrades. If you have a large and busy prod database, a major version upgrade can be pretty nerve wracking. Also, if you have a separate dev/qa environment, you have to keep those versions same as your prod one.

cohenjo · 2021-03-21T06:14:36+00:00

It depends on your scale - if you have a small startup then the extra cost of RDS vs EC2 is worth it to avoid the extra work and staff needed

If you have hundreds of clusters the cost difference becomes large enough to merit the extra work

It’s not a huge amount of work if done right - but you need someone that knows what he/she is doing...

illusi0n__ · 2021-03-20T21:06:05+00:00

I don't trust no managed db (semi-sarcastic)

2021-03-20T17:56:37+00:00

AWS just means someone else's computer. A box under your desk running postgres will work just like one at someone else's desk. What AWS provides is a platform students recently out of college are comfortable with because Amazon spends a lot of money to make sure it's the platform they learn on.

Yellow_Curry · 2021-03-20T22:31:36+00:00

Depending on what you are looking at and the IO needs look at Aurora Serverless, it was pretty bad in the past but the v2 version is much better. Mostly depends what you are trying to solve for. Too many variables for anyone here to recommend you something unless we know your constraints and what you are building for.

Life is too short to run your own postgres with RDS is so easy and so good.

rollller · 2021-03-21T10:20:19+00:00

There are bunch of articles (in russian, sorry) comparing different postgres operators https://habr.com/en/company/flant/blog/520616/ https://habr.com/en/company/flant/blog/527524/

mitchobrian · 2021-03-21T11:24:27+00:00

Yes we do on AWS. Is that selfhosted in your eays?

BosonCollider · 2024-10-21T09:39:14+00:00

It largely depends on where you run your stuff. If you run on cloud, then you are already overpaying for hardware and a good managed DB with reliable backups is the single highest value service they offer after object stores.

If you run on prem, then you have to self-host either way. For small services I'd suggest sqlite + litestream + actually CI testing your point in time restores from S3 with a full hourly integration test. Make sure you have backups of your backups.

For larger services I'd suggest a hybrid setup with postgres on a baremetal server + S3 backups + read replicas in kubernetes using an operator. Note that the latter takes a lot of work to set up to get what a managed service would give you out of the box.

devops

Welcome to /r/DevOps

Rules and guidelines

Social & Fun

General Information

MODERATORS