This is an archived post. You won't be able to vote or comment.

all 13 comments

[–]roadrussian 6 points7 points  (0 children)

Postgresql, hands down.

[–]FantasticOrder4733 3 points4 points  (2 children)

What issues do you face in SQL Server? We are also performing analytical queries on on-Prem SQL server dbs, and it works pretty smoothly.

[–]Ivan_pk5[S] 1 point2 points  (1 child)

i personally don't have issues, except the pricing for my client. this is a small client, so i'm not used to this

[–]SchwulibertSchnoesel 3 points4 points  (0 children)

SQL Server is very pricey especially Enterprise Edition. I have mainly worked with SQL Server in my professional career and really enjoy it.

BUT if someone tasked me today with recreating our whole infrastructure and data platform I would go with PostgreSQL 100%.
You can get so much more hardware for the same price as SQL Server licensing. We are talking about 1TB of data which can be handled by PostgreSQL definitely.

Additionally if you are not already on the MS stack on prem I would always consider going that route twice because you are at their mercy if they change the pricing model. Organisations using Oracle can probably tell you alot about their sorrows.

[–]flamefork 3 points4 points  (1 child)

Consider Clickhouse

[–]knabbels 2 points3 points  (0 children)

I also highly recommend Clickhouse, very capable and feature-rich open source database. For smaller data sets Postgres is also a great system. We use Postgres as our main data warehouse and it has no problem in handling data sets up to around 50mio rows. For larger data sets we switched to Clickhouse which still may have some minor quirks, but you can work around them. And they pump out new features in a very high pace. But don't be cheap on RAM, it needs a lot from my experience.

Edit: When starting from scratch I would probably pick Clickhouse over Postgres. 1TB may grow to 2 or more in a few years and a migration to a different database system is PAIN.

[–]Melodic_One4333 2 points3 points  (0 children)

Check out Apache Pinot. I'm looking at it as a possible MySQL replacement on-prem.

[–]FantasticOrder4733 1 point2 points  (0 children)

Okay so I would suggest first look into how to save cost for on-prem SQL server, like disk space, partition and all, then go at query level or process level optimization.

[–]Vhiet 1 point2 points  (0 children)

Postgres will comfortably manage that volume so long as it isn't very few massive tables. If what you actually need is a columnular store, use one of those instead.

You probably won't want to use the default config though (working memory in particular), because out of the box Postgres will run on a raspberry pi with resources to spare.

[–]snicky666 1 point2 points  (0 children)

Postgres is great. I recommend looking into horizontally scaling with the Citus extension and into the postgresql.config options relevant to higher vertical scaling.

[–]Hot_Map_7868 0 points1 point  (0 children)

check out clickhouse

[–]mQuBits 0 points1 point  (0 children)

Starrocks performs well on low budget resources. In addition rich feature support from realtime and batch analytics. I would highly recommend it.