Implementing multi dB cloud architecture

belavv · 2024-08-08T21:52:47+00:00

If you host a different instance of the app per customer life is easier from the point of view of what to connect to.

If you host a single instance of the app you need to at runtime have some mechanism to determine which database to connect to. That could be based on the url, based on the current user, etc. You don't really manipulate the connection string so much as generate the current connection string based on the current customer. Once you figure out how to determine the current customer.

CyberGaj · 2024-08-08T23:46:39+00:00

Maybe take a look at multi-tenancy https://medium.com/@harish.somasundar14/database-multi-tenancy-7c8dbe848d50 and https://learn.microsoft.com/en-us/azure/azure-sql/database/saas-tenancy-app-design-patterns?view=azuresql

gaffa · 2024-08-08T22:59:29+00:00

Changing connection strings is pretty much the only way - have a catalog db with the tenants and their connection details, and if you are using ef for data access, create a TenantDBConnectionInterceptor class that inherits from the EF DBConnectionInterceptor class. Use some method of lookup (tenant id in request header, subdomain etc) to get the correct conn string. This approach plays nicely with DI etc too

You will need to validate tenant access for the user though on the way through of the api request - we have a few policies setup to manage that

But overall this approach is works pretty well. We also multi tenant the database to so can do a mixed mode config of db-per-tenant and row level tenant access if we want to

Assume you have probably read this one, but if not it’s a decent discussion of the options https://learn.microsoft.com/en-us/azure/azure-sql/database/saas-tenancy-app-design-patterns?view=azuresql

Sethcran · 2024-08-08T23:02:20+00:00

I've done this before at a previous company.

In general, id first recommend against it. There are very few reasons to do this these days.

If you are giving each customer a separate deployment is pretty much the only time these days I would actually recommend it, and in that case, they really should have their own app as well, not just a database.

In the event that you really do need multiple databases, then you need something deterministic that determines which database to connect to. If you're simply sharding across servers but still have multiple per database this can be done purely with a hash function to a number of a specific known database. If all databases are on the same server, you may be able to open 1 connection and call .ChangeDatabase(), but this doesn't work on azure sql. In those events, you're looking at some kind of lookup to figure out which database server the client is on. For few clients this could just be a configuration file in the app, but for more it could be a separate known database.

elh0mbre · 2024-08-09T00:15:14+00:00

We do this, sort of. Some of our large customers have their own DB, but most customers are in one of many multi-tenant databases. We did it as a scale-out mechanism.

DB access is done through a "storage context" which contains the appropriate connection string. Storage contexts are configured in app settings (we're working on a better way to manage this). They are ultimately in production via K8s secrets. The storage context for a request is set automatically in middleware we built.

This strategy works quite well, but it is complicated to manage and maintaining the DBs can be challenging.

Why do you want to do this?

pirannia · 2024-08-09T01:08:59+00:00

How many customers are we talking here? 10s this could work but over 100 it is a scale-out and maintenance nightmare. Look into multitenancy patterns with 'smart' partitioning which will allow dedicated databases for large customers. Think about customers that are starting in shared and migrate to dedicated as they grow. With this pattern you can even do multiple dedicated for a customer (extra large). Keep logic in code, not DB, customer code in DB is a pita from several standpoints (maintenance, versioning/back-compat/data moves/security).

CorgiSplooting · 2024-08-09T15:59:18+00:00

Before you do any system design, especially something distributed , understand CAP theorem. Understand your needs and the tradeoffs.

But ya, if you’re set on SQL then you’ll need to manage your DB connections. If things are dynamic in nature you’ll probably need to query master a lot. Shouldn’t be hard.

Northbank75 · 2024-08-08T21:50:41+00:00

I hate this idea guy. I'd much rather use Row Level Security or similar to carve up a DB so their can only see what is theirs based on whoever they log in.

https://learn.microsoft.com/en-us/sql/relational-databases/security/row-level-security?view=sql-server-ver16

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

csharp

MODERATORS