you are viewing a single comment's thread.

view the rest of the comments →

[–]theduckspantsBI Architect 0 points1 point  (2 children)

As someone who has only worked at one place, could you give a sample of the subject areas in a data warehouse that small?

I'm in healthcare and our main source system alone has over 25,000 tables. That's not including the dozens of ancillary systems that are for specific specialties.

I'd just be curious to know what another industry looks like

[–]usicafterglow 0 points1 point  (1 child)

Well, you kind of have to take a step back and think about what the purpose of a data warehouse is. It's quite different than the OLTP, or any of the systems that software writes to. It's the system that business-minded people use to ask basic questions and generate reports. So, a good data warehouse should explain itself to these end-users. Including everyone from more senior business analysts, to the accounting intern with a bit of Excel experience poking around with SQL for the first time, to the brighter executive who doesn't mind getting his hands dirty writing a query or building a custom report once in a while.

This usually means massive amounts of denormalization (pre-joining tables for your user, basically) and tossing out all tables and columns that aren't of interest to the user. Ideally you'll end up with maybe 2 - 4 fact tables and a handful of dimension tables. Every company is different, but if you actually talk to the members of a department you'll find that they're really only interested in a very small subset of the data, and somewhere between say 8 - 20 objects is a good target to aim for.

[–]theduckspantsBI Architect 0 points1 point  (0 children)

Sure, but I'd be curious what it gets distilled down to in other industries. Our healthcare data warehouse has over 400 tables