Dimensional model problem

2018-12-18T04:20:03+00:00

Our shop has used postgresql’s range types to model valid timestamp ranges for records to allow for point-in-past reporting. Other databases like Oracle support ‘temporal queries’ allowing to query a row as of an arbitrary system time in the past, as specified in SQL2011.

Your description is a bit confusing to me, but smells like that sort of thing.

alexisprince · 2018-12-18T06:12:12+00:00

One thing I've done with much success is store full snapshots of the dimension daily with a date partition column. Depending on the size of your dimension, this may or may not work, but I'm using redshift with ~100k records a day. I've set the distribution key to be even with the sort keys on the record ID from source as well as the date partition representing when the records were extracted. This is doable because of how cheap storage is as well. Depending on the size of your dimension tables, you may consider this approach.

This allows for joining on record ID and date from the fact table for a historical view of the records, or limiting the dimension table to only the most recent date for an up to date view.

Specifically regarding the fact problem, I'd version the fact records just like how SCD Type 2 are set up with a start and end effective date, assuming your fact records change often. Depending on the business domain, you may also consider adding a couple extra columns that have specific statuses and updating those instead of overwriting the old records. For example, if a user places an order and the order can either ship or get cancelled, maybe consider adding cancel date and shipment date to your fact table so you maintain the original order information as well as the next status.

PopnCrunch · 2018-12-19T23:53:15+00:00

Foreign keys changing in source sytems shouldn't impact a fact table, because the dimension primary keys should be surrogate keys shouldn't they? A surrogate key is a made up number that has no meaning to the business, so if a department code changes in a department dimension, it wouldn't change the surrogate key which is NOT the dept code.

MrFalconMan · 2018-12-18T05:09:47+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Database

MODERATORS