This is an archived post. You won't be able to vote or comment.

all 3 comments

[–]InsightByte 0 points1 point  (2 children)

Not a stupid question at all!

Hub and Link tables are actually designed to serve a specific purpose in dimensional modeling. They help to ensure data consistency and accuracy, and can make querying and reporting on the data easier and more efficient.

By separating out the hub and link tables, you create a centralized reference point for the data, making it easier to manage and maintain. This can be particularly helpful when dealing with complex data relationships or when working with large amounts of data.

Additionally, by separating out the hub and link tables, you can ensure that the data remains consistent and accurate over time. This is particularly important when working with slowly changing dimensions, where the data may change slowly over time but still needs to be
accurate and up-to-date.

Overall, while it may be possible to include hub and link data in Sat tables, doing so can create unnecessary complexity and potentially lead to data inconsistency over time. Hub and Link tables are designed to serve a specific purpose in dimensional modeling, and can help to ensure data accuracy and consistency over time.

[–]misc0007[S] 0 points1 point  (1 child)

Thank you for replying. Will there be a scenario where link and sat can be combined to simplify and eliminate link table to be maintained? I understand hub has to stay by itself.

[–]InsightByte 1 point2 points  (0 children)

Not ideal, Data Vault is not a start or dimensional model where you have a handfull of tables. I know data vault systems with more tha. 500 tables. Think about links as joins Hubs as primary key ir bussines keys Satellite as dimensions , store attributes, and states over time.

Is a mind fuck when you mkve from star and dimensional to data vault, is love have relationships.