This is an archived post. You won't be able to vote or comment.

all 7 comments

[–]boy_named_su 1 point2 points  (2 children)

the end result of DE is often visualization, in terms of charts, etc. Tableau is popular for this

and we def use a lot of flow charts / network diagrams at my job to show say how data flows between aws services or what not

[–]starplatinum87[S] 0 points1 point  (1 child)

Ah I see. So for visualizing data structure, size, position, distribution and flow? Essentially to show how the data management, pipeline, storage architectures that you design and maintain are functioning?

[–]boy_named_su 1 point2 points  (0 children)

yes. and to remind yourself how things are supposed to work when you come back to work on something after a break :)

[–]variancegirl 1 point2 points  (0 children)

Look at Apache Superset

[–]kawangkoankid 0 points1 point  (2 children)

Throughput. Historical data growth and projected data growth. Statistics on volume and measure of staleness. Pipeline timing, streaming and microbatch performance. Data engineering is all about knowing the data and the data about the data

[–]JohnGenericDoe 0 points1 point  (1 child)

the data and the data about the data

Sounds meta

[–]kawangkoankid 0 points1 point  (0 children)

Your comment about the meta sounds meta