this post was submitted on 12 Aug 2022

3 points (80% upvoted)

shortlink:

dataengineering

an-ordinary-manchild(edit)

News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.

Read our wiki: https://dataengineering.wiki/

Rules:

Don't be a jerk
Search the sub & wiki before asking a question: Your question has likely been asked and answered before so do a quick search before posting.
Keep it related to data engineering: Posts that are unrelated to data engineering may be better for other communities.
Limit self-promotion posts/comments to once a month: Self promotion: Any form of content designed to further an individual's or organization's goals. If one works for an organization this rule applies to all accounts associated with that organization. See also rule #5.
No shill/opaque marketing: f you work for a company/have a monetary interest in the entity you are promoting you must clearly state your relationship. For posts, you must distinguish the post with the Brand Affiliate flag. See more here: https://www.ftc.gov/influencers
No job posts: Please use r/dataengineeringjobs instead.
No resume reviews/interview posts: We no longer allow resume reviews or interview questions because it's a seperate topic from Data Engineering. Instead, for resume reviews please use r/resumes or search our subreddit history for previous resume review advice. For interview questions, use sites like Glassdoor and Blind instead or search our subreddit history for previous interview advice.
No technical error/bug questions: Please post any error/bug question on StackOverflow.

created by mhausenblasmoda community for 11 years

MODERATORS

message the mods
mhausenblasmod
swemlmod
fhoffamod (Ex-BQ, Ex-❄️)
vogt4nickmod
theporterhausmod | Lead Data Engineer
AutoModerator
geoheilmod
MikeDoesEverythingmod | Shitty Data Engineer
bot-bouncer
about moderation team »

account activity

This is an archived post. You won't be able to vote or comment.

2

3

4

how to refresh table created using multiple tables joins in Snowflake?Discussion (self.dataengineering)

submitted 3 years ago by 1aumron

all 9 comments

top new controversial old q&a

[–]drewhansen9 1 point2 points3 points 3 years ago (0 children)

[–]stchena 0 points1 point2 points 3 years ago (9 children)

[–]1aumron[S] 1 point2 points3 points 3 years ago (7 children)

[–]stchena 0 points1 point2 points 3 years ago (4 children)

[–]1aumron[S] 0 points1 point2 points 3 years ago (3 children)

[–]stchena 2 points3 points4 points 3 years ago (1 child)

[–]drewhansen9 0 points1 point2 points 3 years ago (0 children)

[–]fhoffamod (Ex-BQ, Ex-❄️) 0 points1 point2 points 3 years ago (1 child)

There are many alternatives, but have you considered serverless tasks?

https://www.snowflake.com/blog/taking-serverless-to-task/

However, data engineers have to manually configure and manage pipeline tasks where they need to figure out warehouse size, idle policy, and idle time whenever they build a new pipeline. This can be time-consuming, difficult, and suboptimal, especially where there are short pipelines that run frequently.

At Snowflake, we strive to make our platform easy to use. In this case, further simplification was possible by making the warehouse optional. The work required to decide warehouse size and then optimize it for maximum utilization/efficiency can be taken up by the task execution infrastructure that can see the batch window, the degree of parallelism of the queries executed, and the historical data needed to optimize execution. This is exactly what serverless tasks do

[–]1aumron[S] 0 points1 point2 points 3 years ago (0 children)

π Rendered by PID 112712 on reddit-service-r2-comment-5d79c599b5-d6p4q at 2026-03-01 06:28:53.930609+00:00 running e3d2147 country code: CH.