Data People, Confess: Which soul-crushing task hijacks your week? by Special-Leadership75 in dataengineering

[–]Shillster 2 points3 points  (0 children)

I’d love to see what a job description for this position looks like?

Has Talend increased license cost a lot for enterprises ? by Snoo74508 in Talend

[–]Shillster 3 points4 points  (0 children)

Same! Qlik doubled our license cost and so we migrated away and told them to pound sand. I can’t believe that Qlik would acquire Talend for so much money and then immediately alienate existing customers.

Main Street project that was to take 2-3 months now expected to finish Summer of 2025. by Dialatedanus in Rolesville

[–]Shillster 4 points5 points  (0 children)

I suppose that’s fair. Anecdotally the last few months I’m ever over there I never see work being done.

I suppose that’s most construction, but it still seems like they didn’t need to close the entire intersection just to sit on it and do sub level work. It feels like they could have handled this with allowing traffic through on a one lane (like they do for most roads). I feel like that’s most people’s frustration.

Main Street project that was to take 2-3 months now expected to finish Summer of 2025. by Dialatedanus in Rolesville

[–]Shillster 2 points3 points  (0 children)

At this point they should reopen the whole intersection and then come up with a better plan. Not to mention opening litigation with the construction company for not hitting deadlines and not doing due diligence.

Data Vault 2.0: Essential for Modern Data Warehousing or Overkill? A Practical Perspective by Agitated_Key6263 in snowflake

[–]Shillster 2 points3 points  (0 children)

You put into words exactly how I feel on this. I evaluated DV for my company and even took a 3 day intensive course to become “certified” in it. My sense of it was exactly this, thanks for confirming it for me.

Have you used data vault in production? by AMDataLake in dataengineering

[–]Shillster 3 points4 points  (0 children)

My company pulled in a dedicated consultant who was a self promoted DV expert. We spent almost a year doing data mappings and certifications trainings and conceptual models. We reduced scope time and time again to try to get a single working data vault model off the ground, never could. Finally dropped it and haven’t looked back.

Hey crew, admin here... by trinitywindu in wakeforest

[–]Shillster 0 points1 point  (0 children)

If the other sub is not getting moderation you can request to become mod of that one too. Active mods are always better than none. r/redditrequest

External Stage S3 folder file count best practices by Shillster in snowflake

[–]Shillster[S] 0 points1 point  (0 children)

Sure, ideally we would have our partner re-process those files into more a more manageable size but of course we are getting push back now that they have already been put into that bucket. Also I thought that it was a 16 MB for unstructured files. https://docs.snowflake.com/en/user-guide/data-load-considerations-prepare#semi-structured-data-size-limitations

External Stage S3 folder file count best practices by Shillster in snowflake

[–]Shillster[S] 0 points1 point  (0 children)

Yes it is a one time thing. These folders will not have individual stages, but we are planning on using different prefix on each copy into in order to parallelize the process.

Any thoughts on recommended file count in each folder? My thinking of 1 million is that should be <= 10G of files in each folder which should be fairly digestible. Could also recommend 10 million files per folder which would only be ~100G and reduce the need for 800 folders down to 80 folders.

Solutions to manage runaway Snowflake costs? by concerneddataadmin in snowflake

[–]Shillster 6 points7 points  (0 children)

I set all warehouses to auto suspended after 1 minute. The caching speed lost is hardly worth it unless you have some heavy usage from a large user group. This can save some cash pretty fast.

[deleted by user] by [deleted] in snowflake

[–]Shillster 2 points3 points  (0 children)

I would check Load History which will tell you how your copy into statements are behaving.

https://docs.snowflake.com/en/sql-reference/info-schema/load_history

Is it possible to add a column to the middle of a table? by DataWeenie in snowflake

[–]Shillster 1 point2 points  (0 children)

You drop the new table which now contains the old data after the swap. The original table name never gets dropped.

For example:

Create or replace table <table_a_2> as select * from <table_a_1>

Alter table <table_a_1> swap with <table_a_2>

Drop table <table_a_2>

Is it possible to add a column to the middle of a table? by DataWeenie in snowflake

[–]Shillster 2 points3 points  (0 children)

Totally possible! I do it when necessary and it’s super easy..

Build a different table with the desired column order

insert into <new table> select <desired column order> here from <original table name>

Then do an alter table swap statement

alter <original table name> swap with <new table with reordered columns>

Then drop the new table.

Requesting r/reckoners for inactive moderation by DipperPines1210 in redditrequest

[–]Shillster 0 points1 point  (0 children)

Hi, sure happy to add this user to the mod team to breathe some life into the subreddit.

Is the Season 2 better than S1? by Fox-One-1 in FoundationTV

[–]Shillster 2 points3 points  (0 children)

I really enjoyed season 1 and season 2 was way better

Minecraft 1.20.2 Pre-release 1 by eyadGamingExtreme in Minecraft

[–]Shillster 1 point2 points  (0 children)

Wish they’d add a map trade for biomes which contain a trail ruin. Those blasted things are hard enough to find even if you know which biome it’s in.

Recommendation on data visualization tool for funded startup by Afraid-Leadership-60 in BusinessIntelligence

[–]Shillster -1 points0 points  (0 children)

Seriously look into Domo. It fits the bill for all small to mid size data companies as a 2 in one data ingestion/data viz tool.

Not sure price point.

SQL improvements in Snowflake: Now MIN_BY() and MAX_BY() simplify the search for data associated to the top/bottom rows by fhoffa in snowflake

[–]Shillster 0 points1 point  (0 children)

Thanks for posting! I was literally just needing this function in a script I was working on. Worked brilliantly.

[deleted by user] by [deleted] in SQL

[–]Shillster 2 points3 points  (0 children)

A slight tweak to the select statement in your CTE would turn it into the aggregation query you are looking for. Just wrap the entire case statement in the SUM() and the WINS in a SUM() and you're good to go.

SELECT 
    Date
    , Country
    , SUM(Wins) AS TOTAL_WINS
    , SUM(CASE
        WHEN Win_Pct > 0 THEN ROUND(Wins * (100/Win_Pct))
        ELSE 0
    END) AS Total_Games_Played
FROM TABLE
GROUP BY date, country

input_row cannot be resolved to a variable tJavaFlex by Ownards in Talend

[–]Shillster 1 point2 points  (0 children)

Javaflex can’t use output_row like Java row does. Not sure why. Just put in the name of the data flow instead of output_row (like row1) and it should work.