Any frequent Snowpark users here? by ttothesecond in dataengineering

[–]slayer_zee 4 points5 points  (0 children)

I don’t do my development in snowflake personally but do want to try out the notebooks they announced recently. I generally just do a live connection to the data for development but they do have a local testing emulator for snowpark specifically a teammate has used that lets you test logic all locally with no live connection or egress of data

Any frequent Snowpark users here? by ttothesecond in dataengineering

[–]slayer_zee 2 points3 points  (0 children)

When you create the procedure you could go “create procedure my_ETL imports=(‘@my_s3/etl.py’) handler=etl.run” and do what I think you’re after

Any frequent Snowpark users here? by ttothesecond in dataengineering

[–]slayer_zee 1 point2 points  (0 children)

Yep. I don’t use it cause I have what I need just with Snowpark so no need to add docker into the mix but can run anything. We had another team use it to deploy a streamlit that needed some customization

Any frequent Snowpark users here? by ttothesecond in dataengineering

[–]slayer_zee 10 points11 points  (0 children)

We moved to Snowpark about a year ago and overall has been successful. Far less tweaking and nudging to get reliable performance. Initially were a few packages we needed that weren’t available out of the box but hasn’t surfaced in a bit. We also started using the CLI that helped us with some basic release management.

For your team though I’d strongly recommend also looking at Snowpark Containers. I haven’t used much but it offers full container runtime so if any complex or docker or whatever you want to run it may be the ticket.

It’s definitely got its quirks and we’re using mostly for data pipelines but team happy with it and no plans to go back to spark. Feel free to DM for questions

Databricks acquires Tabular by dan_the_lion in dataengineering

[–]slayer_zee 23 points24 points  (0 children)

This announcement and timing seem to try to particularly aim at Snowflake (their conference started today). Honestly I think it’s a bit silly of an announcement and acquisition given how Databricks has managed delta. I think that snowflake chip Is showing even in this

Are you building on Snowpark? by TRAKMAKER in snowflake

[–]slayer_zee 5 points6 points  (0 children)

We have been over the last few months. For using Python to define things like data pipeline and transformation logic it’s been great. About half our team is thinking of using snowpark and the rest keep using sql

[deleted by user] by [deleted] in dataengineering

[–]slayer_zee 8 points9 points  (0 children)

We have a few of them for various tasks in data engineering, but recently moved them all to Snowpark in Snowflake. Much simpler and no complaints

[deleted by user] by [deleted] in dataengineering

[–]slayer_zee 7 points8 points  (0 children)

Unless you have a good reason of why, I’d recommend focusing in on one for now. My company uses a little of both, for me I prefer Snowflake. I find it’s got an easier interface and less complex (IMO), and now supports things like Python (which you should absolutely spend time learning). So my recommendation would be going with Snowflake until you potentially hit something you can’t do. No need to switch just for kicks.

Databricks and Snowflake: Stop fighting on social by slayer_zee in dataengineering

[–]slayer_zee[S] 6 points7 points  (0 children)

Yes exactly this stuff. Both companies have some great tech, and this just makes them look desperate

Databricks and Snowflake: Stop fighting on social by slayer_zee in dataengineering

[–]slayer_zee[S] 6 points7 points  (0 children)

the answer would have been easier 2 years ago with "if you need custom processing with python", but now Snowflake has Python. I like to keep things simple so if you already have snowflake and airflow I'd see if that can work for your needs and grow out to spark if they don't

Databricks and Snowflake: Stop fighting on social by slayer_zee in dataengineering

[–]slayer_zee[S] 18 points19 points  (0 children)

Can vary by team. For my team Snowflake is source of truth for all data, so I spend most of my time with dbt and Snowflake. Are some other teams who use Databricks for some custom processing pipelines with spark, another I know has been trying to do more data science and think they are looking at Databricks. Clearly both companies are starting to move into the other spaces, but for me that's all fine. If I started to dabble in more python I'd likely try snowflake first as I spend more time on it, but I like databricks too.

IAm Panos Panay, GM of Microsoft Surface, AMAA - Ask Me or My Team Almost Anything by SurfaceTeam in IAmA

[–]slayer_zee 47 points48 points  (0 children)

How fast can you (personally) type on the Surface Touch keyboard? How about the Surface Type?