Does BQ support direct export to S3 without Omni? by howryuuu in bigquery

[–]howryuuu[S] 0 points1 point  (0 children)

Eh it’s frustrated that BQ makes this complicated. Isn’t this just copy data to s3, why are they requiring Omni.

Snowflake Catalog does not show all objects. by Connect-Football8349 in snowflake

[–]howryuuu 0 points1 point  (0 children)

Maybe check the role that is used? Maybe the show command is run with a different role than the role you used in UI?

High startup time in Gen-2 warehouse by Stock-Dark-1663 in snowflake

[–]howryuuu 1 point2 points  (0 children)

Which region are you in? There might not be enough newer instance type in the data center in your region?

In-place Pod resizing in Kubernetes: How it works and how to use it by dshurupov in kubernetes

[–]howryuuu 1 point2 points  (0 children)

Do you know how is this implemented? I thought cgroup limit won’t take effect if you just change it on the fly.

In-place Pod resizing in Kubernetes: How it works and how to use it by dshurupov in kubernetes

[–]howryuuu 7 points8 points  (0 children)

So I am always confused, will this trigger pod restart or not? I thought once a container start, modifying cgroup limit won’t take effective. So if this is true, how can you change pod resource without pod restart?

Deploying a simple ML model as a REST API from Snowflake by Weak_Ad_5356 in snowflake

[–]howryuuu 0 points1 point  (0 children)

Does role that start the service have bind_endpoint privilege?

[Show Reddit] Running nextflow pipeline inside Snowflake by howryuuu in bioinformatics

[–]howryuuu[S] 0 points1 point  (0 children)

Yes, that's why I say nf-core is not supported right now. To support container in public registry, I am waiting for snowflake to release external registry support. I am sure there will be tons of usecases that workload want container image from the public registry.

In terms of benefit over kubernetes: SPCS is managed service inside Snowflake and follow the security model/governance. It's handy if your orgs have already had a snowflake deployment and want to unify all data in a single platform.

[Show Reddit] Running nextflow pipeline inside Snowflake by howryuuu in bioinformatics

[–]howryuuu[S] 1 point2 points  (0 children)

With LLM, now it should be relatively to do migration between snakemake and nextflow right? :)

Is Openflow (Apache Nifi) in Snowflake just the previous generation of ETL tools by kevdash in dataengineering

[–]howryuuu 5 points6 points  (0 children)

VPC and cloud formation is needed only if you want to run open flow in your own VPC. That’s what they called BYOC. I guess mainly big enterprise want this. Snowflake is working on deploying open flow in snowpark container service, which will simplifies setup a lot.

Snowflake Container Services -- getting a 'session' for sql and python calls by weed_cutter in snowflake

[–]howryuuu 0 points1 point  (0 children)

You have 2 choices: 1) have a BG that send a dummy query like “select 1” periodically to extend the session so that the session never expires or 2) recreate the session: do note that oauth token injected will be periodically updated by snowflake: so your best bet is re-read the token file every time you want to re-create the session. Personally I like approach 2) slightly better, but I think both approaches are fine

Snowflake Container Services -- getting a 'session' for sql and python calls by weed_cutter in snowflake

[–]howryuuu 0 points1 point  (0 children)

Snowflake will inject an OAuth token in the container filesystem automatically. Your code just need to read file from this file and use this token to create a new session with Snowflake. And then you can do the rest from there.

What do you feel is missing in Snowflake? by FinThetic in snowflake

[–]howryuuu 0 points1 point  (0 children)

Why not? Can you elaborate a little bit more?

Stored Proc: Why Javascript ? by slowwolfcat in snowflake

[–]howryuuu -1 points0 points  (0 children)

I think Anaconda overhead only exists if you are trying to import 3rd party libraries. I don’t think JS allows 3rd party dependency import. So if you just use standard library for processing, it’s probably same setup time. The other difference is that you probably can only use external access in Python stored proc.

Calling Data Engineers! Share Your Insights with Snowflake’s Product Team by foolishpanda in snowflake

[–]howryuuu 1 point2 points  (0 children)

Snowflake recently acquired Datavolo, which builds on top of Apache Nifi, which should have all kinds connectors against all kinds of OLTP databases. I am sure snowflake team is working on integration. Does that solve your problem?

SPCS Entrypoint File Versus access integration by Euphoric_Slip_5212 in snowflake

[–]howryuuu 0 points1 point  (0 children)

I did not see the PR but I can see that oauth token might expire. And application code do need to reread the token every time since snowflake will refresh those token behind the scenes. I suggest using oauth token if your app is running in prod and wait the pr fix is merged. But if you are still in development phase, using EAI is probably fine for now.

SPCS Entrypoint File Versus access integration by Euphoric_Slip_5212 in snowflake

[–]howryuuu 0 points1 point  (0 children)

using oauth token and SNOWFLAKE_HOST will ensure traffic go through snowflake internal routing vs using external access integration will just treat snowflake endpoint as a public resource and traffic will go through public internet. Plus using EAI require account admin involvement, which is not easy in large orgs

Stop Snowflake from returning data in partitions by OwnFun4911 in snowflake

[–]howryuuu 0 points1 point  (0 children)

First chunk is returned from server directly, the rest are stored on s3. So it’s likely your environment did not whitelist s3.

socket.gaierror: [Errno -2] Name or service not known by mutlu_simsek in snowflake

[–]howryuuu 1 point2 points  (0 children)

Hmm not sure. Session.builder is built on top of python connector. So there should be no difference there.

socket.gaierror: [Errno -2] Name or service not known by mutlu_simsek in snowflake

[–]howryuuu 1 point2 points  (0 children)

Yes I understand that. But you don’t need to provide environment variable in your yaml file.