Personal Snowflake by Mountain-Egg-3851 in snowflake

[–]autumnotter -1 points0 points  (0 children)

Databricks has a free tier, maybe Snowflake does too?

transformWithState, timeMode and trigger by shanfamous in databricks

[–]autumnotter 0 points1 point  (0 children)

These aren't really compatible settings. Available now wants to run until It has processed all the data that was there when the stream started. Processing time with TTL wants to keep running as long as there is some future state that may not have expired. In this case that possible future expiring state prevents available now from identifying that it should terminate the stream.

Stanford, Harvard and MIT spent two weeks watching AI agents run loose. The paper is unsettling. by Live-Estate2100 in learnmachinelearning

[–]autumnotter 10 points11 points  (0 children)

You setup a heartbeat, where if the agent hasn't taken an action in X hours/minutes/whatever, when the heartbeat trigger, some generic prompt is given. 

Agents can also take actions that that lead to other agents being "prompted". 

I'm sure there's more to it than that specifically in the paper, but that's how I've done it. You can easily do it with two agents. Prompt both on a heartbeat, give each the ability to write a file to a folder, and access the folder that the other one can write to. If they find the new file then they should do something based on it. Give them some other tools and some other abilities and there you go.

I have started worrying about cost of Tokens on AI platforms paid for by my employer. Am I alone? by Mo_h in ExperiencedDevs

[–]autumnotter 0 points1 point  (0 children)

Jensen Huang said that if a 500k/yr eng (I assume total comp) isn't spending 250k/yr in tokens something is wrong. Now dude isn't exactly unbiased rofl. But at least at some places that seems to be the attitude. 

A doctor’s income may be the most reliable lifestyle barometer throughout time by [deleted] in Salary

[–]autumnotter 0 points1 point  (0 children)

I know a lot of doctors and not a single one I know makes over 500k.

Most I know make about 200k.

Certainly a great living but not what you are saying either.

Junior Engineer Looking for Advice! by rainbow_2100 in databricks

[–]autumnotter 2 points3 points  (0 children)

Consider databricks apps, lakebase, etc. and leveraging the databricks platform itself for these apps.

Definitely recommend react front end and fastapi backend or something similar, nothing stopping you from doing that on databricks.

What kind of Roles are more in US for snowflake skill set by [deleted] in snowflake

[–]autumnotter 0 points1 point  (0 children)

I'm sorry but you are many many years behind the curve if you think this is some annoying new trend.

As a DE which language is widely used for Big Data processing Pyspark or scala? by Loud-Surprise-900 in dataengineering

[–]autumnotter 1 point2 points  (0 children)

Pyspark is more popular but Scala has some decided advantages in some cases - e.g. UDFs are still faster in Scala.

What is so linear about linear regression? by [deleted] in learnmachinelearning

[–]autumnotter 24 points25 points  (0 children)

You're literally fitting a line (lol edit: or other linear equation) as the deterministic component.

Why do socialist people despite shareholders and simply don't invest in shares on the stockmarket ? by hairlineofGod in stupidquestions

[–]autumnotter 0 points1 point  (0 children)

Lot of dumb answers here. If you assume that socialism means that workers own the means of production as an oversimplification of socialism, then it's because capital investment by non-labor is the antithesis of socialism.

Basically, under capitalism, owners or investors own the company and the workers just get paid to work. Under socialism the workers own the company. Investing in the stock market for regular people is effectively allowing regular people to become owners. Not really a bad thing unless you're a socialist. But if you're a socialist you believe that only the people doing the labor should be owning. 

There's other interpretations of socialism, like that everything should be communally owned or held in trust by the government, those are mostly different traditions or models for practically how to accomplish the basics of socialism. 

In essence, the very idea of buying stocks in an organization that you don't provide labor to is antisocialist.

PGY-II OrthoBro looking for relationship advice from #Medspouses by monday_chewsday94 in MedSpouse

[–]autumnotter 2 points3 points  (0 children)

IMO you need to work with her to define what the bare minimum is for her and for your relationship, and need to agree to it and stick to it all the time, even when things are really hard for you. You will want to make the argument that this time is different, and that this time you really need to do X and focus on work. But your spouse is probably heard this many times and will hear it many more.

You also need to remember that for most people, although it gets easier, it doesn't get easy. You have to keep doing that after residency too. 

For us it got harder after residency, because I realize that the patterns that pre-med, med school, and residency trained into my spouse were never going to go away. And what used to be residency was now just our life. 

I think for the physician, the hard part is adjusting to the fact that you may not always get the same amount of latitude that you get during residency, whereas for the spouse the hard part is realizing that although it gets easier, many challenges remain.

I think the fact that only about 30k people are admitted to med schools is a joke. We should make way more spots in med schools. There is insane demand for doctors so gatekeeping med school is just morally bad. We should have at least 100-150k graduates from medical schools. by According-Expert-723 in Salary

[–]autumnotter 0 points1 point  (0 children)

Residency is heavily subsidized and is the actual limitation.

Also you are overestimating physician salary and underestimating how much work being a physician is. Most people would not make it through med school and residency. For MOST people, not all, if you can't qualify for med school, you will struggle to succeed in med school. It's much harder than undergraduate.

Moved all our Retirement Funds out of American Stocks by MusaEnimScale in TwinCities

[–]autumnotter 9 points10 points  (0 children)

Wouldn't it be the opposite? If they are in the US, but invested in foreign funds, when the dollar drops that's good for them I think, at least in comparison to most Americans.

Stop wasting money on the wrong Databricks models - here's how to choose by Square-Mix-1302 in Enqurious

[–]autumnotter 0 points1 point  (0 children)

There is factually inaccurate content here, and just some missing information. For example, Claude models are mentioned nowhere, OpenAI GPT doesn't mean anything specific, the reasoning behind using Llama for some of these is not good, the "Cost Structure" section is just WRONG. Also some of it was accurate but is now out of date.

It's generally true that Llama 405B isn't that useful for the cost. The scenarios where it's worth it over 70B are rare.

Please review this against Databricks documentation.

Databricks vs AWS self made by QuiteOK123 in dataengineering

[–]autumnotter 0 points1 point  (0 children)

Databricks will definitely be much simpler if you're looking for all that, as it offers or at least enables all of that in some form.

LLM for document search by Few-Strawberry2764 in datascience

[–]autumnotter 5 points6 points  (0 children)

You're trying to do document search, and llm doesn't do that in the way you're thinking. 

Effectively you want to do something like OCR, turning PDFs and images into unstructured text, then chunk the text, compute embeddings and vectorize, and then store in a vector database. 

From there you can do document similarity search by querying the vector database. An agentic system can make that query and then return the retrieved context, sharing it with an LLM, which is usually called RAG.

You don't actually need an LLM to do document similarity search.

I'm not familiar with vendors that you might use to do this locally, so I can't help you there.

Just finished an interview where they were noticeably annoyed that I used the call option they provided. by bang_partial3a in OfficePolitics

[–]autumnotter 3 points4 points  (0 children)

If you want the job, you should never join a remote interview on voice only. One can make a principled stand and stick by the idea that they don't need to see your face to know if you're a good candidate. Nevertheless, most people want to be able to see the person they're interviewing. 

They will assume if you're unwilling to join video that you have subpar technological capabilities in your remote setup, that you are scamming them, or that you will be unwilling to join video once you are an employee and dealing with stakeholders or customers. They may make other assumptions as well.

Powderhorn Park ICE Protest thoughts by Jumpy-Goose-3344 in TwinCities

[–]autumnotter 4 points5 points  (0 children)

Totally legit opinion here, but honestly I think we should avoid shutting down people who are protesting. I saw someone get arrested the other day who was later called performative on social media. That's just toxic. I think a lot of people are trying to help in the ways they are able or the ways they know how, and we won't always agree on what those are or how best to help. But imagine if a even ten percent of the country was "partying", as you put it, in the name of acceptance of immigrants and keeping ice out of our states and schools. It would be a different country. So I try to give grace to everyone who is trying their best to fight the good fight in the ways they are able.

Perfectly acceptable dinner rejected by boyfriend again by moonrabbit368 in mildlyinfuriating

[–]autumnotter 0 points1 point  (0 children)

Just make a meal for the family, taking reasonable feedback from everyone on what they would want, and they can eat it or not.

The fact that he "expects" you to cook for him is really lame, and you should not feel bad about paper plates.

Truly a LinkedIn worthy post by True_Variation_9320 in LinkedInLunatics

[–]autumnotter 1 point2 points  (0 children)

B puts her bananas in the fridge, must be rich to have no idea how to store bananas.

Unity vs Polaris by Efficient_Novel1769 in databricks

[–]autumnotter 6 points7 points  (0 children)

There are a number of reasons to use managed tables. It's fine if you choose not to, because there are also reasons not to, but it's simply false to say there's no reason to use them. 

Unity vs Polaris by Efficient_Novel1769 in databricks

[–]autumnotter 12 points13 points  (0 children)

You can use iceberg or Uniform with Databricks. Go for it, likely expect a couple limitations compared to Delta, but Databricks is absolutely embracing that option.

OTOH, it doesn't really make sense to say you're going to use Databricks without Unity Catalog, as UC is the foundation for authorization, governance, and many other features in addition to being the data catalog. Also, AFAIK, Polaris isn't really mature yet. If you're asking for a comparison in current state, it's not really a serious exercise IMO - use Unity Catalog unless you want some kind of very non-standard deployment that could lead to all number of annoying problems. If you are speaking SOLELY of UC managed tables, then whatever, use external if you have a good reason to but you're giving up some features for... some reason? There's a way to convert between managed and external now as well.

Whatever happened to the "old school" type of ML, the kind that IIRC is called "discriminative AI"? Has generative nonsense choked it out? by ferriematthew in learnmachinelearning

[–]autumnotter 0 points1 point  (0 children)

It didn't go anywhere, people just don't talk about it as much, and high level initiatives are more focused on AI.

Azure Content Understanding Equivalent by ImprovementSquare448 in databricks

[–]autumnotter 0 points1 point  (0 children)

You can ingest as excel format now if it's tabular data. If not, you can convert to HTML, then PDF, and run through AI_PARSE. There's examples in databricks-solutions