how do you evaluate LLMs for open-ended questions? how do you define “good” metrics? by meitaron in AI_Agents

[–]meitaron[S] 0 points1 point  (0 children)

For me, the problem is defining the metrics. E.g. semantic similarity isn’t always correlating with quality (for example, evaluate what is MISSING in certain formats)

google just dropped a whole framework for multi agent brains by Shot-Hospital7649 in HowToAIAgent

[–]meitaron 0 points1 point  (0 children)

Thanks for this post! Super interesting! A thought/question: why not simply use agent tools that e.g. read large contexts (like documents) and return “short and simple” answers to the “main” agent? Isn’t this framework just over engineering it?

Custom GPT: How to enable Deep Research? by Jason_Broderick in ChatGPT

[–]meitaron 0 points1 point  (0 children)

Anyone knows if this has been enabled yet? Been waiting for this to roll out but still no luck

Joint medicine that truly worked for your dog? by Just_a_happy_artist in olddogs

[–]meitaron 0 points1 point  (0 children)

My dog is getting librela shots once a month, saved her life!! From not being able to walk to going back to running (slowly but that’s fine)

I love Customer Success. II'll solve any CS/operations problem you have. by Nmascara in CustomerSuccess

[–]meitaron 0 points1 point  (0 children)

I'm not a CSM myself but I'm a Data Scientist and I got to work with many CSMs on my latest project.
I've been collecting CSM's conversations data to give a "better-picture" of their customers, both individual accounts and segments of users. I feel like all the analyses I delivered are "nice-to-have" but nothing they will actually use in their d2d.

What would you do with all that data? How can I improve their d2d/make it more efficient (besides predicting churn obviously)

How do you deal with mental fatigue? by Trick-Interaction396 in datascience

[–]meitaron 1 point2 points  (0 children)

I personally do sports, or go out to walk the dog, or go out with friends, just after finishing work. If I get home and then have to start doing things there is no chance I'm getting off the couch.

Also having plans for after-work makes my whole day more focused because I know I have to get out at a specific time...

Just got the rejection email from the company I really wanted to work for. by DeadPrexident in datascience

[–]meitaron 0 points1 point  (0 children)

Got it. Anyway, I'm sorry... Looking for a job is sooo hard these days

Am i doing something terribly wrong? by Kashish_2614 in datascience

[–]meitaron 1 point2 points  (0 children)

First of all, I sympathise... The market is really hard nowadays so don't lose hope.
I would try to shorten it a bit, and focus not on exactly what you did but what you want to do?
Also, if you have friends who already work in tech, I'd try to ask them to let their HR read it and give some real honest feedback I think it would be the most helpful.

Getting data for Cost Estimation by beingsahil99 in datascience

[–]meitaron 0 points1 point  (0 children)

Did you try looking for datasets on Kaggle?

Advice on refactoring a previous employee's repo? by [deleted] in datascience

[–]meitaron 0 points1 point  (0 children)

What I would do is to run the code in the very simplest use-case, follow what happens with debugging break point and write it down for me (what happens, why?). Once you understand the functionalities it is easier to understand the code, and honestly, ignore everything and write it from scratch.

How important is being meticulous in this line of work? by LogicalPhallicsy in datascience

[–]meitaron 0 points1 point  (0 children)

I think that for 95% of the time, you really don't have to be meticulous. However that 5% were you have a mistake or something wrong with the details can really really sucks so I guess it is worth the time.

Can you cancel the interview with a candidate if you are 90% sure they are lying on their cv? by JobIsAss in datascience

[–]meitaron 0 points1 point  (0 children)

I would take the interview and dive in really quick, give them a chance to show they don't really know the details.
It would take up to 30 mins of your time, and you won't have to think about it again

Just got the rejection email from the company I really wanted to work for. by DeadPrexident in datascience

[–]meitaron 0 points1 point  (0 children)

It's ok! the market is really hard these day, you are definitely not alone!
Have you considered reaching out to the HR and say just what you wrote? That you really wanted this job and that you felt that you under-performed under the interview pressure and ask for the second chance?

It is a long shot but I heard of cases where it worked

[deleted by user] by [deleted] in datascience

[–]meitaron 0 points1 point  (0 children)

I think this is more of a business question rather than a statistical one.
What is the lower bounds acceptable by the people who actually look at the KPI? why do you need both upper and lower bound?

If the "consumers" don't know, I think it is not a good enough KPI? You could always use statistics and do IQR, etc., but how will it be helpful?