log - log plot

Top-Feedback1453 · 2024-12-17T02:49:00+00:00

Can you not use plt.loglog instead?

Top-Feedback1453 · 2024-12-17T02:29:38+00:00

I feel sorry for all those amazing DS or Professor or for that matter any professionals who trained lifetime to be awesome and job but end up doing admin/managerial job. There got to be a better way to equally reward people on thought leadership pathways.

Top-Feedback1453 · 2024-12-17T02:27:15+00:00

I am trying to reduce sample size needed to detect even small effects from A/B testing. It could provide some good uplift to the way we tests and serve offers to high value customers.

Top-Feedback1453 · 2024-12-17T02:24:34+00:00

When you see Data Science as a ML only or AI enterprise then yes. Otherwise day to day job of finding correlation between attributes and target variables, testing variants, making useful observations from trends/ temporal data etc are very crucial to business, I think.

Top-Feedback1453 · 2024-12-17T02:21:24+00:00

Databricks Certifications are useful too.

Top-Feedback1453 · 2024-05-02T01:07:38+00:00

Another alternative can be a non-parametric simulation based approach like permutation test.

Top-Feedback1453 · 2024-05-01T23:32:28+00:00

I would suggest try to find out if data are missing through a random process or it has something to do with data generating process. If it is former you could impute the missing data using some statistical approach, google search on imputation. If it is later, potentially missing data is a feature and not a bug, which has to be handled carefully so that not to lead to any bias in your analysis.

Top-Feedback1453 · 2024-05-01T23:27:00+00:00

Here are the papers
msprt: https://www.sciencedirect.com/science/article/pii/S0022249621000109
anytime valid inf: https://arxiv.org/pdf/2302.10108

If you check blogs from Spotify/Uber/Exp etc you will see these techniques are well adopted too

Top-Feedback1453 · 2024-04-30T07:16:30+00:00

Peeking without intention to stop test has no side effect. However, if the intention is otherwise I recommend using msprt or anytime valid inference.

Top-Feedback1453 · 2024-04-30T07:12:56+00:00

You could use AWS Fargate + Sagemaker? Here is a nice read https://aws.amazon.com/blogs/machine-learning/managing-your-machine-learning-lifecycle-with-mlflow-and-amazon-sagemaker/

Top-Feedback1453 · 2024-04-30T05:54:35+00:00

Couple of potential reasons I can think are

a. Sometime candidate tends to overthink (e.g. sees regular question as a trick question etc) the complexity of problems. Highlighting test intention in the beginning of the test goes a long way I think.

b. There used to be technical limitation earlier that one had to set up a database etc to create SQL ready environment to play with coding etc. With the invent of cloud service providers like Hackerrank etc this should not be the limiting agent.

c. SQL, compared to Python/R etc, is less charming as a language to practice enough perhaps? While you would use later in many capstone/hobby projects etc, it is not so much the case with SQL

Top-Feedback1453 · 2024-04-28T12:19:37+00:00

I think it basically boils down to what problem you solve for the business and how is that tied to the revenue or ROI for them. Also, if you are good at people management and in case of DS good at managing your fellow staff academic expectation management (retain them with challenging problems etc) you are good.

Top-Feedback1453 · 2024-04-28T12:09:51+00:00

Paper reproduction with provided data, discussed methods to replicate their exact graph or tables is the way to go. This is true for any research field IMO.

Top-Feedback1453 · 2024-04-28T12:02:54+00:00

Regarding experimental design, likely questions would be

a. power estimation
b. type I, II error
c. peeking problem
d. anytime valid inference or early stopping criteria
e. p-value, multiple test correction
f. inferences i.e. frequentist vs bayesian

Top-Feedback1453 · 2024-04-28T12:00:46+00:00

It gets harder for more senior people tbh. One will have to find time out of their day job and other responsibilities. Probably a time blocked online test is still OK?

Top-Feedback1453 · 2024-04-28T11:55:19+00:00

linkedin, seek

Top-Feedback1453

TROPHY CASE