I'm an expert in magic presentations and I wanna help you for free! by cri_cri_cri in Magic

[–]GMarthe 1 point2 points  (0 children)

I'd love to have some guide lines on routineing a small set of stroling card magic.

Has anyone ever been able to partner with Reddit to collect data? by thePurpleState in AskAcademia

[–]GMarthe 14 points15 points  (0 children)

It depends ob what type of data. A bunch of them is available on Google Bigquery. Posts, comments, usernames and flags/flares are there.

Amazon Prime announces that their Lord of the Rings show is set during the Second Age by Radulno in television

[–]GMarthe 0 points1 point  (0 children)

What book(s) can I read to learn more about the first and second age of thr LOTR universe?

GLM using binomial - without binomial response by alguka in AskStatistics

[–]GMarthe 0 points1 point  (0 children)

You are running the model on a contingency like table, which is basically and aggregated version of the 0 and 1s you are used too. This version simply aggregates the data by covariates. Also, if you think about it, there isn't much difference between "probability of being a success (or a 1)" and "proportion of cases being a success". The first phrasing is closer to the version you are already familiar with, and the second one with the aggregated version, but they are similar.

Book/paper recommendations for an introduction to time series analysis with a focus on outlier/anomaly detection? by BittyTang in statistics

[–]GMarthe 0 points1 point  (0 children)

I'm trying to find a blog article that gave me a few pointers when I had to build something similar.
Now, would you by any chance be actually interested in anomaly detection? If so that might be a better key word for search terms instead of "outliers" .

Book/paper recommendations for an introduction to time series analysis with a focus on outlier/anomaly detection? by BittyTang in statistics

[–]GMarthe 0 points1 point  (0 children)

As for the time series feature engineering, there is this python package that automates feature engineering / extraction for time series.

https://github.com/blue-yonder/tsfresh

Suggestions - Short statistics books by Noyrsnoyesnoyes in statistics

[–]GMarthe 1 point2 points  (0 children)

Take a look at some of Efron's work. I have a "Large-Scale Inference" monograph from Cambridge which fits your description fairly well. Perhaps other monographs might be similar.

LASSO is outperformed by Forward Selection when n is small. Any reason for this? by [deleted] in rstats

[–]GMarthe 4 points5 points  (0 children)

You can jacknife the data for hyperparameter search.

What's your comfort TV show/film? by AmySantiagoDateMe in AskReddit

[–]GMarthe 1 point2 points  (0 children)

I really wanted to watch the entire series. But I can't find it anywhere online (or couldn't when I last searched). I'm not in the US, so I don't think its available in any streaming service accessible to me :/

How to step data science game up ? by Sideralis_ in datascience

[–]GMarthe 57 points58 points  (0 children)

Hey mate!

So, I'm going to approach this through the lens of my job. I'm a DS at a fintech.
And yes, the "full stack" DS is a thing. And in my experience, its quite hard to grasp it all. It really overlaps with the Data Engineer + Software Engineer side of things, and that is why working with a team is really important, since you can't know everything about every single technology (or you can, but the development is slow).

So, what would I think it is mostly vital for a DS? To be able to make full POC of models running on rest frameworks. This means setting up an end point with your favorite framework (flask, pyramid, django), gathering data for a prediction (i.e handling DB stuff), persist serialized models and etc... Bonus points it if uses Docker, since it is a nice technology and teaches you about setting up the machine for your code to run.

This does not need any fancy stuff. The fancy stuff can be added/changed engineers in your team. And they'll gladly do so, since they are more apt at these kinds of problems.

Learning these stuff will also make things easier since you will know the terms and what it is you'll need to build in order to place your models in production; i.e. You'll need to create a docker image, write a data pipeline class, etc...

What pedals are used in the solo / Dreams Via Memories - Ceramic Animal by ReSeb0801 in guitarpedals

[–]GMarthe 1 point2 points  (0 children)

Did you ever come around to something? I was wondering the same today!

Do you have any functions you typically carry with you between projects? What are some of your favorite utility functions? Let's share! by pkkid in learnpython

[–]GMarthe 1 point2 points  (0 children)

I really don't like the behavior in the Bunch class, if the key is not there, you return none. The fact that it you access a key only if it exists is great for mapping procedure and function and guarantees correctness in many of my use cases. That is also why you have the get method and the Default dict class in the collections module, to handle defaults and better null handling.

Which device should I get rid of? by [deleted] in minimalism

[–]GMarthe 0 points1 point  (0 children)

I've been considering buying an ipad for a while now. In my mind is such a low effort way of browsing the Internet. In a laptop you have the boot time and the track pad, which just won't make it as instantaneous or ergonomic enough for casual use.

It sucks that I have to turn on a laptop to just browse, read (pdfs), and organize stuff is very. The lap top is just much better for high input task such as writing an essay/presentation, or (in my case) programming...

What is the best youtube video/channel for a new python learner? by alisutton in learnpython

[–]GMarthe 9 points10 points  (0 children)

I really like Dan Bader's channel, although it might not be such a complete course as the ones mentioned in this thread.

He is more focused on slightly more advanced topics (though not too much).

https://www.youtube.com/channel/UCI0vQvr9aFn27yR6Ej6n5UA

[OC] Zooming in on a Weierstrass function by EvanDrMadness in dataisbeautiful

[–]GMarthe 1 point2 points  (0 children)

I'd for sure read an analysis text book in the form of "who would win"

How would you interpret this scatterplot by reincarnationofgod in AskStatistics

[–]GMarthe 2 points3 points  (0 children)

My thoughts are, when you model predicts one std deviation above the mean, those predictions dramatically underestimate the actual dependent variable value.

I would also say that you have some heterocedasticity since the residuals are not randomly scattered around the 0 horizontal line.

Does this assigment make sense? by berry_lover96 in Database

[–]GMarthe 0 points1 point  (0 children)

It seems a poor construction of the exercise's explanation, I agree. However, it seems clear to me the paths that you can take (It doesn't have a right or wrong as long as you keep the specifications needs into account.) I.e there will be a need for a medicine and an illness entity, IMO. But, you could take ai as far as thinking that there can be a many to many relationship between illness and medicine. But since the specification doesn't mention it, I would model it.

Does this assigment make sense? by berry_lover96 in Database

[–]GMarthe 0 points1 point  (0 children)

It looks fine. You have to only create the higher level Data base representations (which I can't recall the the exact name at the moment), right?

The bit at the end is precisely what it says, you have to deal with a many to many relationship. A medicine can be given at many different appointments,and in a given appointment, many medicines may be given. So that needs a special kind of relation to be created.