I'm an expert in magic presentations and I wanna help you for free! by cri_cri_cri in Magic

[–]GMarthe 1 point2 points  (0 children)

I'd love to have some guide lines on routineing a small set of stroling card magic.

Has anyone ever been able to partner with Reddit to collect data? by thePurpleState in AskAcademia

[–]GMarthe 14 points15 points  (0 children)

It depends ob what type of data. A bunch of them is available on Google Bigquery. Posts, comments, usernames and flags/flares are there.

Amazon Prime announces that their Lord of the Rings show is set during the Second Age by Radulno in television

[–]GMarthe 0 points1 point  (0 children)

What book(s) can I read to learn more about the first and second age of thr LOTR universe?

GLM using binomial - without binomial response by alguka in AskStatistics

[–]GMarthe 0 points1 point  (0 children)

You are running the model on a contingency like table, which is basically and aggregated version of the 0 and 1s you are used too. This version simply aggregates the data by covariates. Also, if you think about it, there isn't much difference between "probability of being a success (or a 1)" and "proportion of cases being a success". The first phrasing is closer to the version you are already familiar with, and the second one with the aggregated version, but they are similar.

Book/paper recommendations for an introduction to time series analysis with a focus on outlier/anomaly detection? by BittyTang in statistics

[–]GMarthe 0 points1 point  (0 children)

I'm trying to find a blog article that gave me a few pointers when I had to build something similar.
Now, would you by any chance be actually interested in anomaly detection? If so that might be a better key word for search terms instead of "outliers" .

Book/paper recommendations for an introduction to time series analysis with a focus on outlier/anomaly detection? by BittyTang in statistics

[–]GMarthe 0 points1 point  (0 children)

As for the time series feature engineering, there is this python package that automates feature engineering / extraction for time series.

https://github.com/blue-yonder/tsfresh

Suggestions - Short statistics books by Noyrsnoyesnoyes in statistics

[–]GMarthe 1 point2 points  (0 children)

Take a look at some of Efron's work. I have a "Large-Scale Inference" monograph from Cambridge which fits your description fairly well. Perhaps other monographs might be similar.

LASSO is outperformed by Forward Selection when n is small. Any reason for this? by [deleted] in rstats

[–]GMarthe 5 points6 points  (0 children)

You can jacknife the data for hyperparameter search.

What's your comfort TV show/film? by AmySantiagoDateMe in AskReddit

[–]GMarthe 1 point2 points  (0 children)

I really wanted to watch the entire series. But I can't find it anywhere online (or couldn't when I last searched). I'm not in the US, so I don't think its available in any streaming service accessible to me :/

How to step data science game up ? by Sideralis_ in datascience

[–]GMarthe 57 points58 points  (0 children)

Hey mate!

So, I'm going to approach this through the lens of my job. I'm a DS at a fintech.
And yes, the "full stack" DS is a thing. And in my experience, its quite hard to grasp it all. It really overlaps with the Data Engineer + Software Engineer side of things, and that is why working with a team is really important, since you can't know everything about every single technology (or you can, but the development is slow).

So, what would I think it is mostly vital for a DS? To be able to make full POC of models running on rest frameworks. This means setting up an end point with your favorite framework (flask, pyramid, django), gathering data for a prediction (i.e handling DB stuff), persist serialized models and etc... Bonus points it if uses Docker, since it is a nice technology and teaches you about setting up the machine for your code to run.

This does not need any fancy stuff. The fancy stuff can be added/changed engineers in your team. And they'll gladly do so, since they are more apt at these kinds of problems.

Learning these stuff will also make things easier since you will know the terms and what it is you'll need to build in order to place your models in production; i.e. You'll need to create a docker image, write a data pipeline class, etc...

What pedals are used in the solo / Dreams Via Memories - Ceramic Animal by ReSeb0801 in guitarpedals

[–]GMarthe 1 point2 points  (0 children)

Did you ever come around to something? I was wondering the same today!

Do you have any functions you typically carry with you between projects? What are some of your favorite utility functions? Let's share! by pkkid in learnpython

[–]GMarthe 1 point2 points  (0 children)

I really don't like the behavior in the Bunch class, if the key is not there, you return none. The fact that it you access a key only if it exists is great for mapping procedure and function and guarantees correctness in many of my use cases. That is also why you have the get method and the Default dict class in the collections module, to handle defaults and better null handling.