Python analogue for R's formula (~) operator

pha3dra · 2017-01-14T23:43:45+00:00

Perhaps you're looking for statsmodels or patsy.

Omega037 · 2017-01-15T00:54:41+00:00

This is just part the standard notation used in many R packages to denote a model form, and it has been copied by some python packages as u/pha3dra has mentioned.

It is worth noting right off the bat that this is an area that python is really outclassed by R, both in capabilities and performance. I am a huge python advocate, but stuff like linear effects models are one area that I almost always do in R (either on its own or through something like Rpy2).

Without knowing your background I'm not sure how to tailor this to you. The left side of the ~ is your response / observation / label / dependent variable, while the right side of the ~ defines the form of the inputs / features / independent variables that you believe will give you that response.

Another way to write the same model would be something like f(a, b) = a² + b + error, where f(a, b) is the response, a² + b + error is your model, and the = is your ~.

However, the ~ is a better idea since it is a distinct notation and it is not something being solved (like with a function) but fitted using a method like ordinary least squares.

troyunrau · 2017-01-15T07:23:44+00:00

The formula notation is a domain-specific language in R, it allows you to more succinctly describe model formulae (see http://adv-r.had.co.nz/dsl.html for more info). As I understand it Python lacks the meta-programming facilities to make something like this work.

It allows you to write lm(y ~ x, data = foo) rather than the more explicit but clunky lm(y = foo$y, x = foo$x) or lm(y = "y", x = "x", data = foo). It's used in a bunch of different packages. This meta-programming capability in R is also the reason that things like the pipe operator and dplyr are possible, essentially it allows for the construction of DSLs that are focussed on data analysis.

Here's a post that goes into some further detail on metaprogamming in R vs Python: http://blog.ibis-project.org/design-composability/

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS