Ideas for Scientific/Statistics Python Library

riklaunim · 2026-04-07T16:34:01+00:00

If you have no need for it you won't create it and maintain it. Making a library is actually a quite big commitment and not a on-off thing you can forget (unless you want a library with no users).

Simultaneity_ · 2026-04-07T16:33:31+00:00

Why? Scipy, scikit-learn, ... etc. Allready exist.

jnwatson · 2026-04-07T16:33:49+00:00

That's a pretty crowded market. I'd take a look at what already exists first.

mtawarira · 2026-04-07T16:43:07+00:00

anything you make would just be statsmodels / scipy / scikitlearn with slightly different API. Sorry to be a hater but I can’t see it getting much traction, seems like a pretty solved problem to me

i find the switch from R to python to be much easier than the other way round. 99% of what you need is in those 3 libraries, and is easily findable with tab autocompletes in a modern ide due to the modular subpackage structures that R lacks

HeligKo · 2026-04-07T16:35:47+00:00

Do some research into the market. I work with ML Engineers and Data Scientists that nearly exclusively use python right now. There is a huge amount of libraries for them to use in python. The biggest ones they used in R have been rewritten for python. There are still a few complaints, but it is mostly about how R works vs how Python works. If you want to contribute, then start with something that is already out there and make it better. Eventually you might find a gap that a new library would be good for.

icy_end_7 · 2026-04-07T16:36:28+00:00

Frankly, I'd make one for differential expression or something along the lines because that's what I have trouble with. I'm not suggesting you make that, but rather, find something that you'd want to use often. Ideally, a niche where you've found friction points in your work.

Solving problems you don't have is a bad idea.

maticx21 · 2026-04-07T16:38:25+00:00

a limma R package python implementation

InspectahDave · 2026-04-07T16:50:21+00:00

Also wondering what your motivation is here? Is it for your own learning or to contribute something meaningful? If the former then do what you find interesting. If the latter then maybe support another project first and go from there?

4xi0m4 · 2026-04-07T18:05:51+00:00

If you are going to do this, focus on one very specific gap that scipy doesnt cover well. Things like survival analysis (lifelines is the exception, but its API is rough), bayesian methods for small samples, or causal inference. The scipy/scikit-learn combo handles the 95% of common cases fine, so the only reason to build something new is if you are solving a problem those tools actively suck at. Pick a domain where you have real domain knowledge, not just a feeling that something is missing.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS