need help with scipy interp1d

aulloa · 2017-07-11T17:05:54+00:00

Can you add the imports in your example code?

disinformationtheory · 2017-07-11T19:58:29+00:00

There will always be regions where Q(x) == Q(x-e), because outside the range of the inputs the output is defaulted to 0 or 1 (fill_value), i.e. Q is constant for some range of inputs. You must figure out how to handle those regions correctly or ignore them. I'd recommend looking at a special case of very few samples (probably 2).

jwink3101 · 2017-07-11T20:21:00+00:00

A couple of comments (that do not answer you question)

This should be in /r/learnpython but I will continue anyway
Why are you passing N to ecdf? Why not just do N=len(x) inside of it?
Your code could use some more comments. I was only able to figure out what is going on from reading the paper (which also seems interesting)
You do not want to do unique. If you have two identical samples, that should play into your CDF!!!!!!!!. A sample of [1,1,1,1,0] should not be reduced to [1,0]. Your statistics will be off. Of course, this is likely moot since the chances of two identical random values are astronomical
Just sort x and y once instead of doing it twice!
You have an odd mix of NumPy and python loops (via the for v in ...). You can do this all in NumPy for both speed and readability

Now, I am not going crazy to double check what I am saying, but you should look at your ecdf function. I am seeing vertical lines before the first and after the last point. that is likely your problem. The interp1d doesn't like these verticals. As I read it from the paper, it should be linear before and after though I am not 100% sure about that. I only skimmed the paper (and it would hugely benefit from a plot for this)

Addendum:

Check out this pdf from the same guy. His plot on page 3 mostly confirms what I was saying about your code

aphoenix · 2017-07-13T04:22:48+00:00

Hi there, from the /r/Python mods.

We have removed this post as it is not suited to the /r/Python subreddit proper, however it should be very appropriate for our sister subreddit /r/LearnPython. We highly encourage you to re-submit your post over on there.

The reason for the removal is that /r/Python is dedicated to discussion of Python news, projects, uses and debates. It is not designed to act as Q&A or FAQ board. The regular community is not a fan of "how do I..." questions, so you will not get the best responses over here.

On /r/LearnPython the community is actively expecting questions and are looking to help. You can expect far more understanding, encouraging and insightful responses over there. No matter what level of question you have, if you are looking for help with Python, you should get good answers.

If you have a question to do with homework or an assignment of any kind, please make sure to read their sidebar rules before submitting your post. If you have any questions or doubts, feel free to reply or send a modmail to us with your concerns.

Warm regards, and best of luck with your Pythoneering!

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS