views:

616

answers:

1

Hi Everybody,

I am new to Python and new to SciPy libraries. I wanted to take some ques from the experts here on the list before dive into SciPy world.

I was wondering if some one could provide a rough guide about how to run two stats functions: Cumulative Distribution Function (CDF) and Probability Distribution Function (PDF).

My use case is the following: I have a sampleSpaceList [] which have 1000 floating point values. When a new floating point value is generated in my program, I would like to run both CDF and PDF on the sampleList for it and get the probabilty of value less or equal for CDF and probablity distribution for PDF.

Many thanks in advance!

Omer

=== some more information ===

Basically, in my program there are events which can either succeed or fail. If they succeed, then I calculate a event-ratio for that event and add to my sampleSpaceList until it reaches a threshold of 1000. Once the threshold is achieved, then for any next event-ratio; I would like to get a probability that whether that event-ratio would succeed or not in my system.

What I basically would like to get is the probability of success for a particular event ratio.

I am not very sure whether CDF or PDF will be relative to my problem so that 's why I wanted to learn how to use both but at any given moment, I will be only using either CDF or PDF to get a probability of event-ratio being successful.

+1  A: 

See this article: Probability distributions in SciPy.

John D. Cook
Thanks for the link John to your blog. It partially answered m question but since scipy.stats.norm(mean,deviation) takes mean/deviation before cdf/pdf could be called; i guess i would have to calculate the mean/deviation for the my samplespace[] with floating points. Is there a function in scipy which can take a list and return mean/std.deviation??Thanks,
Yes. If you assume your data are normally distributed you can use scipy.stats.mean and scipy.stats.std to find the sample mean and standard deviation. If your data are not normally distributed, look into the fit method on your distribution.
John D. Cook
So the fit function will take in a sample and return the distribution type for it? And then based on that distribution, one can calculate mean, std and probability right?