ansaurus

Question

"Reverse" statistics: generating data based on mean and standard deviation

Answer 1

+6 A:

You can generate standard normal random variables with the Box-Mueller method. Then to transform that to have mean mu and standard deviation sigma, multiply your samples by sigma and add mu. I.e. for each z from the standard normal, return mu + sigma*z.

John D. Cook 2010-07-08 21:58:00

Answer 2

A:

You could make it a kind of Monte Carlo simulation. Start with a wide random "acceptable range" and generate a few truly random values. Check your statistics and see if the average and variance are off. Adjust the "acceptable range" for the random values and add a few more values. Repeat until you have hit both your requirements and your population sample size.

Just off the top of my head, let me know what you think. :-)

eruciform 2010-07-08 21:58:28

Answer 3

A:

It is easy to generate dataset with normal distribution (see http://en.wikipedia.org/wiki/Box%E2%80%93Muller_transform ).
Remember that generated sample will not have exact N(0,1) distribution! You need to standarize it - substract mean and then divide by std deviation. Then You are free to transform this sample to Normal distribution with given parameters: multiply by std deviation and then add mean.

Tomek Tarczynski 2010-07-08 22:03:14

Answer 4

+2 A:

There are several methods to generate Gaussian random variables. The standard method is Box-Meuller which was mentioned earlier. A slightly faster version is here:

http://en.wikipedia.org/wiki/Ziggurat_algorithm

Here's the wikipedia reference on generating Gaussian variables

http://en.wikipedia.org/wiki/Normal_distribution#Generating_values_from_normal_distribution

Joel 2010-07-08 22:06:06

Answer 5

+2 A:

I'll give an example using R and the 2nd algorithm in the list here.

X<-4; Y<-2 # mean and std
z <- sapply(rep(0,100000), function(x) (sum(runif(12)) - 6) * Y + X)

plot(density(z))
> mean(z)
[1] 4.002347

> sd(z)
[1] 2.005114

> library(fUtilities)

> skewness(z,method ="moment")
[1] -0.003924771
attr(,"method")
[1] "moment"

> kurtosis(z,method ="moment")
[1] 2.882696
attr(,"method")
[1] "moment"

gd047 2010-07-09 08:00:22

Answer 6

+1 A:

This is really easy to do in Excel with the norminv() function. Example:

=norminv(rand(), 100, 15)

would generate a value from a normal distribution with mean of 100 and stdev of 15 (human IQs). Drag this formula down a column and you have as many values as you want.

el chief 2010-07-10 03:55:54

+1 for no programming required

quantumSoup 2010-07-13 18:20:53

ansaurus

tags:

views:

answers:

"Reverse" statistics: generating data based on mean and standard deviation

related questions