ansaurus

Question

Calculating an area under a continuous density plot

Answer 1

+4 A:

Three possibilities:

The logspline package provides a different method of estimating density curves, but it does include pnorm style functions for the result.

You could also approximate the area by feeding the x and y variables returned by the density function to the approxfun function and using the result with the integrate function. Unless you are interested in precise estimates of small tail areas (or very small intervals) then this will probably give a reasonable approximation.

Density estimates are just sums of the kernels centered at the data, one such kernel is just the normal distribution. You could average the areas from pnorm (or other kernels) with the sd defined by the bandwidth and centered at your data.

Greg Snow 2010-10-06 20:15:19

Answer 2

+2 A:

What you are looking for is the Empirical Cumulative Distribution Function, given a specific density.

wok 2010-10-06 20:19:25

Answer 3

+2 A:

Calculate the density seperately and plot that one to start with. Then you can use basic arithmetics to get the estimate. An integration is approximated by adding together the area of a set of little squares. I use the mean method for that. the length is the difference between two x-values, the height is the mean of the y-value at the begin and at the end of the interval. I use the rollmeans function in the zoo package, but this can be done using the base package too.

require(zoo)

X <- rnorm(100)
# calculate the density and check the plot
Y <- density(X) # see ?density for parameters
plot(Y$x,Y$y, type="l") #can use ggplot for this too
# set an Avg.position value
Avg.pos <- 1

# construct lengths and heights
xt <- diff(Y$x[Y$x<Avg.pos])
yt <- rollmean(Y$y[Y$x<Avg.pos],2)
# This gives you the area
sum(xt*yt)

This gives you a good approximation up to 3 digits behind the decimal sign. If you know the density function, take a look at ?integrate

Joris Meys 2010-10-06 21:38:27

ansaurus

tags:

views:

answers:

Calculating an area under a continuous density plot

related questions