statistics

Representing continuous probability distributions

I have a problem involving a collection of continuous probability distribution functions, most of which are determined empirically (e.g. departure times, transit times). What I need is some way of taking two of these PDFs and doing arithmetic on them. E.g. if I have two values x taken from PDF X, and y taken from PDF Y, I need to get t...

Logical Languages - Prolog or Lisp/Smalltalk or something else?

So, I am writing some sort of a statistics program (actually I am redesign it to something more elegant) and I though I should use a language that was created for that kind of stuff (dealing with huge data of stats, connections between them and some short of genetic/neural programming). To tell you the truth, I just want an excuse to d...

Is it possible to get statistics with TortoiseSVN?

Is it possible to get statistics about each users contributions, total line count etc? ...

comparing statically typed code with dynamically typed code - costs and benefits

Whatever its merits, Adobe's Actionscript 3 presents what may be a unique opportunity to explore the consequences of typed versus untyped languages, because it is pretty much a strict superset of javascript, with all the syntactic benefits/overhead of strict type declarations, declarative casting, etc. What this makes possible is compar...

Best way to collect Rails application usage statistics

I have a Rails application where users can set up their domains and publish content onto them. I need to collect public traffic statistics such as pageview count etc. One good example of this feature is flickr usage statistics i can see as a customer. The question is what is the best way to collect usage information. Should it be done b...

How to calculate the sum of two normal distributions

I have a value type that represents a gaussian distribution: struct Gauss { double mean; double variance; } I would like to perform an integral over a series of these values: Gauss eulerIntegrate(double dt, Gauss iv, Gauss[] values) { Gauss r = iv; foreach (Gauss v in values) { r += v*dt; } return r; }...

Which java-library computes the cumulative standard normal distribution function?

For a project I have a specification with formulas, I have to implement. In these formulas a cumulative standard normal distribution function exists, that takes a float and outputs a probability. The function is symbolized by a Φ. Exists a Java-library, that computes this function? ...

How to do Unit Testing with Uncertainties?

We have several different optimization algorithms that produce a different result for each run. For example the goal of the optimization could be to find the minimum of a function, where 0 is the global minima. The optimization runs returns data like this: [0.1, 0.1321, 0.0921, 0.012, 0.4] Which is quite close to the global minima, so...

sql statistics io scan count explanation

Hi, Simple question, but I haven't found a good explanation on google. When using Set Statistics IO ON, the logical reads and scan count is provided in the message window of management studio. If I have: tblExample, scan count 5, logical reads 20 What does scan count signify? ...

Statistical library for Java

I was wondering if there is a good statistical library for Java. I used to use colt from CERN but it does not provide a good weighted sampling. So I usually use MATLAB Java Builder and export everything to Java afterward. Any thoughts or insights? ...

A/B testing and stats solutions

I've been looking for a good testing framework for months, not finding anything, so I've just been building my own. This is what I want to do: - track arbitrary behaviors (e.g. # of photos viewed, # of comments posted) - track correlation between arbitrary variables and those behaviors (e.g, how do different versions of this prompt af...

Per version (1.0,1.1,2.0,3.0,3.5) how many classes are in the .NET framework?

Per version (1.0,1.1,2.0,3.0,3.5) how many classes are in the .NET framework? I am looking for the info for a presentation that I'm going to give for a class in my Master's program. ...

How to calculate a weighted mean?

My language is PHP, but the algorithm should be fairly universal. I have an associative array of (let's say) ratings and number of times that rating has been given. $ratings = array( 1 => 1, 2 => 3, 3 => 6, 4 => 3, 5 => 3 ); This is the equivalent of: [1, 2, 2, 2, 3, 3, 3, 3, 3, 3, 4, 4, 4, 5, 5, 5], but given the...

Resources for learning SAS if you already familiar with R

I would like to learn some SAS because I am interested in a few industries that tend to use it exclusively. However, I don't want to get stuck with a resource that assumes I know nothing about statistical programming. Is there a good guide for programmers with statistics experience in R? Thanks, Steven ...

What is an Average that does not include outliers?

What do you call an Average that does not include outliers? for example if you have a set: {90,89,92,91,5} avg = 73.4 but excluding the outlier (5) we have {90,89,92,91(,5)} avg = 90.5 How do you describe this average in statistics? ...

What is statistical debugging?

What is statistical debugging? I haven't found a clear, concise explanation yet, but the term certainly sounds impressive. Is it just a research topic, or is it being used somewhere, for actual development? In other words: Will it help me find bugs in my program? ...

How to find the one hour period with the most datapoints?

I have a database table with hundreds of thousands of forum posts, and I would like to find out what hour-long period contains the most number of posts. I could crawl forward one minute at a time, keeping an array of timestamps and keeping track of what hour had the most in it, but I feel like there is a much better way to do this. I wi...

Computing the statistical mode

Hi all, I'm currently trying to verify whether or not, given an unsorted array A of length N and an integer k, whether there exists some element that occurs n/k times or more. My thinking for this problem was to compute the mode and then compare this to n/k. However, I don't know how to compute this mode quickly. My final result needs t...

How many developers do the different platforms have?

I noticed that Joel Spolsky repeatedly estimated the worldwide number of developers at 4 million in the Stackoverflow podcast. So, how many developers do the different platforms, such as Java, .NET/Windows, Flash, PHP, Python, Ruby etc., really have? This article puts Windows developers at 6 million, comment no. 2 in this article claim...

How to design a database table structure for storing and retrieving search statistics?

I'm developing a website with a custom search function and I want to collect statistics on what the users search for. It is not a full text search of the website content, but rather a search for companies with search modes like: by company name by area code by provided services ... How to design the database for storing statistics ...