statistics

Python's list comprehensions and other better practices

This relates to a project to convert a 2-way ANOVA program in SAS to Python. I pretty much started trying to learn the language Thursday, so I know I have a lot of room for improvement. If I'm missing something blatantly obvious, by all means, let me know. I haven't got Sage up and running yet, nor numpy, so right now, this is all quit...

Page visits statistics

I need to implement product pages visits statistics (by days). How better to do this in Rails? ...

mysql query count per databse

Hi, Im searching a cmdline or any mysql config which can tell me the queries being fired per database. I would check this info frequenty. mysql gives query count but i want it itemized so I can locate which database is more busy more often. Planning to put that code into my munin plugin=chart. thanks, vikas ...

Code libraries/applications in C#

I am refreshing my memory on C# (first used it several years ago), and I want to look at some real world quality code (rather than the simplistic ones used in many books). My preference would be mathematical/statistics libraries written in C# as I would like to see how Matrices and PDEs (partial differential equations etc) are implement...

An accurate running statistical mean of a large array of bytes

I have a two dimensional array of bytes which looks like this: int n = 100000; int d = 128; byte[][] samples = new byte[n][d] /* proceed to fill samples with some delicious data */ byte[] mean = new byte[d]; findMean(mean,samples); My findMean function proceeds to fill mean such that: mean[k] = mean(samples[:][k]) Simple enough so ...

Open source queuing theory algorithms in Java

I need to write a program to analyze the performance of computer systems and networks using queuing theory (http://en.wikipedia.org/wiki/Queueing_theory). I was wondering if there is an open source Java library implementing the various algorithms of queuing theory that can make my task easier. Does anyone have any recommendations? ...

Is there a nice GUI available for showing Hibernate statistics?

Hibernate exposes many internal metrics via the Statistics API. Is there an easy-to-use GUI that I can use to visualize these statistics? Bonus if there is a Grails plug-in for this. ...

Web stats: Calculating/estimating unique visitors for arbitary time intervals.

Hi, I am writing an application which is recording some 'basic' stats -- page views, and unique visitors. I don't like the idea of storing every single view, so have thought about storing totals with a hour/day resolution. For example, like this: Tuesday 500 views 200 unique visitors Wednesday 400 views 210 unique visitors Thur...

GLM on police stops data in Gelman / Hill book

Has anyone worked with NY police stops data mentioned in Gelman, Hill book Data Analysis Using Reg. and Multi/Hier Modeling (ARM). The data is under http://www.stat.columbia.edu/~gelman/arm/examples/police/ the file is frisk_with_noise.dat. I removed the description part of this data, renamed past.arrests as arrests, saved it as frisk....

Is it possible to use the Google Analytics API to provide stats for customer's page views?

Let's say I run a site where customers are willing to pay for a page that shows some sort of cool info about them. The whole site is tracked using Google Analytics. To provide stat tracking for the customers, would it be possible to mine the data from Google Analytics, using the AJAX API? Are there any show-stoppers I should look out ...

How does Google Scribe work?

I'm asking about the new labs feature "Google Scribe." Here is the link: http://scribe.googlelabs.com/ I am interested in the backend and the frontend, but mainly the backend. I want to build something similar with a very specific data set (derived from my own documents). I think the frontend of it is fairly straightforward, and I co...

Constraint Satisfaction Problem: Choosing real numbers with certain characteristics

I have a set of n real numbers. I also have a set of functions, f_1, f_2, ..., f_m. Each of these functions takes a list of numbers as its argument. I also have a set of m ranges, [l_1, u_1], [l_2, u_2], ..., [l_m, u_m]. I want to repeatedly choose a subset {r_1, r_2, ..., r_k} of k elements such that l_i <= f_i({r_1, r_2, ...,...

Need to create a log-prob chart in .NET

I can create a chart with a logarithmic scale on one axis, but I need to have a probability scale on the other axis as well. Thanks in advance. ...

R Time series - having trouble making bollinger lines - need simple example please

Learning R language - I know how to do a moving average but I need to do more - but I am not a statistician - unfortunately all the docs seem to be written for statisticians. I do this in excel a lot, it's really handy for analysis of operational activities. Here are the fields on each row to make bollinger bands: Value could be # of ...

Probability of occuring of An infinitely often

IF probability of occurance of A , infinitely often is 0, does it mean probability of occurance of A complement , infinitely often is 1??? ...

Subversion and Stats

If I wanted to see how productive I had been over a 6 month period is there any commands or tools I can run over a SVN repo to get this kind of information? by user (so I can filter to myself) I would like to see things like lines committed by me lines removed by me some kind of churn stat etc... Looking for an easy method. ...

PCA: What's wrong with this algorithm?

Can someone please either confirm or correct this Wikipedia algorithm for computing the first principal component? I want a simple implementation of PCA in D, which doesn't have any existing libraries for PCA AFAIK. I've tried implementing this, and it doesn't seem like my results on simple examples match stuff I get from R or Octave. ...

Random numbers probability

Hi, I am trying to randomly choose from e.g. 4 numbers. I need to compare the probability of these 2 algorithms. 1# int a = random.Next(0, 4); if (a = 0) statement1 if (a = 1) statement2 if (a = 2) statement3 ...

Should I compute statistics on the fly, or generate with a cron job?

Howdy, I have a simple enough web application. I want to measure for any day or month how many new free signups I have, how many paid signups, how many paid upgrades, how many cancellations, etc. That data will then be represented on my admin dashboard by sparklines. Generally, do you suggest: a) Writing a script that upon each call, ...

creating statistics files for users

My software performs a number of jobs throughout the day and I would like to make it possible for my users to examine statistics about these jobs. For example, the data for a job are its results (fail/pass), run duration, user name who started the job, description and an error string. Ideally I would like to simply create a daily log f...