statistics

Is there a chart that displays browser usage in different sectors, for instance local authorities?

Hi, I'm a frontend web developer working for a company at the moment that has a lot of RIA design going on in the backend but it also caters almost entirely for UK local authorities and social workers. For those of you not too enlightened about local authorities/councils here in the UK, they tend to be rather slow on the uptake of new ...

Sas Delimiting Character

We are loading a Fixed width text file into a SAS dataset. The character we are using to delimit multi valued field values is being interpreted as 2 characters by SAS. This breaks things, because the fields are of a fixed width. We can use characters that appear on the keyboard, but obviously this isn't as safe, because our data could ...

Library for an interactive stats website

How would you go about coding an interactive website to display stats/graphs. Say I wanted to create something interactive for people to look at Stackoverflow stats - something that looks like awstats / google analytics but allows you to drill down to stats/graphs like: All questions: total, by hour of day, by day of week (interesting...

Google Analytics LIKE Tool

I'm considering writing my own tool for tracking visitors/sales as Google Analytics and others are just not comprehensive enough in the data dept. They have nice GUIs but if you have SQL skills those GUIs are unnecessary. I'm wondering what the best approach is to do this. I could simply just log the IP, etc to a text file and then ha...

Inverted beta in MySQL

I need to implement an inverted beta function in MySQL (similar to Excel's BETAINV). There is some related material is available on Wolfram MathWorld's Beta Distribution page. Any clues on where to start implementing this functionality in MySQL? ...

How to run statistics Cumulative Distribution Function and Probablity Density Function using SciPy?

Hi Everybody, I am new to Python and new to SciPy libraries. I wanted to take some ques from the experts here on the list before dive into SciPy world. I was wondering if some one could provide a rough guide about how to run two stats functions: Cumulative Distribution Function (CDF) and Probability Distribution Function (PDF). My use...

Is there an API for drawing statistics in OpenGL ES on iPhone OS?

If I wanted to draw pie charts and other nice stuff, are there libraries or API's for doing this? ...

Most mature sparse matrix package for R?

There are at least two sparse matrix packages for R. I'm looking into these because I'm working with datasets that are too big and sparse to fit in memory with a dense representation. I want basic linear algebra routines, plus the ability to easily write C code to operate on them. Which library is the most mature and best to use? So ...

Linear regression confidence intervals in SQL

I'm using some fairly straight-forward SQL code to calculate the coefficients of regression (intercept and slope) of some (x,y) data points, using least-squares. This gives me a nice best-fit line through the data. However we would like to be able to see the 95% and 5% confidence intervals for the line of best-fit (the curves below). ...

Predicting missing data values in a database

I have a database, consisting of a whole bunch of records (around 600,000) where some of the records have certain fields missing. My goal is to find a way to predict what the missing data values should be (so I can fill them in) based on the existing data. One option I am looking at is clustering - i.e. representing the records that ar...

How do you pause an R script / animation?

How do you pause an R script for a specified number of seconds or miliseconds? The intended purpose is for self-timed animations. The desired solution works without asking for user input. ...

How to efficiently calculate a running standard deviation?

I have an array of lists of numbers, e.g.: [0] (0.01, 0.01, 0.02, 0.04, 0.03) [1] (0.00, 0.02, 0.02, 0.03, 0.02) [2] (0.01, 0.02, 0.02, 0.03, 0.02) ... [n] (0.01, 0.00, 0.01, 0.05, 0.03) What I would like to do is efficiently calculate the mean and standard deviation at each index of a list, across all array elements. To do the ...

how owa works step by step

raster1 {{0,1},{1,1}} raster2 {{1,1},{0,0}} hi can you explain me how the Ordered Weighted Average method works given the above two rasters step-by-step? thanks ...

efficiently predicting the likelihood of a user clicking a hyperlink

Possible Duplicate: Determining the probability of a user clicking a hyperlink So I have a bunch of hyperlinks on a web page. From past observation I know the probabilities that a user will click on each of these hyperlinks. I can therefore calculate the mean and standard deviation of these probabilities. I now add a new hype...

Goodness of fit functions in R

What functions do you use in R to fit a curve to your data and test how well that curve fits? What results are considered good? ...

Does column order matter when creating statistics for Microsoft SQL Server?

The Database Engine Tuning Advisor has recommended the creation of some statistics for several of our queries. Turns out that some of these are the same just the column order is different in the CREATE STATISTICS command. For example: CREATE STATISTICS [StatName1] ON [dbo].table1 CREATE STATISTICS [StatName2] ON [dbo].table1 are the...

Hierarchical Bayes for R or Python

Hierarchical Bayes models are commonly used in Marketing, Political Science, and Econometrics. Yet, the only package I know of is bayesm, which is really a companion to a book (Bayesian Statistics and Marketing, by Rossi, et al.) Am I missing something? Is there a software package for R or Python doing the job out there, and/or a worked-...

Getting stats for Twitter app

I'm using Twitter's OAuth for my app (DroidIn) To my dismay I can't find any way to track who and how often is using the app. Searching Twitter for "sent from DroidIn" does not yield any results. I suppose I can call some sort of counter app from my code but that doesn't seem to be fair to my users. Any ideas or suggestions? ...

MySQL / PHP: Date functions for page view statistics and popularity...

Hi, I have a table with number of page views per day. Something like this: +------+------------+------+ | id | date | hits | +------+------------+------+ | 4876 | 2009-07-14 | 4362 | +------+------------+------+ | 4876 | 2009-07-15 | 1324 | +------+------------+------+ | 7653 | 2009-06-09 | 5643 | +------+------------+------+ ...

Visualizing Data in Java

Hi, is there a good library on the market to visualize big datas in Java. Maybe a library for statistical outputs. I know the programming language R to visualize statistical data in R. I also have seen a solution to connect Java and R. It would be better if a have a pure Java solution. ...