statistics

Traceroute comparison and statistics

I have a number of traceroutes that i need to compare against each other but i dont know the best way to do it, ive been told that hash maps are a good technique but i dont know how to implement them on my code. so far i have: FileInputStream fstream = new FileInputStream("traceroute.log"); // Get the object of DataInputStream DataI...

Conditionally colour data points outside of confidence bands in R

I need to colour datapoints that are outside of the the confidence bands on the plot below differently from those within the bands. Should I add a separate column to my dataset to record whether the data points are within the confidence bands? Can you provide an example please? Example dataset: ## Dataset from http://www.apsnet.org...

what is the best way to track unique visitors?

hello what i want is to make user counter as true as possible. exluding bots, and clever users as much as possible. as 4 what i know, it can be done in several ways: ip (trouble with dinamic ones and proxy's) cookies (with session id maybe, but can be deleted or browser can be changed) flash cookies (not all users have it) any other ...

SPSS - sum of squares change radically with slight model changes in ANOVA??

I have noticed that the sum of squares in my models can change fairly radically with even the slightest adjustment to my models???? Is this normal???? I'm using SPSS 16, and both models presented below used the same data and variables with only one small change - categorizing one of the variables as either a 2 level or 3 level variable. ...

Student's t distribution in JavaScript for Google Spreadsheet

Google Spreadsheets currently does not support the standard function TDIST - i.e. the Student's t-distribution. This function is critical for calculating p-values. It seems that this is related to the fact that no integral-using functions (AFAICT) are implemented either. However, Google Docs allows people to add and publish their own s...

Youtube visit count and chromeless player

I'm building a youtube views counter to register how many visits a video receive form some web sites I made. I'd like my views count to be as similar as possible to youtube stats. first I'd like to have your opinion about the way youtube count a view, it seems that to avoid spam they don't tell exactly when a view is counted. Second I ...

Statistics toolbox in Matlab

Is the default linear discriminant analysis (LDA) in Matlab the Fisher discriminant analysis? ...

Statistical approach to chess?

Reading about how Google solves the translation problem got me thinking. Would it be possible to build a strong chess engine by analysing several million games and determining the best possible move based largely (completely?) on statistics? There are several such chess databases (this is one that has 4.5 million games), and one could po...

How to change the icon in the title bar in R?

I just installed R 2.11.0-x64 onto my Windows 7 Professional machine. With my previous installations of R (2.10.1 32 bit was the most recent) the little icon that appeared in the title bar and in the taskbar at the bottom of windows was the R "R." Now however, the icon almost looks like a small windows Task Manager. I know this isn't ...

MatLab (stats, data processing)

I've got an m x m array. I want to apply Fisher discriminant analysis to it - the LDA in MatLab's stats toolbox isn't the Fisher one so I used the version provided by the supplementary toolbox stprtool package. http://cmp.felk.cvut.cz/cmp/software/stprtool/index.html How do I run my program? I don't really understand the input required:...

Where can I find simple beta cdf implementation.

I need to use beta distribution and inverse beta distribution in my project. There is quite good but complicated implementation in GSL, but I don't want to use such a big library only to get one function. I would like to either, implement it on my own or link some simple library. Do you know any sources that could help me? I'm looking...

May we have Ruby and Rails performance statistics? We're persuading the business to use Rails!

We're convincing our Products officer that we want to use JRuby on Rails, and we're having a hard time coming up with some statistics which show that: Coding time is less using Rails vs. say Struts or Zend Framework or what have you. Ruby (and JRuby in particular) performance isn't horrible (anymore). Rails performance isn't bad either...

Referal links - how does it work ?

Could you explain me? Because in stats I can find Refelar links and I am curious. How are they made? Is it placed somewhere in HTTP request? ...

using R to estimate finite mixture model with underlying Markov process

Hello, My apologies if this is more of a statistics question than an R question. I am trying to estimate the following model in R. y_t = mu0 (1 - S_t) + mu1 S_t + e_t e_t ~ N(0, sigma_t^2) sigma_t^2 = sigma_0^2 (1 - S_t) + sigma_1^2 S_t where mu_t = mu0 if S_t = 0, mu_t = mu1 if S_t = 1, and S_t is a Markov process, either 0 or 1, ...

Determining the chances of an event occurring when it hasn't occurred yet

A user visits my website at time t, and they may or may not click on a particular link I care about, if they do I record the fact that they clicked the link, and also the duration since t that they clicked it, call this d. I need an algorithm that allows me to create a class like this: class ClickProbabilityEstimate { public void r...

Efficient calculation of matrix cumulative standard deviation in r

I recently posted this question on the r-help mailing list but got no answers, so I thought I would post it here as well and see if there were any suggestions. I am trying to calculate the cumulative standard deviation of a matrix. I want a function that accepts a matrix and returns a matrix of the same size where output cell (i,j) is s...

How can I structure and recode messy categorical data in R?

I'm struggling with how to best structure categorical data that's messy, and comes from a dataset I'll need to clean. The Coding Scheme I'm analyzing data from a university science course exam. We're looking at patterns in student responses, and we developed a coding scheme to represent the kinds of things students are doing in their ...

Is Weblog Expert reliable?

Hi, My boss asked me if Weblog expert (http://www.weblogexpert.com/lite.htm) is reliable in calculating the average time of the incoming visitors in a web site. Since HTTP is a stateless protocol, I think that the average time might be something left to personal interpretation. Does any one uses Weblog Expert? Is the visitor's average t...

Cumulative Normal Distribution function in objective C

Anyone know of a good implementation of this whose license is compatible with non-free iPhone apps? As suggested in this question, Boost looks absolutely wonderful. But as best I can tell, it is only available in C++. http://stackoverflow.com/questions/2328258/cumulative-normal-distribution-function-in-c ...

Using Linq and C#, how would I categorize a list of list and getting empty categories?

Having the following: var categories = new List<double> {10,20,30,40}; // Note the 40 here... var bundleA = new List<double> {10,20}; var bundleB = new List<double> {20,20,30}; var lots = new List<List<double>> {bundleA, bundleB}; var total = lots.Sum (l => l.Count); var res = from lot in lots from bundle in lot ...