statistics

Lines counted raw or nonblank noncomment

When the size of a code base is reported in lines, is it more usual/standard to report raw wc count, or nonblank noncomment lines? I'm not asking which measure should be used, only, if I see a number given with no other information, which measure it is at best guess more likely to be. ...

Google Analytics - how does it track clicks?

So I'd just like help understanding how exactly google analytics collects this info. I have a homepage and the statistics say that 46% of clicks are on the logo (which links to the homepage). Does google analytics just assume that if they've ended up on the homepage they have clicked there? How accurate is this measurement? I wonder be...

Stat Counter Not accurate all of a sudden on my site

Stat Counter Not showing Accurate Hits Webiste: www.curbappealconcepts.com I have pasted xhtml code into the head tags of my site.master page for the stat counter. This site was built with Visual Studio 2010 in C# ASP.NET I dont know if any of this matters just wanted to be thorough. The stat counter im using is from www.statcounte...

How to select standard deviation within a row? (in SQL - or R :)

Dear all, I wonder if there is a way to select the standard deviation from several integer fields in MySQL within the same row. Obviously, if I use SELECT STDDEV(col1) FROM mytable I just get the standard deviation of that particular column. Let´s assume I have a table like: id,somefield1,somefield2, integerfield1,integerfield2,inte...

Select an effective revision control

Hello, We are a company, working on small and some huge project. Sometimes on one project are more developers 5, 10, 20 etc. We are searching for an effective revision control system with best optional properties: Performance issues – one file size, complex directory structure Scalability – support for 5, 10, 25 developers, 10, 50, ...

automatically compare two series -Dissimilarity test

I have two series, series1 and series2. My aim is to find how much Series2 is different from Series1,on a bin to bin basis, (each bin represents a particular feature,) automatically/quantitatively. This image can be seen in its original size by clicking here. Series1 is the expected result. Series2 is the test/incoming series. I am pr...

Access.Log Analyzer

Just a quick question. I use Ubuntu as a server and as a desktop. Does anyone have any ideas for a nice access.log analyzer? It could be a program that I download to the desktop or the server. Thanks. ...

most popular social sites to link to? twitter, facebook, delicious, evernot? how to know?

I don't know if this could go on stack overflow, but I'm trying to figure out what the most popular social sites are, so that I can add only the most necessary, and not add 30 sites to the "share this" feature i'm adding to my website. My site is comprised of unique articles on history and science and stuff like that. How would I find t...

please help with statistics problem

Hi All, Let's say I have a contact manager system. There are notes associated to each contact made by employees. So, here's my quick example: ContactName, NoteCount John, 100 Rob, 10 Amy, 10 Chris, 10 How do i figure out the that 75% of contacts have 10 notes assoicated with them and that 25% of contacts have 100 notes associated w...

SQL Statistics and Indexes - How detailed are they?

Do the statistics (which help decide whether an index is to be used) take into account the number of rows per actual column value, or does it just use the average number of rows per value. Suppose I have a table with an bit column called active which has a million of rows, but with 99.99% set to false. If I have an index on this column,...

MYSQL user statistics query help?

I have a table (statistics) with the columns User, Date (datetime), Page and IP. I have this query: SELECT * FROM statistics WHERE page LIKE '%page_name' ORDER BY date DESC LIMIT 30. That's ok to display all the user that visited that page. But I would like to display unique ips per day, something like DISTINCT ip (per day) I don't k...

Looking to contribute to open source or ideas for something open source

I hold degrees in computer science and mathematics. Both of these fields are extremely interesting to me. I particularly like high performance computing, statistical computing, c++, distributed computing, and algorithms. I would very much like to experience open source programming but don't really have any ideas of my own and would like ...

How to get make stats in constant memory

I have a function, which creates some random numerical results. I know, that the result will be an integer in a (small, a - b approx 50) range a, b. I want to create a function which execute the above function let's say 1000000 times and calculates, how often the each result appears. (The function takes a random generator to produce the ...

Chi Square Test using Frequencies, Bins, CDF, Python

Hi, I am trying to write a chi square goodness-of-fit test for Beta distribution from scratch, without using any external functions. The code below reports '1' for a fit, even though kstest from scipy.stats returns a zero. Data is distributed normally, so my function should also return zero. import numpy as np from scipy.stats import ch...

Using MDX Correlation() function to find correlation of 2 dimentional members.

I'm trying to find the Pearson correlation of a single measure between two members of a dimension, but the msdn docs are a bit sparse. Specifically I have a cube with a fact count measure, a date dimension, and a tool dimension and I'd like to find the correlation of tool X and tool Y over the date dimension. ...

How do I find the number of ACTIVE MEMBERS on a forum?

I am putting together a Forum Stats website and I need to find the number of Active Members on several forums. Many have "Total members" listed but that doesn't help me. I'm considering "Active" as someone that has posted at least 5 times within 6 the last months. I'm really perplexed as to how this might be done. Any suggestions? ...

What happended to robot replay and are there any other services like that?

A couple of years ago I tried robot replay's service where I could, by adding a script on a site, replay what the users were doing. Mouse movement, scrolling, clicks etc. It was pretty cool and can be really useful. What happended to them? I can't go to their site anymore. Is there any other service like this? ...

Typo3: How to count page impressions on every page with an extension

I need to count the page impressions of every page on a typo3 site into the db. So I think I need an extension which is called on every page impression and increase a column 'impressions' in the db of the specific page. I'm new to typo3 and new to extension development as well. Is there a way to include an extbase-extension on every pa...

MLE for Naive Bayes in R

i am using naivebayes function of e1071 library of R like below: model <- naiveBayes(Species ~ ., data = iris) pred <- predict(model, iris[,]) my question is: how can i get maximum likelihood estimate for conditional probability distibution of this model? ...

Web site analyser

This is sort of a statistics question. I am looking for a website analyser, not quite like google analytics. I want the analyser to crawl the website itself and record all the data on a page. Images, size of image and so on. Even if it is just a library then its a start for me. Thanks ...