statistics

How to get the temperature of motherboard of a PC (and other hardware statistics)?

Hello, does any one know how to get the current motherboard, processor or HD temperature statistics? In GNU/Linux, I know I could use something like hddtemp or sensord to get the info, and then parse it... but in Windows: How can I do this? And, Can it be done with with C# or Java or any other hight level programming language? Thanks! ...

Opensource Implementation of the Alias Method

I am doing a project at the moment, and in the interest of code reuse, I went looking for a library that can perform some probabilistic accept/reject of an item: i.e., there are three people (a, b c), and each of them have a probability P{i} of getting an item, where p{a} denotes the probability of a. These probabilities are calculated...

What does "Create Statistics" do in SQL Server 2005?

The Database Tuning Advisor is recommending that I create a bunch of statistics in my Database. I'm something of a SQL n00b, so this was the first time I'd ever come across such a creature. The entry in MSDN was a little obtuse - could someone explain what exactly this does, and why it's a good idea? ...

Recommend an Open Source .NET Statistics Library

I need to calculate averages, standard deviations, medians etc for a bunch of numerical data. Is there a good open source .NET library I can use? I have found NMath but it is not free and may be overkill for my needs. ...

Algorithm to score similarness of sets of numbers

What is an algorithm to compare multiple sets of numbers against a target set to determine which ones are the most "similar"? One use of this algorithm would be to compare today's hourly weather forecast against historical weather recordings to find a day that had similar weather. The similarity of two sets is a bit subjective, so the ...

SO Technology@work survey

This is my one solitary (forgivable?) poll. To prove that I'm not being a rep !@&#&!!@(#*, I'm making this comm owned. What I want SO users to do is to upvote or create an answer that represents the technology they use at work that pays for their bills. Also if you're particularly co-operative, tag on a comment stating your work org...

Algorithm for finding the maximum difference in an array of numbers

I have an array of a few million numbers. double* const data = new double (3600000); I need to iterate through the array and find the range (the largest value in the array minus the smallest value). However, there is a catch. I only want to find the range where the smallest and largest values are within 1,000 samples of each other. S...

How do you measure if an interface change improved or reduced usability?

For an ecommerce website how do you measure if a change to your site actually improved usability? What kind of measurements should you gather and how would you set up a framework for making this testing part of development? ...

Methods for Geotagging or Geolabelling Text Content

What are some good algorithms for automatically labeling text with the city / region or origin? That is, if a blog is about New York, how can I tell programatically. Are there packages / papers that claim to do this with any degree of certainty? I have looked at some tfidf based approaches, proper noun intersections, but so far, no...

Table load via Partition Exchange (Oracle 10g)

I have a few questions about optimizing this type of load. One builds a new table of data to be loaded into a partitioned table and then builds the indexes on this new table. Should you build the index with the COMPUTE STATISTICS option or use the Cascade option of the DBMS_Stats? Should you gather stats on the table before the swap ...

What would be Linux equivalent of GetProcessIoCounters?

Here's a link to Windows documentation: http://msdn.microsoft.com/en-us/library/ms683218(VS.85).aspx. Basically I would like to get similar data, but on Linux. If not all is possible, then at least some parts. ...

SQL Server STATISTICS

So for this one project, we have a bunch of queries that are executed on a regular basis (every minute or so. I used the "Analyze Query in Database Engine " to check on them. They are pretty simple: select * from tablex where processed='0' There is an index on processed, and each query should return <1000 rows on a table with 1MM reco...

Statistical tools for programmers

I'm trying to evaluate the purchase of a statistical tool. This will be used in part by non-programming users (doing clinical studies) and in part by programmers, so I'm trying to find a good compromise between usability and automation. Of course, cost is an issue, but if I can build a solid case, we could probably buy a commercial packa...

Statistics book recomendations

As somebody having background in math - including probability and computer science. What are the steps to learn - or even more, to understand - statistics? What books are good? What books cover, the more advanced topics? ...

Latent Dirichlet Allocation, pitfalls, tips and programs

I'm experimenting with Latent Dirichlet Allocation for topic disambiguation and assignment, and I'm looking for advice. Which program is the "best", where best is some combination of easiest to use, best prior estimation, fast How do I incorporate my intuitions about topicality. Let's say I think I know that some items in the corpus a...

Books for learning the R language

I'm looking for resources to start exploring the R language (as was recommended in my question on Statistical Tools). I know there are some suggestions for basic online tutorials in this thread, but I'd like something a little bit more extensive. Any suggestions for good textbook like references, either online and in dead-tree form? ...

How do I programmatically calculate Poker Odds?

Hello, I'm trying to write a simple game/utility to calculate poker odds. I know there's plenty of resources that talk about the formulas to do so, but I guess I'm having trouble translating that to code. Particularly, I'm interested in Texas Hold-em ... I understand that there are several different approaches, one being that you can ...

Why are Wemf and Google Analytics telling me so different things?

The website works with Wemf and Google Analytics, but they are giving me very different results; WEMF counts 10 - 30 % more page views than Google Analytics... Why can this be? Google Analytics works only with JavaScript enabled, yes - but I don't think that's the reason for this difference. The site which should be counted is a famous ...

Best practice: How to track outbound links?

How do you track outbound links for your web sites, since the request is logged on the destination server, not yours? What method works best for an HTML site, a PHP or ASP.NET site, a template site, or a framework-based site? ...

Where can I find free and open data?

Sooner or later, coders will feel the need to have access to "open data" in one of their projects, from knowing a city's zip to a more obscure information such as the axial tilt of Pluto. I know data.un.org which offers access to the UN's extensive array of databases that deal with human development and other socio-economic issues. The ...