machine-learning

How do you make this code more pythonic?

Could you guys please tell me how I can make the following code more pythonic? The code is correct. Full disclosure - it's problem 1b in Handout #4 of this machine learning course. I'm supposed to use newton's algorithm on the two data sets for fitting a logistic hypothesis. But they use matlab & I'm using scipy Eg one question i have...

The business of Artificial Intelligence

I'm putting together a presentation aimed towards entrepreneurs on the present state of industrial AI development, titled "The business of AI"; however, what little resources I have found on Google seems awfully outdated. So I turn to the nice folks on Stackoverflow: Of the present day used systems, which products do you consider good b...

Online AI research communities?

I understand, that most of the research communication in the field of AI is still done in the form of white papers, and conferences; but I refuse to believe there's nobody on the internet talking about it. So, I'd like to know what kind of forums / mailing lists would you recommend in the topic with high signal-to-noise ratio? Especiall...

Pointers to some good SVM Tutorial ...

Hi all, I have been trying to grasp the basics of Support Vector Machines, and downloaded and read many online articles. But still am not able to grasp it. I would like to know, if there are some nice tutorial sample code which can be used for understanding or something, that you can think of, and that will enable me to learn SVM B...

Weighted Decision Trees using Entropy

I'm building a binary classification tree using mutual information gain as the splitting function. But since the training data is skewed toward a few classes, it is advisable to weight each training example by the inverse class frequency. How do I weight the training data? When calculating the probabilities to estimate the entropy, do ...

How do I plot a classification graph of a SVM in R

I have an svm in R and I would now like to plot the classificaiton space for this machine. I have found some examples on the Internet, but I can't seem to make sense of them. My R script is as follows: library(e1071) day_of_week <- c(0,1,2,3,4,5,6) holiday <- factor( c(T, F, F, F, F, F, T) ) model <- svm(day_of_week, holiday) plot(mode...

Difference between a linear problem and a non-linear problem? Essence of Dot-Product and Kernel trick.

The kernel trick maps a non-linear problem into a linear problem. My questions are: 1. What is the main difference between a linear and a non-linear problem? What is the intuition behind the difference of these two classes of problem? And How does kernel trick helps use the linear classifiers on a non-linear problem? 2. Why is the dot ...

How hard is it to implement a chess engine?

I'm wondering how hard it would be to implement a chess engine. Are there already open-source implementations? It seems that you'd need a scoring function for a given board constellation, and a very fast way of exploring several likely future board constellations. Exploring all possible future moves is of course impossible, so one coul...

Algorithm for Hand writing recognition

Hello , Is there any specific algorithm for handwriting recognition? The algorithm should recognize the hand written letter. Any one could help would be greatly appreciated in advance. Thank you ...

What tried and true algorithms for suggesting related articles are out there?

Hi, Pretty common situation, I'd wager. You have a blog or news site and you have plenty of articles or blags or whatever you call them, and you want to, at the bottom of each, suggest others that seem to be related. Let's assume very little metadata about each item. That is, no tags, categories. Treat as one big blob of text, includi...

How to filter/sort/rank object model nodes?

I have some kind of object model and I need to filter and sort it's nodes for some kind of property. What kinds of automated systems exist to generate and select properties of the object model that correlate to what I want? (I'm intentionally being abstract and non-specific) I'm thinking of a system that works kind of like spam filters ...

What is the 11pt average precision metric?

Hello I have two questions 1- what is the "11pt average precision metric"? 2- and how to use it in information retrieval? Thanks ...

Is there a recommended package for machine learning in Python?

Is there a recommended package for machine learning in Python? I have previous experience in implementing a variety of machine learning and statistical algorithms in C++ and MATLAB, but having done some work in Python I'm curious about the available packages for Python. ...

Matlab - Neural network training

I'm working on creating a 2 layer neural network with back-propagation. The NN is supposed to get its data from a 20001x17 vector that holds following information in each row: -The first 16 cells hold integers ranging from 0 to 15 which act as variables to help us determine which one of the 26 letters of the alphabet we mean to express ...

How do I create a good evaluation function for a new board game?

I write programs to play board game variants sometimes. The basic strategy is standard alpha-beta pruning or similar searches, sometimes augmented by the usual approaches to endgames or openings. I've mostly played around with chess variants, so when it comes time to pick my evaluation function, I use a basic chess evaluation function....

Meaning of -fold

I saw this text while reading wikipedia(http://en.wikipedia.org/wiki/K-means%2B%2B) The authors tested their method with real and synthetic datasets and obtained typically 2-fold improvements in speed, and for certain datasets close to 1000-fold improvements in error. and I'm curious about the meaning of -fold, like 1000-fold, 2-f...

Using the Apache Mahout machine learning libraries

I've been working with the Apache Mahout machine learning libaries in my free time a bit over the past few weeks. I'm curious to hear about how others are using these libraries. ...

Matlab:K-means clustering

I have a matrice of A(369x10) which I want to cluster in 19 clusters. I use this method [idx ctrs]=kmeans(A,19) which yields idx(369x1) and ctrs(19x10) I get the point up to here.All my rows in A is clustered in 19 clusters. Now I have an array B(49x10).I want to know where the rows of this B corresponds in the among given 19 cluste...

When to use Genetic Algorithms and when to use Neural Networks?

Is there a rule of thumb or set of examples to determine when to use Genetic Algorithms versus when to use Neural Networks to solve a problem? I know there are cases in which you can have both methods mixed, but I am looking for a high level reasoning between the two methods. ...

Netflix prize dataset?

Hi, I am looking to work on a machine learning project for my course and I would like to use the netflix prize dataset? But it looks like the contest is closed and the dataset is not available for download in the netflix website. Does anyone who wokred on it has the dataset? If so ,can u share it? ...