machine-learning

Implementing a linear, binary SVM (support vector machine)

I want to implement a simple SVM classifier, in the case of high-dimensional binary data (text), for which I think a simple linear SVM is best. The reason for implementing it myself is basically that I want to learn how it works, so using a library is not what I want. The problem is that most tutorials go up to an equation that can be s...

How are neural networks used when the number of inputs could be variable?

All the examples I have seen of neural networks are for a fixed set of inputs which works good for images and fixed length data. How do you deal with variable length data such sentances, queries or source code. Is there a way to encode variable length data into fixed length inputs and still get the generalization properties of neural n...

Any python Support Vector Machine library around that allows online learning?

I do know there are some libraries that allow to use Support vector Machines from python code, but I am looking specifically for libraries that allow one to teach it online (this is, without having to give it all the data at once). Are there any? ...

Question About Using Weka, the machine learning tool

I'm using the explorer feature of Weka for classification. So I have my .arff file, with 2 features of NUMERIC value, and my class is a binary 0 or 1 (eg {0,1}). Sample: @RELATION summary @ATTRIBUTE feature1 NUMERIC @ATTRIBUTE feature2 NUMERIC @ATTRIBUTE class {1,0} @DATA 23,11,0 20,100,1 2,36,0 98,8,1 ..... I load this .arff file,...

What kind of learning algorithm would you use to build a model of how long it takes a human to solve a given Sudoku situation?

I don't have much experience in machine learning, pattern recognition, data mining, etc. and in their underlying theory and systems. I would like to develop an artificial model of the time it takes a human to make a move in a given Sudoku puzzle. So what I'm looking for as an output from the machine learning process is a model that can...

Information mining, classification, modification

Any examples, tips, guidance for the following scenario? I have retrieved updates from several different news websites. I then analyse that information to predict on current trend in the world. I could only find the information on data mining when searching for above idea, but it is for database systems. While data mining is similar to...

How to recognize rectangles in this image?

Hi, I have a image with horizontal and vertical lines. In fact, this image is the BBC website converted to horizontal and vertical lines. My problem is that I want to be able to find all the rectangles in the image. I want to write a computer program to find all the rectangles. Does anyone know how to do this or suggest ideas on how to ...

What is the difference between supervised learning and unsupervised learning?

In terms of artificial intelligence and machine learning. Can you provide a basic, easy explanation with an example? ...

Machine learning challenge: diagnosing program in java/groovy (datamining, machine learning)

Hi All! I'm planning to develop program in Java which will provide diagnosis. The data set is divided into two parts one for training and the other for testing. My program should learn to classify from the training data (BTW which contain answer for 30 questions each in new column, each record in new line the last column will be diagnos...

Genetics algorithms theoretical question

Hi All! I'm currently reading "Artificial Intelligence: A Modern Approach" (Russell+Norvig) and "Machine Learning" (Mitchell) - and trying to learn basics of AINN. In order to understand few basic things I have two 'greenhorn' questions: Q1: In a genetic algorithm given the two parents A and B with the chromosomes 001110 and 101101, r...

Resample Filter of WEKA - How to interpret the result

Dear all, I am currently strugeling with a machine learning problem whereas I have to deal with great unbalanced data sets. That is, there are six classes ('1','2'...'6'). Unfortunately there are e.g. for class '1' 150 examples/instances, for '2' 90 instances and for class '3' only 20. All other classes can't be "trained" since there ar...

How to pick a language for Artificial Intelligence Programming?

Hi, what is the best programming language for artificial intelligence purposes? Mind that using suggested language I must be able to employ any AI technique (or at least most of them). Thanks. ...

What is the difference between Foward-backward algorithm and viterbi algorithm?

What is the difference between Foward-backward algorithm on n-gram model and viterbi algorithm on HMM model? When I review the implementation of these two algorithms, only thing I found is that the transaction probability is coming from different probabilistic models. Is there a difference between these 2 algorithms? ...

MATLAB - usage of knnclassify

When doing: load training.mat training = G load testing.mat test = G and then: >> knnclassify(test.Inp, training.Inp, training.Ltr) ??? Error using ==> knnclassify at 91 The length of GROUP must equal the number of rows in TRAINING. Since: >> size(training.Inp) ans = 40 40 2016 And: >> length(trainin...

Training Hidden Markov Models without Tagged Corpus Data

For a linguistics course we implemented Part of Speech (POS) tagging using a hidden markov model, where the hidden variables were the parts of speech. We trained the system on some tagged data, and then tested it and compared our results with the gold data. Would it have been possible to train the HMM without the tagged training set? ...

How to do multi class classification using Support Vector Machines (SVM).

Hello, In every book and example always they show only binary classification (two classes) and new vector can belong to any one class. Here the problem is I have 4 classes(c1, c2, c3, c4). I've training data for 4 classes. For new vector the output should be like C1 80% (the winner) c2 10% c3 6% c4 4% How to do th...

...

    Reconstruct scene like Photosynth

    Is there any open source code which does similar to MS Photosynth Photosynth is a potent mixture of two independent breakthroughs: the ability to reconstruct the scene or object from a bunch of flat photographs, and the technology to bring that experience to virtually anyone over the Internet. Using techniques from t...

    What does dimensionality reduction mean?

    What does dimensionality reduction mean exactly? I searched for its meaning, I just found that it means the transformation of raw data into a more useful form. So what is the benefit of having data in useful form, I mean how can I use it in a practical life (application)? ...

    Calculating Mutual Information For Selecting a Training Set in Java

    Scenario I am attempting to implement supervised learning over a data set within a Java GUI application. The user will be given a list of items or 'reports' to inspect and will label them based on a set of available labels. Once the supervised learning is complete, the labelled instances will then be given to a learning algorithm. Thi...