machine-learning

How to combine False positives and false negatives into one single measure

I'am trying to measure the performance of a computer vision program that tries to detect objects in video. I have 3 different versions of the program which have different parameters. I've benchmarked each of this versions and got 3 pairs of (False positives percent, False negative percent). Now i want to compare the versions with each...

Similarity search between time series in Matlab. Possible ? I cant find R-tree implementation in matlab

Hi there, I would like to implement similarity search in matlab. I wanna to know is it possible ? My plan is to do use 2 popular similarity measurement which are Euclidean Distance and Dynamic Time Warping. Both of these will be applied on time series dataset. My question at this point is how can I evaluate both of these two measurem...

Hierarchical Task Network Planner in Python

Is anyone aware of a hierarchical task network planner implemented in Python or Java? I've found a few open source systems, but they're all seemingly dead projects and haven't been maintained in years. ...

Vetting Second Hand Knowledge in an Ontology

How would you assign objective certainties to statements asserted by different users in an ontology? For example, consider User A asserts "Bob's hat is blue" while User B asserts "Bob's hat is red". How would you determine if: User A and User B are referring to different people named Bob, and may or may not be correct. Both users are ...

Generating RDF From Natural Language

Are there any tools available for generating RDF from natural language? A list of RDFizers compiled by the SIMILE project only mentions one, the Monrai Cypher. Unfortunately, it seems to have been a proprietary tool developed by Monrai Technologies, which has since disappeared, and I can't find any download links. Has anyone seen anythin...

Resources about different Computer Science fields

I am going to begin my final year at university this September so I need to do a project for my dissertation. I had a look at the list with the projects suggested by the uni last year and I don't find any of them THAT interesting. That, combined to my "love" for "theoretical" computer science as a whole got me into thinking that it would...

Estimating the number of neurons and number of layers of an artificial neural network

I am looking for a method on how to calculate the number of layers and the number of neurons per layer. As input i only have the size of the input vector, the size of the output vector and the size of the trainig set. Usually the best net is determined by trying different net topologies and selecting the one with the least error. Unfort...

Dereferencing a Slot in a Matched Fact in Clips

How do you dereference a slot in a fact matched in the LHS of a rule? If a variable matches a fact, I can't find how to create further conditions that match slots within that fact. For example, in the code below, I want to print some text if there's a fact of the form "(do (action ?action))". However, ?action is itself a fact, and I ...

datamining metadata

I build a bunch of data mining models on training data that is located in different folders. For eg. for data in folder1 I build an SVM based model, for data in folder2 I build an naive bayes model. I have almost 100 such folders and each of the folders have different data ( read different attributes ). Is there a framework which enables...

Algorithm for matching 'noisy' names

I have an application which scrapes soccer results from different sources on the web. Team names are not consistent on different websites - eg Manchester United might be called 'Man Utd' on one site, 'Man United' on a second, 'Manchester United FC' on a third. I need to map all possible derivations back to a single name ('Manchester Unit...

Representing Natural Language as RDF

How much of the concepts conveyed in natural language is RDF/OWL able to represent? I'm still learning RDF and other semantic technologies, but as I currently understand it, information is typically represented as triples of the form (subject,predicate,object). So I can imagine how the sentence "Bob has a hat" might be represented. Howev...

Automated journalism

A webapp called StatSheet got funded today - http://techcrunch.com/2010/08/04/former-crunchies-finalist-statsheet-recieves-1-3-million-in-series-a/ They are doing 'automated journalism' - using computers to generate human-looking reports of sports games from the statistics http://www.guardian.co.uk/media/pda/2010/mar/30/digital-media...

What is the OpenCV svm type parameter

The opencv SVM implementation takes a parameter labeled as "SVM type" which must be used in the CVSVMParams structure used in training the SVM. All the explanation I can find is: // SVM type enum { C_SVC=100, NU_SVC=101, ONE_CLASS=102, EPS_SVR=103, NU_SVR=104 }; Anyone know what these different values represent? ...

IR vs Data mining vs ML

People often throw around the terms IR, ML, and data mining, but I have noticed a lot of overlap between them. From people with experience in these fields, what exactly draws the line between these? ...

Real world typo statistics?

Where can I find some real world typo statistics? I'm trying to match people's input text to internal objects, and people tend to make spelling mistakes. There are 2 kinds of mistakes: typos - "Helllo" instead of "Hello" / "Satudray" instead of "Saturday" etc. Spelling - "Shikago" instead of "Chicago" I use Damerau-Levenshte...

Multilabel AdaBoost for MATLAB

Hi, I am currently looking for a multilabel AdaBoost implementation for MATLAB or a technique for efficiently using a two-label implementation for the multilabel case. Any help in that matter would be appreciated. ...

A few implementation details for a Support-Vector Machine (SVM)

Hello, In a particular application I was in need of machine learning (I know the things I studied in my undergraduate course). I used Support Vector Machines and got the problem solved. Its working fine. Now I need to improve the system. Problems here are I get additional training examples every week. Right now the system starts trai...

MATLAB: single layer neural network

For the implementation of single layer neural network, I have two data files. In: 0.832 64.643 0.818 78.843 Out: 0 0 1 0 0 1 The above is the format of 2 data files. The target output is "1 for a particular class that the corresponding input belongs to and "0 for the remaining 2 outputs. The problem is as follows: Your ...

Guidelines require to solve a mathematical intensive (or machine learning?) problem

Hello, The problem I have has got me puzzled. The problem is explained below, There is a container, for example lets say which has a volume of "V". The container needs to be filled with various types of boxes, where each type has a unique size (volume), for example lets say Box Type A - has a volume of K Box Type B - has a volume of L...

Ways to improve the accuracy of a Naive Bayes Classifier?

I am using a Naive Bayes Classifier to categorize several thousand documents into 30 different categories. I have implemented a Naive Bayes Classifier, and with some feature selection (mostly filtering useless words), I've gotten about a 30% test accuracy, with 45% training accuracy. This is significantly better than random, but I want i...