training-data

Is neural network's response guaranteed on training data?

I'm trying to train an ANN (I use this library: http://leenissen.dk/fann/ ) and the results are somewhat puzzling - basically if I run the trained network on the same data used for training, the output is not what specified in the training set, but some random number. For example, the first entry in the training file is something like ...

Writing an image processing application for analysis of satellite imagery

I have to start work on application for analysis of satellite imagery to identify some man made structure. I would like to use C or Java for this. For satellite I am planning to use Google Maps data. I have three questions here: What is best source for GIS data besides Google Maps/earth. Best language to write such an application con...

Finding weather data to use for training a neural network

I'm looking for some downloadable weather data i can use to train a neural network to do forecasting, where can i find some? Basically things like temperature, humidity, wind speed/direction, anything and everything that might help a neural network do simple forecasting. ...

What is the best source for typo statistics?

I am looking for a reliable source that would provide some statistics on what keys are the most frequently mistyped. For example: is "a" and "s" more commonly mistyped compared with "m" and "n"? if yes what are the underlying data i.e. "a" is mistyped instead of "s" when the previous letter is "o" in 25% of cases but contrary "s" is mi...

General frameworks for preparing training data?

As a student of computational linguistics, I frequently do machine learning experiments where I have to prepare training data from all kinds of different resources like raw or annotated text corpora or syntactic tree banks. For every new task and every new experiment I write programs (normally in Python and sometimes Java) to extract the...

Where does the verification data go when training an ANN?

The need for having part of the training set used as verification data is straightforward, but I am not really clear on how and at what stage of the training should it be incoperated? Is it at the end of the training (after reaching a good minimum for the training data)? If so, what should be done if the verification data yeilds a big e...

SVM Classification - minimum number of input sets for each class

Im trying to build an app to detect images which are advertisements from the webpages. Once I detect those Ill not be allowing those to be displayed on the client side. From the help that I got here in stackoverflow, I thought SVM is the best approach to my aim. So, I have coded SVM and an SMO myself. The dataset which I have got from ...

Training sets for AdaBoost algorithm

How do you find the negative and positive training data sets of Haar features for the AdaBoost algorithm? So say you have a certain type of blob that you want to locate in an image and there are several of them in your entire array - how do you go about training it? I'd appreciate a nontechnical explanation as much as possible. I'm new t...

Where to get backgound sample images for haar training?

I need collection of sample images (negative images, or also called background images) for haar training using opencv. It is written that I need to have a lot of them. About 5000 or 10000. Do you know where to get them? ...