Weighted Decision Trees using Entropy | ansaurus

tags:

views:

199

answers:

2

+2 Q:

Weighted Decision Trees using Entropy

I'm building a binary classification tree using mutual information gain as the splitting function. But since the training data is skewed toward a few classes, it is advisable to weight each training example by the inverse class frequency.

How do I weight the training data? When calculating the probabilities to estimate the entropy, do I take weighted averages?

EDIT: I'd like an expression for entropy with the weights.

+1 A:

Robert Harvey 2009-07-15 18:19:41

Yes, I realized this. I was hoping for a weighted version of entropy. I use various entropy estimates to calculate a scores similar to mutual information.

Jacob 2009-07-17 20:09:52

A:

State-value weighted entropy as a measure of investment risk.
http://www56.homepage.villanova.edu/david.nawrocki/State%20Weighted%20Entropy%20Nawrocki%20Harding.pdf

Robert Harvey 2009-07-17 20:16:58

related questions

Best approach to what I think is a machine learning problem

Do People Actually Use Machine Learning?

What's the best open-source Java Bayesian spam filter library?

How do I group objects in a set by proximity?

What machine learning benchmarks are out there?

Neural Network Structure

Way to infer the size of the userbase of a site from sampling taken usernames

cool project to use a genetic algorithm for?

Canonicalize NFL team names.

Measuring the performance of classification algorithm

Techniques for building recommendation engines?

Bayesian networks tutorial

Parallel programming course or classification and prediction course?

Mathematics for AI/Machine learning ?

How does the Google "Did you mean?" Algorithm work?

Graduate Level Degree for Simulation/Statistics/Prediction?

Find class probabilities in matlab PNN and make ROC plot

What are some practical applications for a single layer perceptron?

What are known uses of AI in web development?

Machine Learning, AI, and Soft Computing

Will I Need a Computer Science Education for Soft Computing/Machine Learning?

Anyone Recommend a Good Tutorial on Conditional Random Fields

Suggestion needed to learn Machine Learning and Information Retrieval

How Do You Categorize Based On Text Content?

How to robustly, but minimally, distribute items across a peer-to-peer system