machine-learning

Java Decision Tree for Numeric Attributes & Classes

Hi, I'm looking for a java library for decision trees which accepts numeric attributes and classes/grades. Weka's J48 deals with discrete attributes but doesn't accept numeric ones. Thanks ...

Is libsvm accurate?

With StompChicken's corrections (I miscomputed one dot product, ugh!) the answer appears to be yes. I have since tested the same problem using a precomputed kernel with the same correct results. If you are using libsvm StompChickens clear, organized computations are a very nice check. Original Question: I am about to start using precom...

Aggregating Probabilistic Plans

I'm trying to create a simple STRIPS-based planner. I've completed the basic functionality to compute separate probabilistic plans that will reach a goal, but now I'm trying to determine how to aggregate these plans based on their initial action, to determine what the "overall" best action is at time t0. Consider the following example. ...

Machine Learning Library?

Which libraries do you use for machine learning for C/C++, Python, Java? ...

what is the bootstrapped data in data mining?

Hello there,recently I came across this term,but really have no idea what it refers to.I've searched online,but with little gain. Thanks. ...

need Advice on ID3 implementation and Datatype to be used

Need advice: I am implementing ID3 algorithm in Machine Learning. I am using dictionary to read the training file and store into. But as I am going forward I am understanding that in dictionary v dont have fixed places for each key,value pair as in list or array. Now I might have problem in getting the position of the final attribute and...

In ID3 implementation, at which point the recursion in Algorithm should stop.

In ID3 implementation, at which point the recursion in Algorithm should stop. ...

Python or SQL Logistic Regression

Given time-series data, I want to find the best fitting logarithmic curve. What are good libraries for doing this in either Python or SQL? Edit: Specifically, what I'm looking for is a library that can fit data resembling a sigmoid function, with upper and lower horizontal asymptotes. ...

What subjects, topics does a computer science graduate need to learn to apply available machine learning frameworks, esp. SVMs

I want to teach myself enough machine learning so that I can, to begin with, understand enough to put to use available open source ML frameworks that will allow me to do things like: Go through the HTML source of pages from a certain site and "understand" which sections form the content, which the advertisements and which form the meta...

What are good algorithms for detecting abnormality?

Background Here is the problem: A black box outputs a new number each day. Those numbers have been recorded for a period of time. Detect when a new number from the black box falls outside the pattern of numbers established over the time period. The numbers are integers, and the time period is a year. Question What algorithm will i...

Algorithm for online approximation of a slowly-changing, real valued function

Hi all, I'm tackling an interesting machine learning problem and would love to hear if anyone knows a good algorithm to deal with the following: The algorithm must learn to approximate a function of N inputs and M outputs N is quite large, e.g. 1,000-10,000 M is quite small, e.g. 5-10 All inputs and outputs are floating point values, ...

Newbie: where to start given a problem to predict future success or not

We have had a production web based product that allows users to make predictions about the future value (or demand) of goods, the historical data contains about 100k examples, each example has about 5 parameters; Consider a class of data called a prediciton: prediction { id: int predictor: int predictionDate: date p...

Learning the Structure of a Hierarchical Reinforcement Task

I've been studying hierachial reinforcement learning problems, and while a lot of papers propose interesting ways for learning a policy, they all seem to assume they know in advance a graph structure describing the actions in the domain. For example, The MAXQ Method for Hierarchial Reinforcement Learning by Dietterich describes a complex...

Appropriate Data structure

I have to implement the value iteration algorithm for finding the optimal policy for each state of an MDP using Bellman's equation. The input file is some thing like below: s1 0 (a1 s1 0.5) (a1 s2 0.5) (a2 s1 1.0) s2 0 (a1 s2 1.0) (a2 s1 0.5) (a2 s3 0.5) s3 10 (a1 s2 1.0) (a2 s3 0.5) (a2 s4 0.5) where s1 is the state 0 is the reward ass...

Machine learning/information retrieval project

Hi all, I’m reading towards M.Sc. in Computer Science and just completed first year of the source. (This is a two year course). Soon I have to submit a proposal for the M.Sc. Project. I have selected following topic. “Suitability of machine learning for document ranking in information retrieval system”. Researchers have been using va...

How to create a simple Gradient Descent algorithm

Hi everyone, i'm studying simple machine learning algorithms, beginning with a simple gradient descent, but i've got some trouble trying to implement it in python. here is the example i'm trying to reproduce, i've got data about houses with the (living area (in feet2), and number of bedrooms) with the resulting price : Living area (...

Detecting rare incidents from multivariate time series intervals

Given a time series of sensor state intervals, how do I implement a classifier which learns from supervised training data to detect an incident based on a sequence of state intervals? To simplify the problem, sensor states are reduced to either true or false. Update: I've found this paper (PDF) on Mining Sequences of Temporal Intervals ...

Implementing HexQ Algorithm

Does anyone know if there's an open source implementation (in any language) of the HexQ algorithm for hierarchy discovery in reinforcement learning, or something like it? I'd like to evaluate it in different domains but I'm having trouble understanding how to implement it from the paper's description. ...

Decision tree induction open-source code

I am preparing a task for computer vision class, which involves training a simple classifier after extracting features from images. Since machine learning is not the main topic here, I don't want students to implement a learning algirithm from scratch. So, I have to recommend them some reference implementations. I believe the decision tr...

Applying hidden Markov model to multiple simultaneous bit sequences

This excellent article on implementing a Hidden Markov Model in C# does a fair job of classifying a single bit sequence based on training data. How to modify the algorithm, or build it out (multiple HMMs?) to support the classification of multiple simultaneous bit sequences? Example Instead of classifying just one stream: double t1 =...