ansaurus

Question

Avoiding Determinism with Markovian Logic

Answer 1

+3 A:

You use a random number generator to pick which path you go down. You have to save each state (which is really a history of N previous items) and the probabilities for that state. Then you pick a random number and decide based on it what the next state you transition to is.

In your example you have a Markov chain with an N of 1 you would have a chain structure that looked something like this:

<start> -> Hello : 1.0

Hello -> World. : 0.66666
Hello -> Dolly. : 0.33333

Dolly. -> Hello : 1.0

World. -> <end> : 0.5
World. -> Hello : 0.5

If your current state is Hello, then your next possible states are World. and Dolly.. Generate a random number between 0 and 1 and choose World. if it's less than 0.666666, otherwise choose Dolly.

With an N=2 Markov chain, you get almost deterministic behavior with that input:

<start> <start> -> <start> Hello : 1.0

<start> Hello -> Hello World. : 1.0

Hello World. -> World. Hello : 0.5
Hello World. -> World. <end> : 0.5

World. Hello -> Hello Dolly. : 1.0

Hello Dolly. -> Dolly. Hello : 1.0

Dolly. Hello -> Hello World. : 1.0

Omnifarious 2009-09-04 21:28:58

Answer 2

A:

Two comments:

1) To generate samples from a random process, whether or not a certain choice is quite (>50%) likely, and others less so, just requires a weighted "coin flip": generate a random real number uniformly on [0,1), and consider the possibilities in the same fixed order, keeping a sum of probabilities so far. As soon as that sum exceeds your randomly chosen number, select that choice. If your choices have unnormalized (not summing to 1) probabilities, you first need to compute the sum of probabilities s, and either divide them all by s, or choose your random number on [0,s)

2) To prevent overfitting when estimating your model from a small amount of sample training data (compared to the number of parameters), use Bayesian priors on the model parameters. For a really cool example of this, where the number of model parameters (history size) isn't fixed to any finite number in advance, see the Infinite HMM. If you don't use Bayesian methods, then you'll want to choose the history length appropriately for the amount of training data you have, and/or implement some ad-hoc smoothing (e.g. linear interpolation between an order-2 and order-1 model).

wrang-wrang 2009-09-04 22:28:14

ansaurus

tags:

views:

answers:

Avoiding Determinism with Markovian Logic

related questions