Backpropagation calculates dW
(weight delta) per weight per pattern, so it's straightforward how to modify weights when doing stochastic training. How do I use it for batch training, though? Simply accumluate dW
over the entire training set and then apply the modfication, or is there more to it?
views:
242answers:
2
A:
Yes, just accumluate dW over the entire training set. At least that is how I coded it back in grad school...
rschuler
2010-01-26 14:32:58
+1
A:
You can do a lot with the different gradients from the different samples. That includes higher order information (approximate 2nd derivative) or conjugate gradient or natural gradient or ... :)
bayer
2010-01-26 19:39:55