views:

242

answers:

2

Backpropagation calculates dW (weight delta) per weight per pattern, so it's straightforward how to modify weights when doing stochastic training. How do I use it for batch training, though? Simply accumluate dW over the entire training set and then apply the modfication, or is there more to it?

A: 

Yes, just accumluate dW over the entire training set. At least that is how I coded it back in grad school...

rschuler
+1  A: 

You can do a lot with the different gradients from the different samples. That includes higher order information (approximate 2nd derivative) or conjugate gradient or natural gradient or ... :)

bayer