Next: 2.4.5 Local Minimum Problem
Up: 2.4 Backpropagation Neural Networks
Previous: 2.4.3 Backpropagation Processing Unit
The backpropagation algorithm trains a given feed-forward
multilayer neural network for a given set of input patterns
with known classifications. When each entry of the sample set
is presented to the network, the network examines
its output response to the sample input pattern. The output response is then
compared to the known and desired output and the error value is calculated.
Based on the error, the connection weights are adjusted. The backpropagation
algorithm is based on Widrow-Hoff delta learning rule in which the
weight adjustment is done through mean square error of the output
response to the sample input [Vel98]. The set of these
sample patterns are repeatedly presented to the network until the error
value is minimized.
Refer to the figure 2.12 that illustrates the
backpropagation multilayer network with
layers.
represents the
number of neurons in
th layer. Here, the network is presented the
th
pattern of training sample set with
-dimensional input
and
-dimensional known output response
. The actual response to the input pattern by the network
is represented as
. Let
be the
output from the
th neuron in layer
for
th pattern;
be the
connection weight from
th neuron in layer
to
th neuron in
layer
; and
be the error value associated with the
th
neuron in layer
.
Figure 2.12:
Backpropagation Neural Network
 |
The following is the outline of the backpropagation learning
algorithm [BJ91]:
- Initialize connection weights into small random values.
- Present the
th sample input vector of pattern
and the corresponding output target
to the network.
- Pass the input values to the first layer, layer 1. For every input
node
in layer 0, perform:
- For every neuron
in every layer
, from input to
output layer, find the output from the neuron:
where
- Obtain output values. For every output node
in layer
, perform:
- Calculate error value
for every neuron
in every
layer in backward order
, from output to input
layer, followed by weight adjustments.
For the output layer, the error value is:
 |
(2.10) |
and for hidden layers:
 |
(2.11) |
The weight adjustment can be done for every connection from neuron
in layer
to every neuron
in every layer
:
 |
(2.12) |
where
represents weight adjustment factor normalized between 0
and 1. The derivation of the equations above will be discussed soon.
The actions in steps 2 through 6 will be repeated for every training
sample pattern
, and repeated for these sets until the root mean
square (RMS) of output errors is minimized.
We now attempt to derive the error and weight adjustment equations
shown above. Let's begin with the Root Mean Square (RMS) of the
errors in the output layer defined as:
 |
(2.13) |
for the
th sample pattern. In generalized delta rule
[BJ91,Day90,Gur97], the error value
associated with the
th neuron in layer
is the rate of change in the
RMS error
respect to the sum-of-product of the neuron:
 |
(2.14) |
where
represents the sum-of-product value. With the chain rule,
we can obtain the rate of change in the RMS error
in response to
weight change:
We can say that the weight change is proportional to this value above
[BJ91].
 |
(2.15) |
where
is a constant.
Thus, weight change can be performed as:
 |
(2.16) |
which should match equation (2.12).
Now let's get back to the equation (2.14) to find an error
value associate with the neuron. Again, using the chain rule, we get:
 |
(2.17) |
For output layer,
and
. Thus,
Using equation (2.9),
This should correspond with equation (2.10). For error values
associated with the hidden layer neurons, we cannot use target values.
For this reason, the part
/
in equation
(2.21) needs to be found using a different approach. We use the
chain rule applied to the sum-of-product values of neurons in the front layer
(layer
).
Finally, combined with
/
we get:
This should concur with equation (2.11).
Next: 2.4.5 Local Minimum Problem
Up: 2.4 Backpropagation Neural Networks
Previous: 2.4.3 Backpropagation Processing Unit
Kiyoshi Kawaguchi
2000-06-17