Suppose the network has Ni input units, denoted
,Nh hidden units,
,
and No output
units,
.
Put N =Ni + Nh + No.
The weights wij and thresholds
, are
initialized to some arbitrary values, subject
to
,and wii = 0.
The following procedure is repeated until the weights stabilise.
The input and output pairs are selected in some order for presentation to the network. The states of the input and output units are fixed at the values determined by each selected input and output pair, the states of the hidden units are set to some random initial values, and the temperature is set to some high value.
Simulated annealing is used to bring the network
to a steady state.
The hidden units are chosen in some random order
and the quantity
is computed, where the chosen
unit is the ith one, and
is the state
of the jth unit at time
t. A random number between and 1, denoted
, is also chosen.
The state of the chosen unit is udpated according
to the following rules:
After cycling through the units a number of times, the temperature parameter is reduced and the procedure is repeated. When the annealing schedule is completed, the network is run for a number of cycles while the states of pairs of units are examined, and the pairs which are both in the on state are recorded.
After all the input and output pairs have been presented a number of times, the recorded results from each presentation are used to compute pij, the proportion of the time that both the ith and jth units were on.
The exemplar patterns are now presented to the
network again, but this time
only the states of the input units are held fixed,
while the states of both the
hidden units and the output units are changed
according to the simulated
annealing process. Statistics are collected for
the computation of
, the proportion of the time that
both the ith and jth units
were on when the output units were free.
The weights are now updated. If
is
incremented by a fixed amount. If
is
decremented by a fixed amount.
Once the weights stabilize, the network can be used for recall. The states of the input units are fixed at the values determined by the selected input vector and the simulated annealing procedure is applied to the hidden units and the output units. When equilibrium is reached, the output vector can be read off the output units.