The first layer is the input layer and is designated
U0. It consists of an
array of photoreceptor units arranged in a
square. The
remaining layers are divided into four sets of
two layers each. Each of these
sets has a US layer and a UC layer, which
are given numbers to indicate
the set to which they belong. The flow of information
is along the path
.
The US1 layer has
units. Each unit in the U0
layer has 12 units corresponding to it in the
US1 layer. Each of these
units receives inputs from the corresponding units
in the U0 layer and from
the eight units surrounding it. Each of these
twelve units is trained to
respond to a pattern that represents a small portion
of a line segment. In
this way, the S1 units function as a collection
of line segment detectors
spread over the entire input array.
The UC1 layer consists of
units. The 12 bit
patterns recognized by the S1 units actually
correspond to 8 possible
orientations of line segments, so outputs from
equivalent S1 units are fed
to the same C1 units. Each C1 unit receives
inputs from S1 units
corresponding to a
square of receptor
units. This reduces the
dimensions of the layers from
to
.
There are similar arrangements between the succeeding
layers, so that the
US2 layer has
units,
the UC2 layer has
units, the US3 layer
has
,the UC3 layer has
,
the US4 layer has
, and the UC4 layer
has ten units, which are used
to indicate which of the digits has been presented
to the network. The
gradual reduction in the first two dimensions
of each layer reflects the
gradual increase in the proportion of the input
array which affects each unit.
The alternating reduction and increase in the
size of the third dimension
reflects encoding of a number of features in the
output from the previous layer
and the subsequent reduction of this number by
identifying features that are
equivalent for the purposes of digit recognition.
The units described above are all excitatory units. In addition, there are cells between each of the UC layers and the following US layer which have an inhibitory effect on the units in the following layer. A little arithmetic shows that there are more than 50000 units in the Neocognitron.
The Neocognitron is a feedforward network with the outputs from each layer (except the last) becoming the inputs to the next layer. It is not a fully connected network; there are no connections between units belonging to the same layer, and each unit is connected to some, rather than all, of the units in the previous layer. This means that the number of connections is less than that of a Backpropagation network of similar size and far less than that of a comparable Hopfield network. Even so, the Neocognitron has about 14 million connections.