The cortical architecture for the model has been simplified and reduced to the minimum necessary configuration to account for the observed phenomena. Because the focus is on the two-dimensional organization of the cortex, each ``neuron'' in the model cortex corresponds to a vertical column of cells through the six layers of the primate cortex. The cortical network is modeled with a sheet of interconnected neurons and the retina with a sheet of retinal ganglion cells (figure 2). Neurons receive afferent connections from broad overlapping circular patches on the retina. The N × N network is projected on to a central region of the R × R retinal ganglion cells, and each neuron is connected to ganglion cells in an area of radius r around the projections. Thus, neurons at a particular cortical location receive afferents from a corresponding location on the retina. Since the LGN accurately reproduces the receptive fields of the retina, it has been bypassed for simplicity.
Each neuron also has reciprocal excitatory and inhibitory lateral connections with itself and other neurons. Lateral excitatory connections are short-range, connecting each neuron with itself and its close neighbors. Lateral inhibitory connections run for comparatively long distances, but also include connections to the neuron itself and to its neighbors.
The input to the model consists of 2-D ellipsoidal Gaussian patterns representing retinal ganglion cell activations (as in figures 2 and 3a ). For training, the orientations of the Gaussians are chosen randomly from the uniform distribution in the full range , and the positions are chosen randomly within the retina. The elongated spots approximate natural visual stimuli after the edge detection and enhancement mechanisms in the retina. They can also be seen as a model of the intrinsic retinal activity waves that occur in late pre-natal development in mammals (Bednar and Miikkulainen, 1998; Shatz, 1990). The RF-LISSOM network models the self-organization of the visual cortex based on these natural sources of elongated features.
The afferent weights are initially set to random values, and the lateral weights are preset to a smooth Gaussian profile. The connections are organized through an unsupervised learning process. At each training step, neurons start out with zero activity. The initial response of neuron (i,j) is calculated as a weighted sum of the retinal activations:
where is the activation of retinal ganglion (a,b) within
the anatomical receptive field (RF) of the neuron, is the corresponding
afferent weight, and is a piecewise linear approximation of
the sigmoid activation function.
The response evolves over a very short time scale through lateral
interaction. At each time step, the neuron combines the above
afferent activation with lateral excitation and
inhibition:
where Eij,kl is the excitatory lateral connection weight on the
connection from neuron (k,l) to neuron (i,j) , Iij,kl is the
inhibitory connection weight, and is the activity of
neuron (k,l) during the previous time step.
The scaling factors and determine
the relative strengths of excitatory and inhibitory lateral
interactions.
While the cortical response is settling, the retinal activity remains constant. The cortical activity pattern starts out diffuse and spread over a substantial part of the map (as in figure 3c ), but within a few iterations of equation 2, converges into a small number of stable focused patches of activity, or activity bubbles (figure 3d ). After the activity has settled, the connection weights of each neuron are modified. Both afferent and lateral weights adapt according to the same mechanism: the Hebb rule, normalized so that the sum of the weights is constant:
where stands for the activity of neuron (i,j) in the final activity bubble, wij,mn is the afferent or lateral connection weight (, E or I ), is the learning rate for each type of connection ( for afferent weights, for excitatory, and for inhibitory) and Xmn is the presynaptic activity ( for afferent, for lateral). The larger the product of the pre- and post-synaptic activity , the larger the weight change. Therefore, when the pre- and post-synaptic neurons fire together frequently, the connection becomes stronger. Both excitatory and inhibitory connections strengthen by correlated activity; normalization then redistributes the changes so that the sum of each weight type for each neuron remains constant.
At long distances, very few neurons have correlated activity and therefore most long-range connections eventually become weak. The weak connections are eliminated periodically, resulting in patchy lateral connectivity similar to that observed in the visual cortex. The radius of the lateral excitatory interactions starts out large, but as self-organization progresses, it is decreased until it covers only the nearest neighbors. Such a decrease is one way of varying the balance between excitation and inhibition to ensure that global topographic order develops while the receptive fields become well-tuned at the same time (Miikkulainen et al., 1997; Sirosh, 1995). Similar effects can be achieved by changing the scaling factors and over time, or by using connections whose sign depends upon the activation level of the neuron (as in Stemmler et al., 1995).