Neural Network Systems Techniques and Applications: Implementation
Techniques
by Cornelius T. Leondes (editor)
Published by Academic Press, San Diego, CA, 1998
401 pages, $70
reviewed by
Dan Simon
Innovatia Software
dansimon@innovatia.com
This book, the third volume
in its series, contains eight academically-oriented research monographs. Each
monograph discusses a different facet of neural network implementation.
The first contribution is
about recurrent neural networks (RNNs). RNNs are very general networks in that
their neurons contain feedforward, feedback, and lateral connections. This
chapter is an impressive treatise, highly mathematical and theoretical, and
contains some original and previously unpublished contributions. The chapter
views RNNs as a nonlinear control system, and as such may be particularly
interesting to readers of Control Engineering Practice. The authors use
rigorous mathematics to discuss the observability, identifiability, minimality,
and controllability of both continuous-time and discrete-time RNNs. The chapter
closes with a list of open problems related to the topics under discussion.
The second chapter is about
Botzmann machines, which are recurrent stochastic neural networks with binary
outputs. A Boltzmann machine without any hidden layers can be viewed as a
stochastic version of a Hopfield network. However, a Boltzmann machine can also
contain hidden layers. This type of network is particularly interesting from a
statistical point of view in that it can be used to estimate the parameters of
an exponential probability distribution. The authors discuss various ways of
training Boltzmann machines for this purpose. This chapter is more practical
than the first chapter in that it provides some simple numerical examples of
the training methods and draws conclusions based on the results.
The third chapter is an
excellent and comprehensive treatment of constructive training techniques for
classification and regression networks. Classification networks assign an input
to one of a number of distinct output classes, and regression networks provide
a continuous mapping between inputs and outputs. Constructive algorithms not
only optimize the typical network parameters (weights, thresholds, etc.) but
also determine the architecture of the network. These techniques typically have
the advantages of fast training times and good incremental learning of new
data. They also have some basis in biological neural learning. However, they
exhibit a tendency to overtrain. This chapter discusses several constructive
training algorithms, including some that add entire subnetworks, some that
prune an already-trained network, and some that use variable activation
functions to add further flexibility. The comprehensiveness of this chapter can
be seen by the 143 references listed at the end.
The fourth contribution
discusses the relatively new topic of modular neural networks. Modularity is a
fundamental engineering principle. A modular neural network is a group of
loosely connected neural networks (just as a neural network is a group of
loosely connected neurons). The authors survey various ways of connecting
neural networks and provide some experimental results. The use of modular
neural nets simplifies training, because if a task can be broken into several
simpler subtasks, then separate networks can be trained more easily to solve
the subtasks. The authors also touch on constructive training for modular
networks, in which new modules are added to a network during training.
The fifth chapter is an
extensive and excellent discussion of neural networks as associative memories.
The authors study the subject from a dynamic systems point of view, noting that
dynamic systems can contain equilibrium points. Stable equilibria are known as
attractors. An associative memory can be viewed as a dynamic system whose
attractors represent stored memory patterns. Hence the retrieval of a stored
pattern is not a direct mapping, but rather a process by which the network's
state reaches equilibrium. The authors present many associative memory
architectures, discuss their stability and convergence properties, and delve
into strategies for encoding associative memories from given desired
attractors. They also discuss the tradeoff between storage capacity and
error-correcting ability. Several interesting examples illustrate the
principles of this chapter.
The sixth study is perhaps
the most unusual in the book. The authors propose a logical and philosophical
basis for the design and construction of neural networks. They discuss the
reasons why neural networks are (or should be) designed as they are. They quote
ancient Chinese philosophers, dissect the meaning of induction, and finally end
up with a neural architecture named an Inductive Logic Unit (ILU). The chapter
concludes with a simulation of an ILU. Overall the chapter is far-sighted and
thought-provoking, although it may be difficult for most engineers to firmly
grasp.
The seventh contribution is
an impressively complete categorization of classification methods using neural
networks. The chapter is notable for its many examples. The authors review
methods of data analysis for determining the complexity of a classification
problem. They discuss methods by which data's separability can be determined.
They consider ways of selecting a classifier so as to provide the most
confidence in performance. Finally they survey various ways of monitoring the
stability and generalization ability of a classifier. Each of their discussions
is full of interesting and illustrative examples.
The final chapter
challenges the long-held assumption of neurons as simple processing elements.
Current neural networks use simple neurons. Most researchers agree that the
computational power of neural networks arises from the connection of many
simple neurons. This chapter presents biological evidence for much more complex
neurons, even neurons that change their operation under various circumstances.
Most of the chapter is spent in giving circuit models for such neurons.
Overall, this book would
make a valuable addition to most libraries (personal or institutional). Its
academic bent makes it more useful for research than practical application, but
its depth and breadth and leading-edge flavor will be of interest to many
neural network engineers.
Home Credentials Publications White Papers
ã
1999–2001 Innovatia. All Rights Reserved.
Email Address: dansimon@innovatia.com
Phone Number: (330)665-9629
Last Revised: March 13, 2001