0% found this document useful (0 votes)

25 views

Training Neural Networks With GA Hybrid Algorithms

This document discusses training neural networks using hybrid genetic algorithms. It compares five algorithms: Backpropagation, Levenberg-Marquardt, a Genetic Algorithm, and two hybrids combining the Genetic Algorithm with Backpropagation and Levenberg-Marquardt. The algorithms are evaluated on three medical classification problems. The conclusions show that the hybrid algorithms outperform the pure algorithms.

Uploaded by

jkl316

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Training Neural Networks With GA Hybrid Algorithms

Uploaded by

jkl316

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Training Neural Networks

with GA Hybrid Algorithms

Enrique Alba and J. Francisco Chicano

Departamento de Lenguajes y Ciencias de la Computación

University of Málaga, SPAIN
[email protected] [email protected]

Abstract. Training neural networks is a complex task of great impor-

tance in the supervised learning field of research. In this work we tackle
this problem with five algorithms, and try to offer a set of results that
could hopefully foster future comparisons by following a kind of stan-
dard evaluation of the results (the Prechelt approach). To achieve our
goal of studying in the same paper population based, local search, and
hybrid algorithms, we have selected two gradient descent algorithms:
Backpropagation and Levenberg-Marquardt, one population based heu-
ristic such as a Genetic Algorithm, and two hybrid algorithms combining
this last with the former local search ones. Our benchmark is composed
of problems arising in Medicine, and our conclusions clearly establish the
advantages of the proposed hybrids over the pure algorithms.

1 Introduction
The interest of the research in Artificial Neural Networks (ANNs) resides in the
appealing properties they exhibit: adaptability, learning capability, and ability
to generalize. Nowadays, ANNs are receiving a lot of attention from the interna-
tional research community with a large number of studies concerning training,
structure design, and real world applications, ranging from classification to robot
control or vision [1].
The neural network training task is a capital process in supervised lear-
ning, in which a pattern set made up of pairs of inputs plus expected outputs
is known beforehand, and used to compute the set of weights that makes the
ANN to learn it. One of the most popular training algorithms in the domain of
neural networks is the Backpropagation (or generalized delta rule) technique [2],
a gradient-descent method. Other techniques such as evolutionary algorithms
(EAs) have been also applied to the training problem in the past [3, 4], trying
to avoid the local minima that so often appear in complex problems. Although
training is a main issue in ANN’s design, many other works are devoted to evolve
the layered structure of the ANN or even the elementary behavior of the neu-
rons composing the ANN. For example, in [5] a definition of neurons, layers, and
the associated training problem is analyzed by using parallel genetic algorithms;
also, in [6] the architecture of the network and the weights are evolved by using
the EPNet evolutionary system. It is really difficult to perform a revision of this
topic; however, the work of Yao [7] represents an excellent starting point to get
acquired of the research in training ANNs.
The motivation of the present work is manyfold. First, we want to perform
a standard presentation of results that promotes and facilitates future compa-
risons. This sounds common sense, but it is not frequent that authors follow
standard rules for comparisons such as the structured Prechelt’s set of recom-
mendations [8], a “de facto” standard for many ANN researchers. A second
contribution is to include in our study, not only the well known Genetic Algo-
rithm (GA) and Backpropagation algorithm, but also the Levenberg-Marquardt
(LM) approach [9], and two additional hybrids. The potential advantages coming
from an LM utilization merit a detailed study. We have selected a benchmark
from the field of Medicine, composed of three classification problems: diagnosis
of breast cancer, diagnosis of diabetes in Pima Indians, and diagnosis of heart
disease.
The remainder of the article is organized as follows. Section 2 introduces the
Artificial Neural Network computation model. Next, we give a brief description
of the algorithms under analysis (Section 3). The details of the experiments and
their results are shown in Section 4. Finally, we summarize our conclusions and
future work in Section 5.

2 Artificial Neural Networks

Artificial Neural Networks are computational models naturally performing a pa-

rallel processing of information [10]. Essentially, an ANN can be defined as a
pool of simple processing units (neurons) which communicate among themsel-
ves by means of sending analog signals. These signals travel through weighted
connections between neurons. Each of these neurons accumulates the inputs it
receives, producing an output according to an internal activation function. This
output can serve as an input for other neurons, or can be a part of the network
output. In Fig. 1 left we can see a neuron in detail.

Output

Inputs Weights
W1
Output Layer
A1
W2
A2
W3 Neuron
A3
Sum-of-Product
Output
x Hidden Layer

WN
G f(x) y
Connection
AN Sumation
Activation weights
Function
1 Function
q
Bias Input Layer

Input Pattern

Fig. 1. An artificial neuron (left) and a multilayer perceptron (right)

There is a set of important issues involved in the ANN design process. As a
first step, the architecture of the network has to be decided. Initially, two ma-
jor options are usually considered: feedforward networks and recurrent networks
(additional considerations regarding the order of the ANN exist, but are out of
our scope). The feedforward model comprises networks in which the connections
are strictly feedforward, i.e., no neuron receives input from a neuron to which
the former sends its output, even indirectly. The recurrent model defines net-
works in which feedback connections are allowed, thus making the dynamical
properties of a capital importance. In this work we will concentrate on the first
and simpler model: the feedforward networks. To be precise, we will consider the
so-called multilayer perceptron (MLP) [11], in which units are structured into
ordered layers, and connections are allowed only between adjacent layers in an
input-to-output sense (see Fig. 1 right).
For any MLP, several parameters such as the number of layers and the num-
ber of units per layer must be defined. After having done this, the last step in
the design is to adjust the weights of the network, so that it produces the de-
sired output when the corresponding input is presented. This process is known
as training the ANN or learning the network weights. Network weights com-
prise both the previously mentioned connection weights, as well as a bias term
for each unit. The latter can be viewed as the weight of a constant saturated
input that the corresponding unit always receives. As initially stated, we will
focus on the learning situation known as supervised training, in which a set of
input/desired-output patterns is available. Thus, the ANN has to be trained to
produce the desired output according to these examples. The input and output
of the network are both real vectors in our case.
In order to perform a supervised training we need a way of evaluating the
ANN output error between the actual and the expected output. A popular mea-
sure is the Squared Error Percentage (SEP). We can compute this error term
just for one single pattern or for a set of patterns. In this last case, the SEP is
the average value of the patterns individual SEP. The expression for this global
SEP is:

P S
omax − omin X X p
SEP = 100 · (ti − opi )2 . (1)
P ·S p=1 i=1

where tpi and opi are, respectively, the i-th components of the expected vector
and the actual current output vector for the pattern p; omin and omax are the
minimum and maximum values of the output neurons, S is the number of output
neurons, and P is the number of patterns.
In classification problems, we could use still an additional measure: the Clas-
sification Error Percentage (CEP). CEP is the percentage of incorrectly classified
patterns, and it is a usual complement to any of the other two (SEP or the well-
known MSE) raw error values, since CEP reports in a high-level manner the
quality of the trained ANN.
3 The Algorithms
We use for our study several algorithms to train ANNs: the Backpropagation
algorithm, the Levenberg-Marquardt algorithm, a Genetic Algorithm, a hybrid
between Genetic Algorithm and Backpropagation, and a hybrid between Genetic
Algorithm and Levenberg-Marquardt. We briefly describe them in the following
paragraphs.

3.1 Backpropagation
The Backpropagation algorithm (BP) [2] is a classical domain-dependent techni-
que for supervised training. It works by measuring the output error, calculating
the gradient of this error, and adjusting the ANN weights (and biases) in the
descending gradient direction. Hence, BP is a gradient-descent local search pro-
cedure (expected to stagnate in local optima in complex landscapes).
First, we define the squared error of the ANN for a set of patterns:
P X
X S
E= (tpi − opi )2 . (2)
p=1 i=1

The actual value of the previous expression depends on the weights of the
network. The basic BP algorithm (without momentum in our case) calculates
the gradient of E (for all the patterns in our case) and updates the weights by
moving them along the gradient-descendent direction. This can be summarized
with the expression ∆w = −η∇E, where the parameter η > 0 is the learning
rate that controls the learning speed. The pseudo-code of the BP algorithm is
shown in Fig. 2.

InitializeWeights;
while not StopCriterion do
for all i,j do
∂E
wij := wij − η ∂w ij
;
endfor;
endwhile;

Fig. 2. Pseudo-code of the BP algorithm

3.2 Levenberg-Marquardt
The Levenberg-Marquardt algorithm (LM) [9] is an approximation to the New-
ton method used also for training ANNs. The Newton method approximates
the error of the network with a second order expression, which contrasts to the
Backpropagation algorithm that does it with a first order expression. LM is po-
pular in the ANN domain (even it is considered the first approach for an unseen
MLP training task), although it is not that popular in the metaheuristics field.
LM updates the ANN weights as follows:
" P
#−1
X
p T p
∆w = − µI + J (w) J (w) ∇E(w) . (3)
p=1

where J p (w) is the Jacobian matrix of the error vector ep (w) evaluated in w,
and I is the identity matrix. The vector error ep (w) is the error of the network for
pattern p, that is, ep (w) = tp −op (w). The parameter µ is increased or decreased
at each step. If the error is reduced, then µ is divided by a factor β, and it is
multiplied by β in other case. Levenberg-Marquardt performs the steps detailed
in Fig. 3. It calculates the network output, the error vectors, and the Jacobian
matrix for each pattern. Then, it computes ∆w using (3) and recalculates the
error with w + ∆w as network weights. If the error has decreased, µ is divided
by β, the new weights are maintained, and the process starts again; otherwise,
µ is multiplied by β, ∆w is calculated with a new value, and it iterates again.

InitializeWeights;
while not StopCriterion do
Calculates ep (w) for each pattern;
PP
e1 := p=1
ep (w)T ep (w);
Calculates J p (w) for each pattern;
repeat
Calculates ∆w;
PP
e2 := p=1
ep (w + ∆w)T ep (w + ∆w);
if (e1 <= e2) then
µ := µ ∗ β;
endif;
until (e2 < e1);
µ := µ/β;
w := w + ∆w;
endwhile;

Fig. 3. Pseudo-code of the LM algorithm

3.3 Genetic Algorithm

A GA [12] is a stochastic general search method. It proceeds in an iterative
manner by generating new populations of individuals from the old ones. Every
individual is the encoded (binary, real, etc.) version of a tentative solution. The
canonical algorithm applies stochastic operators such as selection, crossover, and
mutation on an initially random population in order to compute a new popula-
tion. In generational GAs all the population is replaced with new individuals.
In steady-state GAs (used in this work) only one new individual is created and
it replaces the worst one in the population if it is better. The pseudo-code of
the GA we are using here can be seen in Fig. 4. The search features of the GA
contrast with those of the BP and LM in that it is not trajectory-driven, but
population-driven. The GA is expected to avoid local optima frequently by pro-
moting exploration of the search space, in opposition to the exploitative trend
usually allocated to local search algorithms like BP or LM.

t := 0;
Initialize: P (0) := {a1 (0), . . . , aµ (0)} ∈ I µ ;
Evaluate: P (0) : {Φ (a1 (0)) , . . . , Φ (aµ (0))};
while ι (P (t)) 6= true do //Reproductive loop
Select: P 0 (t) := sΘs (P (t));
Recombine: P 00 (t) := ⊗Θc (P 0 (t));
Mutate: P 000 (t) := mΘm (P 00 (t));
Evaluate: P 000 (t) : {Φ (a000 000
1 (t)) , . . . , Φ (aλ (t))};
Replace: P (t + 1) := rΘr (P 000 (t) ∪ Q);
t := t + 1;
endwhile;

Fig. 4. Pseudo-code of a Genetic Algorithm

3.4 Hybrid Algorithms

Here, the hybridization refers to the inclusion of problem-dependent knowledge

in a general search template [13, 14]. We can distinguish two kinds of hybridiza-
tion: strong and weak hybridization. In the first one, the knowledge is included
using specific operators or representations. In the latter, several algorithms are
combined somehow. In this last case, an algorithm can be used to improve the
results of another one separately or it can be used as an operator of the other.
The hybrid algorithms that we use in this work are combinations of two
algorithms (weak hybridization), where one of them acts as an operator in the
other. We combine a GA with the BP algorithm (GABP), and a GA with LM
(GALM). In both cases the problem-specific algorithm (BP and LM) is used as
a mutation-like operation of the general search template (GA). Therefore, GAxx
is a GA (Fig. 4) in which the mutation has been replaced by the “xx” algorithm
that is applied with probability pt .

4 Empirical Study

After discussing the algorithms, we present in this section the experiments per-
formed and their results. The benchmark for training and the parameters of the
algorithms are presented in the next subsection. The analysis of the results is
shown in Subsection 4.2.
4.1 Computational Experiments
We tackle three classification problems. These problems consist in determining
the class that a certain input vector belongs to. Each pattern from the training
pattern set contains an input vector and its desired output vector. These vectors
are formed by real numbers. However, in classification problems, the output of
the network must be interpreted as a class. Such interpretation can be performed
in different ways [8]. One of them consists in assigning an output neuron to each
class. When an input vector is presented to the network, the network response is
the class associated with the output neuron with the larger value. This method
is known as winner-takes-all and it is employed in this work.
The instances solved here belong to the Proben11 benchmark [8]: Cancer,
Diabetes, and Heart. We now briefly detail them:

– Cancer: Diagnosis of breast cancer. Classify a tumor as either benign or

malignant based on cell descriptions gathered by microscopic examination.
There are 699 examples that were obtained by Dr. William H. Wolberg at
the University of Wisconsin Hospitals, Madison [15–18].
– Diabetes: Diagnose diabetes of Pima Indians. Based on personal data and
the results of medical examinations, decide whether a Pima Indian indivi-
dual is diabetes positive or not. There are 768 examples from the National
Institute of Diabetes and Digestive and Kidney Diseases by Vincent Sigi-
llito [19].
– Heart: Predict heart disease. Decide whether at least one of four major ves-
sels is reduced in diameter by more than 50%. This decision is made based on
personal data and results of medical examinations. There are 920 examples
from four different sources: Hungarian Institute of Cardiology in Budapest
(Andras Janosi, M.D.), University Hospital of Zurich in Switzerland (Wi-
lliam Steinbrunn, M.D.), University Hospital of Basel in Switzerland (Math-
hias Pfisterer, M.D.), V.A. Medical Center of Long Beach and Cleveland
Clinic Foundation (Robert Detrano, M.D., Ph.D.) [20, 21].

The structure of the MLP used for any problem accounts for three layers
(input-hidden-output) having six neurons in the hidden layer. The number of
neurons in the input and output layers depends on the concrete instance. The
activating function of the neurons is the sigmoid function. Table 1 summarizes
the network architecture for each instance.
To evaluate an ANN, we split the pattern set into two subsets: the training
one and the test one. The ANN is trained with all the algorithms by using the
training pattern set, and then it is evaluated on the unseen test pattern set.
The training set for each instance is approximately made of the first 75% of
the examples, while the last 25% constitutes the test set. The exact number of
patterns for each instance is presented in Table 1 to ease future comparisons.
After presenting the problems, we now turn to describe the parameters for the
algorithms (Table 2). To get the parameters of the pure algorithms we performed
1
Available from ftp://ftp.ira.uka.de/pub/neuron/proben1.tar.gz.
Table 1. MLP architecture and patterns distribution for all instances

Patterns
Instance Architecture
Training Test
Cancer 9 - 6 - 2 525 174
Diabetes 8 - 6 - 2 576 192
Heart 35 - 6 - 2 690 230

some preliminary experiments and defined those with the best results. The hy-
brid algorithms GABP and GALM use the same parameters as their elementary
components. However, the mutation operator of the GA is not applied; instead,
it is replaced by BP or LM, respectively. The BP and LM are applied with an
associated probability pt only to one individual generated after recombination
at each iteration. When applied, BP/LM only performs one single epoch.

Table 2. Parameters for the algorithms

BC DI HE
Epochs 1000 1000 500
BP
η 0.01 0.01 0.001
Epochs 1000 1000 500
LM µ 0.001 0.001 0.001
β 10 10 10
Population size 64
Selection Roulette (2 inds.)
Recombination SPX (pc = 1.0)
GA
Mutation Bit-Flip (pm = 1/lenght)
Replacement Elitist
Stop criterion 1064 evals.
pt 1.0 1.0 0.5
GAxx
Epochs of xx 1 1 1

As to the representation of the individuals, the weights are encoded as binary

vectors. These vectors allocate 16 bit substrings to represent a real value in the
interval [−1, +1]. The weights associated to any link arriving to a neuron (and
the neuron bias) are placed together in the chromosome.
Finally, we need to define a fitness function to guide the search of the GA
(either pure or hybrid). The fitness function (to be maximized) is the inverse of
the SEP for the training set.

4.2 Analysis of the Results

In this section we present the results obtained after the application on the three
instances of the five algorithms. We report the mean and the standard deviation
of the CEP for the test pattern set after performing 50 independent runs. Table 3
and Fig. 5 show the results.
Table 3. Results of the ANN training

CEP(%) BP LM GA GABP GALM

x 0.91 3.17 16.76 1.43 0.02
Cancer
σn 0.28 1.29 6.15 4.87 0.11
x 21.76 25.77 36.46 36.46 28.29
Diabetes
σn 0.38 3.26 0.00 0.00 1.15
x 27.41 34.73 41.50 54.30 22.66
Heart
σn 1.48 3.68 14.68 20.03 0.82

A first conclusion is that the GA obtains always a higher CEP than BP, LM
and the hybrids (except for Heart and GABP). This is not a surprising fact,
since the GA performs a rather explorative search in this kind of problems. BP
is slightly more accurate than LM for all the instances, what we did not expect
after the accurate behavior of LM in other studies.
With respect to the hybrid algorithms, the results do confirm our hypothesis
of work: GALM is more accurate than GABP. In fact, this is noticeable since BP
performed better than LM. Of course, we are not saying that this holds for any
ANN training problem. However, we do state a clear claim after these results,
i.e., GABP has received “too much” attention from the community, while maybe
GALM could have worked out lower error percentages. To help the reader we
also display these results in a graph in Fig. 5.

Fig. 5. Comparison among the algorithms (CEP)

We have traced the evolution of each algorithm for the Cancer instance to
better explain how the different algorithms work (Fig. 6). We measure the SEP
of the network in each epoch of the algorithm. For population-based algorithms
(GA, GABP and GALM) we trace the SEP of the best fitness network. Each
trace line represents the average SEP over 50 independent runs. We can observe
that LM is the faster algorithm, followed by BP, what confirms intuition on
the velocity of local search compared to GAs and hybrids. BP an LM clearly
stagnate before 200 epochs in a solution. The GA is the slowest algorithm, and
its hybridization with BP, and especially with LM, shows an acceleration of the
evolution. An interesting observation is that the algorithms with the lowest SEP
(BP and LM) do not always get the lowest CEP (best classification) for the
test patterns. For example, GALM, which exhibits the lowest CEP, has only a
modest value of SEP in the training process. This is due to the overtraining
of the network in the BP and the LM algorithms, and confirms the necessity of
reporting both, ANN errors and classification percentages in this field of research.

GABP
15
SEP

10
GALM

5
BP

LM
0
0 100 200 300 400 500 600 700 800 900 1000
Epochs

Fig. 6. Average evolution of SEP for the algorithms on the Cancer instance

There are many interesting works related to neural network training that
also solve the instances tackled here. But unfortunately, some of the results are
not comparable with ours, because they use a different definition of the training
and test sets; this is why we consider a capital issue to adhere to any standard
way of evaluation like the one proposed by Prechelt [8]. However, we did find
some works for meaningful comparisons.
For the Cancer instance we find that the best mean CEP [22] is 1.1%, which
represents a lower accuracy compared to our 0.02% obtained with the GALM
hybrid. In [23], a CEP close to 2% for this instance is achieved, while our GALM
is one hundred times more accurate. The mentioned work uses 524 patterns
for the training set and the rest for the test set, that is, almost exactly our
configuration with only one pattern changed (a minor detail), and therefore
the results can be compared. The same occurs for the work of Yao and Liu [6],
where their EPNet algorithm works out neural networks of a lower quality (1.4%
of CEP).
For the Diabetes instance, a CEP of 30.11% is reached in [24] (outperformed
by our BP, LM, and GALM) with the same network architecture as in our work.
In [6] we found for this instance a 22.4% of CEP (outperformed by our BP with
a 21.76%).
Finally, in [24] we found a 45.71% of CEP for the Heart instance using the
same architecture. In this case, all our algorithms outperform their CEP measure
(except GABP).
In summary, while we have found some of the more accurate results for the
three instances, it is still needed to get ahead on other instances, always keeping
in mind the importance of reporting results in a standardized manner.

5 Conclusions
In this work we have tackled the neural network training problem with five al-
gorithms: two well-known problem-specific algorithms such as Backpropagation
and Levenberg-Marquardt, a general metaheuristic such as a Genetic Algorithm,
and two hybrid algorithms combining the Genetic Algorithm with the problem-
specific techniques. To compare the algorithms we solve three classification pro-
blems from the domain of Medicine: the diagnosis of breast cancer, the diagnosis
of diabetes in the Pima Indians, and the diagnosis of heart disease.
Our results show that the problem-specific algorithms (BP and LM) get lo-
wer classification error than the genetic algorithm, and thus confirm numerically
what intuition can only suggest. The hybrid algorithm GALM outperforms in
two of the three instances the classification error of the problem-specific algo-
rithms. This makes GALM look as a promising algorithm for neural network
training. On the other hand, many of the classification errors obtained in this
work are below those found in the literature, what represents a cutting-edge
result. As a future work we plan to add new algorithms to the analysis, and to
apply them to more instances, especially in the domain of Bioinformatics.

Acknowledgments
This work has been partially funded by the Ministry of Science and Techno-
logy and FEDER under contract TIC2002-04498-C05-02 (the TRACER project,
http://tracer.lcc.uma.es).

References
1. Alander, J.T.: Indexed Bibliography of Genetic Algorithms and Neural Networks.
Technical Report 94-1-NN, University of Vaasa, Department of Information Tech-
nology and Production Economics (1994)
2. Rumelhart, D., Hinton, G., Williams, R.: Learning Representations by Backpro-
pagation Errors. Nature 323 (1986) 533–536
3. Cotta, C., Alba, E., Sagarna, R., Larrañaga, P.: Adjusting Weights in Artificial
Neural Networks using Evolutionary Algorithms. In Larrañaga, P., Lozano, J., eds.:
Estimation of Distribution Algorithms. A New Tool for Evolutionary Computation,
Kluwer Academic Publishers (2001) 357–373
4. Cantú-Paz, E.: Pruning Neural Networks with Distribution Estimation Algorithms.
In Erick Cantú-Paz et al., ed.: GECCO 2003, LNCS 2723, Springer-Verlag (2003)
790–800
5. Alba, E., Aldana, J.F., Troya, J.M.: Full Automatic ANN Design: A Genetic
Approach. In Mira, J., Cabestany, J., Prieto, A., eds.: New Trends in Neural
Computation, Springer-Verlag (1993) 399–404
6. Yao, X., Liu, Y.: A New Evolutionary System for Evolving Artificial Neural Net-
works. IEEE Transactions on Neural Networks 8 (1997) 694–713
7. Yao, X.: Evolving Artificial Neural Networks. Proceedings of the IEEE 87 (1999)
1423–1447
8. Prechelt, L.: Proben1 — A Set of Neural Network Benchmark Problems and
Benchmarking Rules. Technical Report 21, Fakultät für Informatik Universität
Karlsruhe, 76128 Karlsruhe, Germany (1994)
9. Hagan, M.T., Menhaj, M.B.: Training Feedforward Networks with the Marquardt
Algorithm. IEEE Transactions on Neural Networks 5 (1994)
10. McClelland, J.L., Rumelhart, D.E.: Parallel Distributed Processing: Explorations
in the Microstructure of Cognition. The MIT Press (1986)
11. Rosenblatt, F.: Principles of Neurodynamics. Spartan Books, New York (1962)
12. Holland, J.H.: Adaptation in Natural and Artificial Systems. The University of
Michigan Press, Ann Arbor, Michigan (1975)
13. Davis, L., ed.: Handbook of Genetic Algorithms. Van Nostrand Reinhold, New
York (1991)
14. Cotta, C., Troya, J.M.: On Decision-Making in Strong Hybrid Evolutionary Al-
gorithms. Tasks and Methods in Applied Artificial Intelligence, Lecture Notes in
Artificial Intelligence 1415 (1998) 418–427
15. Bennett, K.P., Mangasarian, O.L.: Robust Linear Programming Discrimination
of Two Linearly Inseparable Sets. Optimization Methods and Software 1 (1992)
23–34
16. Mangasarian, O.L., Setiono, R., Wolberg, W.H.: Pattern Recognition via Linear
Programming: Theory and Application to Medical Diagnosis. In Coleman, T.F.,
Li, Y., eds.: Large-Scale Numerical Optimization. SIAM Publications, Philadelphia
(1990) 22–31
17. Wolberg, W.H.: Cancer Diagnosis via Linear Programming. SIAM News 23 (1990)
1–18
18. Wolberg, W.H., Mangasarian, O.L.: Multisurface Method of Pattern Separation
for Medical Diagnosis Applied to Breast Cytology. In: Proceedings of the National
Academy of Sciences. Volume 87., U.S.A (1990) 9193–9196
19. Smith, J.W., Everhart, J.E., Dickson, W.C., Knowler, W.C., Johannes, R.S.: Using
the ADAP Learning Algorithm to Forecast the Onset of Diabetes Mellitus. In:
Proceedings of the Twelfth Symposium on Computer Applications in Medical Care,
IEEE Computer Society Press (1988) 261–265
20. Detrano, R., Janosi, A., Steinbrunn, W., Pfisterer, M., Schmid, J., Sandhu, S.,
Guppy, K., Lee, S., Froelicher, V.: International Application of a New Probability
Algorithm for the Diagnosis of Coronary Artery Disease. American Journal of
Cardiology (1989) 304–310
21. Gennari, J.H., Langley, P., Fisher, D.: Models of Incremental Concept Formation.
Artificial Intelligence 40 (1989) 11–61
22. Ragg, T., Gutjahr, S., Sa, H.: Automatic Determination of Optimal Network
Topologies Based on Information Theory and Evolution. In: Proceedings of the
23rd EUROMICRO Conference, Budapest, Hungary (1997)
23. Land, W.H., Albertelli, L.E.: Breast Cancer Screening Using Evolved Neural Net-
works. In: IEEE International Conference on Systems, Man, and Cybernetics,
1998. Volume 2., IEEE Computer Society Press (1998) 1619–1624
24. Erhard, W., Fink, T., Gutzmann, M.M., Rahn, C., Doering, A., Galicki, M.: The
Improvement and Comparison of Different Algorithms for Optimizing Neural Net-
works on the MasPar MP-2. In Heiss, M., ed.: Neural Computation – NC’98, ICSC
Academic Press (1998) 617–623

The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
100% (8)
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
148 pages
How To Hack Atm
87% (15)
How To Hack Atm
1 page
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
88% (8)
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
56 pages
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
100% (10)
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
821 pages
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
100% (25)
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
306 pages
Banana Pancakes - Ukulele Chord Chart
100% (1)
Banana Pancakes - Ukulele Chord Chart
2 pages
170 Machine Learning Interview Questios - Greatlearning
100% (1)
170 Machine Learning Interview Questios - Greatlearning
57 pages
5 Steps To A 5 AP Computer Science Principles 2022 (Julie Schacht Sway)
100% (1)
5 Steps To A 5 AP Computer Science Principles 2022 (Julie Schacht Sway)
484 pages
Gestalt
100% (3)
Gestalt
39 pages
75 Productivity Hacks - System Sunday
100% (7)
75 Productivity Hacks - System Sunday
75 pages
Military Remote Viewing Manual
100% (5)
Military Remote Viewing Manual
72 pages
Structural Enneagram 140914
100% (2)
Structural Enneagram 140914
34 pages
Interface Zero (OEF) (2019)
100% (14)
Interface Zero (OEF) (2019)
273 pages
Artifical Neural Network
No ratings yet
Artifical Neural Network
7 pages
Back Propagation Algorithm - A Review: Mrinalini Smita & Dr. Anita Kumari
No ratings yet
Back Propagation Algorithm - A Review: Mrinalini Smita & Dr. Anita Kumari
6 pages
Optimal Neural Network Models For Wind S
No ratings yet
Optimal Neural Network Models For Wind S
12 pages
Architecture and Learning process in neural network - GeeksforGeeks
No ratings yet
Architecture and Learning process in neural network - GeeksforGeeks
6 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
A Systolic Array Exploiting The Parallelisms of Artificial Neural Inherent Networks
No ratings yet
A Systolic Array Exploiting The Parallelisms of Artificial Neural Inherent Networks
15 pages
Unit-V
No ratings yet
Unit-V
42 pages
Introduction to ANN
No ratings yet
Introduction to ANN
6 pages
Fundamentals of Artificial Neural Networks
No ratings yet
Fundamentals of Artificial Neural Networks
27 pages
Artificial Neural Network Methodology For Modelling and Forecasting Maize Crop Yield
No ratings yet
Artificial Neural Network Methodology For Modelling and Forecasting Maize Crop Yield
6 pages
NNunit 2
No ratings yet
NNunit 2
25 pages
Unit 4
No ratings yet
Unit 4
38 pages
Lecture_2 (1)
No ratings yet
Lecture_2 (1)
52 pages
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
No ratings yet
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
19 pages
Charotar University of Science and Technology Faculty of Technology and Engineering
No ratings yet
Charotar University of Science and Technology Faculty of Technology and Engineering
10 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Applied Neural Networks For Predicting Approximate Structural Response Behavior Using Learning and Design Experience
No ratings yet
Applied Neural Networks For Predicting Approximate Structural Response Behavior Using Learning and Design Experience
15 pages
UNit 6 Machine Learning
No ratings yet
UNit 6 Machine Learning
23 pages
Back-Propagation Algorithm of CHBPN Code
No ratings yet
Back-Propagation Algorithm of CHBPN Code
10 pages
Pattern Classification of Back-Propagation Algorithm Using Exclusive Connecting Network
No ratings yet
Pattern Classification of Back-Propagation Algorithm Using Exclusive Connecting Network
5 pages
ANN PG Module1
No ratings yet
ANN PG Module1
75 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Tabu Based Back Propagation Algorithm For Performance Improvement in Communication Channels
No ratings yet
Tabu Based Back Propagation Algorithm For Performance Improvement in Communication Channels
5 pages
Unit I - Afs
No ratings yet
Unit I - Afs
18 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Paper Presentation
No ratings yet
Paper Presentation
21 pages
AIML - 04 Single Layer Perceptron
No ratings yet
AIML - 04 Single Layer Perceptron
11 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
41 pages
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
No ratings yet
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
4 pages
Over
No ratings yet
Over
22 pages
NNs PDF
No ratings yet
NNs PDF
16 pages
Feedforward Neural Networks: An Introduction
No ratings yet
Feedforward Neural Networks: An Introduction
16 pages
BACK PROPAGATION Cluster 4
No ratings yet
BACK PROPAGATION Cluster 4
45 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
30 pages
unit 5
No ratings yet
unit 5
46 pages
Soft Computing
No ratings yet
Soft Computing
38 pages
Lab Manual On Soft Computing (IT-802) : Ms. Neha Sexana
No ratings yet
Lab Manual On Soft Computing (IT-802) : Ms. Neha Sexana
29 pages
Multilayer Perceptron
No ratings yet
Multilayer Perceptron
9 pages
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
50% (2)
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
16 pages
10 1 1 45
No ratings yet
10 1 1 45
45 pages
Aditya Jain NN Assignment
No ratings yet
Aditya Jain NN Assignment
13 pages
U2-ML-QB With Answers
No ratings yet
U2-ML-QB With Answers
16 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
ANN (SPPU AI&DS Insem Solved Question Paper 2019 Pattern)
No ratings yet
ANN (SPPU AI&DS Insem Solved Question Paper 2019 Pattern)
26 pages
Object Classification Through Perceptron Model Using Labview
No ratings yet
Object Classification Through Perceptron Model Using Labview
4 pages
Major Classes of Neural Networks
No ratings yet
Major Classes of Neural Networks
21 pages
Main
No ratings yet
Main
25 pages
TamanaChauhan-1
No ratings yet
TamanaChauhan-1
75 pages
Neural Networks Notes
No ratings yet
Neural Networks Notes
22 pages
Neural Networks: Aroob Amjad Farrukh
No ratings yet
Neural Networks: Aroob Amjad Farrukh
6 pages
NNDL-1
No ratings yet
NNDL-1
13 pages
Unit 2
No ratings yet
Unit 2
9 pages
Supervised Learning
No ratings yet
Supervised Learning
14 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Unit 3
No ratings yet
Unit 3
8 pages
3 4-ArtificialNeuralNetworks
No ratings yet
3 4-ArtificialNeuralNetworks
18 pages
MLT Answer Key
No ratings yet
MLT Answer Key
10 pages
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
I Jcs It 2014050535
No ratings yet
I Jcs It 2014050535
5 pages
Information Sciences: Doina Bucur
No ratings yet
Information Sciences: Doina Bucur
16 pages
Tabu Search
No ratings yet
Tabu Search
6 pages
A Hybrid Tabu-SA Algorithm For Location-Inventory Model With Considering Capacity Levels and Uncertain Demands
No ratings yet
A Hybrid Tabu-SA Algorithm For Location-Inventory Model With Considering Capacity Levels and Uncertain Demands
15 pages
Multiple Criteria Decision Analysis Using A Likelihood-Based Outranking Method Based On Interval-Valued Intuitionistic Fuzzy Sets
No ratings yet
Multiple Criteria Decision Analysis Using A Likelihood-Based Outranking Method Based On Interval-Valued Intuitionistic Fuzzy Sets
21 pages
Multiobjective Optimization Based On Reputation
No ratings yet
Multiobjective Optimization Based On Reputation
22 pages
Design of Fuzzy Rule-Based Classifiers With Semantic Cointension
No ratings yet
Design of Fuzzy Rule-Based Classifiers With Semantic Cointension
17 pages
Microsoft Word - JCD Final
No ratings yet
Microsoft Word - JCD Final
8 pages
Validationofassociationrulemining ACI2013 Wright Sittig
No ratings yet
Validationofassociationrulemining ACI2013 Wright Sittig
10 pages
Algorithms For Association Rule Mining - A General Survey and Comparison
No ratings yet
Algorithms For Association Rule Mining - A General Survey and Comparison
7 pages
Genetic Algorithms Rule Discovery Data Mining: For in
No ratings yet
Genetic Algorithms Rule Discovery Data Mining: For in
7 pages
Genetic Algorithms For Multi-Criterion Classification and Clustering in Data Mining
No ratings yet
Genetic Algorithms For Multi-Criterion Classification and Clustering in Data Mining
12 pages
Information Sciences: Marta Galende, María José Gacto, Gregorio Sainz, Rafael Alcalá
No ratings yet
Information Sciences: Marta Galende, María José Gacto, Gregorio Sainz, Rafael Alcalá
24 pages
Measuring The Accuracy and Interest of Association Rules: A New Framework
No ratings yet
Measuring The Accuracy and Interest of Association Rules: A New Framework
15 pages
Association Rule Mining - Models and Algorithms (Zhang & Zhang 2002-05-28)
50% (2)
Association Rule Mining - Models and Algorithms (Zhang & Zhang 2002-05-28)
248 pages
Dependence Factor For Association Rules: Abstract. Certainty Factor and Lift Are Known Evaluation Measures of Associa
No ratings yet
Dependence Factor For Association Rules: Abstract. Certainty Factor and Lift Are Known Evaluation Measures of Associa
2 pages
Pone 0201868 PDF
No ratings yet
Pone 0201868 PDF
20 pages
A Hierarchical Model of A Linguistic Variable
No ratings yet
A Hierarchical Model of A Linguistic Variable
15 pages
Association Rule Mining On Distributed Data: Pallavi Dubey
No ratings yet
Association Rule Mining On Distributed Data: Pallavi Dubey
6 pages
Weighted Association Rule Mining Using Weighted Support and Significance Framework
No ratings yet
Weighted Association Rule Mining Using Weighted Support and Significance Framework
6 pages
Roadmap How To Learn AI in 2024 (Uncovered AI)
No ratings yet
Roadmap How To Learn AI in 2024 (Uncovered AI)
6 pages
The Secrets of A Slot Machine
No ratings yet
The Secrets of A Slot Machine
4 pages
Download Complete Artificial Intelligence and Problem Solving 1st Edition Danny Kopec PDF for All Chapters
100% (4)
Download Complete Artificial Intelligence and Problem Solving 1st Edition Danny Kopec PDF for All Chapters
61 pages
My Ai Cheat List
100% (11)
My Ai Cheat List
3 pages
Rationality From AI To Zombies
86% (7)
Rationality From AI To Zombies
1,813 pages
Teas Topics To Study
100% (12)
Teas Topics To Study
6 pages
Mythic Magazine #009
100% (3)
Mythic Magazine #009
27 pages
A New Edgerunner Role For Cyberpunk Red
No ratings yet
A New Edgerunner Role For Cyberpunk Red
5 pages
CAN Bus - The Ultimate Guide
100% (3)
CAN Bus - The Ultimate Guide
114 pages
WISC-IV Technical Report 1
0% (1)
WISC-IV Technical Report 1
6 pages
Situationalawareness 1 30
No ratings yet
Situationalawareness 1 30
30 pages
Attention Is All You Need
50% (2)
Attention Is All You Need
11 pages
Cognitive Bias Cheat Sheet
100% (1)
Cognitive Bias Cheat Sheet
17 pages
2045: The Year Man Becomes Immortal
No ratings yet
2045: The Year Man Becomes Immortal
9 pages
Sudoku Theory
No ratings yet
Sudoku Theory
13 pages
Prompt Response Handbook
No ratings yet
Prompt Response Handbook
75 pages
Banana Pancakes - Jack Johnson
No ratings yet
Banana Pancakes - Jack Johnson
3 pages
0607_62
No ratings yet
0607_62
16 pages
Application of The Cube-per-Order Index Rule For Stock Location in A Distribution Warehouse
100% (1)
Application of The Cube-per-Order Index Rule For Stock Location in A Distribution Warehouse
11 pages
Measure of Dispersion and Skwness
No ratings yet
Measure of Dispersion and Skwness
41 pages
Rocket Propulsion: Physics Montwood High School R. Casao
No ratings yet
Rocket Propulsion: Physics Montwood High School R. Casao
41 pages
Data Binding
No ratings yet
Data Binding
123 pages
APPC 1.9-1.10B Wkst AP Style MCQ Rational Functions VA and Holes
No ratings yet
APPC 1.9-1.10B Wkst AP Style MCQ Rational Functions VA and Holes
3 pages
Lesson 11 - Angles in A Unit Circle
No ratings yet
Lesson 11 - Angles in A Unit Circle
77 pages
Lubok Saham
No ratings yet
Lubok Saham
35 pages
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
No ratings yet
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
128 pages
Course ES50112 Applied Econometrics 2
No ratings yet
Course ES50112 Applied Econometrics 2
5 pages