Next: Performance Evaluation of the
Up: New LM with Adaptive
Previous: New LM with Adaptive
  Contents
  Index
The Algorithm with Adaptive Momentum
The goal is to choose minimization directions, which are not interfering and linearly independent. This can be achieved by the selection of conjugate directions which forms the basis of the CG method. Two vectors are non-interfering and linearly or mutually conjugate with respect to
when
|
(1025) |
where
,
is the Hessian matrix
, with is the number of training patterns, is the number of outputs, the weights vector, and is the optimal step (or the search direction). The objective is to reach a minimum of the cost function with respect to and to simultaneously maximize
without compromising the need for a decrease of the cost function. At each iteration of the learning process, the weight vector will be incremented by , so that:
|
(1026) |
where is a constant and the change in is equal to a predetermined quantity
:
|
(1027) |
This is a constrained optimization problem which can be analytically solved by introducing two Lagrange multipliers and . Then function is introduced to evaluate the differentials involved in the right hand side and to substitute the function :
|
(1028) |
By replacing by its value, we obtain:
|
(1029) |
To maximize at each iteration, the following equations must be satisfied:
|
(1030) |
and
|
(1031) |
Hence from 10.30 we obtain:
|
(1032) |
From equations 10.27 and 10.32 we obtain:
|
(1033) |
where
|
(1034) |
|
(1035) |
From 10.33 we obtain :
|
(1036) |
It remains to obtain . This can be done by substituting Eqn. 10.30 in Eqn. 10.26 to obtain:
|
(1037) |
where
|
(1038) |
Finally, Eqn. 10.36 is substituted into Eqn. 10.37 and solve for to obtain:
|
(1039) |
The positive square root value has been chosen for in order to satisfy Eqn. 10.31. Note the bound
set on the value of by Eqn. 10.39. The value chosen for is always:
where is a constant between 0 and 1. Therefore the free parameters are and (
). The authors recorded during their simulations that the best results are given for
and
.
Next: Performance Evaluation of the
Up: New LM with Adaptive
Previous: New LM with Adaptive
  Contents
  Index
Samir Mohamed
2003-01-08