Levenberg-Marquardt
Manopt.LevenbergMarquardt
— FunctionLevenbergMarquardt(M, f, jacobian_f, p, num_components=-1; kwargs...)
LevenbergMarquardt(M, vgf, p; kwargs...)
LevenbergMarquardt(M, nlso, p; kwargs...)
LevenbergMarquardt!(M, f, jacobian_f, p, num_components=-1; kwargs...)
LevenbergMarquardt!(M, vgf, p, num_components=-1; kwargs...)
LevenbergMarquardt!(M, nlso, p, num_components=-1; kwargs...)
compute the the Riemannian Levenberg-Marquardt algorithm [Pee93, AOT22] to solve
\[\operatorname*{arg\,min}_{p ∈ \mathcal M} \frac{1}{2} \sum_{i=1}^m \lvert f_i(p) \rvert^2\]
where $f: \mathcal M → ℝ^m$ is written with component functions $f_i: \mathcal M → ℝ$, $i=1,…,m$, and each component function is continuously differentiable.
The second block of signatures perform the optimization in-place of p
.
Input
M::
AbstractManifold
: a Riemannian manifold $\mathcal M$
f
: a cost function $f: \mathcal M→ℝ^m$. The cost function can be provided in two different ways- as a single function returning a vector $f(p) ∈ ℝ^m$
- as a vector of functions, where each single function returns a scalar $f_i(p) ∈ ℝ$
function_type=
keyword argument.jacobian_f
: the Jacobian of $f$. The Jacobian can be provided in three different ways- as a single function returning a vector of gradient vectors $\bigl(\operatorname{grad} f_i(p)\bigr)_{i=1}^m$
- as a vector of functions, where each single function returns a gradient vector $\operatorname{grad} f_i(p)$, $i=1,…,m$
- as a single function returning a (coefficient) matrix $J ∈ ℝ^{m×d}$, where $d$ is the dimension of the manifold.
AbstractBasis
of the tangent space atp
. The type is determined by thejacobian_type=
keyword argument.p
: a point on the manifold $\mathcal M$num_components
: length $m$ of the vector returned by the cost function. By default its value is -1 which means that it is determined automatically by callingf
one additional time. This is only possible whenevaluation
isAllocatingEvaluation
, for mutating evaluation this value must be explicitly specified.
You can also provide the cost and its Jacobian already as aVectorGradientFunction
vgf
, Alternatively, passing a NonlinearLeastSquaresObjective
nlso
.
Keyword arguments
evaluation=
AllocatingEvaluation
()
: specify whether the functions that return an array, for example a point or a tangent vector, work by allocating its result (AllocatingEvaluation
) or whether they modify their input argument to return the result therein (InplaceEvaluation
). Since usually the first argument is the manifold, the modified argument is the second.η=0.2
: scaling factor for the sufficient cost decrease threshold required to accept new proposal points. Allowed range:0 < η < 1
.expect_zero_residual=false
: whether or not the algorithm might expect that the value of residual (objective) at minimum is equal to 0.damping_term_min=0.1
: initial (and also minimal) value of the damping termβ=5.0
: parameter by which the damping term is multiplied when the current new point is rejectedfunction_type=
FunctionVectorialType
: anAbstractVectorialType
specifying the type of cost function provided.initial_jacobian_f
: the initial Jacobian of the cost functionf
. By default this is a matrix of sizenum_components
times the manifold dimension of similar type asp
.initial_residual_values
: the initial residual vector of the cost functionf
. By default this is a vector of lengthnum_components
of similar type asp
.jacobian_type=
FunctionVectorialType
: anAbstractVectorialType
specifying the type of Jacobian provided.retraction_method=
default_retraction_method
(M, typeof(p))
: a retraction $\operatorname{retr}$ to use, see the section on retractions
All other keyword arguments are passed to decorate_state!
for state decorators or decorate_objective!
for objective decorators, respectively.
Output
The obtained approximate minimizer $p^*$. To obtain the whole final state of the solver, see get_solver_return
for details, especially the return_state=
keyword.
Manopt.LevenbergMarquardt!
— FunctionLevenbergMarquardt(M, f, jacobian_f, p, num_components=-1; kwargs...)
LevenbergMarquardt(M, vgf, p; kwargs...)
LevenbergMarquardt(M, nlso, p; kwargs...)
LevenbergMarquardt!(M, f, jacobian_f, p, num_components=-1; kwargs...)
LevenbergMarquardt!(M, vgf, p, num_components=-1; kwargs...)
LevenbergMarquardt!(M, nlso, p, num_components=-1; kwargs...)
compute the the Riemannian Levenberg-Marquardt algorithm [Pee93, AOT22] to solve
\[\operatorname*{arg\,min}_{p ∈ \mathcal M} \frac{1}{2} \sum_{i=1}^m \lvert f_i(p) \rvert^2\]
where $f: \mathcal M → ℝ^m$ is written with component functions $f_i: \mathcal M → ℝ$, $i=1,…,m$, and each component function is continuously differentiable.
The second block of signatures perform the optimization in-place of p
.
Input
M::
AbstractManifold
: a Riemannian manifold $\mathcal M$
f
: a cost function $f: \mathcal M→ℝ^m$. The cost function can be provided in two different ways- as a single function returning a vector $f(p) ∈ ℝ^m$
- as a vector of functions, where each single function returns a scalar $f_i(p) ∈ ℝ$
function_type=
keyword argument.jacobian_f
: the Jacobian of $f$. The Jacobian can be provided in three different ways- as a single function returning a vector of gradient vectors $\bigl(\operatorname{grad} f_i(p)\bigr)_{i=1}^m$
- as a vector of functions, where each single function returns a gradient vector $\operatorname{grad} f_i(p)$, $i=1,…,m$
- as a single function returning a (coefficient) matrix $J ∈ ℝ^{m×d}$, where $d$ is the dimension of the manifold.
AbstractBasis
of the tangent space atp
. The type is determined by thejacobian_type=
keyword argument.p
: a point on the manifold $\mathcal M$num_components
: length $m$ of the vector returned by the cost function. By default its value is -1 which means that it is determined automatically by callingf
one additional time. This is only possible whenevaluation
isAllocatingEvaluation
, for mutating evaluation this value must be explicitly specified.
You can also provide the cost and its Jacobian already as aVectorGradientFunction
vgf
, Alternatively, passing a NonlinearLeastSquaresObjective
nlso
.
Keyword arguments
evaluation=
AllocatingEvaluation
()
: specify whether the functions that return an array, for example a point or a tangent vector, work by allocating its result (AllocatingEvaluation
) or whether they modify their input argument to return the result therein (InplaceEvaluation
). Since usually the first argument is the manifold, the modified argument is the second.η=0.2
: scaling factor for the sufficient cost decrease threshold required to accept new proposal points. Allowed range:0 < η < 1
.expect_zero_residual=false
: whether or not the algorithm might expect that the value of residual (objective) at minimum is equal to 0.damping_term_min=0.1
: initial (and also minimal) value of the damping termβ=5.0
: parameter by which the damping term is multiplied when the current new point is rejectedfunction_type=
FunctionVectorialType
: anAbstractVectorialType
specifying the type of cost function provided.initial_jacobian_f
: the initial Jacobian of the cost functionf
. By default this is a matrix of sizenum_components
times the manifold dimension of similar type asp
.initial_residual_values
: the initial residual vector of the cost functionf
. By default this is a vector of lengthnum_components
of similar type asp
.jacobian_type=
FunctionVectorialType
: anAbstractVectorialType
specifying the type of Jacobian provided.retraction_method=
default_retraction_method
(M, typeof(p))
: a retraction $\operatorname{retr}$ to use, see the section on retractions
All other keyword arguments are passed to decorate_state!
for state decorators or decorate_objective!
for objective decorators, respectively.
Output
The obtained approximate minimizer $p^*$. To obtain the whole final state of the solver, see get_solver_return
for details, especially the return_state=
keyword.
Options
Manopt.LevenbergMarquardtState
— TypeLevenbergMarquardtState{P,T} <: AbstractGradientSolverState
Describes a Gradient based descent algorithm, with
Fields
A default value is given in brackets if a parameter can be left out in initialization.
p::P
: a point on the manifold $\mathcal M$storing the current iterateretraction_method::AbstractRetractionMethod
: a retraction $\operatorname{retr}$ to use, see the section on retractionsresidual_values
: value of $F$ calculated in the solver setup or the previous iterationresidual_values_temp
: value of $F$ for the current proposal pointstop::StoppingCriterion
: a functor indicating that the stopping criterion is fulfilledjacobian
: the current Jacobian of $F$gradient
: the current gradient of $F$step_vector
: the tangent vector atx
that is used to move to the next pointlast_stepsize
: length ofstep_vector
η
: Scaling factor for the sufficient cost decrease threshold required to accept new proposal points. Allowed range:0 < η < 1
.damping_term
: current value of the damping termdamping_term_min
: initial (and also minimal) value of the damping termβ
: parameter by which the damping term is multiplied when the current new point is rejectedexpect_zero_residual
: if true, the algorithm expects that the value of the residual (objective) at minimum is equal to 0.
Constructor
LevenbergMarquardtState(M, initial_residual_values, initial_jacobian; kwargs...)
Generate the Levenberg-Marquardt solver state.
Keyword arguments
The following fields are keyword arguments
β=5.0
damping_term_min=0.1
η=0.2
,expect_zero_residual=false
initial_gradient=
zero_vector
(M, p)
retraction_method=
default_retraction_method
(M, typeof(p))
: a retraction $\operatorname{retr}$ to use, see the section on retractionsstopping_criterion=
StopAfterIteration
(200)
|
StopWhenGradientNormLess
(1e-12)
|
StopWhenStepsizeLess
(1e-12)
: a functor indicating that the stopping criterion is fulfilled
See also
Technical details
The LevenbergMarquardt
solver requires the following functions of a manifold to be available
- A
retract!
(M, q, p, X)
; it is recommended to set thedefault_retraction_method
to a favourite retraction. If this default is set, aretraction_method=
does not have to be specified. - the
norm
as well, to stop when the norm of the gradient is small, but if you implementedinner
, the norm is provided already. - A
copyto!
(M, q, p)
andcopy
(M,p)
for points.
Literature
- [AOT22]
- S. Adachi, T. Okuno and A. Takeda. Riemannian Levenberg-Marquardt Method with Global and Local Convergence Properties. ArXiv Preprint (2022).
- [Pee93]
- R. Peeters. On a Riemannian version of the Levenberg-Marquardt algorithm. Serie Research Memoranda 0011 (VU University Amsterdam, Faculty of Economics, Business Administration and Econometrics, 1993).