fmincon: any way to enforce linear inequality constraints at intermediate iterations?

I am solving an optimization problem with the interior-point algorithm of fmincon.

My parameters have both lower and upper bounds and linear inequality constraints.

A_ineq = [1 -1 0 0 0;
          -1 2 -1 0 0;
          0 -1 2 -1 0;
          0 0 -1 2 -1;
          0 0 0 1 -1];
b_ineq=[0 0 0 0 0];

I observed that fmincon satisfies, of corrse, the bounds at all iterations but not the linear inequality constraints.

However, the linear inequality constraints are crucially important in my case. If violated, my optimization problem can not be solved.

Is there anything one can do to ensure that linear inequality constraints are satisfied at intermediate iterations?

6 Comments
Show 4 older commentsHide 4 older comments

SA-W on 2 Mar 2023

Scaling the matrix A_ineq by 1e4 or any other large value really helps in my case to (in a least-squares sense) enforce the linear constraints at intermediate iterations.

Matt J on 6 May 2023

Edited: Matt J on 6 May 2023

I'd like to point out that it violates the theoretical assumptions of fmincon, and probably all the Optimization Toolbox solvers as well, when the domain of your objective function and constraints is not an open subset of

. If you forbid evaluation outside the closed set defined by your inequality constraints, that is essentially the situation you are creating. It's not entirely clear what hazards this creates in practice, but it might mean one of the Global Optimization Toolbox solvers might be more appropriate.

Sign in to answer this question.

Answer 1

Matt J on 5 May 2023

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/1921445-fmincon-any-way-to-enforce-linear-inequality-constraints-at-intermediate-iterations#answer_1229679

Edited: Matt J on 23 May 2023

Within your objective function, project the current x onto the constrained region using quadprog,

fun=@(x) myObjective(x, A_ineq,b_ineq,lb,ub);
x=fmincon(fun, x0,[],[],[],[],[],lb,ub,options)
function fval=myObjective(x, A_ineq,b_ineq,lb,ub)
  C=speye(numel(x));
  x=lsqlin(C,x,A_ineq,b_ineq,[],[],lb,ub); %EDIT: was quadprog by mistake
  
  fval=...
end

47 Comments
Show 45 older commentsHide 45 older comments

SA-W on 23 May 2023

A_ineq.txt

I tried your suggested strategy, but it is not working as expected:

function fval=myObjective(x, A_ineq,b_ineq,lb,ub, weight )
  
  %1st iteration, i.e. x is the start vector
  
  x = 
       0.004000000000000
       0.001000000000000
                       0
       0.001000000000000
       0.004000000000000
       0.009000000000000
       0.016000000000000
       0.025000000000000
       0.036000000000000
                       0
       0.020250000000000
       0.063245553203368
       0.083495553203368
       0.081000000000000
       0.144245553203368
       0.089442719099992
       0.109692719099992
       0.170442719099992
       
  all(A_ineq*x) % == 1, i.e. start vector satisfies constraints
  
  C=speye(numel(x));
  [x,dist]=quadprog(C,x,A_ineq,b_ineq,[],[],lb,ub);
  
  x =
      1.0e-03 *
    
       0.178567411241192
       0.050029327861103
                       0
       0.000014275626508
       0.000030351196081
       0.000053745685964
       0.000087040217878
       0.000137579529376
       0.000245546982784
                       0
       0.000003261476952
       0.000013844333853
       0.000023204570239
       0.000023729908401
       0.000054189214599
       0.000019086149826
       0.000029710389433
       0.000063797849365
       
   % not possible to work with the new x, which is several magnitudes lower
      
  
end

Calling quadprog with the start vector, which satisfies all constraints and bounds, results in a new vector x, where nearly all components are several orders of magnitudes lower.

I attached the matrix A_ineq, b_ineq=0, lb=2, ub=0. Basically, A_ineq encodes that parameters are less or greater than neighboring parameters.

Ideally, if constraints were satisfied, I would expect quadprog to not change x significantly.

Does it make sense to you how the projected x looks like in my case?

SA-W on 13 Feb 2025 at 7:01

Edited: SA-W on 13 Feb 2025 at 7:04

You suggested to project onto active constraints using lsqlin in every call to the objective function using

x = lsqlin(C,x,A_ineq,b_ineq,[],[],lb,ub);

The projection onto the active set and its gradient are then given as

In the objective function, we can then return

where g(x) is my notation for the gradient my objective function would return without any lsqlin projection.

First of all, I do not think the equation for

is correct if we pass the bounds (lb and ub) to lsqlin. Consider the toy problem in 1d (ub = 0, lb = 1) where the "projection function" is already not-differentiable. Correct me if I am wrong, but when passing the bounds to lsqlin, there is no hope for an analytical

.

But thats ok, I can formulate my problem without bounds. More important for me is the case where some components of x are fixed values (i.e.

for some i). For this, lets partition

and

where

refers to the fixed (constrained) values and

to the free values, Ais the linear inequality matrix. Then the projection problem reduces to finding the free components:

So the overall

being used for the gradient of the objective

has a 2x2 block structure

(i) Is this mathematical derivation correct? It looks a bit suspicious to me to have this zero block at the top right, but intutitively it makes sense (the fixed values do not change if the free values are changed...)

(ii) Below is a minimal example demonstrating the projection step and the extraction of the active constraints at the projected solution. Looks this reasonable?

The reason why I show this is that the gradient of my objective function (probably the

part) is not valid anymore what I verify against a FD approximation. Without the projection, the gradient check works. So I might have a logical mistake in the code below, maybe also in the way I am extracting the active constraints and construct the final nabla P.

% Points and mock values x
pt = [3 5 7 9 11]; % Shape 5x1
x = [0 0.046506608245001 0.096706757662298 0.153809034551104 0.205280568411077];
% Compute linear constraints
k = 4; % cubic spline f
B = spapi(k, pt, eye(numel(pt)));   % function basis
Bd = fnder(B, 1);                   % first derivative basis
Bdd = fnder(B, 2);                  % second derivative basis
% Implement [f >= 0; fd >= 0, fdd >=0] in Aineq x <= b
A = [-B.coefs'; -Bd.coefs'; -Bdd.coefs']; % Shape 12x5
b = zeros(size(A, 1), 1); % Shape 12x1                  
% Assume x = [x_c, x_f] where x_c = x(1) = 0
xc = 0;             % Shape 1x1
xf = x(2:end);      % Shape 4x1
Ac = A(:, 1);       % Shape 12x1
Af = A(:, 2:end);   % Shape 12x4
% Projection onto active set A_f x_f = b - A_c x_c
C = eye(numel(xf));
zf = lsqlin(C, xf, Af, b);
% Find active constraints at projected solution
idx_active = find(abs(Af*zf) < 1e-6); % idx = [1, 12], i.e., two ctr are active
Af_active = Af(idx_active,:);
nablaP = eye(size(Af_active, 2)) - pinv(Af_active) * Af_active;
% Concatenated solution z and gradient nabla P
z = vertcat(0, zf);
nablaP = blkdiag(0, nablaP); % Create 2x2 block shape
nablaP(2:end, 1) = -pinv(Af_active) * Ac(idx_active); % Set (2,1) block
nablaP(1, 1) = 1.0; % Set (1,1) block

Matt J on 13 Feb 2025 at 13:24

Edited: Matt J on 13 Feb 2025 at 13:43

Correct me if I am wrong, but when passing the bounds to lsqlin, there is no hope for an analytical.

I don't see why the presence of bounds would invalidate anything, as long as you are treating the bounds the same way you are treating the other inequality constraints. Remember, bounds are a special case of linear inequality constraints, when Aineq is the identity matrix eye(N)*x<=ub, -eye(N)*x<=-lb. So, you also need to look at which bounds are active when you form the projection operator pinv(A).

More important for me is the case where some components of x are fixed values.

You haven't explained how you are implementing that. Ideally, fixed and apriori known values wouldn't be among the x(i) at all - because they're not unknowns. In that case, they wouldn't have any role to play in the projection.

If you are including them in the unknowns, it would have to mean that you have set lb(i)=ub(i)=c(i) If that's the case, the only change that should be needed is to treat these as bound constraints that are always active, as mentioned above.

The reason why I show this is that the gradient of my objective function (probably the part) is not valid anymore what I verify against a FD approximation. Without the projection, the gradient check works.

You haven't shown us the numbers, but there is no reason to expect agreement with an FD check anymore, or at least not everywhere. The function is now only piecewise differentiable. At points where the set of active constraints change (such as at the boundary of the linearly constrained region) there will be a discontinuity in the derivative.

SA-W on 13 Feb 2025 at 14:16

Edited: SA-W on 13 Feb 2025 at 15:45

You haven't shown us the numbers, but there is no reason to expect agreement with an FD check anymore, or at least not everywhere. The function is now only piecewise differentiable.

This makes sense. I did not keep that in mind. However, I managed to create a toy problem which does not converge to the expected solution.

Specifically, I am creating reference data (xq --> yq = xq.^2) and want to fit a quadratic spline to this data. The objective function is ||f(xq) - yq||^2, where f(xq) represents the sampled spline values. Since the spline is quadratic, it can perfectly fit the data, which is reproduced by the code.

Then, I implemented the linear constraints f''>=0 and the projection as discussed herein and fmincon converged to a wrong solution:

You can reproduce the output with this code. Do you see a mistake in the implementation?

% Generate data
xq = linspace(3, 11, 30);
yq = xq.*xq;
% Quadratic spline f and basis for df/dy
k = 3;
x = [3 5 7 9 11];
B = spapi(k, x, eye(numel(x)));
% Linear constraints
Bdd = fnder(B, 2);
Aineq = -Bdd.coefs';                % Shape 3x5
bineq = zeros(size(Aineq, 1), 1);   % Shape 5x1
% Function handle and solver options
objectiveFunc = @(y) Objective_func(y, k, x, Aineq, bineq, xq, yq);
options = optimoptions("fmincon", ...
    "FiniteDifferenceStepSize", 1e-6, ...
    "SpecifyObjectiveGradient", true);
% Optimizer (no constraints)
lb = []; ub = []; A = []; b = [];
x0 = -x .*x; % x0 activates the constraint
sol = fmincon(objectiveFunc, x0, lb, ub, A, b, [], [], [], options);
% Plot data and solution
f = spapi(k, x, sol);
fnplt(f); hold on;
plot(xq, yq, 'r+', 'DisplayName', 'data');
legend show;
function [fval, grad] = Objective_func(y, k, x, Aineq, bineq, xq, yq)
% y are the current parameters
% Project y onto active set
C = eye(numel(y));
options = optimset('Display', 'off');
y = lsqlin(C, y(:), Aineq, bineq, [], [], [], [], [], options);
% Find active ctr
idx_active = find(abs(Aineq * y(:)) < 1e-6);
Aineq_active = Aineq(idx_active,:); %#ok<FNDSB>
nablaP = eye(numel(y)) - pinv(Aineq_active) * Aineq_active;
% Compute residual and sum of squares
f = spapi(k, x, y);
residual = fnval(f, xq) - yq;
fval = sum(residual.*residual);
% Compute jacobian and gradient
if nargout > 1
    B = spapi(k, x, eye(numel(x)));
    jacobianT = fnval(B, xq);
    grad = 2 * jacobianT * residual(:);
    grad = nablaP' * grad;
end
end

Matt J on 13 Feb 2025 at 17:11

For a quadratic spline, you need k=2:

% Generate data

xq = linspace(3, 11, 30);

yq = xq.*xq;

% Quadratic spline f and basis for df/dy

k = 2;

x = [3 5 7 9 11];

B = spapi(k, x, eye(numel(x)));

% Linear constraints

Bdd = fnder(B, 2);

Aineq = -Bdd.coefs'; % Shape 3x5

bineq = zeros(size(Aineq, 1), 1); % Shape 5x1

% Function handle and solver options

objectiveFunc = @(y) Objective_func(y, k, x, Aineq, bineq, xq, yq);

options = optimoptions("fmincon", ...

"FiniteDifferenceStepSize", 1e-6, ...

"SpecifyObjectiveGradient", true);

% Optimizer (no constraints)

lb = []; ub = []; A = []; b = [];

x0 = -x .*x; % x0 activates the constraint

sol = fmincon(objectiveFunc, x0, lb, ub, A, b, [], [], [], options);

Local minimum found that satisfies the constraints. Optimization completed because the objective function is non-decreasing in feasible directions, to within the value of the optimality tolerance, and constraints are satisfied to within the value of the constraint tolerance.

% Plot data and solution

f = spapi(k, x, sol);

fnplt(f); hold on;

plot(xq, yq, 'r+', 'DisplayName', 'data');

legend show;

Matt J on 14 Feb 2025 at 0:19

Edited: Matt J on 14 Feb 2025 at 4:26

Here is how I would implement it (you will need to download func2mat from the File Exchange),

% Generate data

xdata = linspace(3, 11, 30); ydata = xdata.^2;

% Quadratic spline f and basis for df/dy

k = 3;

xcp = [3 5 7 9 11]; %control point x-locations

fcn = @(z) fnval(spapi(k, xcp, z),xdata);

tic

S=func2mat(fcn,xcp,'doSparse',false);

toc

Elapsed time is 0.055264 seconds.

dS=diff(S,1,1);

% Linear constraints

Aineq = -dS;

bineq = zeros(height(dS), 1);

% Function handle and solver options

objectiveFunc = @(z) Objective_func(z,Aineq, bineq, ydata,S);

options = optimoptions("fmincon", ...

"FiniteDifferenceStepSize", 1e-6, ...

"SpecifyObjectiveGradient", true);

% Optimizer (no constraints)

lb = []; ub = []; A = []; b = [];

z0 = -xcp(:); %

violation = any(Aineq*z0>bineq) %Initial guess z0 violates the constraint

violation = logical

1

z = fmincon(objectiveFunc, z0, lb, ub, A, b, [], [], [], options);

Local minimum found that satisfies the constraints. Optimization completed because the objective function is non-decreasing in feasible directions, to within the value of the optimality tolerance, and constraints are satisfied to within the value of the constraint tolerance.

% Plot data and solution

close all

xFit=linspace( xdata(1),xdata(end),1e4);

yFit=fnval(spapi(k, xcp, z), xFit );

plot(xFit,yFit,'-b',xdata,ydata, 'ro'); hold off

legend('Fitted Spline Samples','Data',Location='northwest'); hold off

function [fval, grad] = Objective_func(z,Aineq, bineq, ydata,S)

% y are the current parameters

z=z(:); ydata=ydata(:); I = speye(numel(z));

% Project z onto constraining set

options = optimoptions('lsqlin','Display', 'none');

zp = lsqlin(I, z, Aineq, bineq, [], [], [], [], [], options);

errData=S*zp-ydata; %data error term (depends only on zp)

errReg=zp-z; %penalty term on constraint violation

% Compute objective

fval=norm(errData)^2/2 + norm(errReg)^2/2;

% Compute jacobian and gradient

if nargout > 1

idx_active = abs(Aineq * zp - bineq) < 1e-6;

A = Aineq(idx_active,:);

JacP = I - pinv(A) * A;

grad=(S*JacP).'*errData + (JacP-I).'*errReg;

end

SA-W on 14 Feb 2025 at 6:01

Edited: SA-W on 14 Feb 2025 at 6:02

Thanks for the working example.

fcn = @(z) fnval(spapi(k, xcp, z),xdata);

In my real problem, the evaluation points (xdata) are different for every fmincon iteration. So probably, I needed to setup (fcn) and (Aineq) anew inside the objective function.

Anyway, the minimal example code I provided is closer to my real code base. Also the logic of computing the constraints as (Aineq = -Bdd.coefs'). So it were important for me to understand why the code below converges to the wrong solution.

What needs to be changed below? The gradient at the solution is zero, so I suspect something must be wrong with the gradient, but I can not see a mistake.

% Generate data

xq = linspace(3, 11, 30);

yq = xq.*xq;

% Quadratic spline f and basis for df/dy

k = 3;

x = [3 5 7 9 11];

B = spapi(k, x, eye(numel(x)));

% Linear constraints

Bdd = fnder(B, 2);

Aineq = -Bdd.coefs'; % Shape 3x5

bineq = zeros(size(Aineq, 1), 1); % Shape 5x1

% Function handle and solver options

objectiveFunc = @(y) Objective_func(y, k, x, Aineq, bineq, xq, yq);

options = optimoptions("fmincon", ...

"FiniteDifferenceStepSize", 1e-6, ...

"SpecifyObjectiveGradient", true);

% Optimizer (no constraints)

lb = []; ub = []; A = []; b = [];

x0 = -x .*x; % x0 activates the constraint

sol = fmincon(objectiveFunc, x0, lb, ub, A, b, [], [], [], options);

Local minimum found that satisfies the constraints. Optimization completed because the objective function is non-decreasing in feasible directions, to within the value of the optimality tolerance, and constraints are satisfied to within the value of the constraint tolerance.

% Plot data and solution

f = spapi(k, x, sol);

fnplt(f); hold on;

plot(xq, yq, 'r+', 'DisplayName', 'data');

legend show;

function [fval, grad] = Objective_func(y, k, x, Aineq, bineq, xq, yq)

% y are the current parameters

% Project y onto active set

C = eye(numel(y));

options = optimset('Display', 'off');

y = lsqlin(C, y(:), Aineq, bineq, [], [], [], [], [], options);

% Find active ctr

idx_active = find(abs(Aineq * y(:)) < 1e-6);

Aineq_active = Aineq(idx_active,:); %#ok<FNDSB>

nablaP = eye(numel(y)) - pinv(Aineq_active) * Aineq_active;

% Compute residual and sum of squares

f = spapi(k, x, y);

residual = fnval(f, xq) - yq;

fval = sum(residual.*residual);

% Compute jacobian and gradient

if nargout > 1

B = spapi(k, x, eye(numel(x)));

jacobianT = fnval(B, xq);

grad = 2 * jacobianT * residual(:);

grad = nablaP' * grad;

end

Matt J on 14 Feb 2025 at 6:42

Edited: Matt J on 14 Feb 2025 at 6:58

The main problem seems to be that you are not adding a penalty term on the projection residual, as I advised in this earlier comment. This creates lots of local minima. Your code does converge as is when a better initial point is provided, e.g., x0 =x.^2-1. By adding a penalty term, as below, it also converges from x0=-x.^2:

% Generate data

xq = linspace(3, 11, 30);

yq = xq.*xq;

% Quadratic spline f and basis for df/dy

k = 3;

x = [3 5 7 9 11];

B = spapi(k, x, eye(numel(x)));

% Linear constraints

Bdd = fnder(B, 2);

Aineq = -Bdd.coefs'; % Shape 3x5

bineq = zeros(size(Aineq, 1), 1); % Shape 5x1

% Function handle and solver options

objectiveFunc = @(y) Objective_func(y, k, x, Aineq, bineq, xq, yq);

options = optimoptions("fmincon", ...

"FiniteDifferenceStepSize", 1e-6, ...

"SpecifyObjectiveGradient", true);

% Optimizer (no constraints)

lb = []; ub = []; A = []; b = [];

x0 = -x .*x; % x0 activates the constraint

sol = fmincon(objectiveFunc, x0, lb, ub, A, b, [], [], [], options);

Local minimum found that satisfies the constraints. Optimization completed because the objective function is non-decreasing in feasible directions, to within the value of the optimality tolerance, and constraints are satisfied to within the value of the constraint tolerance.

% Plot data and solution

f = spapi(k, x, sol);

fnplt(f); hold on;

plot(xq, yq, 'r+', 'DisplayName', 'data');

legend show;

function [fval, grad] = Objective_func(z, k, x, Aineq, bineq, xq, yq)

% y are the current parameters

z=z(:); xq=xq(:); yq=yq(:); I = eye(numel(z));

% Project z onto linearly constrained region

options = optimset('Display', 'off');

zp = lsqlin(I, z, Aineq, bineq, [], [], [], [], [], options);

% Compute norms of residuals

f = spapi(k, x, zp);

dataResidual = fnval(f, xq) - yq;

projectionResidual=zp-z;

fval = norm(dataResidual).^2/2 + norm(projectionResidual)^2/2;

% Compute jacobian and gradient

if nargout > 1

% Find active ctr

idx_active = abs(Aineq * zp - bineq) < 1e-6;

A = Aineq(idx_active,:);

JacP = I- pinv(A) * A;

B = spapi(k, x, eye(numel(x)));

jacobianT = fnval(B, xq);

grad = (jacobianT.' * JacP).'*dataResidual(:) + (JacP-I).'*projectionResidual;

end

SA-W on 14 Feb 2025 at 12:00

The main problem seems to be that you are not adding a penalty term on the projection residual, as I advised in this earlier comment. This creates lots of local minima.

Ah, I was not aware that neglecting the constraint violation would lead to multiple local minima since the problem without projection is quadratic.

Below, I added the constraint x(1) = 0 and I implemented it by setting lb(1) = ub(1) = 0 and pass the bounds to both fmincon and the lsqlin solver. Keeping all parameters (also the fixed ones) is easier in my interfaces. You said ...

So, you also need to look at which bounds are active when you form the projection operator pinv(A).

In this example, lb(1) and ub(1) are active per design. But how can we include this when forming JacP?

% Find active ctr
idx_active = abs(Aineq * zp - bineq) < 1e-6;
A = Aineq(idx_active,:); % Active linear constraints, How to include active lb(1) and ub(1) ?
JacP = I- pinv(A) * A;

One idea I have (based on your comment) is to add the unit vectors ([-1 0 0 0 0] and [1 0 0 0 0]) to Aineq ...

lb1 = [-1 0 0 0 0]; ub1 = [1 0 0 0 0];
A = [lb1; ub1; A]; 
% Find active ctr
idx_active = abs(Aineq * zp - bineq) < 1e-6; % Now always includes indices "1" and "2"
A = Aineq(idx_active,:); 
JacP = I- pinv(A) * A;

but I feel this is not the most efficient way to do this.

Code with x(1) = lb(1) = ub(1) = 0:

% Generate data
xq = linspace(3, 11, 30);
yq = xq.*xq;
% Quadratic spline f and basis for df/dy
k = 3;
x = [3 5 7 9 11];
B = spapi(k, x, eye(numel(x)));
% Linear constraints
Bdd = fnder(B, 2);
Aineq = -Bdd.coefs';                % Shape 3x5
bineq = zeros(size(Aineq, 1), 1);   % Shape 5x1
% Bound constraints to fix x(1) = 0
lb = -Inf(numel(x), 1);
ub = +Inf(numel(x), 1);
lb(1) = 0.0;
ub(1) = 0.0;
% Function handle and solver options
objectiveFunc = @(y) Objective_func(y, k, x, lb, ub, Aineq, bineq, xq, yq);
options = optimoptions("fmincon", ...
    "FiniteDifferenceStepSize", 1e-6, ...
    "SpecifyObjectiveGradient", true);
% Call optimizer
x0 = -x .*x; % x0 activates the constraint
sol = fmincon(objectiveFunc, x0, [], [], [], [], lb, ub, [], options);
% Plot data and solution
f = spapi(k, x, sol);
fnplt(f); hold on;
plot(xq, yq, 'r+', 'DisplayName', 'data');
legend show;
function [fval, grad] = Objective_func(z, k, x, lb, ub, Aineq, bineq, xq, yq)
% y are the current parameters
z=z(:); xq=xq(:); yq=yq(:); I = eye(numel(z));
% Project z onto linearly constrained region
options = optimset('Display', 'off');
zp = lsqlin(I, z, Aineq, bineq, [], [], lb, ub, [], options);
% Compute norms of residuals
f = spapi(k, x, zp);
dataResidual = fnval(f, xq) - yq;
projectionResidual=zp-z;
fval = norm(dataResidual).^2/2 + norm(projectionResidual)^2/2;
% Compute jacobian and gradient
if nargout > 1
    
    % Find active ctr
    idx_active = abs(Aineq * zp - bineq) < 1e-6;
    A = Aineq(idx_active,:); 
    JacP = I- pinv(A) * A;
    
    B = spapi(k, x, eye(numel(x)));
    jacobianT = fnval(B, xq);
    grad = (jacobianT.' * JacP).'*dataResidual(:) + (JacP-I).'*projectionResidual;
    
end
end

SA-W on 15 Feb 2025 at 9:13

I don't know if that helps you choose your tolerance, but maybe it will.

Not sure how it might help, but normalizing the rows of Aineq is in either case a good idea if different constraint sets (e.g. f>=0 and f''>=0) are assembled into Aineq to weight the different sets equally. Makes sense to formulate it like that?

Another hyperparameter is the weighting factor associated with the projection residual:

zp = lsqlin(I, z, Aineq, bineq, [], [], [], [], [], options);
projRes = zp-z;
fval = fval + weight * norm(projRes).^2

Usually, this weight is chosen on a case to case basis and has the purpose to balance/weight different contributions to the residuals.

In our case, the value we assign to "weight" does not really matter as the solution is characterized by "procRes = 0". So it may affect the path taken to the solution, but it should affect the solution itself, right?

Matt J on 15 Feb 2025 at 17:42

but it should [Ed. not] affect the solution itself, right?

I dont think it should, no.

Answer 2

Shubham on 5 May 2023

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/1921445-fmincon-any-way-to-enforce-linear-inequality-constraints-at-intermediate-iterations#answer_1229654

Hi SA-W,

Yes, there are a few options you can try to ensure that the linear inequality constraints are satisfied at intermediate iterations of the interior-point algorithm in fmincon:

Tighten the tolerances: By tightening the tolerances in the fmincon options, you can force the algorithm to take smaller steps and converge more slowly, but with higher accuracy. This may help to ensure that the linear inequality constraints are satisfied at all intermediate iterations.
Use a barrier function: You can try using a barrier function to penalize violations of the linear inequality constraints. This can be done by adding a term to the objective function that grows very large as the constraints are violated. This will encourage the algorithm to stay within the feasible region defined by the constraints.
Use a penalty function: Similar to a barrier function, a penalty function can be used to penalize violations of the linear inequality constraints. However, instead of growing very large, the penalty function grows linearly with the degree of violation. This can be a more computationally efficient approach than a barrier function.
Use a combination of methods: You can try using a combination of the above methods to ensure that the linear inequality constraints are satisfied at all intermediate iterations. For example, you could tighten the tolerances and use a penalty function or barrier function to further enforce the constraints.

It's important to note that these methods may increase the computational cost of the optimization problem, so it's important to balance the accuracy requirements with the available resources.

0 Comments
Show -2 older commentsHide -2 older comments

Answer 3

Matt J on 5 May 2023

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/1921445-fmincon-any-way-to-enforce-linear-inequality-constraints-at-intermediate-iterations#answer_1229684

Within your objective function, check if the linear inequalites are satsfied. If they are not, abort the function and return Inf. Otherwise, compute the output value as usual.

fun=@(x) myObjective(x, A_ineq,b_ineq);
x=fmincon(fun, x0,A_ineq,b_ineq,[],[],lb,ub,[],options)
function fval=myObjective(x, A_ineq,b_ineq)
  if ~all(A_ineq*x<=b_ineq)
     fval=Inf; return
  end
      
  fval=...
end

0 Comments
Show -2 older commentsHide -2 older comments