Tune PI Controller Using Reinforcement Learning

Question

嘻嘻 on 18 Oct 2023

0
Link

Direct link to this question

https://nl.mathworks.com/matlabcentral/answers/2035244-tune-pi-controller-using-reinforcement-learning

Answered: Emmanouil Tzorakoleftherakis on 23 Oct 2023

Accepted Answer: Emmanouil Tzorakoleftherakis

How is the initial value of the weight of this neural network determined? If I want to change my PI controller to a PID controller, do I just add another weight to this row that is initialGain = single([1e-3 2])?

This code is from the demo "Tune PI Controller Using Reinforcement Learning."

initialGain = single([1e-3 2]);

actorNet = [

featureInputLayer(numObs)

fullyConnectedPILayer(initialGain,'ActOutLyr')

];

actorNet = dlnetwork(actorNet);

actor = rlContinuousDeterministicActor(actorNet,obsInfo,actInfo);

Can my network be changed to look like the following：

actorNet= [

featureInputLayer(numObs)

fullyConnectedPILayer(randi([-60,60],1,3), 'Action')]

3 Comments
Show 1 older commentHide 1 older comment

嘻嘻 on 18 Oct 2023

I want the weights of the network to represent the controller parameters, the input of the network to represent the error and the error integral and its first derivative, and the final output of the network to be the control instructions

嘻嘻 on 18 Oct 2023

I'm not really sure. What do you think of this scheme?

Sign in to comment.

Sign in to answer this question.

Answer 1

Emmanouil Tzorakoleftherakis on 23 Oct 2023

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/2035244-tune-pi-controller-using-reinforcement-learning#answer_1339046

I also replied to the other thread. The fullyConnectedPILayer is a custom layer provided in the example - you can open it and see how it's implemented. So you can certainly add a third weight for the D term, but you will most likely run into other issues (e.g. how to approximate the error derivative)

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Tune PI Controller Using Reinforcement Learning

3 Comments
Show 1 older commentHide 1 older comment

Accepted Answer

0 Comments
Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

Tune PI Controller Using Reinforcement Learning

3 Comments Show 1 older commentHide 1 older comment

Accepted Answer

0 Comments Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Community Treasure Hunt

3 Comments
Show 1 older commentHide 1 older comment

0 Comments
Show -2 older commentsHide -2 older comments