I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

Question

Maha Mosalam on 19 Dec 2021

0
Link

Direct link to this question

https://nl.mathworks.com/matlabcentral/answers/1614395-i-am-using-ddpg-if-there-are-four-network-to-algorithm-actor-target-actor-critic-target-crit

Answered: Yash on 23 Dec 2024

for example online actor=10^-1 and target actor 10^-2...how I can do this in matlab?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Yash on 23 Dec 2024

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/1614395-i-am-using-ddpg-if-there-are-four-network-to-algorithm-actor-target-actor-critic-target-crit#answer_1556284

Open in MATLAB Online

Yes, you can use different learning rates for Actor and Critic by specifying them individually when setting up your training options for DDPG agent. Here is a simple code snippet to achieve this:

actorOptimizerOptions = rlOptimizerOptions(LearnRate=1e-1)
criticOptimizerOptions = rlOptimizerOptions(LearnRate=1e-2)
opt = rlDDPGAgentOptions('ActorOptimizerOptions',actorOptimizerOptions,'CriticOptimizerOptions',criticOptimizerOptions)

Refer to this documentation page for more information on creating an object for DDPG agent: https://www.mathworks.com/help/reinforcement-learning/ref/rl.option.rlddpgagentoptions.html

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments