Hyperparameter optimization and saving the best agents for Reinforcement Learning

Question

laha_M on 2 Dec 2020

0
Link

Direct link to this question

https://nl.mathworks.com/matlabcentral/answers/672698-hyperparameter-optimization-and-saving-the-best-agents-for-reinforcement-learning

Commented: Francisco Serra on 23 Jan 2024

Accepted Answer: Emmanouil Tzorakoleftherakis

I am trying to train my RL agent (ddpg) but it's performing quite poorly. I think it may be a problem with the hyperparameter values since I have not tuning. Now I have two questions--

If there is anything in MATLAB that may help solve this problem of hyperparameter optimization other than manual trial-and-error?
How do I save the best performing agent given I don't know the critical values (i.e. don't know the range of the reward)? Basically, I want to save the agent that provides maximum reward or, say, top-5 highest rewarding agents?

Thanks.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Emmanouil Tzorakoleftherakis on 3 Dec 2020

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/672698-hyperparameter-optimization-and-saving-the-best-agents-for-reinforcement-learning#answer_564528

Hello,

You can use something like this. We do not have any examples with Reinforcement Learning Toolbox that show how to use this yet unfortunately.
If it's challenging to estimate what a good episode reward is, you can run a singe training session for a good number of episodes (e.g. 5k episodes) to get some idea how the agent is doing and then use that knowledge from the training plot to set the 'SaveAgent' option as needed. Most of the time you will need to run multiple training sessions either way to tweak parameters, rewards, etc, so just use the first one to get some intuition.

2 Comments
Show NoneHide None

laha_M on 4 Dec 2020

Thanks, Emmanouil.

Francisco Serra on 23 Jan 2024

Hey @laha_M, did you manage to to this with RL Toolbox?

Sign in to comment.

Hyperparameter optimization and saving the best agents for Reinforcement Learning

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

2 Comments
Show NoneHide None

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Hyperparameter optimization and saving the best agents for Reinforcement Learning

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

2 Comments Show NoneHide None

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

2 Comments
Show NoneHide None