Training problem in DDPG agent

Question

Sam Chen on 2 Mar 2020

0
Link

Direct link to this question

https://nl.mathworks.com/matlabcentral/answers/508442-training-problem-in-ddpg-agent

Answered: Benjamin Feaster on 2 Mar 2020

IMAG4696.jpg

I have a problem training with DDPG as shown below. The Episode Q0 became NaN at episode 658, I had saved all the agent during training and checked the parameter by 'getCritic' and 'getActor', it seems that all weights in neural network became NaN between agent657 and agent658. I can't figure out what happened during the training.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Benjamin Feaster on 2 Mar 2020

0
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/508442-training-problem-in-ddpg-agent#answer_418152

Open in MATLAB Online

I had this problem once as well when training an AC agent. This can happen when an equation(probably in your step function), tried to calculate one of the following:

zero/zero, zero*infinity, infinity/infinity, infinity-infinity.

Try troubleshooting with something like this at the end of your step function:

if any(isnan(NextObs), 'all') % if any element in NextObs matrix contains a NaN
    [row, col] = find(isnan(NextObs)) % Display the row and column position in the matrix
end

Note this will also work with a NextObs vector. This will give you the row and column position of the first NaN value and ouput it to the command line. You can then determine which NextObs value this corresponds to and find where in your code that value is calculated.

Without looking at the code I can only give limited advice. Also make sure you have a "fallback" value when calculating your NextObs if your implementation requires it:

if something == 1
    NextObs = 2; % your regular calculations you have implemented already
else
    NextObs = -1; % return a number instead to represent the NextObs value doesnt apply for this step
end

I referenced two posts:

Roger Stafford's answer on this post, and Steven Lord's answer on this post. Hope this helps!

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Training problem in DDPG agent

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Training problem in DDPG agent

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments