How to use the reinforcement learning toolbox in Matlab to implement delayed reward
Show older comments
I want to implement delayed reward with matlab code. For example, I need to wait until the end of my current episode before giving the reward for each action in this episode. How can I achieve this?
Accepted Answer
More Answers (1)
MOHAMMADREZA
on 5 Mar 2025
0 votes
Hi, I am having the same problem. Hwever, I am using the Matlab heper (class) for environment. I do not know how to handle reward so that at the end of episode the reward is used for updating the parameters. More specifically, when using class template, I have step, reset,... functions. when the parameters is updated? is it after running step function? I wrote the reward in the step function. but I need to update the parameters only at the end of episode.
Categories
Find more on Reinforcement Learning in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!