- Custom Training: https://www.mathworks.com/help/reinforcement-learning/ug/train-reinforcement-learning-policy-using-custom-training.html
- ‘train’ function to train reinforcement learning agents: https://in.mathworks.com/help/reinforcement-learning/ref/rl.agent.rlqagent.train.html
- Training Using Existing Training Data: https://www.mathworks.com/help/reinforcement-learning/ref/rl.agent.rlqagent.train.html
- ‘rlMultiAgentTrainingOptions‘: https://www.mathworks.com/help/reinforcement-learning/ref/rlmultiagenttrainingoptions.html
Is it possible to change some parameters during a RL training?
2 views (last 30 days)
Show older comments
Hello everyone,
For those who don't know, I am currently working on a supervised learning/RL learning training session with a 3DOF model of a jet aircraft.
My idea is to change, during a training, some parameters such as the number of allowed interventions of the external controller as well as their duration. Another whish is to change accordingly these parametes as the agent makes progress, e.g. its episode reward will increase during training. Is it possible to do this during a training? That is, is it possible to implement some for loops that exploits the trainStats object while the training is runnig? Thank you
0 Comments
Accepted Answer
Subhajyoti
on 5 Aug 2024
Hi Leonardo,
Custom training loops and callbacks can adjust the parameters dynamically during the training of supervised learning or reinforcement learning models in the Reinforcement Learning Toolbox.
You can implement a custom training loop by defining your environment and agent, then loop through a predefined number of episodes. Within each episode, interact with the environment by selecting actions, applying them, and updating the agent based on the received rewards and observations. Log the total reward for each episode to monitor performance.
Monitor metrics like total reward per episode and adjust parameters such as the number of allowed interventions and their duration if specific criteria are met (e.g., every 50 episodes or when the average reward exceeds a threshold).
Use the ‘trainStats’ object to log and monitor performance metrics during training. Store the metrics like total reward for each episode, which can then be used to calculate statistics for dynamic parameter adjustment.
By combining a custom training loop, dynamic parameter adjustment, and the ‘trainStats’ object, an adaptive training process can be created that enhances the learning efficiency of the RL agent.
You may go through the following MathWorks documentation links to know more about Training RL Agents options and ‘trainStats’.
I hope this helps.
0 Comments
More Answers (1)
Yash Sharma
on 5 Aug 2024
Hi Leonardo,
If you want to change some parameters while the training is running, you will need to implement a custom training loop. Below is the documentation that can help you achieve this.
Train Reinforcement Learning Policy Using Custom Training Loop: https://www.mathworks.com/help/reinforcement-learning/ug/train-reinforcement-learning-policy-using-custom-training.html
0 Comments
See Also
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!