Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?
    2 views (last 30 days)
  
       Show older comments
    
Since TD3, SAC, etc. agents support using recurrent layers by themselves, would using these recurrent base agents still not work with MBPO?
Could this limit be circumvented by using a custom training loop for the environment model and for the base agents?
2 Comments
  Naren Raman
    
 on 6 May 2024
				Thank you for your question. No, MBPO agents do not support recurrent networks for now as mentioned in the documentation. The custom training loop provides more flexibility. Yes, you should be able to use the custom training loop to create a custom MBPO agent with recurrent neural networks.
Answers (0)
See Also
Categories
				Find more on Deep Learning Toolbox in Help Center and File Exchange
			
	Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!
