Reinforcement Learning Toolbox train two agent
Show older comments

I want to train a DDPG model like this architecture.
I train this model with 500 episodes and 1 episodes have 1000 step.
But when I run the m-file.
The result is train 'RL Agent_1' 500 episodes then train 'RL Agent_2' 500 episodes.
This result will let the parameters 'x1' in the environment cannot return to 'Subsystem_1'.
How to fix this problem?
Accepted Answer
More Answers (0)
Categories
Find more on Reinforcement Learning in Help Center and File Exchange
Products
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!