Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

11 views (last 30 days)

Unmanned Aerial and Space Systems on 30 Apr 2022

0
Link

Direct link to this question

https://ch.mathworks.com/matlabcentral/answers/1708930-reinforcement-learning-based-quadrotor-control-using-soft-actor-critic-the-reward-is-not-converging

Edited: Unmanned Aerial and Space Systems on 1 May 2022

Hi, I am trying to control of a rotary wing UAV (quadrotor) by using Soft-Actor Critic methodology, but I have some problems, my reward is increasing continously after the point you see following image, what is the main problem, can you advice for this situation, I am sharing my files (Simulink and m-file). My max reward values should be zero as we define in reward function on Simulink file. This reward function indicates that the difference between desired trajectory and actual trajectory is about zero.

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

Products

Release

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

0 Comments
Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

0 Comments Show -2 older commentsHide -2 older comments

Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments