Implementing Actor-Critic with Experience Replay for Continuous Action Spaces

I have been trying to implement the ACER algorithm for continuous action spaces in reinforcement learning. The paper for the algorithm can be found here: Sample Efficient Actor-Critic with Experience Replay I have implemented parts of the algorithm, but I have encountered some roadblocks that I have not been able to figure out. The following…

Implementing Actor-Critic with Experience Replay for Continuous Action Spaces

I have been trying to implement the ACER algorithm for continuous action spaces in reinforcement learning. The paper for the algorithm can be found here: Sample Efficient Actor-Critic with Experience Replay I have implemented parts of the algorithm, but I have encountered some roadblocks that I have not been able to figure out. The following…

Implementing Actor-Critic with Experience Replay for Continuous Action Spaces

I have been trying to implement the ACER algorithm for continuous action spaces in reinforcement learning. The paper for the algorithm can be found here: Sample Efficient Actor-Critic with Experience Replay I have implemented parts of the algorithm, but I have encountered some roadblocks that I have not been able to figure out. The following…