Should I consider mean or sampled value for action selection in ppo algorithm?

When considering the policy network in PPO algorithm, we need to fit a gaussian distribution to the neural network output(for a continuous action space problem). When I use this network to obtain action, should I sample from the fitted distribution or directly consider the mean output from the policy nn? Also, what should be the…

Details

How to stop nginx from adding ending slash

I don’t know why, but if I go to mysite.com/about nginx returns a 301 and directs to mysite.com/about/ (with the trailing slash). This is my conf. It’s basically just the default ubuntu apt-get config: user www-data; worker_processes auto; pid /run/nginx.pid; include /etc/nginx/modules-enabled/*.conf; events { worker_connections 768; # multi_accept on; } http { server { root…

Details