-
- Downloads
support multi-dim reward for AC and PPO (#952)
* support multi-dim reward for AC and PPO * address comments * more updates
Showing
- alf/algorithms/actor_critic_algorithm.py 22 additions, 3 deletionsalf/algorithms/actor_critic_algorithm.py
- alf/algorithms/actor_critic_algorithm_test.py 9 additions, 5 deletionsalf/algorithms/actor_critic_algorithm_test.py
- alf/algorithms/actor_critic_loss.py 50 additions, 14 deletionsalf/algorithms/actor_critic_loss.py
- alf/algorithms/agent.py 3 additions, 3 deletionsalf/algorithms/agent.py
- alf/algorithms/lagrangian_reward_weight_algorithm.py 155 additions, 20 deletionsalf/algorithms/lagrangian_reward_weight_algorithm.py
- alf/algorithms/lagrangian_reward_weight_algorithm_test.py 17 additions, 9 deletionsalf/algorithms/lagrangian_reward_weight_algorithm_test.py
- alf/algorithms/muzero_algorithm.py 2 additions, 2 deletionsalf/algorithms/muzero_algorithm.py
- alf/algorithms/ppo_algorithm.py 15 additions, 9 deletionsalf/algorithms/ppo_algorithm.py
- alf/algorithms/ppo_algorithm_test.py 8 additions, 4 deletionsalf/algorithms/ppo_algorithm_test.py
- alf/algorithms/ppo_loss.py 3 additions, 1 deletionalf/algorithms/ppo_loss.py
- alf/algorithms/rl_algorithm_test.py 10 additions, 3 deletionsalf/algorithms/rl_algorithm_test.py
- alf/algorithms/sac_algorithm.py 7 additions, 3 deletionsalf/algorithms/sac_algorithm.py
- alf/examples/ppo_conf.py 34 additions, 0 deletionsalf/examples/ppo_conf.py
- alf/networks/network.py 1 addition, 1 deletionalf/networks/network.py
- alf/utils/tensor_utils.py 9 additions, 5 deletionsalf/utils/tensor_utils.py
Loading
Please register or sign in to comment