-
- Downloads
Minor refactoring for supervised training (#840)
* dropout for TransformerBlock Include dropout to conform with the original implementation in "Attention is all you need". Though it seems to hurt the unittest and ppo_babyai.py example. * Configurable activation for TransformerBlock Also only use position embedding for the first tranformer block of TransformerNetwork, which is the common practice. * Slight refactoring of trainer for supervised learning Also adds an alf conf example of language modeling task for supervised learning. * Fix hypernetwork_algorithm_test * Fix policy_trainer_test * Address review comments
Showing
- alf/algorithms/algorithm.py 19 additions, 1 deletionalf/algorithms/algorithm.py
- alf/algorithms/config.py 4 additions, 1 deletionalf/algorithms/config.py
- alf/algorithms/hypernetwork_algorithm.py 54 additions, 41 deletionsalf/algorithms/hypernetwork_algorithm.py
- alf/algorithms/hypernetwork_algorithm_test.py 9 additions, 8 deletionsalf/algorithms/hypernetwork_algorithm_test.py
- alf/algorithms/planning_algorithm.py 4 additions, 0 deletionsalf/algorithms/planning_algorithm.py
- alf/algorithms/prior_actor.py 4 additions, 2 deletionsalf/algorithms/prior_actor.py
- alf/algorithms/reward_learning_algorithm.py 4 additions, 2 deletionsalf/algorithms/reward_learning_algorithm.py
- alf/algorithms/rl_algorithm.py 0 additions, 17 deletionsalf/algorithms/rl_algorithm.py
- alf/bin/grid_search.py 1 addition, 1 deletionalf/bin/grid_search.py
- alf/bin/train.py 4 additions, 6 deletionsalf/bin/train.py
- alf/config_util.py 8 additions, 2 deletionsalf/config_util.py
- alf/examples/hypernet_mnist.gin 4 additions, 3 deletionsalf/examples/hypernet_mnist.gin
- alf/examples/lm_conf.py 146 additions, 0 deletionsalf/examples/lm_conf.py
- alf/layers.py 24 additions, 9 deletionsalf/layers.py
- alf/networks/networks.py 14 additions, 1 deletionalf/networks/networks.py
- alf/networks/transformer_networks.py 18 additions, 7 deletionsalf/networks/transformer_networks.py
- alf/optimizers/optimizers.py 10 additions, 9 deletionsalf/optimizers/optimizers.py
- alf/trainers/policy_trainer.py 1 addition, 45 deletionsalf/trainers/policy_trainer.py
- alf/trainers/policy_trainer_test.py 6 additions, 10 deletionsalf/trainers/policy_trainer_test.py
- alf/utils/common.py 6 additions, 2 deletionsalf/utils/common.py
Loading
Please register or sign in to comment