Unverified Commit 6ff31512 authored 4 years ago by emailweixu Committed by GitHub 4 years ago

Minor refactoring for supervised training (#840)

* dropout for TransformerBlock

Include dropout to conform with the original implementation in "Attention is all you need".
Though it seems to hurt the unittest and ppo_babyai.py example.

* Configurable activation for TransformerBlock

Also only use position embedding for the first tranformer block of TransformerNetwork, which is the common practice.

* Slight refactoring of trainer for supervised learning

Also adds an alf conf example of language modeling task for supervised learning.

* Fix hypernetwork_algorithm_test

* Fix policy_trainer_test

* Address review comments

parent d9a75079

No related branches found

No related tags found

No related merge requests found

Hide whitespace changes

Inline Side-by-side

Showing with 340 additions and 167 deletions

Please register or to comment

GitLab is undergoing maintenance

Minor refactoring for supervised training (#840)