- Dec 30, 2022
-
- Dec 20, 2022
-
-
Haichao Zhang authored
-
Haonan Yu authored
-
Haonan Yu authored
-
- Nov 22, 2022
-
-
Haichao Zhang authored
* get_mode for mixture distribution * Add comments * Address comments
-
- Nov 12, 2022
-
-
Haonan Yu authored
* add RandomCrop gym wrapper * address comments
-
Break Yang authored
-
- Nov 10, 2022
-
-
Break Yang authored
-
- Nov 02, 2022
-
-
Break Yang authored
* Unify ParallelNormalProjectionNetwork and NormalProjectionNetwork * Add make_parallel() back for normal projection network * Add parallel FC version of make_parallel to BetaProjectionNetwork * Implement the MixtureProjectionNetwork * Address comments
-
Break Yang authored
* Unify ParallelNormalProjectionNetwork and NormalProjectionNetwork * Add make_parallel() back for normal projection network * Add parallel FC version of make_parallel to BetaProjectionNetwork
-
Haichao Zhang authored
* VQ VAE Algorithm * Address comments * Fix embedding summary
-
- Nov 01, 2022
-
-
Break Yang authored
* Support replace() for TensorSpec * Add type hints for replace()
-
Haonan Yu authored
* check unused parameters * update * improve warning message
-
- Oct 29, 2022
-
-
Haichao Zhang authored
* Causal BC Algorithm * Add comments * Address comments * Address comments
-
- Oct 22, 2022
-
-
Haichao Zhang authored
* TCN wrapper * Address comments * Reduce arguments
-
- Oct 21, 2022
-
-
emailweixu authored
* KLD for various DiscreteRegressionLoss * Fix unittest
-
- Oct 19, 2022
-
-
Break Yang authored
-
- Oct 17, 2022
-
-
Haonan Yu authored
* MoNet algorithm * add train play test * address reviews; adding more code comments * add illustrative graph for the unet
-
- Oct 10, 2022
-
-
Haonan Yu authored
-
- Oct 07, 2022
-
-
Break Yang authored
* Use torch.div for floor div in replay buffer * Address comments
-
Haichao Zhang authored
* BC for CARLA * Address comments * Address more comments * Address further comment
-
emailweixu authored
* Support asynchronous unroll Perform unroll and training concurrently. This can be useful in two situations: 1. The environment step is expensive. This can help reducing time spending on unroll 2. Realtime training where the interaction with environment happens in realtime. For this, we can set TrainerConfig.unroll_step_interval to the desired interaction period. * Fix circular import * Address comments * Fix failure for build.sh Need to avoid mess up with commandline arguments * Address comments
-
- Sep 30, 2022
-
-
Haichao Zhang authored
-
- Sep 21, 2022
-
-
Haonan Yu authored
-
- Sep 16, 2022
-
-
emailweixu authored
Also fixed a potential RuntimeError "RuntimeError: unable to open shared memory object" when many training sessions are running.
-
- Sep 15, 2022
-
-
Haonan Yu authored
* support detach for distributions * kl_divergence has been fixed by pytorch * address comments
-
- Sep 12, 2022
-
-
Break Yang authored
* Add MuZero configuration for MetaDrive * Add the training curve * Address comments * Use LSTM for reward prediction
-
Haonan Yu authored
-
Haonan Yu authored
-
- Sep 02, 2022
-
-
Break Yang authored
-
- Sep 01, 2022
-
-
Break Yang authored
* Add extra rewards to metadrive * Address comments
-
Haonan Yu authored
* add advanced Fetch envs * address comments and add gripper orientation into obs * address comments
-
Break Yang authored
* Support injected summarize function from config for summarize_rollout * inject_summary -> custom_summary
-
- Aug 31, 2022
-
-
Break Yang authored
-
- Aug 27, 2022
-
-
Haonan Yu authored
* add OneHotCategoricalGumbelSoftmax * fix typo
-
Haichao Zhang authored
* IQL algorithm * Some minor updates * Remove unused arguments for actor training; state-independent std etc * Address comments
-
- Aug 26, 2022
-
-
Haonan Yu authored
-
- Aug 25, 2022
-
-
Haonan Yu authored
* add discrete VAE * also return z_mode for discrete vae * add ST gumbel-softmax * update * add scheduler option for the temperature * address comments
-
- Aug 18, 2022
-
-
Haichao Zhang authored
* RewardShifting data transformer * Make bias a required argument * Fix observation_spec argument
-
- Aug 17, 2022
-
-
Haonan Yu authored
* add image decoding network with upsampling+conv * reference of padding modes
-