Commits · hobot_01052023 · Philipp Sauer / alf

Dec 30, 2022
- Fix comments in module_py (#1420) · f5ec2ce6
  Haichao Zhang authored 2 years ago
  
  View commits for tag hobot_01052023 hobot_01052023
  
  f5ec2ce6
Dec 20, 2022
- Support reset_parameters for NaiveParallelLayer (#1417) · 318a63a2
  Haichao Zhang authored 2 years ago
  
  318a63a2
- fix frameresize of floating image (#1419) · 653f3c90
  Haonan Yu authored 2 years ago
  
  653f3c90
- add FrameFlip wrapper (#1418) · 4e3a61b1
  Haonan Yu authored 2 years ago
  
  4e3a61b1
Nov 22, 2022
- get_mode for mixture distribution (#1415) · 0f8d0ec5
  Haichao Zhang authored 2 years ago
  
  * get_mode for mixture distribution * Add comments * Address comments
  0f8d0ec5
Nov 12, 2022
- add RandomCrop gym wrapper (#1412) · 306c8744
  Haonan Yu authored 2 years ago
  
  * add RandomCrop gym wrapper * address comments
  306c8744
- [Tiny] Quick fix for torchvision (#1414) · 530d776b
  Break Yang authored 2 years ago
  
  530d776b
Nov 10, 2022
- [Tiny] Use collections.abc.Mapping instead of collections.Mapping (#1413) · cdd101f4
  Break Yang authored 2 years ago
  
  cdd101f4
Nov 02, 2022

Implement the MixtureProjectionNetwork (#1411) · 88871599

Break Yang authored 2 years ago

* Unify ParallelNormalProjectionNetwork and NormalProjectionNetwork

* Add make_parallel() back for normal projection network

* Add parallel FC version of make_parallel to BetaProjectionNetwork

* Implement the MixtureProjectionNetwork

* Address comments

88871599

Add parallelism support for Normal and Beta projection networks. (#1410) · 4f16843f

Break Yang authored 2 years ago

* Unify ParallelNormalProjectionNetwork and NormalProjectionNetwork

* Add make_parallel() back for normal projection network

* Add parallel FC version of make_parallel to BetaProjectionNetwork

4f16843f

VQ VAE Algorithm (#1409) · d4c60d2c
Haichao Zhang authored 2 years ago
```
* VQ VAE Algorithm

* Address comments

* Fix embedding summary
```
d4c60d2c

Nov 01, 2022
- Support replace() for TensorSpec (#1407) · 59cf10cd
  Break Yang authored 2 years ago
  
  * Support replace() for TensorSpec * Add type hints for replace()
  59cf10cd
- check unused parameters (#1406) · f20a7518
  Haonan Yu authored 2 years ago
  
  * check unused parameters * update * improve warning message
  f20a7518
Oct 29, 2022
- Causal Imitation Learning Algorithm (#1403) · 7392e331
  Haichao Zhang authored 2 years ago
  
  * Causal BC Algorithm * Add comments * Address comments * Address comments
  7392e331
Oct 22, 2022
- TCN wrapper (#1402) · e93ceef0
  Haichao Zhang authored 2 years ago
  
  * TCN wrapper * Address comments * Reduce arguments
  e93ceef0
Oct 21, 2022
- KLD for various DiscreteRegressionLoss (#1400) · a75fb8f8
  emailweixu authored 2 years ago
  
  * KLD for various DiscreteRegressionLoss * Fix unittest
  a75fb8f8
Oct 19, 2022
- Convert MBRL examples to python config (#1401) · 6b228bde
  Break Yang authored 2 years ago
  
  6b228bde
Oct 17, 2022

MoNet algorithm (#1399) · ed9f76be

Haonan Yu authored 2 years ago

* MoNet algorithm

* add train play test

* address reviews; adding more code comments

* add illustrative graph for the unet

ed9f76be

Oct 10, 2022
- Remove duplicate observe_for_metrics (#1398) · 30dc8f87
  Haonan Yu authored 2 years ago
  
  30dc8f87
Oct 07, 2022

Use torch.div for floor div in replay buffer (#1396) · efc304ed
Break Yang authored 2 years ago
```
* Use torch.div for floor div in replay buffer

* Address comments
```
efc304ed

BC for CARLA (#1395) · 3b64a133

Haichao Zhang authored 2 years ago

* BC for CARLA

* Address comments

* Address more comments

* Address further comment

3b64a133

Support asynchronous unroll (#1397) · af7ed9d3

emailweixu authored 2 years ago

* Support asynchronous unroll

Perform unroll and training concurrently. This can be useful in two
situations:
1. The environment step is expensive. This can help reducing time
spending on unroll
2. Realtime training where the interaction with environment happens
in realtime. For this, we can set TrainerConfig.unroll_step_interval
to the desired interaction period.

* Fix circular import

* Address comments

* Fix failure for build.sh

Need to avoid mess up with commandline arguments

* Address comments

af7ed9d3

Sep 30, 2022
- Fix Collision Sensor (#1394) · 518c7153
  Haichao Zhang authored 2 years ago
  
  518c7153
Sep 21, 2022
- rtd furo theme (#1393) · 30b05851
  Haonan Yu authored 2 years ago
  
  30b05851
Sep 16, 2022

Fix evaluator for handling pre-configs (#1369) · 1e7e4534

emailweixu authored 2 years ago

Also fixed a potential RuntimeError "RuntimeError: unable to open shared memory object" when many training sessions are running.

1e7e4534

Sep 15, 2022

support detach for distributions (#1392) · bcff2350

Haonan Yu authored 2 years ago

* support detach for distributions

* kl_divergence has been fixed by pytorch

* address comments

bcff2350

Sep 12, 2022
- Add MuZero configuration for MetaDrive (#1389) · f19ec18f
  Break Yang authored 2 years ago
  
  * Add MuZero configuration for MetaDrive * Add the training curve * Address comments * Use LSTM for reward prediction
  f19ec18f
- support initial alpha for TAAC (#1390) · 65e67cb3
  Haonan Yu authored 2 years ago
  
  65e67cb3
- fix NaiveParallelNetwork for projection networks (#1391) · 819cb504
  Haonan Yu authored 2 years ago
  
  819cb504
Sep 02, 2022
- Increase the traffic density and speed reward (#1388) · b7bd7bdf
  Break Yang authored 2 years ago
  
  b7bd7bdf
Sep 01, 2022
- Add extra rewards to metadrive (#1387) · 2c8e0aba
  Break Yang authored 2 years ago
  
  * Add extra rewards to metadrive * Address comments
  2c8e0aba
- add advanced Fetch envs (#1356) · 7a01dc55
  Haonan Yu authored 2 years ago
  
  * add advanced Fetch envs * address comments and add gripper orientation into obs * address comments
  7a01dc55
- Support injected summarize function from config for summarize_rollout (#1385) · 9449bdba
  Break Yang authored 2 years ago
  
  * Support injected summarize function from config for summarize_rollout * inject_summary -> custom_summary
  9449bdba
Aug 31, 2022
- Correct handling of DataTransformer for PPG Auxiliary Algorithm (#1384) · 5d576ec0
  Break Yang authored 2 years ago
  
  5d576ec0
Aug 27, 2022

add OneHotCategoricalGumbelSoftmax (#1383) · 612042eb
Haonan Yu authored 2 years ago
```
* add OneHotCategoricalGumbelSoftmax

* fix typo
```
612042eb

IQL algorithm (#1381) · 36d508f8

Haichao Zhang authored 2 years ago

* IQL algorithm

* Some minor updates

* Remove unused arguments for actor training; state-independent std etc

* Address comments

36d508f8

Aug 26, 2022
- support optional fields for AverageEnvInfoMetric (#1382) · 69607844
  Haonan Yu authored 2 years ago
  
  69607844
Aug 25, 2022

add discrete VAE (#1372) · e8dc3367

Haonan Yu authored 2 years ago

* add discrete VAE

* also return z_mode for discrete vae

* add ST gumbel-softmax

* update

* add scheduler option for the temperature

* address comments

e8dc3367

Aug 18, 2022

RewardShifting data transformer (#1379) · 770ac568

Haichao Zhang authored 2 years ago

* RewardShifting data transformer

* Make bias a required argument

* Fix observation_spec argument

770ac568

Aug 17, 2022
- add image decoding network with upsampling+conv (#1367) · 53efd5be
  Haonan Yu authored 2 years ago
  
  * add image decoding network with upsampling+conv * reference of padding modes
  53efd5be