Skip to content
Snippets Groups Projects
  1. Aug 03, 2021
    • Break Yang's avatar
      Add the directory of the entry point gin file to search path (#914) · 4e477d45
      Break Yang authored
      This allows running alf train, play and other programs from
      anywhere (no longer restricted to `alf/examples/`)
    • Break Yang's avatar
      8753b8bb
    • Neale Ratzlaff's avatar
      Add GPVI training (#932) · 1bfcf5fc
      Neale Ratzlaff authored
      This commits adds GPVI to alf, main changes to alf/algorithms/generator.py and tests are
      * Add _create_mvp_network method, which instantiates an encoding network to compute a matrix-vector
              product,  used for computing the inverse jacobian vector product in GPVI.
      * Add InverseMVPAlgorithm to specify a training step of the above InverseMVP network.
      * Add rkhs_func_grad function to generator.py
          This is the main update routine for GPVI, and is related to the InverseMVPAlgorithm.
          I have not added a function value version of GPVI, nor have I added a minmax version
          ``rkhs_func_grad`` is called whenever the generator argument ``functional_gradient`` is set to ``rkhs``.
          Right now, ``functional_gradient`` only supports a ReluMLP generator, as it relies on the fast ``compute_vjp`` function, rather than autograd
      * Add GPVI tests to generator_test.py.
      * Add GPVI tests to hypernetwork_test.py.
      
      --Small Changes--
      * ReluMLP method ``compute_vjp`` now returns the output of the forward evaluation. needed for GPVI update
      * Added arguments for GPVI to generator.py, such as force_fullrank, and fullrank_diag_weight
      * Added arguments for pinverse_network to hypernetwork_algorithm.py
      * Added arguments for GPVI to hypernetwork_algorithm.py
      
      * addressing comments in PR discussion
      * renamed pinverse network to InverseMVPNetwork for (matrix vector product)
      * added a test for InverseMVPNetwork that explicitly computes the inverse Jacobian vector product of an MLP with respect to some random input. This is evaluated against the solution found by training an InverseMVPNetwork.
      * Implemented fixes and suggestions to generator and hypernetwork files.
      * Changed naming of Pinverse network to the new InverseMVP network.
      
      * Addressing PR comments.
      * Removed InverseMVPNetwork file. Opted for EncodingNetwork as suggested by Wei.
      * Refactored InverseMVP test accordingly, to show that the idea still works.
      * Added helper function to generator.py, to create this network
      * Added/changed docstrings to generator.py as suggested.
      1bfcf5fc
    • Break Yang's avatar
  2. Aug 02, 2021
  3. Jul 30, 2021
  4. Jul 29, 2021
  5. Jul 28, 2021
  6. Jul 27, 2021
  7. Jul 25, 2021
  8. Jul 23, 2021
  9. Jul 22, 2021
    • Qinxun Bai's avatar
      update oac algorithm with latest alf updates (#943) · 7acb5d56
      Qinxun Bai authored
      * update oac algorithm with latest alf updates
      
      * update oac test hyper parameters to past test more stably
      7acb5d56
    • Break Yang's avatar
      Extract worker logic out of ProcessEnvironment #913 (#939) · f2d378ad
      Break Yang authored
      This is part of the effort to unblock #913. Two reasons for this change
      
      1. `worker` definitely does not rely on `ProcessEnvironment` at all, and therefore it is cleaner to make it independent of `ProcessEnvironment`.
      2. If it stays as a member method of `ProcessEnvironment`, `multiprocess.Process` will get stuck on `start()` if the parent process is also a `multiprocess.Process`, for unknown reason though (tried investigation but haven't figured out).
      f2d378ad
    • Haonan Yu's avatar
      first section of ALF tutorial (#935) · 5b277769
      Haonan Yu authored
      * first section of ALF tutorial
      
      * address comments
      5b277769
    • Break Yang's avatar
      Extract message types as MessageType Enum in ProcessEnvironment (#938) · d1d1cc61
      Break Yang authored
      This is part of the effort to address #913. A sub-task requires extract the worker logic to be out of the class (for some reason it will prevent `multiprocessing` to work correctly). Without such change the `multiprocessing.Process` will just be stuck on `start()`.
      d1d1cc61
  10. Jul 20, 2021
  11. Jul 18, 2021
    • pd-perry's avatar
      Load BSuite Environment (#933) · 25b23774
      pd-perry authored
      * added suite load for bsuite environment
      
      * fixed observation shape bug in suite_bsuite and added alf_bsuite_wrapper
      
      * Revert "Correct relative path import in py configurations (#928)"
      
      This reverts commit 33c0ba21.
      
      * removed alf_bsuite_wrapper and added method to suite_bsuite instead
      
      * removed alf_bsuite_wrapper import from suite_bsuite file
      
      * change loadbsuite to bsuitewrapper and edit docstrings
      
      * edited description
      
      * fixed PR review changes
      
      * added check for max steps and change copyright year to this year
      25b23774
  12. Jul 13, 2021
  13. Jul 12, 2021
  14. Jul 10, 2021
  15. Jul 08, 2021
  16. Jun 28, 2021
  17. Jun 19, 2021
  18. Jun 18, 2021
    • Qinxun Bai's avatar
      Optimistic actor critic (#899) · 61b31de4
      Qinxun Bai authored
      * add OacAlgorithm and replicate paper results on HalfCheetah
      
      * add oac_humannoid_conf and oac_algorithm_test
      
      * minor updates in oac_algorithm_test
      
      * address code reviews
      
      * add oac halfcheetah and humanoid result figures
      
      * update oac_algorithm test
      
      * address further code reviews
      
      * address further code reviews
      
      * remove the unroll_with_grad
      
      * update oac_halfcheetah_conf for better performance
      
      * minor update of oac_halfcheetah_conf
      61b31de4
    • hnyu's avatar
    • emailweixu's avatar
      config_util improvement (#909) · addcaab7
      emailweixu authored
      * Minor improvement to config_util
      
      1. Environment varible for not using gin. gin's wrapper is very complicated, which can make debugging unfriendly and slow down the execution.
      2. Error report for misuse of alf.config()
      
      * comment
      
      * Fix message
      addcaab7
  19. Jun 17, 2021
    • Haichao Zhang's avatar
      Compositional FC (#907) · 50b3286c
      Haichao Zhang authored
      * Compositional FC
      
      * Use bmm for weighted combination
      
      * Better support to layer chaining
      50b3286c
    • emailweixu's avatar
      lean_function (#908) · b2efe2fc
      emailweixu authored
      Wrap a function to save memory for backward. The returned function performs same
      computation as ``func``, but save memory by discarding intermediate results.
      It calculates the gradient by recomputing ``func`` using the same input during backward.
      b2efe2fc
  20. Jun 16, 2021
  21. Jun 15, 2021
  22. Jun 12, 2021
  23. Jun 10, 2021
Loading