This repository provides an implementation of Goal-Conditioned Supervised Learning (GCSL), as proposed in Learning to Reach Goals via Iterated Supervised Learning The manuscript is available on arXiv
If you use this codebase, please cite
Dibya Ghosh, Abhishek Gupta, Justin Fu, Ashwin Reddy, Coline Devin, Benjamin Eysenbach, Sergey Levine. Learning to Reach Goals via Iterated Supervised Learning
Bibtex source is provided at the bottom of the Readme.
Conda
conda env create -f environment/environment.yml
Docker
docker pull dibyaghosh/gcsl:0.1
python experiments/gcsl_example.py
If you have, and would like to use, a GPU, you will need to additionally install a GPU-compiled version of PyTorch. To do so, simply run
pip uninstall torch && pip install torch==1.1.0
The directory structure currently looks like this:
gcsl (Contains all code)
experiments (Contains all launcher files)
Please file an issue if you have trouble running this code.
@article{Ghosh2019LearningTR,
title={Learning To Reach Goals Without Reinforcement Learning},
author={Dibya Ghosh and Abhishek Gupta and Justin Fu and Ashwin Reddy and Coline Devin and Benjamin Eysenbach and Sergey Levine},
journal={ArXiv},
year={2019},
volume={abs/1912.06088}
}
hello, I've read the source code of the paper, and I found that this source code discretes all the envs after creating the env. What should I do in order to support continous envs?
Hey,
I've added a barebone GCSL implementation to https://github.com/cheind/gcsl, featuring a new goal-reaching cartpole environment. I believe the code is quite easy to digest in its current form and might help users to get started more easily.
When I run your code, I encounter such an error:
Traceback (most recent call last):
File "/home/gcsl/scripts/run_experiment_lite_doodad.py", line 10, in torch.device
, optional):' in _value.doc:
TypeError: argument of type 'property' is not iterable
It seems that there may exist some error in the function _replace_funcs.
This is one of the best papers I've read this year. Thank you so much for open-sourcing it!
Do you still have the code for TD3-HER baseline? Would you mind open-sourcing that part as well? Thanks in advance for your help!
Please provide a example on how to use GCSL for TimeSeries analysis .
I was interested in using the environments you used in this project, and I wanted to be able to include the environments as a package in another project I'm developing. In order to do so, I wanted to make it possible to install your code with pip. With the changes I've made in this pull request, running this command in the root directory:
pip install .
leads to what appears to be a successful installation of the environments. Specifically, the following code runs successfully:
``` from gcsl import envs
for env_name in envs.env_names: envs.create_env(env_name) print(f"Successfully created env: {env_name}") ```
It's probably worth testing if the rest of the code, e.g., the training script, also works after pip install .