STAX

This is the code for the ECJ journal paper Discovering and Exploiting Sparse Rewards in a Learned Behavior Space.

It is based on my Novelty Search Gym library (https://github.com/GPaolo/novelty_search_gym).

To install run:

pipenv shell --three
python setup.py install

This code can be used in two ways:

By taking advantage of the run_experiment.py script in the script folder
By importing and using the Searcher class in the core/searcher.py file. This one can be easily used in the following way:

from parameters import params
from core.searcher import Searcher

searcher = Searcher(params)
for k in range(params.generations):
    searcher.generational_step()
searcher.close()

Performing an experiment

To run the algorithm you just need to launch:

ipython script/run_experiment.py

If you want to change the experiment parameters, go to: parameters.py

You can also specify some of the parameters on the command line. If you want to check which ones, launch:

ipython script/run_experiment.py -- -h

For each generation the script saves: population, offsprings, archive as pkl files in folders whose name is formatted as: <env_name>_<exp_type>/<year>_<month>_<day>_<hour>:<minute>_<random_seed_used>. The time corresponds to when the experiment has been launched.

In this folder you find the parameters, in a file called: _params.json, the archive for each generation, in files called: archive_gen_<generation>.pkl, the population for each generation, in files called: population_gen_<generation>.pkl, and the offsprings for each generation, in files called: offsprings_gen_<generation>.pkl.

Evaluating the archive

Once the experiment is finished, if you want to study the behavior descriptors of the agents in the archive you have to evaluate the archive first by running it in the environment and saving the trajectories of images and observations.

You can do that by launching:

ipython scripts/evaluate_archive.py -- -e EXPERIMENT_PATH

This script will evaluate the archive of the given generation (The default is the 500).

As for the run_experiment.py script, you can change the parameters or provide some on the command line.

Once the evaluation is done, the trajectories will be saved in the experiment folder, inside a folder called analyzed_data as:

archive_info_gen_<generation>.pkl: these are the infos given by the gym environment every time a step is performed;
archive_obs_gen_<generation>.pkl: these are the observations given by the gym environment every time a step is performed.
cvg_gen_<generation>.pkl: the coverage of the ground truth behavior description space obtained by the archive.
unif_gen_<generation>.pkl: the uniformity of the archive in the behavior descriptor space.
gt_bd_gen_<generation>.pkl: the behavior descriptor's points for every agent in the archive.

Plotting the results

Finally you can plot your results by using the jupyter notebook archive_analysis located in the analysis folder.

Extending it

As said, this code is fairly modular and can be extended easily, both by adding new kind of experiments/metrics or by adding new gym environments.

Adding environments

If you want to add an environment you have to do:

Add the gym environment in the environments/assets folder
Register the environment in the environments/environments.py file as an entry in the registered_envs dictionary
Add an input formatter (and in case also an output formatter) in the environments/io_formatters.py files. These formatters are used to interface the environment with the controllers.
- The input formatters prepares the observation to be fed to the controller.
- The output formatters takes the controller output and formats it as an action for the environment.
Add the ground truth behavior descriptor in the analysis/gt_behavior_descriptors.py and in the get_metrics function in analysis/evaluate_archive.py.
Add the observations extraction function from the trajectory in the core/behavior_descriptors/trajectory_to_observations.py.

Adding experiments

If you want to add an experiment type you have to:

The behavior descriptor in the core/behavior_descriptors/behavior_descriptor.py. You have to both add the option in the __init__ of the class and the actual descriptor function as a member function of the class.
If you want to define an evolution algorithm you can do it in core/evolvers Then you have to add it in the core/evolvers/__init__.py and in the core/searcher.py
Add the name you chose for the experiment in the list of possible choices in the parser in scripts/run_experiment.py

If you find this code useful and you use it in a project, please cite it:

@misc{NSPaolo2020,
  author = {Paolo, Giuseppe},
  title = {Novelty Search Gym},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/GPaolo/novelty_search_gym}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
analysis		analysis
cmaes_analysis		cmaes_analysis
core		core
environments		environments
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile.lock		Pipfile.lock
README.md		README.md
__init__.py		__init__.py
code test.ipynb		code test.ipynb
code_test.py		code_test.py
parameters.py		parameters.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STAX

Performing an experiment

Evaluating the archive

Plotting the results

Extending it

Adding environments

Adding experiments

About

Releases

Packages

Languages

License

GPaolo/STAX

Folders and files

Latest commit

History

Repository files navigation

STAX

Performing an experiment

Evaluating the archive

Plotting the results

Extending it

Adding environments

Adding experiments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages