Fix benchmarking configs + Add a minimal test suite for those configs. #653

hacobe · 2023-01-06T20:24:39Z

Description

This PR patches in the changes from PR #650. It also fixes an error in the dagger configs by replacing "py/object" with "py/type". "py/object" failed the assert in sacred.config.utils.assert_is_valid_key.

Testing

The PR adds a minimal test suite for the benchmarking configs. Running all the configs with the fast configs applied is too slow and requires MuJoCo. Instead, we test that the print_config command works for all the configs.

This commit patches in the changes from PR #650. It also fixes an error in the dagger configs by replacing "py/object" with "py/type". "py/object" failed the assert in sacred.config.utils.assert_is_valid_key. Finally, it adds a minimal test suite for the benchmarking configs. This commit makes progress on testing, but still does not test that all the benchmarking configs work. Running all the configs with the fast configs applied is too slow. Calling sacred's print_config command for all the configs is also too slow.

… MuJoCo.

…nfigs

AdamGleave

Thanks for adding this test! Some minor suggestions.

tests/test_benchmarking.py

Avoid the overhead of subprocess by running the print_config using the experiment's run method. Also, use a path relative to the script for BENCHMARKING_DIR instead of creating a path and then checking if it exists.

…AI/imitation into fix-benchmarking-configs

AdamGleave

LGTM, 2 small suggestions

tests/test_benchmarking.py

codecov · 2023-01-07T02:51:03Z

Codecov Report

Merging #653 (3115244) into master (d886a8f) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master     #653   +/-   ##
=======================================
  Coverage   97.54%   97.54%           
=======================================
  Files          86       87    +1     
  Lines        8422     8443   +21     
=======================================
+ Hits         8215     8236   +21     
  Misses        207      207

Impacted Files	Coverage Δ
tests/test_benchmarking.py	`100.00% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

This change just made some error messages go away indicating the missing imitation.algorithms.dagger.ExponentialBetaSchedule but it did not fix the root cause.

* Undo the changes from #653 to the dagger benchmark config files. This change just made some error messages go away indicating the missing imitation.algorithms.dagger.ExponentialBetaSchedule but it did not fix the root cause. * Improve readability and interpretability of benchmarking tests. * Add exponential beta scheduler for dagger * Ignore coverage for unknown algorithms. * Cleanup and extend tests for beta schedules in dagger. --------- Co-authored-by: taufeeque9 <[email protected]>

* Merge py file changes from benchmark-algs * Clean parallel script * Undo the changes from #653 to the dagger benchmark config files. This change just made some error messages go away indicating the missing imitation.algorithms.dagger.ExponentialBetaSchedule but it did not fix the root cause. * Improve readability and interpretability of benchmarking tests. * Add pxponential beta scheduler for dagger * Ignore coverage for unknown algorithms. * Cleanup and extend tests for beta schedules in dagger. * Add optuna to dependencies * Fix test case * Clean up the scripts * Remove reporter(done) since mean_return is reported by the runs * Add beta_schedule parameter to dagger script * Update config policy kwargs * Changes from review * Fix errors with some configs * Updates based on review * Change metric everywhere * Separate tuning code from parallel.py * Fix docstring * Removing resume option as it is getting tricky to correctly implement * Minor fixes * Updates from review * fix lint error * Add documentation for using the tuning script * Fix lint error * Updates from the review * Fix file name test errors * Add tune_run_kwargs in parallel script * Fix test errors * Fix test * Fix lint * Updates from review * Simplify few lines of code * Updates from review * Fix test * Revert "Fix test" This reverts commit 8b55134. * Fix test * Convert Dict to Mapping in input argument * Ignore coverage in script configurations. * Pin huggingface_sb3 version. * Update to the newest seals environment versions. * Push gymnasium dependency to 0.29 to ensure mujoco envs work. * Incorporate review comments * Fix test errors * Move benchmarking/ to scripts/ and add named configs for tuned hyperparams * Bump cache version & remove unnecessary files * Include tuned hyperparam json files in package data * Update storage hash * Update search space of bc * update benchmark and hyper parameter tuning readme * Update README.md * Incorporate reviewer's comments in benchmarking readme * Update gymnasium version and render mode in eval policy * Fix error * Update commands.py hex strings --------- Co-authored-by: Maximilian Ernestus <[email protected]> Co-authored-by: ZiyueWang25 <[email protected]>

hacobe requested a review from AdamGleave January 6, 2023 20:24

hacobe and others added 3 commits January 6, 2023 13:14

Update the tests for benchmarking configs so that they do not require…

e88c9f1

… MuJoCo.

Remove unused global in test_benchmarking.py

57d106e

Merge remote-tracking branch 'origin/master' into fix-benchmarking-co…

0f95f05

…nfigs

AdamGleave requested changes Jan 6, 2023

View reviewed changes

tests/test_benchmarking.py Outdated Show resolved Hide resolved

tests/test_benchmarking.py Outdated Show resolved Hide resolved

tests/test_benchmarking.py Outdated Show resolved Hide resolved

AdamGleave mentioned this pull request Jan 6, 2023

Add a program to generate commands to run training scripts #654

Merged

hacobe added 2 commits January 6, 2023 17:41

Test all the benchmarking configs using print_config.

7a397e2

Avoid the overhead of subprocess by running the print_config using the experiment's run method. Also, use a path relative to the script for BENCHMARKING_DIR instead of creating a path and then checking if it exists.

Merge branch 'fix-benchmarking-configs' of github.com:HumanCompatible…

18a23d0

…AI/imitation into fix-benchmarking-configs

AdamGleave approved these changes Jan 7, 2023

View reviewed changes

tests/test_benchmarking.py Outdated Show resolved Hide resolved

tests/test_benchmarking.py Outdated Show resolved Hide resolved

hacobe added 2 commits January 6, 2023 18:19

Minor fixes

f92b3cc

Merge branch 'master' into fix-benchmarking-configs

3115244

AdamGleave merged commit 9822af1 into master Jan 8, 2023

AdamGleave deleted the fix-benchmarking-configs branch January 8, 2023 07:03

ernestum mentioned this pull request Jan 17, 2023

Benchmark and replicate algorithm performance #388

Open

This was referenced Jan 26, 2023

Catch missing classes early when loading config files IDSIA/sacred#902

Merged

Benchmarking example configurations reference non-existent class #664

Closed

ernestum mentioned this pull request Jan 26, 2023

Fix benchmark configs #665

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix benchmarking configs + Add a minimal test suite for those configs. #653

Fix benchmarking configs + Add a minimal test suite for those configs. #653

Uh oh!

hacobe commented Jan 6, 2023 •

edited

Loading

Uh oh!

AdamGleave left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AdamGleave left a comment

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jan 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix benchmarking configs + Add a minimal test suite for those configs. #653

Fix benchmarking configs + Add a minimal test suite for those configs. #653

Uh oh!

Conversation

hacobe commented Jan 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Testing

Uh oh!

AdamGleave left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AdamGleave left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jan 7, 2023

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hacobe commented Jan 6, 2023 •

edited

Loading