Skip to content

Sneaky noop: action=-1 #14

Merged
taufeeque9 merged 4 commits intosokobanfrom
aga/noop
Jul 2, 2024
Merged

Sneaky noop: action=-1 #14
taufeeque9 merged 4 commits intosokobanfrom
aga/noop

Conversation

@rhaps0dy
Copy link

It is useful to have a NOOP action in the environment, for "thinking time" or other ablations. But we don't want to make the NNs that learn from it use NOOPs.

Here, we intentionally send -1 (or any other invalid action <0). We catch that case and make the environment not reset.

@rhaps0dy rhaps0dy requested a review from taufeeque9 June 29, 2024 08:22
Adrià Garriga-Alonso added 2 commits June 29, 2024 01:24
Copy link
Collaborator

@taufeeque9 taufeeque9 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! This should make many of our scripts simpler.

Base automatically changed from tf/loop-around to sokoban June 30, 2024 10:37
@taufeeque9 taufeeque9 merged commit 93bdd8c into sokoban Jul 2, 2024
@taufeeque9 taufeeque9 deleted the aga/noop branch July 2, 2024 08:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants