Skip to content

Have the agent perform "no action" to enter offline phase training #24

@thopkins32

Description

@thopkins32

As described in #22, there are two planned phases of learning: online and offline. It could be interesting to let the agent decide to enter offline training on its own, much like many animals choose to rest or go to sleep.

In this mode, the environment simulation should continue to run in the background but the forced action would be a no-op. Maybe after a certain consecutive number of no-ops chosen by the agent this could occur. I'm not sure what a "natural" threshold for this could be though.

Metadata

Metadata

Assignees

No one assigned

    Labels

    ideaSomething to consider

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions