Skip to content

Clean Up and Deprecate PPO During Transformer Upgrade #61

@youngwanlim

Description

@youngwanlim

Description:

As part of upgrading the Transformers library to the latest version, we should clean up and deprecate the existing PPO (Proximal Policy Optimization) implementation and related code paths.

Goals

  • Remove or deprecate PPO-specific training code that is no longer actively used.
  • Eliminate obsolete dependencies introduced solely for PPO support.
  • Ensure compatibility with the latest Transformers release.
  • Update documentation to reflect the removal/deprecation.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions