flexs.baselines.explorers.environments.ppo

PPO environment module.

class flexs.baselines.explorers.environments.ppo.PPOEnvironment(alphabet, starting_seq, model, max_num_steps)[source]

Bases: tf_agents.environments.py_environment.PyEnvironment

PPO environment based on TF-Agents.

action_spec()[source]

Define agent actions.

get_state_string()[source]

Get sequence representing current state.

observation_spec()[source]

Define environment observations.