unilab.algos.mlx.ppo.ppo.PPOTrainer¶
- class unilab.algos.mlx.ppo.ppo.PPOTrainer[source]¶
Bases:
objectPPO update logic for MLPActorCritic and RolloutBuffer.
- Parameters:
model (
MLPActorCritic)cfg (
PPOConfig)
Methods
- __init__(model, cfg)[source]¶
- Parameters:
model (
MLPActorCritic)cfg (
PPOConfig)