unilab.algos.mlx.ppo.model.MLPActorCritic¶
- class unilab.algos.mlx.ppo.model.MLPActorCritic[source]¶
Bases:
ModuleShared utility class containing actor and critic MLPs.
- Parameters:
Methods
__init__(obs_dim, action_dim, ...[, ...])act(obs)Sample actions and return MLX tensors.
Clamp log-std to avoid numerical explosion.
current_action_std(action_shape)Return broadcasted std tensor for current policy.
distribution_params(obs)policy(obs)update_normalization(obs)value(obs)- __init__(obs_dim, action_dim, actor_hidden_dims, critic_hidden_dims, activation='tanh', init_log_std=0.0, min_log_std=-5.0, max_log_std=2.0, obs_normalization=False, noise_std_type='log', state_dependent_std=False, dtype=None)[source]¶