unilab.algos.mlx.common.rollout_storage.RolloutBuffer¶
- class unilab.algos.mlx.common.rollout_storage.RolloutBuffer[source]¶
Bases:
objectOn-policy rollout storage for vectorized environments.
- Parameters:
Methods
__init__(num_steps, num_envs, obs_dim, ...)add(obs, actions, log_probs, action_mean, ...)clear()compute_returns_and_advantages(last_values)mini_batch_generator(num_mini_batches, ...)Attributes
- add(obs, actions, log_probs, action_mean, action_std, rewards, dones, values)[source]¶
- Parameters:
obs (
array)actions (
array)log_probs (
array)action_mean (
array)action_std (
array)rewards (
array)dones (
array)values (
array)
- Return type: