unilab.algos.mlx.common.rollout_storage¶
Rollout buffer for on-policy algorithms.
Classes
On-policy rollout storage for vectorized environments. |
- class unilab.algos.mlx.common.rollout_storage.RolloutBuffer[source]¶
Bases:
objectOn-policy rollout storage for vectorized environments.
- Parameters:
- add(obs, actions, log_probs, action_mean, action_std, rewards, dones, values)[source]¶
- Parameters:
obs (
array)actions (
array)log_probs (
array)action_mean (
array)action_std (
array)rewards (
array)dones (
array)values (
array)
- Return type: