unilab.algos.mlx.common.distributions

Distribution utilities for RL policies.

Functions

diag_gaussian_entropy(log_std)

Entropy of a diagonal Gaussian.

diag_gaussian_log_prob(actions, mean, log_std)

Log-probability under a diagonal Gaussian.

unilab.algos.mlx.common.distributions.diag_gaussian_log_prob(actions, mean, log_std)[source]

Log-probability under a diagonal Gaussian.

Parameters:
  • actions (array)

  • mean (array)

  • log_std (array)

Return type:

array

unilab.algos.mlx.common.distributions.diag_gaussian_entropy(log_std)[source]

Entropy of a diagonal Gaussian.

Parameters:

log_std (array)

Return type:

array