unilab.algos.torch.hora.sac_learner

HORA-owned FastSAC learner.

Functions

derive_priv_info_from_critic_obs(actor_obs, ...)

Return the privileged tail encoded in the critic observation contract.

Classes

HoraSACLearner

FastSAC learner variant whose actor consumes HORA privileged info.

unilab.algos.torch.hora.sac_learner.derive_priv_info_from_critic_obs(actor_obs, critic_obs, *, context)[source]

Return the privileged tail encoded in the critic observation contract.

Parameters:
Return type:

Tensor

class unilab.algos.torch.hora.sac_learner.HoraSACLearner[source]

Bases: FastSACLearner

FastSAC learner variant whose actor consumes HORA privileged info.

Parameters:
__init__(*, obs_dim, critic_obs_dim, priv_info_dim, action_dim, device='cpu', actor_hidden_dim=512, priv_info_embed_dim=9, priv_mlp_hidden_dims=(256, 128, 9), log_std_max=0.0, log_std_min=-5.0, use_tanh=True, use_layer_norm=True, actor_lr=0.0003, weight_decay=0.001, use_symmetry=False, symmetry_augmentation=None, **kwargs)[source]
Parameters: