unilab.algos.torch.hora.sac_learner¶
HORA-owned FastSAC learner.
Functions
|
Return the privileged tail encoded in the critic observation contract. |
Classes
FastSAC learner variant whose actor consumes HORA privileged info. |
- unilab.algos.torch.hora.sac_learner.derive_priv_info_from_critic_obs(actor_obs, critic_obs, *, context)[source]¶
Return the privileged tail encoded in the critic observation contract.
- class unilab.algos.torch.hora.sac_learner.HoraSACLearner[source]¶
Bases:
FastSACLearnerFastSAC learner variant whose actor consumes HORA privileged info.
- Parameters:
- __init__(*, obs_dim, critic_obs_dim, priv_info_dim, action_dim, device='cpu', actor_hidden_dim=512, priv_info_embed_dim=9, priv_mlp_hidden_dims=(256, 128, 9), log_std_max=0.0, log_std_min=-5.0, use_tanh=True, use_layer_norm=True, actor_lr=0.0003, weight_decay=0.001, use_symmetry=False, symmetry_augmentation=None, **kwargs)[source]¶