Language

unilab.algos.torch.hora.rsl_rl.HoraRslRlVecEnvWrapper¶

class unilab.algos.torch.hora.rsl_rl.HoraRslRlVecEnvWrapper[source]¶

RSL-RL adapter that preserves HORA teacher-policy observation payloads.

Parameters:

Methods

`__init__`(env[, device, policy_obs_mode])
`close`()
`get_observations`()	Return the current HORA-aware observation TensorDict.
`get_privileged_observations`()
`reset`()	Reset the wrapped env and preserve HORA privileged reset payloads.
`step`(actions)	Step the wrapped env while keeping HORA bootstrap payloads intact.

step(actions)[source]¶

Step the wrapped env while keeping HORA bootstrap payloads intact.

Parameters:: actions (Tensor | ndarray) – Torch or numpy action batch with shape (num_envs, action_dim).
Return type:: tuple[TensorDict, Tensor, Tensor, dict]
Returns:: Tuple (obs_td, rewards, dones, infos) matching the RSL-RL VecEnv contract while preserving HORA privileged observations.

Reset the wrapped env and preserve HORA privileged reset payloads.

Parameters:: None.
Return type:: tuple[TensorDict, dict[str, Any]]
Returns:: Tuple (obs_td, info) where obs_td retains HORA privileged inputs.

get_observations()[source]¶

Return the current HORA-aware observation TensorDict.

Parameters:: None.
Return type:: TensorDict
Returns:: TensorDict containing the current observation batch with HORA extras.

__init__(env, device='cpu', policy_obs_mode='flat')¶

Parameters:

close()¶

get_privileged_observations()¶