unilab.algos.torch.hora.rsl_rl.HoraRslRlVecEnvWrapper¶
- class unilab.algos.torch.hora.rsl_rl.HoraRslRlVecEnvWrapper[source]¶
Bases:
RslRlVecEnvWrapperRSL-RL adapter that preserves HORA teacher-policy observation payloads.
Methods
__init__(env[, device, policy_obs_mode])close()Return the current HORA-aware observation TensorDict.
reset()Reset the wrapped env and preserve HORA privileged reset payloads.
step(actions)Step the wrapped env while keeping HORA bootstrap payloads intact.
- get_observations()[source]¶
Return the current HORA-aware observation TensorDict.
- Parameters:
None.
- Return type:
TensorDict- Returns:
TensorDict containing the current observation batch with HORA extras.
- __init__(env, device='cpu', policy_obs_mode='flat')¶