unilab.algos.torch.hora.observations.build_hora_obs_tensordict¶
- unilab.algos.torch.hora.observations.build_hora_obs_tensordict(obs, *, info, device, batch_size, policy_obs)[source]¶
Build the HORA PPO/APPO observation TensorDict for teacher-policy runtime.
- Parameters:
obs (
dict[str,ndarray]) – Environment observation dict following the UniLab env contract.info (
dict[str,Any] |None) – Optional env info dict containing HORA privileged payloads.device (
str) – Torch device string used for the returned tensors.batch_size (
int) – Number of vectorized environments represented by this batch.policy_obs (
ndarray) – Policy observation array already resolved by the caller.
- Return type:
TensorDict- Returns:
TensorDict with generic keys plus HORA-specific
priv_infoand optionalproprio_histwhen the environment provided them.