unilab.base.final_observation

Functions

patch_transition_next_obs(next_obs[, ...])

Patch transition next obs with final_observation without mutating actor inputs.

resolve_terminal_observation_contract(...[, ...])

Resolve terminal observation facts without constructing patched next obs.

resolve_transition_bootstrap_contract(next_obs)

Resolve actor/storage observations and timeout bootstrap masks for a step.

Classes

TerminalObservationContract

TerminalObservationContract(terminal_obs: 'np.ndarray | None', terminal_mask: 'np.ndarray', timeout_terminal_mask: 'np.ndarray', terminal_critic: 'np.ndarray | None' = None)

TransitionBootstrapContract

TransitionBootstrapContract(actor_next_obs: 'np.ndarray', transition_next_obs: 'np.ndarray', terminal_mask: 'np.ndarray', timeout_terminal_mask: 'np.ndarray', actor_next_critic: 'np.ndarray | None' = None, transition_next_critic: 'np.ndarray | None' = None)

class unilab.base.final_observation.TransitionBootstrapContract[source]

Bases: object

TransitionBootstrapContract(actor_next_obs: ‘np.ndarray’, transition_next_obs: ‘np.ndarray’, terminal_mask: ‘np.ndarray’, timeout_terminal_mask: ‘np.ndarray’, actor_next_critic: ‘np.ndarray | None’ = None, transition_next_critic: ‘np.ndarray | None’ = None)

Parameters:
actor_next_obs: ndarray
transition_next_obs: ndarray
terminal_mask: ndarray
timeout_terminal_mask: ndarray
actor_next_critic: ndarray | None = None
transition_next_critic: ndarray | None = None
__init__(actor_next_obs, transition_next_obs, terminal_mask, timeout_terminal_mask, actor_next_critic=None, transition_next_critic=None)
Parameters:
class unilab.base.final_observation.TerminalObservationContract[source]

Bases: object

TerminalObservationContract(terminal_obs: ‘np.ndarray | None’, terminal_mask: ‘np.ndarray’, timeout_terminal_mask: ‘np.ndarray’, terminal_critic: ‘np.ndarray | None’ = None)

Parameters:
terminal_obs: ndarray | None
terminal_mask: ndarray
timeout_terminal_mask: ndarray
terminal_critic: ndarray | None = None
__init__(terminal_obs, terminal_mask, timeout_terminal_mask, terminal_critic=None)
Parameters:
unilab.base.final_observation.patch_transition_next_obs(next_obs, final_observation=None, done=None, info=None, next_critic=None)[source]

Patch transition next obs with final_observation without mutating actor inputs.

Parameters:
Return type:

tuple[ndarray, ndarray | None, ndarray]

unilab.base.final_observation.resolve_transition_bootstrap_contract(next_obs, info=None, final_observation=None, done=None, truncated=None, next_critic=None)[source]

Resolve actor/storage observations and timeout bootstrap masks for a step.

Parameters:
Return type:

TransitionBootstrapContract

unilab.base.final_observation.resolve_terminal_observation_contract(next_obs_batch_size, final_observation=None, done=None, info=None, truncated=None)[source]

Resolve terminal observation facts without constructing patched next obs.

Parameters:
Return type:

TerminalObservationContract