Language

unilab.algos.torch.offpolicy.runtime¶

Runtime resolution helpers for off-policy script assembly.

Functions

Resolve an optional custom off-policy runtime from owner config.

Classes

Optional runtime overrides for the generic off-policy SAC path.

class unilab.algos.torch.offpolicy.runtime.OffPolicyRuntime[source]¶

Optional runtime overrides for the generic off-policy SAC path.

All fields are optional so custom runtimes only declare the behaviour they need to change from standard SAC.

Parameters:

build_model_kwargs(*, obs_dim, critic_obs_dim)[source]¶

Build kwargs shared by learner construction and collector actor construction.

Parameters:

Return type:

dict[str, Any]

__init__(learner_cls=None, algo_type=None, actor_kwargs=<factory>, supports_symmetry=True)¶

Parameters:

unilab.algos.torch.offpolicy.runtime.resolve_custom_offpolicy_runtime(rl_cfg)[source]¶

Resolve an optional custom off-policy runtime from owner config.