unilab.algos.torch.offpolicy.runtime.OffPolicyRuntime

class unilab.algos.torch.offpolicy.runtime.OffPolicyRuntime[source]

Bases: object

Optional runtime overrides for the generic off-policy SAC path.

All fields are optional so custom runtimes only declare the behaviour they need to change from standard SAC.

Parameters:

Methods

__init__([learner_cls, algo_type, ...])

build_model_kwargs(*, obs_dim, critic_obs_dim)

Build kwargs shared by learner construction and collector actor construction.

Attributes

learner_cls: type[Any] | None = None
algo_type: str | None = None
actor_kwargs: dict[str, Any]
supports_symmetry: bool = True
build_model_kwargs(*, obs_dim, critic_obs_dim)[source]

Build kwargs shared by learner construction and collector actor construction.

Parameters:
  • obs_dim (int)

  • critic_obs_dim (int)

Return type:

dict[str, Any]

__init__(learner_cls=None, algo_type=None, actor_kwargs=<factory>, supports_symmetry=True)
Parameters: