Language

unilab.algos.torch.offpolicy.runtime.OffPolicyRuntime¶

class unilab.algos.torch.offpolicy.runtime.OffPolicyRuntime[source]¶

Optional runtime overrides for the generic off-policy SAC path.

All fields are optional so custom runtimes only declare the behaviour they need to change from standard SAC.

Parameters:

Methods

`__init__`([learner_cls, algo_type, ...])
`build_model_kwargs`(*, obs_dim, critic_obs_dim)	Build kwargs shared by learner construction and collector actor construction.

Attributes

build_model_kwargs(*, obs_dim, critic_obs_dim)[source]¶

Build kwargs shared by learner construction and collector actor construction.

Parameters:

Return type:

dict[str, Any]

__init__(learner_cls=None, algo_type=None, actor_kwargs=<factory>, supports_symmetry=True)¶

Parameters: