unilab.base.curriculum.PenaltyCurriculum

class unilab.base.curriculum.PenaltyCurriculum[source]

Bases: object

Adaptive penalty scaling based on episode length.

Parameters:

Methods

__init__(env[, enabled, initial_scale, ...])

update(average_episode_length)

Update penalty scale based on average episode length.

__init__(env, enabled=True, initial_scale=0.5, min_scale=0.5, max_scale=1.0, level_down_threshold=150.0, level_up_threshold=750.0, degree=0.001)[source]
Parameters:
update(average_episode_length)[source]

Update penalty scale based on average episode length.

Parameters:

average_episode_length (float)

Return type:

None