unilab.training.reward¶
Utility functions for reward config handling.
Functions
Extract and validate reward config from Hydra config. |
|
|
Resolve the reward config from the final composed config. |
- unilab.training.reward.resolve_reward_dict(cfg)[source]¶
Resolve the reward config from the final composed config.