unilab.training.reward.resolve_reward_dict

unilab.training.reward.resolve_reward_dict(cfg)[source]

Resolve the reward config from the final composed config.

Parameters:

cfg (DictConfig)

Return type:

dict[str, Any]