Language

unilab.training.reward.resolve_reward_dict¶

unilab.training.reward.resolve_reward_dict(cfg)[source]¶

Resolve the reward config from the final composed config.