Language Language English 简体中文 unilab.training.reward.resolve_reward_dict¶ unilab.training.reward.resolve_reward_dict(cfg)[source]¶ Resolve the reward config from the final composed config. Parameters: cfg (DictConfig) Return type: dict[str, Any]