Language Language English 简体中文 unilab.training.reward.extract_reward_config¶ unilab.training.reward.extract_reward_config(cfg)[source]¶ Extract and validate reward config from Hydra config. Parameters: cfg (DictConfig) – Hydra DictConfig containing reward section Return type: dict[str, dict[str, Any]] Returns: Dictionary with reward_config key for env_cfg_override Raises: ValueError – If reward config is missing