unilab.training.reward.extract_reward_config

unilab.training.reward.extract_reward_config(cfg)[source]

Extract and validate reward config from Hydra config.

Parameters:

cfg (DictConfig) – Hydra DictConfig containing reward section

Return type:

dict[str, dict[str, Any]]

Returns:

Dictionary with reward_config key for env_cfg_override

Raises:

ValueError – If reward config is missing