unilab.logging.offpolicy.OffPolicyLogger¶
- class unilab.logging.offpolicy.OffPolicyLogger[source]¶
Bases:
BaseTrainingLoggerRich logger for off-policy RL algorithms (SAC, TD3, etc).
- Parameters:
Methods
__init__([algo_name, max_iterations, ...])close()Release live terminal state and backend handles without printing a summary.
finish(*[, title, extra_summary])log_buffer_fill(current, target)log_collector(total_steps, buffer_size[, ...])log_save(path)log_status(status)log_step(iteration[, metrics, reward, ...])set_collection_sync(enabled[, ...])start(*[, status])update_buffer_utilization(utilization)update_collector_timing(timing_ms)update_done_rates(timeout_rate, terminated_rate)update_ep_length(length)update_replay_queue(current_len, max_size)update_staging_pool(current_len, max_size)- __init__(algo_name='RL', max_iterations=1500, num_envs=4096, env_name='', obs_dim=0, action_dim=0, refresh_per_second=4, log_dir='', log_backend='tensorboard', wandb_project='unilab', wandb_entity=None, wandb_name='', wandb_group=None, wandb_job_type=None, wandb_tags=None, wandb_notes=None)[source]¶
- log_step(iteration, metrics=None, reward=None, reward_metrics=None, reward_components=None, train_time=0.0, wait_time=0.0, learner_incremental_h2d_time=0.0, weight_sync_time=0.0, extra_info=None)[source]¶
- close()¶
Release live terminal state and backend handles without printing a summary.
- Return type: