unilab.algos.mlx.ppo.runner¶
Runner-style utilities for MLX PPO.
This module keeps train script entrypoints thin, similar to rsl-rl runner usage.
Functions
|
Find the latest model_*.safetensors checkpoint in a run dir. |
|
Find latest run directory under a task log root. |
Classes
High-level PPO wrapper to keep train script lightweight. |
|
Minimal scalar writer based on tensorboard event files. |
- unilab.algos.mlx.ppo.runner.tree_map(fn, tree)¶
- class unilab.algos.mlx.ppo.runner.TensorboardScalarWriter[source]¶
Bases:
objectMinimal scalar writer based on tensorboard event files.
- Parameters:
log_dir (
Path)
- unilab.algos.mlx.ppo.runner.get_latest_run(log_dir)[source]¶
Find latest run directory under a task log root.
- Parameters:
log_dir (
Path)- Return type:
Path|None
- unilab.algos.mlx.ppo.runner.get_latest_checkpoint(run_dir)[source]¶
Find the latest model_*.safetensors checkpoint in a run dir.
- Parameters:
run_dir (
Path)- Return type:
Path|None
- class unilab.algos.mlx.ppo.runner.MLXPPOAgent[source]¶
Bases:
objectHigh-level PPO wrapper to keep train script lightweight.
- update(buffer, last_obs)[source]¶
- Parameters:
buffer (
RolloutBuffer)last_obs (
array)