Algorithms¶
Algorithm pages describe what each checked-in entrypoint runs, where its config lives, and which command shape selects it. For general flags, see CLI Reference.
Algorithm |
Style |
Entrypoint |
Config Evidence |
|---|---|---|---|
PPO |
synchronous on-policy |
|
|
APPO |
async on-policy |
|
|
SAC |
off-policy |
|
|
TD3 |
off-policy |
|
|
FlashSAC |
off-policy |
|
|
HIM-PPO |
height-estimator PPO path |
|
|
HORA |
teacher/student distillation path |
|
|
MLX PPO |
synchronous on-policy for Apple Silicon |
|
|