unilab.algos.torch.appo.runner¶
APPO Runner — Asynchronous PPO with native multiprocessing.
- Pipeline:
Collector subprocess publishes rollout payloads → RolloutRingBuffer
Learner reads rollouts, computes V-trace corrected updates
Weights synced back to collector via SharedWeightSync
Classes
APPO async runner using shared memory. |
- class unilab.algos.torch.appo.runner.APPORunner[source]¶
Bases:
AsyncRunnerAPPO async runner using shared memory.
- Parameters: