unilab.algos.torch.common.stability¶
Numerical stability utilities for RL training.
Functions
|
Check if loss contains NaN or Inf values. |
|
Clip gradients by global norm. |
|
Make tensor numerically safe by clamping and replacing NaN values. |
- unilab.algos.torch.common.stability.check_nan_loss(loss, default_metrics)[source]¶
Check if loss contains NaN or Inf values.
- unilab.algos.torch.common.stability.clip_gradients(parameters, max_norm=10.0)[source]¶
Clip gradients by global norm.
- Parameters:
parameters – Model parameters
max_norm (
float) – Maximum gradient norm