Recent advancements focus on .
: Researchers now use a virtual trajectory method to predict an agent’s future unperturbed states. This allows the estimation of a Maximum Risk Value without needing to train a separate adversary. Maximum Risk
: Standard RL agents are vulnerable to "adversarial perturbations"—small, calculated changes to their input that cause catastrophic failure. Recent advancements focus on