(논문 요약) Soft Adaptive Policy Optimization (paper)

핵심 내용

  • objective 와 gradient weight 를 smooth 하게 변형.

Original formulations

Unified View