-
TRPO: Trust Region Policy Optimization논문 리뷰 2022. 5. 11. 12:45
Trust Region Policy Optimization (mlr.press)
Trust Region Policy Optimization
In this article, we describe a method for optimizing control policies, with guaranteed monotonic improvement. By making several approximations to the theoretically-justified scheme, we develop a pr...
proceedings.mlr.press
'논문 리뷰' 카테고리의 다른 글
Wav2Vec (0) 2022.12.22 Cycle GAN VC 3 and Mask Cycle GAN VC (0) 2022.05.11 MUTE: Multitask Training with Text Data for End-to-End Speech Recognition (0) 2022.05.11 Deep Speech & Deep Speech2 간단 리뷰 (0) 2022.01.15 Listen, Attend and spell (0) 2022.01.01