OpenAI Spinning Up 번역] Part 3: 정책 최적화 소개(Intro to Policy Optimization)
Welcome to Spinning Up in Deep RL! 원본은 Part 3: Intro to Policy Optimization OpenAI Spinning Up 번역] Part 1: 강화학습 핵심 개념(Key Concepts in RL) OpenAI Spinning Up 번역] Part 2: 강화학습 알고리즘 종류(Kinds of RL Algorithms) OpenAI Spinning Up 번역] Part 3: 정책 최적화 소개(Intro to Policy Optimization) Table of Contents Part 3: Intro to Policy Optimization Deriving the Simplest Policy Gradient Implementing the Simplest Po..