• Skip to primary navigation
  • Skip to content
  • Skip to footer
Hun's Blog
  • Project Notes
  • IBM Notes
  • Time Series
    Younghun Lee

    Younghun Lee

    Student

    • Somewhere
    • Personal Email
    • GitHub
    • linkedin
      • 📂 Total Posts # 23
      • Machine Learning
        • IBM (16)
        Statistics
        • Time Series (1)
        Projects
        • Time Series (2)

      Policy Gradient Method

      Tags: RL

      Categories: Reinforcement Learning

      Updated: December 1, 2022


      On this page

      • Policy Gradient Method
      • Proximal Policy Optimization

      Policy Gradient Method

      karpathy's blog

      Proximal Policy Optimization

      • Noise Reduction: sample more trajectories
      • Rewards Normalization: batch normalization also used often.

      Share on

      Twitter Facebook LinkedIn
      Previous Next

      Leave a comment

      You may also enjoy

      Machine Learning on Apple stock daily return2

      16 minute read

      import numpy as np import pandas as pd import os import matplotlib.pyplot as plt %matplotlib inline import seaborn as sns

      Machine Learning on Apple stock daily return

      19 minute read

      import numpy as np import pandas as pd import os import matplotlib.pyplot as plt %matplotlib inline import seaborn as sns

      Reinforcement Learning

      less than 1 minute read

      Reinforcement Learning Agents interact with an Environment Choose from a set of available actions Actions impact the environment, which impacts agents...

      Autoencoder and GAN

      1 minute read

      Autoencoder Several applications: Dimensional reduction Preprocessing for classification Identifying essential elements of the input data and filterin...

      © 2022 Younghun Lee. Powered by Jekyll & Minimal Mistakes.