Younghun Lee

Younghun Lee

Student

Toggle menu

📂 Total Posts # 23
Machine Learning
- IBM (16)
Statistics
- Time Series (1)
Projects
- Time Series (2)

Policy Gradient Method

Tags: RL

Categories: Reinforcement Learning

Updated: December 1, 2022

Policy Gradient Method

karpathy's blog

Proximal Policy Optimization

Noise Reduction: sample more trajectories
Rewards Normalization: batch normalization also used often.

Share on

Twitter Facebook LinkedIn

Leave a comment

You may also enjoy

Machine Learning on Apple stock daily return2

16 minute read

import numpy as np import pandas as pd import os import matplotlib.pyplot as plt %matplotlib inline import seaborn as sns

Machine Learning on Apple stock daily return

19 minute read

import numpy as np import pandas as pd import os import matplotlib.pyplot as plt %matplotlib inline import seaborn as sns

Reinforcement Learning

less than 1 minute read

Reinforcement Learning Agents interact with an Environment Choose from a set of available actions Actions impact the environment, which impacts agents...

Autoencoder and GAN

1 minute read

Autoencoder Several applications: Dimensional reduction Preprocessing for classification Identifying essential elements of the input data and filterin...