Reinforcement Learning
Tags: Deep Learning, IBM, Week6
Categories: IBM Machine Learning
Updated:
Reinforcement Learning
- Agents interact with an Environment
- Choose from a set of available actions
- Actions impact the environment, which impacts agents via rewards
-
Rewards are unknown and must be estimated by the agent
- Solutions represent a Policy by which Agents choose Actions in response to the State
- Agents typically maximize expected rewards over time
Leave a comment