In this course, you will learn how to solve problems with large, high-dimensional, and potentially infinite state spaces. You will see that estimating value functions can be cast as a supervised learning problem---function approximation---allowing you to build agents that carefully balance generalization and discrimination in order to maximize reward. We will begin this journey by investigating how our policy evaluation or prediction methods like Monte Carlo and TD can be extended to the function approximation setting. You will learn about feature construction techniques for RL, and representation learning via neural networks and backprop. We conclude this course with a deep-dive into policy gradient methods; a way to learn policies directly without learning a value function. In this course you will solve two continuous-state control tasks and investigate the benefits of policy gradient methods in a continuous-action environment.
Ce cours fait partie de la Spécialisation Apprentissage par renforcement
Offert par
À propos de ce cours
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode.
Compétences que vous acquerrez
- Artificial Intelligence (AI)
- Machine Learning
- Reinforcement Learning
- Function Approximation
- Intelligent Systems
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode.
Programme de cours : ce que vous apprendrez dans ce cours
Welcome to the Course!
On-policy Prediction with Approximation
Constructing Features for Prediction
Control with Approximation
Policy Gradient
Avis
- 5 stars84,21 %
- 4 stars12,86 %
- 3 stars1,98 %
- 2 stars0,66 %
- 1 star0,26 %
Meilleurs avis pour PREDICTION AND CONTROL WITH FUNCTION APPROXIMATION
Martha and Adam are excellent instructors. This course is so well organized and presented. I have learned a lot! Thanks very much!
Did a good job of attaching a programming assignment to each lesson and giving clear and detailed instructions throughout
This specialization is a gift to humanity. It should have been inscribed into the golden disc of the Voyager and shared with the aliens.
Adam & Martha really make the walk through Sutton & Barto's book a real pleasure and easy to understand. The notebooks and the practice quizzes greatly help to consolidate the material.
À propos du Spécialisation Apprentissage par renforcement

Foire Aux Questions
Quand aurai-je accès aux vidéos de cours et aux devoirs ?
À quoi ai-je droit si je m'abonne à cette Spécialisation ?
Une aide financière est-elle possible ?
D'autres questions ? Visitez le Centre d'Aide pour les Étudiants.