Policy and value iteration

Loading...
En provenance du cours de National Research University Higher School of Economics
Practical Reinforcement Learning
53 notes
National Research University Higher School of Economics
53 notes
Cours 4 sur 7 dans Specialization Advanced Machine Learning
À partir de la leçon
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

Rencontrer les enseignants

  • Pavel Shvechikov
    Pavel Shvechikov
    Researcher at HSE and Sberbank AI Lab
    HSE Faculty of Computer Science
  • Alexander Panin
    Alexander Panin
    Lecturer
    HSE Faculty of Computer Science

Explorer notre catalogue

Rejoignez-nous gratuitement et obtenez des recommendations, des mises à jour et des offres personnalisées.