Policy gradient formalism

Loading...
En provenance du cours de National Research University Higher School of Economics
Practical Reinforcement Learning
39 notes
National Research University Higher School of Economics
39 notes
Cours 4 sur 7 dans Specialization Advanced Machine Learning
À partir de la leçon
Policy-based methods
We spent 3 previous modules working on the value-based methods: learning state values, action values and whatnot. Now's the time to see an alternative approach that doesn't require you to predict all future rewards to learn something.

Rencontrer les enseignants

  • Pavel Shvechikov
    Pavel Shvechikov
    Researcher at HSE and Sberbank AI Lab
    HSE Faculty of Computer Science
  • Alexander Panin
    Alexander Panin
    Lecturer
    HSE Faculty of Computer Science

Explorer notre catalogue

Rejoignez-nous gratuitement et obtenez des recommendations, des mises à jour et des offres personnalisées.