- Introduction, without any prior requirement except being able to code, to the wide world of Reinforcement Learning. This course teaches how to make artificial agents that learn by “trial and error”, suited for various kinds of simple to complicated tasks. It also covers the basics of Neural Networks, Deep Learning, Control Engineering, Stochastic Optimization and Planning.
- Introduction and course Overview
- Markov Decision Processes
- Dynamic Programming
- Value Function Approximation
- Policy Gradients
- Exploration Exploitation
- Detour: Deep Learning, Neural Nets and Convnets
- Deep RL: Value Based
- Deep RL: Policy Based
- The future and open problems
Basic Reinforcement Learning book: Reinforcement Learning: An Introduction 2nd edition
- Practical Reinforcement Learning Coursera
- John Schulman’s and Pieter Abeel’s class: Deep Reinforcement Learning, Fall 2015
- Deep Reinforcement Learning and Control, CMU Spring 2017
- David Silver’s class: Reinforcement learning
- For neural networks material
- Andrej Karpathy’s course
- Geoffrey Hinton on Coursera
- Goodfellow’s Deep Learning book
- The OpenAI Gym will be use as a testbed for learning algorithms (if you do not program your own environment, which is also allowed).
- Experiments will be coded in Python. We recommend the use of “pip”, available on any distribution, and with every Python library available with a single “pip3 install library“. For Windows users, pip is also available with recent versions of Python 3, but in case of any problem, here is a link to the Anaconda framework.
- User manual to use the project starter code on the hydra cluster.
The corresponding competences:
- Knowledge and Insight:
The student has knowledge and insight in the domain of learning systems which allows him to possibly provide an original contribution to the domain.
- The use of Knowledge and Insight:
The student can combine the ideas covered in the course to obtain a suitable approach for a new problem.
- Judgements Ability:
The student can judge autonomously the scientific papers in this domain.
The student can present the content of their final project to the other students and communicate his ideas on the solutions.
- The student can autonomously search, read and implement papers in this area of research.
All detailed and official information about the course here >