Tom Pecher | Portfolio

Reinforcement Learning: Course Introduction

Reinforcement learning (RL) is often considered the odd-one-out of the three subfields of AI, as its principles and methodology of using data is quite different from supervised and unsupervised learning. The goal of RL is to develop agents (algorithms that can act) that make decisions based on interactions with an environment, is popularly defined as "a goal-driven approach to decision making problems". Theoretically speaking however, RL can be best described as a data-driven extension of markov decision processes (MDPs), (see Tranditional AI). In the RL paradigm, an agent can move between environmental states through actions and receive rewards based on the current state. The goal of designing an RL algorithm is to create agents that maximise the sum of these rewards over time. Despite the simple premise, this paradigm can be applied to a variety of problems, with different algorithms and techniques. In many ways, RL is the most intuitive of the three subfields of ML, as we can generally make sense of the strategies produced by the algorithms, even as deep learning becomes more involved.

Course Structure:

Section 1: Foundations and Prerequisites
Section 2: Markov Decision Processes
Section 3: Dynamic Programming
Section 4: Reinforcement Learning Paradigm
Section 5: Temporal Difference Learning
Section 6: Function Approximation
Section 7: Deep Reinforcement Learning
Section 8: Advanced Deep RL Architectures
Section 9: Model-Based Reinforcement Learning
Section 10: Multi-Agent Reinforcement Learning
Section 11: Hierarchical Reinforcement Learning
Section 12: Exploration-Exploitation Strategies
Section 13: Imitation and Inverse Reinforcement Learning
Section 14: Safe and Robust Reinforcement Learning
Section 15: Transfer and Meta Reinforcement Learning
Section 16: Offline Reinforcement Learning
Section 17: Distributed and Scalable Reinforcement Learning
Section 18: Special Applications and Domains

Back to AI Courses...

Course Home Next