Ementorhub

Skills you'll gain

Artificial Intelligence and Machine Learning (AI/ML)

Machine Learning

Reinforcement Learning

Sampling (Statistics)

Probability Distribution

Algorithms

Machine Learning Algorithms

Simulations

There are 5 modules in this course

By the end of this course you will be able to: - Understand Temporal-Difference learning and Monte Carlo as two strategies for estimating value functions from sampled experience - Understand the importance of exploration, when using sampled experience rather than dynamic programming sweeps within a model - Understand the connections between Monte Carlo and Dynamic Programming and TD. - Implement and apply the TD algorithm, for estimating value functions - Implement and apply Expected Sarsa and Q-learning (two TD methods for control) - Understand the difference between on-policy and off-policy control - Understand planning with simulated experience (as opposed to classic planning strategies) - Implement a model-based approach to RL, called Dyna, which uses simulated experience - Conduct an empirical study to see the improvements in sample efficiency when using Dyna

Monte Carlo Methods for Prediction & Control

Temporal Difference Learning Methods for Prediction

Temporal Difference Learning Methods for Control

Planning, Learning & Acting