Finite Markov Decision Process a high-level introduction

I wanted to avoid making this post as there will be zero code. But as I assumed my series will be stand-alone I have to write it. So to move further I have to first establish a definition of Finite Markov Decision Process. It is a crucial assumption. Solving the problem of Finite Markov Decision … Continue reading Finite Markov Decision Process a high-level introduction

Non stationary K-armed bandit problem in Python

Recently I described simple K-bandit problem and solution. I also did a little introduction to Reinforcement Learning problem. Today I am still going to focus on the same problem with a little bit more terminology and few different algorithms (or more like few different variants). I am not going to exhaust the topic as it's … Continue reading Non stationary K-armed bandit problem in Python

RL basics and simple K-armed bandit problem

Basics As I introduced very basic what Reinforcement Learning is in the series hub. There are 4 basic terms which are worth to know when reading around RL stuff, a policy, a reward signal, a value function and┬áenvironment model. I will skip the model as we will explore model-free learning for now. We will get … Continue reading RL basics and simple K-armed bandit problem