Carrot and Stick - Part 2 - Q Learning

01 Feb 2020 in Machine Learning

From Theory to Practice

When I think of Reinforcement Learning I usually think of an agent or robot traveling through a maze, avoiding traps, collecting supplies. In each step it observes its state, tries to estimate what will be the best action to take based on all the experience it gained. The way I visualize it, in each state, the robot scans through a database, looking for all the valid actions it can take in that state, and picks the one with the best chance of being the optimal action - Q Learning is a fundamental Reinforcement Learning algorithm that works similar to this. This post is dedicated to the Q Learning algorithm. By the end of this post you will be able to write your own Q Learning agent and test it in an interactive environment.

Carrot and Stick

01 Jan 2020 in Machine Learning

A Framework to Learn Reinforcement Learning

A while ago I went to a Meetup about Reinforcement Learning (RL), I got into a conversation with some one that sat next to me. He asked me several question about the subject - What is the difference between RL and supervised/unsupervised learning? What is the difference between several types of algorithms? When would you choose this framework over another one?

Not All Who Wander Are Lost

01 Mar 2019 in Machine Learning

A Quick Start Guide to Multi Armed Bandits

Decisions are hard, they have always been. And when you finally find something you like, there is always that thought, in the back of your head - “can I find something better?”.

A Means to an End

01 Feb 2019 in Data Science

Choosing the right mean for an estimator.

As data scientists we often need to estimate a value from a sample data set in order to answer a business question about the whole population. In most cases we get a sample data set, and we wish to estimate the mean of some value and so the sample mean is a good choice for an estimator.

Gouge Away

01 Feb 2019 in Big Data

Incorporating SQOOP in your Data Pipeline

You have just completed setting up your new and shiny EMR cluster, and want to unleash the full power of Spark on the nearest data-source.

Jewpyter

Carrot and Stick - Part 2 - Q Learning

Carrot and Stick

Not All Who Wander Are Lost

A Means to an End

Gouge Away

Error

Pagination

Templates (for web app):

Error