Published on September 11, 2018 by

What is the best way to route data in a network of routers spread out across the globe? This ‘internet of things’-based problem can be solved using reinforcement learning! In this video, i’ll explain the 2 types of policies, the bellman equation, and the value function. All of these concepts are crucial in the RL pipeline and using animations + code, i’ll break them down. Enjoy!

Code for this video:
https://github.com/llSourcell/Sensor_Networks

Please Subscribe! And like. And comment. That’s what keeps me going.

Want more education? Connect with me here:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
instagram: https://www.instagram.com/sirajraval

Github Syllabus:
https://github.com/llSourcell/Move_37_Syllabus

Take the full course at the School of AI:
https://www.theschool.ai

Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/

And please support me on Patreon:
https://www.patreon.com/user?u=3191693

Category Tag