项目作者: camandmum22

项目描述 :
Reinforcement Learning Techniques
高级语言: Python
项目地址: git://github.com/camandmum22/gridworld_mdp.git
创建时间: 2017-08-01T17:43:29Z
项目社区:https://github.com/camandmum22/gridworld_mdp

开源协议:

下载


Gridworld & CartPole Solver

This project implements and compares diffeernt Reinforcement Learning approaches to solve the Gridworld and CartPole problems.

Markov Desicion Proccess (MDP)

Use of an MDP structure + Value Iteration Algorithm to solve the Gridworld board problem

Q-Learning

Use of Q-Learning + Value Iteration Algorithm to solve the Gridworld board problem

Deep Q Network

Implemented 4 different cases to solve the CartPole problem

  • Conventional Deep Q Network
  • Deep Q Network with Experience replay and Target Netwrok
  • Deep Q Network with Target Netwrok
  • Deep Q Network with Experience replay and Target Netwrok (best accuracy level)