项目作者: oiricaud

项目描述 :
Implement decision process for Monte-Carlo, Value Iteration & Q-Learning
高级语言: Java
项目地址: git://github.com/oiricaud/Markov-Decision-Process-AI.git
创建时间: 2017-04-21T22:52:00Z
项目社区:https://github.com/oiricaud/Markov-Decision-Process-AI

开源协议:

下载


Markov-Decision-Process: Artificial Intelligence

Objective

To expirement with some of the basic algorithms for solving MDPs on a simple domain.

Groups: You may optionally work in groups of 2 students.

Doomain: The domain is based on a simple MDP originally designed by Rich Sutton at the University of Alberta. The example describes a Markov Decision Porcess that models the life of a student and the decisions one must make to both have a good time and remain in good academic standing.

States

  1. R = Rested
  2. T = Tired
  3. D = Homework Done
  4. U = Homework Undone
  5. 8p = eight o'clock pm

Actions

  1. P = Party
  2. R = Rest
  3. S = Study

any means any action has the same effect

note: not all actions are possible in all states

Red numbers are rewards

Green numbers are transition probabilities (all those not labeled are probability 1.0)

The gray rectangle denotes a terminal state.

See below for the diagram of the MDP.

alt tag