项目作者: krishnaw14

项目描述 :
Foundations of Intelligent and Learning Agenet
高级语言: Python
项目地址: git://github.com/krishnaw14/CS747-assignments.git
创建时间: 2019-08-26T10:19:04Z
项目社区:https://github.com/krishnaw14/CS747-assignments

开源协议:

下载


CS747-assignments

Foundations of Intelligent and Learning Agents

Assignment 1 (Problem Statement)

Solution of Multi-arm Bandit Problem and analysis of performance of different sampling algorithms such as Round-Robin, epsilon-greedy, UCB, KL-UCB and Thompson Sampling

Assignment 2 (Problem Statement)

Solution of MDPs using Linear Programming and Howard Policy Iteration. Reconstruction of a family of MDPs (differing in discount factor) based on same value function for a certain range of discount factors

Assignment 3 (Problem Statement)

Estimation of value function of a policy for a given MDP from a trajectory of the form state, action, reward, state, action, reward…

Assignment 4 (Problem Statement)

Simulating “Windy Gridworld” environment (as an episodic MDP) and solving the environment using SARSA(0) agent and further analysis