CS747-assignments

Foundations of Intelligent and Learning Agents

Assignment 1 (Problem Statement)

Solution of Multi-arm Bandit Problem and analysis of performance of different sampling algorithms such as Round-Robin, epsilon-greedy, UCB, KL-UCB and Thompson Sampling

Assignment 2 (Problem Statement)

Solution of MDPs using Linear Programming and Howard Policy Iteration. Reconstruction of a family of MDPs (differing in discount factor) based on same value function for a certain range of discount factors

Assignment 3 (Problem Statement)

Estimation of value function of a policy for a given MDP from a trajectory of the form state, action, reward, state, action, reward…

Assignment 4 (Problem Statement)

Simulating “Windy Gridworld” environment (as an episodic MDP) and solving the environment using SARSA(0) agent and further analysis