项目作者: kubistmi

项目描述 :
Review of the topic of the contextual multi-armed bandit. Includes implementation of easy-to-extend building blocks that form the contextual-bandit problem - e.g. agent, oracle, policy, environment.
高级语言: Jupyter Notebook
项目地址: git://github.com/kubistmi/contextual_MAB.git
创建时间: 2021-06-19T20:42:45Z
项目社区:https://github.com/kubistmi/contextual_MAB

开源协议:

下载


Contextual multi-armed bandit

This repo contains a review of the contextual multi-armed bandits.
Includes proposed framework for extendible building blocks that form the contextual bandit problem.
You can find the overview of the contextual bandits, dataset, and the framework in the presentation

Structure

  • src/ contains the modules of the framework
    • agent, environment, oracle, policy, data provider
  • train_manual demonstrates how to combine these modules to form a contextual bandit
  • train_grid allows you to run the model for a specified parameter grid