项目作者: johntelforduk

项目描述 :
Explore, analyse and visualise Betfair Historical Data Feed using PySpark.
高级语言: Jupyter Notebook
项目地址: git://github.com/johntelforduk/betfair-data-analysis.git
创建时间: 2019-06-14T07:44:28Z
项目社区:https://github.com/johntelforduk/betfair-data-analysis

开源协议:MIT License

下载


Betfair Historical Data

A Jupyter notebook to explore, analyse and visualise Betfair historic data using PySpark.

Screenshot

Installation

The following packages should be installed.

  1. pip install pyspark==2.3.3
  2. pip install bz2file
  3. pip install glob2
  4. pip install jupyter
  5. pip install matplotlib
  6. pip install findspark
  7. pip install boto3

See requirements.txt for list of installed packages.

Running The Notebook

cd to the folder that contains the project. Then,

  1. (betfair) C:\betfair> jupyter notebook betfair_analysis.ipynb

Obtaining Data

Historical data may be downloaded from the Betfair website as follows.

  1. Request and downloaded the data you want to analyse from this area of Betfair website,
    https://historicdata.betfair.com/#/mydata
  2. Create a folder data in the project folder.
  3. The data.tar file should be opened using your choice of file compression tool - for example Z-Zip. Using the tool, navigate to data.tar\C:\data\xds\historic\BASIC\, and then extract to the folders to the project’s data folder.

Useful Resources

https://historicdata.betfair.com/Betfair-Historical-Data-Feed-Specification.pdf