Explore, analyse and visualise Betfair Historical Data Feed using PySpark.
A Jupyter notebook to explore, analyse and visualise Betfair historic data using PySpark.
The following packages should be installed.
pip install pyspark==2.3.3
pip install bz2file
pip install glob2
pip install jupyter
pip install matplotlib
pip install findspark
pip install boto3
See requirements.txt
for list of installed packages.
cd
to the folder that contains the project. Then,
(betfair) C:\betfair> jupyter notebook betfair_analysis.ipynb
Historical data may be downloaded from the Betfair website as follows.
data
in the project folder.data.tar
file should be opened using your choice of file compression tool - for example Z-Zip. Using the tool, navigate to data.tar\C:\data\xds\historic\BASIC\
, and then extract to the folders to the project’s data
folder.https://historicdata.betfair.com/Betfair-Historical-Data-Feed-Specification.pdf