football (soccer) datasets
This repository is made available under the Open Data Commons Attribution License
This repository contains some datasets for football (soccer).
(11/2024)
data/goals_time
now includes 23 competitions data/results
contains results of 1,237,935 football games in 207 top-tier domestic leagues and
20 international tournaments (UEFA EuroLeague/ChampionsLeague,etc.) from
1888-2023. The data is provided as a parquet file.
variable | description |
---|---|
home | home team name (not necessarily unique) |
away | away team name (not necessarily unique) |
date | date of match |
gh | goals for home team (including extra time and penalties) |
ga | goals for away team (including extra time and penalties) |
full_time | “F”=game ended in 90’, “E”=extra time, “P”=penalty shoot-out |
competition | country name of league or name of international competition |
home_ident | unique identifier of home team |
away_ident | unique identifier of away team |
home_country | country of home team |
away_country | country of away team |
home_code | country code of home team |
away_code | country code of away team |
home_continent | continent of home team |
away_continent | continent of away team |
continent | continent of competition |
level | “national”= domestic league, “international”= international cup |
This dataset contains errors for older games where sources are not as reliable.
A big issue are teams that merge/split/dissolve over time, which (I think) I did
not resolve consistently over time.
I have gathered this dataset over the course of 10 years and put a lot of effort in
it (see worldclubratings.com). If you use the data for any kind of project, please drop me a line
or open an issue in this repository. If you have any questions or requests,
please open an issue too.
The folder data/formations
contains lineups and formations datasets
The folder data/goals_time
contains details about each scored goal (scorer,
time) in the Top 15 European Leagues (according to the UEFA 5 year rating 2004), the Champions League and Euro League, and
a few national cups.
Similar datasets: