Looking at the fatality rates of traffic accidents in the US and which factors might impact these rates, leveraging several big data tools: AWS EMR cluster, HDFS, Hive, Spark, Hbase.