Spark hands-on exercise for the lecture Distributed Data Analytics
Spark hands-on exercise for the lecture Distributed Data Analytics.
sbt assembly
--path <path to folder>
- Path to the folder containing the dataset csv files. Optional, defaults to ./TPCH
.--paths <fileA,fileB,fileC>
- Direct path to the dataset files seperated by comma. Optional, defaults to --path
argument.--cores <number of cores>
- Number of local cores to use. Optional, defaults to 4
.