项目作者: WistariaDing

项目描述 :
Here are some cases about Spark Python API, pyspark.
高级语言: Jupyter Notebook
项目地址: git://github.com/WistariaDing/SparkWithPython.git
创建时间: 2017-09-20T08:45:42Z
项目社区:https://github.com/WistariaDing/SparkWithPython

开源协议:

下载


SparkWithPython

Here are some cases about Spark Python API, pyspark.

How to install Spark on Windows 10?

1. Install Java 8

  1. Download Java 8 or later version from the official website.Then set system environment variabls
  2. variable: JAVA_HOME
  3. value: C:\Program Files\Java\jdk1.8.0_144 (Java Installation folder)
  4. variable: Path
  5. value: %JAVA_HOME%\bin
  6. Run command window, input java -version, check if it is installed successfully.

2. Install Scala:

  1. Download Scala from the website and install. Then set system environment variables.
  2. variable: Path
  3. value: C:\Program Files (x86)\scala\bin (Scala Installation folder)
  4. Run command window, input scala, check if the result is the version number of scala.

3. Install Spark:

  1. Download spark from https://spark.apache.org/downloads.html into a certain folder and extract.For example, D:\Spark\spark-2.2.0-bin-hadoop2.7. Then set system environmental variables:
  2. variable: Path
  3. value: D:\Spark\spark-2.2.0-bin-hadoop2.7\bin
  4. Run command window, cd to the folder 'D:\Spark\spark-2.2.0-bin-hadoop2.7\bin', then input spark-shell, check.