项目作者: lynnlangit

项目描述 :
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
高级语言: HTML
项目地址: git://github.com/lynnlangit/learning-hadoop-and-spark.git
创建时间: 2019-06-22T15:20:09Z
项目社区:https://github.com/lynnlangit/learning-hadoop-and-spark

开源协议:Apache License 2.0

下载


Learning Hadoop and Spark

Contents

This is the companion repo to my Linked In Learning Courses on Apache Hadoop and Apache Spark.

🐘 1. Learning Hadoop - link

  1. - this course demos I use mostly GCP Dataproc
  2. - for running Hadoop & associated libraries (i.e. Hive, Pig, Spark...) workloads

🌩️ 2. Cloud Hadoop: Scaling Apache Spark - link & link to content area in this repo

  1. - this course demos I use GCP DataProc, AWS EMR --or--
  2. - I use Databricks on AWS or on GCP

⛈️ 3. Azure Databricks Spark Essential Training - link & link to content area in this repo

  1. - this course demos I use Azure with Databricks
  2. - for scaling Apache Spark workloads

Other LinkedIn Learning Courses on Hadoop or Spark

There are ~ 10 courses on Hadoop/Spark topics on LinkedIn Learning. See graphic below
Learning Paths

  • Hadoop for Data Science Tips and Tricks - link
    • Set up Cloudera Enviroment
    • Working with Files in HDFS
    • Connecting to Hadoop Hive
    • Complex Data Structures in Hive
  • Spark courses - link
    • Various Topics - see screenshot below

LinkedInLearningSpark