项目作者: HadesArchitect

项目描述 :
Cassandra + Spark = ❤️ Machine Learning with Apache Spark & Cassandra
高级语言: Jupyter Notebook
项目地址: git://github.com/HadesArchitect/CaSpark.git
创建时间: 2019-10-07T11:59:52Z
项目社区:https://github.com/HadesArchitect/CaSpark

开源协议:

下载


Machine Learning with Apache Spark & Cassandra

Cassandra + Spark = ❤️

A Hands-on Lab delivered by DataStax’ Developer Advocates team. Want to learn the awesomness of distributed databases and computational systems? Jump in, watch the slides and do the practicals steps!

Slides

Labs

Reqs

  • git
  • docker
  • docker-compose

Installation

  1. git clone https://github.com/HadesArchitect/CaSpark.git
  2. cd CaSpark
  3. docker-compose up -d

Usage

You may need to use some custom IP instead of localhost if you use docker-for-mac, docker-for-windows or similar installation.

Known Issues

In some cases executing the exercises may lead to memory issues, especially on weaker or non-Linux machines due to docker limitations on memory. If you have any issues with exercises after the first few, try to clean up and start again docker-compose kill && docker-compose down && docker-compose up -d. You may need to repeat steps of the notebook you were working on.