项目作者: mraduldubey

项目描述 :
Getting started with Twitter data analysis.
高级语言: Jupyter Notebook
项目地址: git://github.com/mraduldubey/bostonbombing.git
创建时间: 2017-04-11T21:54:57Z
项目社区:https://github.com/mraduldubey/bostonbombing

开源协议:GNU General Public License v3.0

下载


twitter-data-analysis

The tweet dataset containing tweets about Boston Marathon Bombing, is cleaned, vectorized, extracted features upon, clustered using Kmeans and represented on
a Scatter Graph by Multi Dimensional Scaling (MDS) of distance b/w tweet-vectors calculated using Cosine Distance. Check the Jupyter Nottebook: “TweetClustering.ipynb”.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.


  • Extract the downloaded the repository.

  • Run “jupyter notebook” in the terminal.

  • Open “TweetClustering.ipynb”.

KMeans (Clusters=25)

KMeans REsult

Prerequisites