项目作者: RameshBattu

项目描述 :
Exploratory Data Analysis with TSNE ON_ DonorsChoose data set
高级语言: Jupyter Notebook
项目地址: git://github.com/RameshBattu/EDA_TSNE_On_-DonorsChoose.git
创建时间: 2020-06-26T10:23:16Z
项目社区:https://github.com/RameshBattu/EDA_TSNE_On_-DonorsChoose

开源协议:

下载


EDATSNE_On-DonorsChoose

Exploratory Data Analysis and apply TSNE On DonorsChoose data set features:

pipeline of EDA: 1. Reading Data,

  1. 2. Data Analysis(Univariate Analysis for each feature and ploting with summary ),
  2. 3. Text preprocessing - Text Data,
  3. 4.Preparing data models - Vectorizing categorical data(one hot encoding),
  4. 5.Vectorizing Text data ( Bag of words, TFIDF, avg W2v, TFIDF weighted W2V ),
  5. 6.Vectorizing Numerical features ( StandardScaler ),
  6. 7.Merging all the above features (Categorical + Text + numarical ) - using hstck method,
  7. 8. Apply TSNE,
  8. 9. Overall Summary.

Thank you.