项目作者: harmanpreet93

项目描述 :
Representation Learning - RNN, GRU and Mutli-headed attention transformers
高级语言: Jupyter Notebook
项目地址: git://github.com/harmanpreet93/recurrent_neural_networks.git
创建时间: 2020-03-05T21:49:09Z
项目社区:https://github.com/harmanpreet93/recurrent_neural_networks

开源协议:

下载


Sequential Networks

Representation Learning - RNN, GRU and Mutli-headed attention transformers

This repository contains code to understand the following concepts:

  • Implimentations of Transformers, GRU and RNN from scratch
  • Evaluate different models by comparing metrics such as perplexity and loss
  • Hyperparameter search done for various experiments. Plots added to the notebook
  • Demonstrate vansihing gradient problem in RNN, and how GRU is able to solve it