My solution to an assignment on neural network initializers and optimizers. Contains some of the most popular approaches such as Xavier/He initialization and SGD, Momentum, AdaGrad, AdaDelta and Adam optimizers.