项目作者: sachin-kmr

项目描述 :
Implementation of Neural Image Captioning model using Keras with Theano backend
高级语言: Jupyter Notebook
项目地址: git://github.com/sachin-kmr/Neural-Image-Captioning.git
创建时间: 2017-07-20T18:24:10Z
项目社区:https://github.com/sachin-kmr/Neural-Image-Captioning

开源协议:

下载


Image Captioning

This repository contains an implementation of image captioning based on neural network (i.e. CNN + RNN). The model first extracts the image feature by CNN and then generates captions by RNN. CNN is VGG16 and RNN is a standard LSTM .

Normal Sampling and Beam Search were used to predict the caption of images.

Dataset used was Flickr8k dataset.

Dependencies

  • Keras 2.0.7
  • Theano 0.9.0
  • Numpy
  • Pandas 0.20.3
  • Matplotlib
  • Pickle

References

[1] Deep Visual-Semantic Alignments for Generating Image
Descriptions ( Karpathy et-al, CVPR 2015)

[2] Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan Show and Tell: A Neural Image Caption Generator

[3] CS231n: Convolutional Neural Networks for Visual Recognition.
( Instructors : Li Fei Fei, Andrej Karpathy, Justin Johnson)