Used deep learning to train a CNN + RNN/LSTM on the MS-COCO dataset to automatically generate captions.