Caffe
DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.
It combines (1) atrous convolution to explicitly control the resolution at which feature responses are computed within Deep Convolutional Neural Networks, (2) atrous spatial pyramid pooling to robustly segment objects at multiple scales with filters at multiple sampling rates and effective fields-of-views, and (3) densely connected conditional random fields (CRF) as post processing.
This distribution provides a publicly available implementation for the key model ingredients reported in our latest arXiv paper.
It also contains implementations for all methods reported in all our previous papers.
Please consult and consider citing the following papers:
@article{CP2016Deeplab,
title={DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs},
author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
journal={arXiv:1606.00915},
year={2016}
}
@inproceedings{CY2016Attention,
title={Attention to Scale: Scale-aware Semantic Image Segmentation},
author={Liang-Chieh Chen and Yi Yang and Jiang Wang and Wei Xu and Alan L Yuille},
booktitle={CVPR},
year={2016}
}
@inproceedings{CB2016Semantic,
title={Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform},
author={Liang-Chieh Chen and Jonathan T Barron and George Papandreou and Kevin Murphy and Alan L Yuille},
booktitle={CVPR},
year={2016}
}
@inproceedings{PC2015Weak,
title={Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation},
author={George Papandreou and Liang-Chieh Chen and Kevin Murphy and Alan L Yuille},
booktitle={ICCV},
year={2015}
}
@inproceedings{CP2015Semantic,
title={Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs},
author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
booktitle={ICLR},
year={2015}
}
Note that if you use the densecrf implementation, please consult and cite the following paper:
@inproceedings{KrahenbuhlK11,
title={Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials},
author={Philipp Kr{\"{a}}henb{\"{u}}hl and Vladlen Koltun},
booktitle={NIPS},
year={2011}
}
DeepLabv2 currently achieves 79.7% on the challenging PASCAL VOC 2012 semantic image segmentation task — see the leaderboard.
Please refer to our project website for details.
We have released several trained models and corresponding prototxt files at here. Please check it for more model details.
Check FAQ if you have some problems while using the code.
There are several variants of DeepLab. To begin with, we suggest DeepLab-LargeFOV, which has good performance and faster training time.
Suppose the codes are located at deeplab/code
Seyed Ali Mousavi has implemented a python version of run_pascal.sh (Thanks, Ali!). If you are more familiar with Python, you may want to take a look at this.