TY - STD TI - Yang Y, Li Z, Zhang L, Murphy C, Hoeve JV, Jiang H (2012) Local label descriptor for example based semantic image labeling In: Proc. of European Converence on Computer Vision (ECCV), 361–375. ID - ref1 ER - TY - STD TI - Sturgess P, Alahari K, Ladicky L, H.S.Torr P (2009) Combining appearance and structure from motion features for road scene understanding In: Proc. of British Machine Vision Conferenve (BMVC). ID - ref2 ER - TY - STD TI - Ladicky L, Sturgess P, Alahari K, Russell C, Torr PHS (2010) What, where and how many? Combining object detectors and CRFs In: Proc. of European Converence on Computer Vision (ECCV), 424–437. ID - ref3 ER - TY - STD TI - Zhang C, Wang L, Yang R (2010) Semantic segmentation of urban scenes using dense depth maps In: Proc. of European Converence on Computer Vision (ECCV), 708–721. ID - ref4 ER - TY - JOUR AU - Tighe, J. AU - Lazebnik, S. PY - 2013 DA - 2013// TI - Superparsing JO - Int J Comput Vision (IJCV) VL - 101 UR - https://doi.org/10.1007/s11263-012-0574-z DO - 10.1007/s11263-012-0574-z ID - Tighe2013 ER - TY - STD TI - Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks In: Proc. of NIPS, 1097–1105. ID - ref6 ER - TY - STD TI - Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation In: Proc. of IEEE Computer Vision and Pattern Recognition (CVPR), 3431–3440. ID - ref7 ER - TY - STD TI - Badrinarayanan V, Kendall A, Cipolla R (2015) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. arXiv: 1511.00561. ID - ref8 ER - TY - STD TI - Paszke A, Chaurasia A, Kim S, Culurciello E (2016) Enet: a deep neural network architecture for real-time semantic segmentation. arXiv: 1606.02147v1. ID - ref9 ER - TY - STD TI - Brust CA, Sickert S, Simon M, Rodner E, Denzler J (2015) Convolutional patch networks with spatial prior for road detection and urban scene understanding In: Proc. of VISAPP. ID - ref10 ER - TY - STD TI - He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. arXiv: 1512.03385. ID - ref11 ER - TY - STD TI - Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2015) Rethinking the inception architecture for computer vision. arXiv: 1512.00567. ID - ref12 ER - TY - STD TI - Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions In: Proc. of IEEE Computer Vision and Pattern Recognition (CVPR), 1–9. ID - ref13 ER - TY - STD TI - Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. arXiv: 1511.07122. ID - ref14 ER - TY - STD TI - Kingma D, Ba J (2014) Adam: a method for stochastic optimization. arXiv: 1412.6980. ID - ref15 ER - TY - STD TI - Brostow GJ, Shotton J, Fauqueur J, Cipolla R (2008) Segmentation and recognition using structure from motion point clouds In: Proc. of European Converence on Computer Vision (ECCV), 44–57. ID - ref16 ER -