Joint training of generic CNN-CRF models with stochastic optimization

TitleJoint training of generic CNN-CRF models with stochastic optimization
Publication TypeConference Paper
Year of Publication2017
AuthorsKirillov, A, Schlesinger, D, Zheng, S, Savchynskyy, B, Torr, PHS, Rother, C
Conference NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISBN Number9783319541839

We propose a new CNN-CRF end-to-end learning framework, which is based on joint stochastic optimization with respect to both Convolutional Neural Network (CNN) and Conditional Random Field (CRF) parameters. While stochastic gradient descent is a standard technique for CNN training, it was not used for joint models so far. We show that our learning method is (i) general, i.e. it applies to arbitrary CNN and CRF architectures and potential functions; (ii) scalable, i.e. it has a low memory footprint and straightforwardly parallelizes on GPUs; (iii) easy in implementation. Additionally, the unified CNN-CRF optimization approach simplifies a potential hardware implementation. We empirically evaluate our method on the task of semantic labeling of body parts in depth images and show that it compares favorably to competing techniques.

Citation KeyKirillov2017