{"id":2968,"date":"2016-10-18T09:42:09","date_gmt":"2016-10-18T09:42:09","guid":{"rendered":"http:\/\/cvlab-dresden.de\/?page_id=2968"},"modified":"2016-10-18T10:58:36","modified_gmt":"2016-10-18T10:58:36","slug":"diverse-solutions-to-graphical-models","status":"publish","type":"page","link":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/","title":{"rendered":"Diverse Solutions to Graphical Models"},"content":{"rendered":"<p><strong>Motivation:<\/strong> In many application it is not sufficient to only compute the (approximate) MAP solution of a graphical model. Hence, Batra et al. proposed in ECCV 2012 the idea of computing multiple solutions with two criteria in mind: a) the solutions should have low energy; b) the solutions should be as different as possible (w.r.t. some diversity measure). Computing such \u201cM-best diverse solutions\u201d has since 2012 been used in many practical applications, e.g. to improve semantic segmentation. In our line of work on this topic we have improved inference techniques for computing \u201cM-best diverse solutions\u201d for different types of energies and different diversity measures. Our key insight has been that it is advantageous to compute the \u201cM-best diverse solutions\u201d jointly (see figures below).<\/p>\n<h3><span style=\"text-decoration: underline;\">Baseline Approach <\/span><\/h3>\n<p><strong><em>Sequential<\/em> inference of diverse labelings: <\/strong>D. Batra, P. Yadollahpour, A. Guzman-Rivera, and G. Shakhnarovich. Diverse M-Best Solutions in Markov Random Fields. ECCV, 2012. <a href=\"http:\/\/filebox.ece.vt.edu\/%7Edbatra\/papers\/MBestModes.pdf\">[paper]<\/a><\/p>\n<h3><a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2970 size-full\" src=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png\" alt=\"pic-sequential-segs\" width=\"1783\" height=\"252\" srcset=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png 1783w, https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs-300x42.png 300w, https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs-1024x145.png 1024w\" sizes=\"auto, (max-width: 1783px) 100vw, 1783px\" \/><\/a><\/h3>\n<h3><span style=\"text-decoration: underline;\">Our Approach<\/span><\/h3>\n<p><strong><em>Joint<\/em> inference of diverse labelings as MAP-inference problem:\u00a0[<a href=\"https:\/\/youtu.be\/rWBRrUM4oLA\">video spotlight<\/a>]: <\/strong><\/p>\n<p style=\"text-align: center;\"><a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-joint-segs.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft wp-image-2969 size-full\" src=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-joint-segs.png\" alt=\"pic-joint-segs\" width=\"1732\" height=\"318\" srcset=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-joint-segs.png 1732w, https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-joint-segs-300x55.png 300w, https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-joint-segs-1024x188.png 1024w\" sizes=\"auto, (max-width: 1732px) 100vw, 1732px\" \/><\/a><\/p>\n<h4><span style=\"text-decoration: underline;\">Our main results for node-wise diversities:<\/span><\/h4>\n<ul>\n<li><strong>Problem formulation and general algorithm:<\/strong> A. Kirillov,\u00a0B. Savchynskyy, D. Schlesinger, D. Vetrov,\u00a0\u00a0C. Rother. Inferring M-Best Diverse Labelings in a Single One. [<a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/HTML\/people\/alexander_kirillov\/publications\/mbest_iccv15.pdf\">pdf with supplementary material<\/a>][<a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/HTML\/people\/alexander_kirillov\/publications\/bib\/kirillov-iccv2015.bib\">bib<\/a>] [<a href=\"https:\/\/youtu.be\/rWBRrUM4oLA\">video spotlight<\/a>] <em>ICCV\u00a0<\/em>2015<\/li>\n<li><strong>Efficient algorithm for submodular energies:<\/strong> A. Kirillov, D. Schlesinger, D. Vetrov,\u00a0\u00a0C. Rother, B. Savchynskyy. M-Best-Diverse Labelings for Submodular Energies and Beyond. [<a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/HTML\/people\/alexander_kirillov\/publications\/mbest_submodular_nips15.pdf\">pdf with supplementary material<\/a>][<a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/HTML\/people\/alexander_kirillov\/publications\/bib\/kirillov-nips2015.bib\">bib<\/a>] <em>NIPS\u00a0<\/em>2015<\/li>\n<li><strong>Superefficient algorithm for <em>binary<\/em> submodular energies:<\/strong> A. Kirillov, A. Shekhovtsov, C. Rother, B. Savchynskyy. Joint M-Best-Diverse Labelings as a Parametric Submodular Minimization. [<a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/HTML\/people\/alexander_kirillov\/publications\/mbest-parametric.pdf\">pdf<\/a>][<a href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/HTML\/people\/alexander_kirillov\/publications\/mbest-parametric-sup.pdf\">supplementary material<\/a>] <em>NIPS <\/em>2016<\/li>\n<\/ul>\n<h3>A summary of these works, and other existing approaches can be found here: [<a href=\"https:\/\/computing.ece.vt.edu\/~cvpr16diversitytutorial\/CVPR_2016_Diversity_files\/CVPR16-tutorial-single-model.pdf\">Tutorial slides (CVPR 2016)<\/a>]<\/h3>\n<h4>See also: <a href=\"https:\/\/computing.ece.vt.edu\/%7Ecvpr16diversitytutorial\/\">Diversity meets Deep Networks \u2014 Inference, Ensemble Learning, and Applications<\/a> &#8211; CVPR&#8217;16 tutorial web-page.<\/h4>\n","protected":false},"excerpt":{"rendered":"<p>Motivation: In many application it is not sufficient to only compute the (approximate) MAP solution of a graphical model. Hence, Batra et al. proposed in ECCV 2012 the idea of computing multiple solutions with two criteria in mind: a) the solutions should have low energy; b) the solutions should be as different as possible (w.r.t. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":688,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"inline_featured_image":false,"footnotes":""},"class_list":["post-2968","page","type-page","status-publish","hentry","post"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Diverse Solutions to Graphical Models - Computer Vision and Learning Lab Heidelberg<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Diverse Solutions to Graphical Models - Computer Vision and Learning Lab Heidelberg\" \/>\n<meta property=\"og:description\" content=\"Motivation: In many application it is not sufficient to only compute the (approximate) MAP solution of a graphical model. Hence, Batra et al. proposed in ECCV 2012 the idea of computing multiple solutions with two criteria in mind: a) the solutions should have low energy; b) the solutions should be as different as possible (w.r.t. [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/\" \/>\n<meta property=\"og:site_name\" content=\"Computer Vision and Learning Lab Heidelberg\" \/>\n<meta property=\"article:modified_time\" content=\"2016-10-18T10:58:36+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/\",\"url\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/\",\"name\":\"Diverse Solutions to Graphical Models - Computer Vision and Learning Lab Heidelberg\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/wp-content\\\/uploads\\\/2016\\\/10\\\/pic-sequential-segs.png\",\"datePublished\":\"2016-10-18T09:42:09+00:00\",\"dateModified\":\"2016-10-18T10:58:36+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/#primaryimage\",\"url\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/wp-content\\\/uploads\\\/2016\\\/10\\\/pic-sequential-segs.png\",\"contentUrl\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/wp-content\\\/uploads\\\/2016\\\/10\\\/pic-sequential-segs.png\",\"width\":1783,\"height\":252},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/diverse-solutions-to-graphical-models\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Research\",\"item\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Combinatorial Optimization\",\"item\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/research\\\/optimization\\\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"Diverse Solutions to Graphical Models\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/#website\",\"url\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/\",\"name\":\"Computer Vision and Learning Lab Heidelberg\",\"description\":\"Computer Vision and Learning Lab Heidelberg\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/hci.iwr.uni-heidelberg.de\\\/vislearn\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Diverse Solutions to Graphical Models - Computer Vision and Learning Lab Heidelberg","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/","og_locale":"en_US","og_type":"article","og_title":"Diverse Solutions to Graphical Models - Computer Vision and Learning Lab Heidelberg","og_description":"Motivation: In many application it is not sufficient to only compute the (approximate) MAP solution of a graphical model. Hence, Batra et al. proposed in ECCV 2012 the idea of computing multiple solutions with two criteria in mind: a) the solutions should have low energy; b) the solutions should be as different as possible (w.r.t. [&hellip;]","og_url":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/","og_site_name":"Computer Vision and Learning Lab Heidelberg","article_modified_time":"2016-10-18T10:58:36+00:00","og_image":[{"url":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png","type":"","width":"","height":""}],"twitter_misc":{"Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/","url":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/","name":"Diverse Solutions to Graphical Models - Computer Vision and Learning Lab Heidelberg","isPartOf":{"@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/#website"},"primaryImageOfPage":{"@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/#primaryimage"},"image":{"@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/#primaryimage"},"thumbnailUrl":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png","datePublished":"2016-10-18T09:42:09+00:00","dateModified":"2016-10-18T10:58:36+00:00","breadcrumb":{"@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/#primaryimage","url":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png","contentUrl":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-content\/uploads\/2016\/10\/pic-sequential-segs.png","width":1783,"height":252},{"@type":"BreadcrumbList","@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/diverse-solutions-to-graphical-models\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/"},{"@type":"ListItem","position":2,"name":"Research","item":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/"},{"@type":"ListItem","position":3,"name":"Combinatorial Optimization","item":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/research\/optimization\/"},{"@type":"ListItem","position":4,"name":"Diverse Solutions to Graphical Models"}]},{"@type":"WebSite","@id":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/#website","url":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/","name":"Computer Vision and Learning Lab Heidelberg","description":"Computer Vision and Learning Lab Heidelberg","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/pages\/2968","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/comments?post=2968"}],"version-history":[{"count":7,"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/pages\/2968\/revisions"}],"predecessor-version":[{"id":2979,"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/pages\/2968\/revisions\/2979"}],"up":[{"embeddable":true,"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/pages\/688"}],"wp:attachment":[{"href":"https:\/\/hci.iwr.uni-heidelberg.de\/vislearn\/wp-json\/wp\/v2\/media?parent=2968"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}