Network Flow Algorithms for Structured Sparsity

pefav - Julien Mairal

Découvre YouScribe en t'inscrivant gratuitement

Je m'inscris

Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus

9 pages

English

Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus

A propos
Informations
Extrait

Description

Network Flow Algorithms for Structured Sparsity Julien Mairal? INRIA - Willow Project-Team† Rodolphe Jenatton? INRIA - Willow Project-Team† Guillaume Obozinski INRIA - Willow Project-Team† Francis Bach INRIA - Willow Project-Team† Abstract We consider a class of learning problems that involve a structured sparsity- inducing norm defined as the sum of ∞-norms over groups of variables. Whereas a lot of effort has been put in developing fast optimization methods when the groups are disjoint or embedded in a specific hierarchical structure, we address here the case of general overlapping groups. To this end, we show that the cor- responding optimization problem is related to network flow optimization. More precisely, the proximal problem associated with the norm we consider is dual to a quadratic min-cost flow problem. We propose an efficient procedure which com- putes its solution exactly in polynomial time. Our algorithm scales up to millions of variables, and opens up a whole new range of applications for structured sparse models. We present several experiments on image and video data, demonstrating the applicability and scalability of our approach for various problems. 1 Introduction Sparse linear models have become a popular framework for dealing with various unsupervised and supervised tasks in machine learning and signal processing.

optimization problem

when restricted

flow problem

can still

problem

sparsity- inducing norms

structured sparse

has recently

Sujets

Optimization problem

Flow network

Problem

Informations

Publié par	pefav
Nombre de lectures	22
Langue	English

Extrait

Network Flow Algorithms for Structured Sparsity

∗ Julien Mairal † INRIA  Willow ProjectTeam julien.mairal@inria.fr

Guillaume Obozinski † INRIA  Willow ProjectTeam guillaume.obozinski@inria.fr

∗ Rodolphe Jenatton † INRIA  Willow ProjectTeam rodolphe.jenatton@inria.fr

Abstract

Francis Bach † INRIA  Willow ProjectTeam francis.bach@inria.fr

We consider a class of learning problems that involve a structured sparsity inducing norm deﬁned as the sum ofℓ∞norms over groups of variables. Whereas a lot of effort has been put in developing fast optimization methods when the groups are disjoint or embedded in a speciﬁc hierarchical structure, we address here the case of general overlapping groups. To this end, we show that the cor responding optimization problem is related to network ﬂow optimization. More precisely, the proximal problem associated with the norm we consider is dual to a quadratic mincost ﬂow problempropose an efﬁcient procedure which com. We putes its solution exactly in polynomial time. Our algorithm scales up to millions of variables, and opens up a whole new range of applications for structured sparse models. We present several experiments on image and video data, demonstrating the applicability and scalability of our approach for various problems.

Introduction

Sparse linear models have become a popular framework for dealing with various unsupervised and supervised tasks in machine learning and signal processing. In such models, linear combinations of small sets of variables are selected to describe the data. Regularization by theℓ1norm has emerged as a powerful tool for addressing this combinatorial variable selection problem, relying on both a welldeveloped theory (see [1] and references therein) and efﬁcient algorithms [2, 3, 4].

Theℓ1norm primarily encourages sparse solutions, regardless of the potential structural relation ships (e.g., spatial, temporal or hierarchical) existing between the variables. Much effort has recently been devoted to designing sparsityinducing regularizations capable of encoding higherorder infor mation about allowed patterns of nonzero coefﬁcients [5, 6, 7, 8, 9, 10], with successful applications in bioinformatics [6, 11], topic modeling [12] and computer vision [9, 10].

By considering sums of norms of appropriate subsets, orgroups, of variables, these regulariza tions control the sparsity patterns of the solutions. The underlying optimization problem is usually difﬁcult, in part because it involves nonsmooth components. Proximal methods have proven to be effective in this context, essentially because of their fast convergence rates and their scalability [3, 4]. While the settings where the penalized groups of variables do not overlap or are embedded in a tree shaped hierarchy [12] have already been studied, regularizations with general overlapping groups have, to the best of our knowledge, never been addressed with proximal methods. This paper makes the following contributions: −It shows that theproximal operatorassociated with the structured norm we consider can be ∗ Contributed equally. † Laboratoire d’Informatique de l’Ecole Normale Supe´rieure (INRIA/ENS/CNRS UMR 8548)