Network Flow Algorithms for Structured Sparsity
9 pages
English

Découvre YouScribe en t'inscrivant gratuitement

Je m'inscris

Network Flow Algorithms for Structured Sparsity

-

Découvre YouScribe en t'inscrivant gratuitement

Je m'inscris
Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus
9 pages
English
Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus

Description

Network Flow Algorithms for Structured Sparsity Julien Mairal? INRIA - Willow Project-Team† Rodolphe Jenatton? INRIA - Willow Project-Team† Guillaume Obozinski INRIA - Willow Project-Team† Francis Bach INRIA - Willow Project-Team† Abstract We consider a class of learning problems that involve a structured sparsity- inducing norm defined as the sum of ∞-norms over groups of variables. Whereas a lot of effort has been put in developing fast optimization methods when the groups are disjoint or embedded in a specific hierarchical structure, we address here the case of general overlapping groups. To this end, we show that the cor- responding optimization problem is related to network flow optimization. More precisely, the proximal problem associated with the norm we consider is dual to a quadratic min-cost flow problem. We propose an efficient procedure which com- putes its solution exactly in polynomial time. Our algorithm scales up to millions of variables, and opens up a whole new range of applications for structured sparse models. We present several experiments on image and video data, demonstrating the applicability and scalability of our approach for various problems. 1 Introduction Sparse linear models have become a popular framework for dealing with various unsupervised and supervised tasks in machine learning and signal processing.

  • optimization problem

  • when restricted

  • flow problem

  • can still

  • arc

  • problem

  • sparsity- inducing norms

  • structured sparse

  • has recently


Sujets

Informations

Publié par
Nombre de lectures 22
Langue English

Extrait

1
Network Flow Algorithms for Structured Sparsity
Julien Mairal INRIA  Willow ProjectTeam julien.mairal@inria.fr
Guillaume Obozinski INRIA  Willow ProjectTeam guillaume.obozinski@inria.fr
Rodolphe Jenatton INRIA  Willow ProjectTeam rodolphe.jenatton@inria.fr
Abstract
Francis Bach INRIA  Willow ProjectTeam francis.bach@inria.fr
We consider a class of learning problems that involve a structured sparsity inducing norm defined as the sum ofnorms over groups of variables. Whereas a lot of effort has been put in developing fast optimization methods when the groups are disjoint or embedded in a specific hierarchical structure, we address here the case of general overlapping groups. To this end, we show that the cor responding optimization problem is related to network flow optimization. More precisely, the proximal problem associated with the norm we consider is dual to a quadratic mincost flow problempropose an efficient procedure which com. We putes its solution exactly in polynomial time. Our algorithm scales up to millions of variables, and opens up a whole new range of applications for structured sparse models. We present several experiments on image and video data, demonstrating the applicability and scalability of our approach for various problems.
Introduction
Sparse linear models have become a popular framework for dealing with various unsupervised and supervised tasks in machine learning and signal processing. In such models, linear combinations of small sets of variables are selected to describe the data. Regularization by the1norm has emerged as a powerful tool for addressing this combinatorial variable selection problem, relying on both a welldeveloped theory (see [1] and references therein) and efficient algorithms [2, 3, 4].
The1norm primarily encourages sparse solutions, regardless of the potential structural relation ships (e.g., spatial, temporal or hierarchical) existing between the variables. Much effort has recently been devoted to designing sparsityinducing regularizations capable of encoding higherorder infor mation about allowed patterns of nonzero coefficients [5, 6, 7, 8, 9, 10], with successful applications in bioinformatics [6, 11], topic modeling [12] and computer vision [9, 10].
By considering sums of norms of appropriate subsets, orgroups, of variables, these regulariza tions control the sparsity patterns of the solutions. The underlying optimization problem is usually difficult, in part because it involves nonsmooth components. Proximal methods have proven to be effective in this context, essentially because of their fast convergence rates and their scalability [3, 4]. While the settings where the penalized groups of variables do not overlap or are embedded in a tree shaped hierarchy [12] have already been studied, regularizations with general overlapping groups have, to the best of our knowledge, never been addressed with proximal methods. This paper makes the following contributions: It shows that theproximal operatorassociated with the structured norm we consider can be Contributed equally. Laboratoire d’Informatique de l’Ecole Normale Supe´rieure (INRIA/ENS/CNRS UMR 8548)
1
  • Univers Univers
  • Ebooks Ebooks
  • Livres audio Livres audio
  • Presse Presse
  • Podcasts Podcasts
  • BD BD
  • Documents Documents