Interactive and life-long learning for identification and categorization tasks [Elektronische Ressource] / Stephan Kirstein. [Technische Universität Ilmenau]

technische_universitat_ilmenau

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

163 pages

English

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

A propos
Informations
Extrait

Description

Sujets

Informatik

Informations

Publié par	technische_universitat_ilmenau
Publié le	01 janvier 2010
Nombre de lectures	11
Langue	English
Poids de l'ouvrage	5 Mo

Extrait

Stephan Kirstein
Interactive and life-long learning for identification
and categorization tasks Interactive and life-long learning
for identification and
categorization tasks
Stephan Kirstein
Universitätsverlag Ilmenau
2010Impressum
Bibliografische Information der Deutschen Nationalbibliothek
Die Deutsche Nationalbibliothek verzeichnet diese Publikation in der
Deutschen Nationalbibliografie; detaillierte bibliografische Angaben sind
im Internet über http://dnb.d-nb.de abrufbar.
Diese Arbeit hat der Fakultät für Informatik und Automatisierung der
Technischen Universität Ilmenau als Dissertation vorgelegen.
Tag der Einreichung: 24. September 2009
1. Gutachter: Prof. Dr. Horst-Michael Groß
(Technische Universität Ilmenau)
2. Gutachter: Prof. Dr. Edgar Körner
(Honda Research Institute Europe GmbH)
3. Gutachter: Prof. Dr. Martin Riedmiller
(Albert-Ludwigs-Universität Freiburg)
Tag der Verteidigung: 15. April 2010
Technische Universität Ilmenau/Universitätsbibliothek
Universitätsverlag Ilmenau
Postfach 10 05 65
98684 Ilmenau
www.tu-ilmenau.de/universitaetsverlag
Herstellung und Auslieferung
Verlagshaus Monsenstein und Vannerdat OHG
Am Hawerkamp 31
48155 Münster
www.mv-verlag.de
ISBN 978-3-939473-80-0 (Druckausgabe)
urn:nbn:de:gbv:ilm1-2010000131
Titelfoto: photocase.com v
Abstract
The presented thesis focuses on life-long and interactive learning for
identiﬁcationandcategorizationtasks. Thefundamentalandstilllargely
unsolved problem of life-long learning with artiﬁcial neural networks is
the so-called “stability-plasticity dilemma”. To achieve plasticity the
learningapproachmustbeabletocontinuouslyintegratenewlyacquired
knowledge into its internal representation, while for the stability the
conservation of this knowledge is required.
To achieve interactive learning for diﬃcult recognition problems the
separation into an intermediate and fast learning short-term memory
(STM) and a slower learning long-term memory (LTM) is proposed.
For the incremental build up of the STM a similarity-based one-shot
learning method was developed. Furthermore two diﬀerent memory
consolidation algorithms were proposed enabling the incremental learn-
ing of LTM representations for various identiﬁcation and categorization
tasks. For identiﬁcation tasks diﬀerent modiﬁcations to the Learning
Vector Quantization (LVQ) network architecture are proposed. The
major changes of the LVQ approach are an error-based node insertion
rule and a node dependent learning rate. Both extensions enable incre-
mental and life-long learning for identiﬁcation problems. For life-long
learning of categories this extended LVQ model was combined with a
forward-feature selection method. This selection method incrementally
extracts small sets of category-speciﬁc features and therefore allows for
a compact representation of categories.
In several interactive and oﬄine recognition experiments the eﬃciency
and performance of the proposed learning methods could be shown for
diﬃcult visual recognition problems. Additionally an active vision sys-
tem was developed that utilizes the proposed learning methods. This
integrated system enables learning of visual representations based on
natural and complex-shaped objects presented in hand.vi
Zusammenfassung
Die vorgelegte Arbeit besch¨aftigt sich mit lebenslangem und interak-
tivem Lernen von Identiﬁkations- und Kategorisierungsaufgaben. Das
grundlegende Problem von lebenslangem Lernen mit Hilfe kunstli¨ cher
neuronalerNetzeistdassogenannte“Stabilit¨ats-Plastizit¨atsDilemma”.
Um die Plastizit¨at zu gew¨ahrleisten muss ein neuronales Netz in der
Lage sein, neues Wissen zu erwerben. Zus¨atzlich muss fur¨ die Stabilit¨at
dieses Wissen vor dem Verlust bewahrt werden.
Um die F¨ahigkeit des interaktiven Lernens zu gew¨ahrleisten wurde die
Aufteilung in eine tempor¨ares und schnell lernendes Kurzzeitged¨achtnis
und ein langsamer lernendes Langzeitged¨achtnis vorgeschlagen.
Zum inkrementellen Aufbau des Kurzzeitged¨achtnisses wurde ein
ahnlic¨ hkeitsbasiertes und sogenanntes “one-shot learning” Verfahren
vorgeschlagen. Fur¨ die Konsolidierung dieser Repr¨asentation in das
Langzeitged¨achtnis wurden zwei verschiedene Methoden betrachtet,
die fur¨ das L¨osen von nahezu beliebigen Identiﬁkations- und Kate-
gorisierungsaufgaben geeignet sind. Um diese Lernf¨ahigkeit fur¨ Iden-
tiﬁkationsprobleme zu erreichen, wurden verschiedenen Erweiterungen
fur¨ die sogenannte “Learning Vector Quantization (LVQ)” Netzwerk-
¨architektur vorgeschlagen. Die wichtigstens Anderungen sind die Ein-
fuhrung¨ einer fehlerbasierten Knoten-Einfugeregel¨ und die Verwendung
einer knotenspeziﬁschen Lernrate. Beide Erweiterungen erm¨oglichen
inkrementelles und lebenslanges Lernen fur¨ Identiﬁkationsaufgaben. Im
Gegensatzdazuwurdefur¨ dasLernenvonKategoriendiesesmodiﬁzierte
LVQ Netzwerkmodell mit einem Verfahren zur Merkmalsselektion kom-
biniert. Dieses Verfahren selektiert kategoriespeziﬁsche Merkmalsdi-
mensionen und ist daher geeignet um besonders kompakte Kategori-
erepr¨asentationen zu lernen.
In verschiedenen Erkennungsexperimenten konnte die Eﬃzienz und
die Leistungsf¨ahigkeit der vorgeschlagenen Verfahren belegt wer-
den. Zus¨atzlich wurde ein integriertes Erkennungssystem entwickelt,
welches die vorgeschlagenen Verfahren einsetzt. Dieses Gesamtsys-
tem erm¨oglicht das interactive Lernen visueller Repr¨asentationen von
beliebigen Gegenst¨anden, welche, in der Hand gehalten, dem System
gezeigt werden.vii
Acknowledgments
ThepresentedworkwasajointprojectbetweentheHondaResearchIn-
stituteEuropeGmbH(HRI-EU)inOﬀenbachandtheNeuroinformatics
and Cognitive Robotics Lab at the Ilmenau University of Technology. I
thank the heads of both institutes Prof. Dr. Edgar K¨orner and Prof.
Dr. Horst-Michael Gross for giving me the possibility to work on this
interesting research topic.
IalsowanttothankmyprojectleaderandsupervisorDr. HeikoWersing
for many fruitful discussions, valuable advise and help, but also for
his constructive criticism. Additionally I would like to thank him and
StephanHaslerforprovidingtheshapefeatureextractionmethodsused
in this dissertation.
Many thanks go also to my colleagues at both institutes. Speciﬁcally
I want to thank the team members of the Learning of Sensory Repre-
sentations group at the Honda Research Institute Europe GmbH: Dr.
Heiko Wersing, Dr. Mathias Franzius, Alexander Denecke, Stephan
Hasler and Samuel John for the productive collaboration and the good
working atmosphere.
Furthermore, I want to thank all readers of this manuscript, includ-
ing Prof. Dr. Horst-Michael Gross, Dr. Heiko Wersing, Dr. Martina
Hasenj¨agerandDr. TobiasRodemann,fortheircommentsandvaluable
hints for the improvement of this work.
Finally I would like to thank all my friends for their support in the
hot phases of this dissertation. I also want to thank them for moti-
vating me to continue with the work on this thesis. Many thanks go
also to my mother, my sister and my stepfather for their support and
encouragement.CONTENTS ix
Contents
1 Introduction 1
1.1 Problem Description . . . . . . . . . . . . . . . . . . . . 3
1.2 Deﬁnition of Common Terms . . . . . . . . . . . . . . . 5
1.3 Inspiration from Biology . . . . . . . . . . . . . . . . . . 7
1.3.1 Memory and Learning in the Human Brain . . . . 7
1.3.2 Consolidation . . . . . . . . . . . . . . . 8
1.3.3 The Cholinergic System . . . . . . . . . . . . . . 10
1.4 The Scope of this Dissertation . . . . . . . . . . . . . . . 11
1.5 Structure of this Thesis . . . . . . . . . . . . . . . . . . . 12
2 Life-Long Learning with Standard Neural Network Ar-
chitectures 15
2.1 Single Layer Perceptron . . . . . . . . . . . . . . . . . . 16
2.2 Multi Layer P . . . . . . . . . . . . . . . . . . . 17
2.3 Cascade Correlation . . . . . . . . . . . . . . . . . . . . 19
2.4 Vector Quantization Networks . . . . . . . . . . . . . . . 21
2.5 Radial Basis Function Networks . . . . . . . . . . . . . . 24
2.6 Fuzzy ARTMAP . . . . . . . . . . . . . . . . . . . . . . 25
2.7 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . 28x CONTENTS
3 Life-Long Learning for Identiﬁcation Tasks 31
3.1 Related Work . . . . . . . . . . . . . . . . . . . . . . . . 32
3.1.1 Online Learning and Man-machine Interaction . . 32
3.1.2 Network Architectures for Incremental and Life-
Long Learning . . . . . . . . . . . . . . . . . . . . 34
3.2 Life-Long Learning Model for Object Identiﬁcation . . . 35
3.2.1 Online Vector Quantization to Build a Short-
Term Memory . . . . . . . . . . . . . . . . . . . . 37
3.2.2 Incremental LVQ to Build a Long-Term Memory. 39
3.3 Experimental Results . . . . . . . . . . . . . . . . . . . . 44
3.3.1 Experimental Setup . . . . . . . . . . . . . . . . . 45
3.3.2 Feature Extraction . . . . . . . . . . . . . . . . . 46
3.3.3 Online Vector Quantization to Build a Short-
Term Memory . . . . . . . . . . . . . . . . . . . . 48
3.3.4 Incremental LVQ to Build a