La lecture à portée de main
Découvre YouScribe en t'inscrivant gratuitement
Je m'inscrisDécouvre YouScribe en t'inscrivant gratuitement
Je m'inscrisDescription
Informations
Publié par | Thesee |
Nombre de lectures | 41 |
Langue | English |
Extrait
AVERTISSEMENT
Ce document est le fruit d'un long travail approuvé par le
jury de soutenance et mis à disposition de l'ensemble de la
communauté universitaire élargie.
Il est soumis à la propriété intellectuelle de l'auteur. Ceci
implique une obligation de citation et de référencement lors
de l’utilisation de ce document.
Toute contrefaçon, plagiat, reproduction illicite encourt une
poursuite pénale.
➢ Contact SCD Nancy 1 : theses.sciences@scd.uhp-nancy.fr
LIENS
Code de la Propriété Intellectuelle. articles L 122. 4
Code de la Propriété Intellectuelle. articles L 335.2- L 335.10
http://www.cfcopies.com/V2/leg/leg_droi.php
http://www.culture.gouv.fr/culture/infos-pratiques/droits/protection.htm Henri Poincaré University – Nancy 1
University of West Bohemia in Pilsen
Doctoral Dissertation
under Joint Supervision
2007 Pavel KRÁL Université Henri Poincaré – Nancy 1
Département de formation doctoral en informatique
University of West Bohemia in Pilsen
Faculty of Applied Sciences
AUTOMATIC RECOGNITION
OF DIALOGUE ACTS
by
Pavel KRÁL
A dissertation under joint supervision submitted in partial
fulfillment of the requirements for the degree of Doctor of
Philosophy in “Computer Science” and “Computer Science
and Engineering”
Presented and defended publicly on November 12, 2007 before the board of examiners.
Régine ANDRÉ-OBRECHT reviewer Université Paul Sabatier
Lud ěk MÜLLER University of West Bohemia
Jean-Paul HATON examiner Université Henri Poincaré
Václav MATOUŠEK University
Jan NOUZA examiner Technical University of Liberec
Christophe CERISARA examiner CNRS Loria
Yves LAPRIE supervisor CNRS Lor
Jana KLE ČKOVÁ visor University of West Bohemia
Nancy / Pilsen 2007
Université Henri Poincaré – Nancy 1
Département de formation doctoral en informatique
Université de Bohême de l’Ouest à Plze ň
Faculté des Sciences Appliquées
RECONNAISSANCE AUTOMATIQUE
DES ACTES DE DIALOGUE
par
Pavel KRÁL
Thèse en cotutelle présentée pour l’obtention du grade de
Docteur de l’Université Henri Poincaré – Nancy 1 (spécialité
Informatique) et de l’Université de Bohême de l’Ouest
(spécialité Informatique et ingénierie)
Soutenue publiquement le 12 novembre 2007 devant la commision d’examen.
Régine ANDRÉ-OBRECHT reviewer Université Paul Sabatier
Lud ěk MÜLLER University of West Bohemia
Jean-Paul HATON examiner Université Henri Poincaré
Václav MATOUŠEK University
Jan NOUZA examiner Technical University of Liberec
Christophe CERISARA examiner CNRS Loria
Yves LAPRIE supervisor CNRS Lor
Jana KLE ČKOVÁ visor University of West Bohemia
Nancy / Plze ň 2007
Université Henri Poincaré – Nancy 1
Département de formation doctoral en informatique
Západočeská univerzita v Plzni
Fakulta aplikovaných v ěd
AUTOMATICKÉ ROZPOZNÁVÁNÍ
DIALOGOVÝCH AKT Ů
Ing. Pavel KRÁL
Diserta ční práce pod dvojím vedením k získání
akademického titulu doktor v oboru „Informatika“ a
„Informatika a výpo četní technika“
P ředneseno a obhájeno ve řejn ě před zkušební komisí dne 12. listopadu 2007.
Régine ANDRÉ-OBRECHT Université Paul Sabatier
Lud ěk MÜLLER KKY Z ČU v Plzni
Jean-Paul HATON Université Henri Poincaré
Václav MATOUŠEK KIV Z ČU v Plzni
Jan NOUZA Technická univerzita v Libereci
Christophe CERISARA CNRS Loria
Yves LAPRIE ia
Jana KLE ČKOVÁ KIV Z ČU v Plzni
Nancy / Plze ň 2007
Declaration
Isubmitthisdoctoralthesisforreviewanddefenseinpartialfulfillmentoftherequirements
for the degree of Doctor of Philosophy at the Henri Poincar´e University in Nancy, France
and at the University of West Bohemia in Pilsen, Czech Republic.
I declare that this doctoral thesis is completely my own work and that I used only the
cited sources.
Pilsen, September 4, 2007 Pavel Kr´al
iAcknowledgements
I wish to express my thanks to Mrs. Jana Kleˇckov´a and to Mr. Yves Laprie, my thesis
supervisors, for their leadership during my PhD studies.
Special thanks belong to Mr. Christophe Cerisara for his support and advices during my
studies and for his valuable remarks during writing this document.
I would like also thank to my family and to my partner Dana Stejskalov´a for their support
and understanding during this studies.
My thank belong also to Mr. Michel Orlhac for his corrections of English language.
Finally, I wish to thank my colleagues from the Parole team, particularly to Emmanuel
Didiot and to Joseph Razik, for their help to work in a friendly atmosphere.
iiThis work has been partly supported by the European integrated project Amigo (IST-
004182), a project partly funded by the European Commission, and by the Ministry
of Education, Youth and Sports of Czech republic grant (NPV II-2C06009).
iiiAbstract
This thesis deals with automatic Dialogue Act (DA) recognition in Czech and in French.
Dialogue acts are sentence-level labels that represent different states of a dialogue, such
as questions, statements, hesitations, etc.
The first main contribution of this work is to propose and compare several approaches
that recognize dialogue acts based on three types of information: lexical, prosodic and
word positions. These approaches are tested on the Czech Railways corpus that contains
human-human dialogues, which are transcribed both manually and with an automatic
speech recognizer for comparison. The experimental results confirm that every type of
feature (lexical, prosodic and word positions) bring relevant and somewhat complemen-
tary information. The proposed methods that take into account word positions are espe-
cially interesting, as they bring global information about the structure of a sentence, at
the opposite of traditional n-gram models that only capture local cues. We propose three
approaches to model this information: the first one, the multiscale position approach, ex-
ploits a description of the sentence at several levels and smoothes the probabilities across
these levels. The second one, the non-linear merging approach, models the dependency
between the words in the sentence and their position with a Multilayer Perceptron. The
third one, the best position approach, exploits the Bayesian framework and assumes con-
ditional independence between the words and their position to infer the probability of the
dialogue act. We also propose a solution to the lack of training data problem, which is a
common issue in DA recognition systems. We develop the clustered unigram model, which
clusters the words in the sentences into several groups by maximizing mutual information
between two neighbor word classes. We show that this method is especially efficient when
the DA corpus is small. When word sequences are estimated from a speech recognizer, the
resulting decrease of accuracy of all proposed approaches is very small (about 3 %), which
confirms the capability to perform well in real applications.
One of the main issue in the domain of automatic dialogue act recognition concerns the
design of a fast and cheap method to label new corpora. The next main contribution is to
applyageneralsemi-supervisedtrainingapproachbasedontheExpectationMaximization
algorithm to the task of labeling a new corpus with pre-defined DAs. We further propose
to filter out incorrect examples with two confidence measures, the maximum a posteriori
probability and the a posteriori probability difference methods. Experimental results show
that the proposed method is an interesting approach to create new dialogue act corpora
at low costs.Resum´e
Ce m´emoire concerne la reconnaissance automatique des Actes de Dialogues (AD) en
tch`eque et en fran¸cais. Les actes de dialogues sont des unit´es au niveau de la phrase
qui repr´esentent les diff´erents ´etats d’un dialogue, comme par exemple les questions, les
affirmations, les h´esitations, etc.
La premi`ere contribution de ce travail est de proposer et comparer plusieurs approches
de reconnaissance des actes de dialogues qui sont bas´ees sur trois types d’informations :
lexical, prosodique et relative a` la position des mots dans une phrase. Ces approches
ont