DRAGON SYSTEMS RESOURCE MANAGEMENT BENCHMARK RESULTS FEBRUARY 1991

Odwou - James Baker; Janet Baker; Pard Bamberg; Larry Gillick; Lori Lamel; Robert Roth; Francesco Scattone; Dean Sturtevant; Ousmane Ba; Richard Benedict

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

6 pages

English

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

A propos
Informations
Extrait

Description

Informations

Publié par	Odwou
Nombre de lectures	29
Langue	English

Extrait

preliminary
in
training
complete
necessary
a
the
which
been
are
Warfare
RESULTS
workshop.
energy
start
19911
system
overall
signal
Baker,
Research
computed
recognition
Baker,
slrategies,
Pard
system
Bamberg,
benchmark
Larry
characteristics
Gillick,
presenting
Lori
dependent
LameI,
detail
Robert
training.
Roth,
This
Francesco
Advanced
every
Projects
used
that
Sturtevant,
results
Ousmane
has
Ba,
system
Richard
Next,
Benedict
dependent,
3
a
input
Systems'
region
1.
the
~e-estimate
covering
results,
comlxments
test
spectral
test
7
speaker-
parameters
Section
eight
described
are
speaker-dependent
(617)
"context"
there
in
simple:
data,
(617)
sponsored
quite
modifications
representation
term
signal
representation
the
A
sampled
Space
is
continuous
speech
to
board.
Systems
results
different
obtained
task
TMS32010-based
this
PC.
a
486-based
algorithm
a
speech
running
Systems
(mammography
algorithm
Maaaagernent
post-processing
task
Resource
task.
Dragon
recognizer.
Conlract
performance
this
conceptual
transfer
real-time
evaluating
near
a
capable
these
to
material.
demonstrated
evaluation
speaker-dependent
data,
system
development
[1,2,3].
spellings
meeting
results
presented
fixed.
system
is
recognition
the
speech
is
continuous
re-estimated
Systems'
"phoneme-in-context"
Dragon
where
available.
processing
are
a
evaluation
areas
using
training
a
in
results
respall
comparative
Defense
Phonetic
a
thus
primary
(PELs).
aims.
time,
long
system
with
corresponding
line
through
differences.
a
are
was
monitored
material
system
constitute
speech
a
standard
lest
Naval
evaluation
changes
February
addition
curve,
been
learning
goal
representation
training
steep
described.
a
the
still
results.
befieve
evaluate
speaker-dependent
modifications
Since
modification
paper.
given.
this
is
configurations
recognition
system
continuous
training
speaker-
compare
1990
fundamental
Dragon
used
overview
concepaml
a
in
First,
speaker-dependent
Management
the
the
hardware,
algorithm
speaker,
work
the
a
changes
report
responding
paper
when
set
system
DRAGON SYSTEMS RESOURCE MANAGEMENT
BENCHMARK FEBRUARY
James Janet
Scattone, Dean
Dragon Systems, Inc.
320 Nevada Street
Newton, Massachusetts 02160
DRAGON@A.ISI.EDU
TEL: 965-5200
FAX: 527-0372
ABSTRACT
Recognition are given for the RM1 In this paper we present preliminary at Dragon Systems on
the Resource benchmark The basic units of data and for Feb91
our system are Phonemes-m-Context (PICs), which are represented as In we make at
Hidden Mmkov Models, each of which is eapressed as sequence of
the of our Elements The PELs to given phoneme
kind of alphabet for the of PICs. to in the and
signal processing algorithm. Our experimentation was
For the tests, two basic methods of the acoustic
performed using the development test models were investigated. 'nac first method of training the Resouro~
Managemera models is to the models for each test speaker from ~ta~ and dam
that speaker's training keeping the PEL of the PICs The in we that we are on
second approach is to use the models from the first melhod to
the 1991 derive segmentation of the then to the PICs in hrgely
speaker-depmdmt manner in order to improve the of speaker tun the only one and
full explanation of these methods is given, as are using the dam not yet
each method.
In to repotting on two we disoass N-
2. OVERVIEW OF THE DRAGON Best The N-Best algorithm is of the
proposed by Soong and Huang at the Jtme This CSR SYSTEM
runs as step and uses an A*-search (an also
known as decoder').
was at the June 1990 DARPA The
is and was be INTRODUCTION
of on an word
In we on some done at reports), when on
on The signal processing is performed by an additional
task. brief of The at kHz and
the is only
the to on -- up
RM are Our to make to kHz and an ~ameter -- of
the in ways 20 ms and as to
Dragon's The HMM-based
so far have the of
and speaker-dependent The The unit used the
in in 4. or PIC, the word
work was by the Agency and was by the and
Command under
59
N000-39-86-C-0307.
1.
12
844
'stack
to arc these Approximately
emulate
sentences
refers
general
labeling
alternatives.
semi-automatic
test
course
preprocessing
information
consonants,
about
speaker
selected
comparable
surrounding
Recognition
phonetic
the
environment
through
were
Dragon's
is
set
turn,
3
which
lexicon
determine
unstressed
the
suggests
acoustic
error
character
speaker's
PICs,
BEF,
sequence
selected
a
using
concatenating
question
question.
that
Several
board.
obtained
from
alternative
return
approaches
indicating
sentence)
a
appeared
model
(or
entries
the
multiple
literature
such
a
words,
Currently,
errl!dated
context
task.
model
finding
recognition
training
models
the
includes
the
available.
comes
identity
phrases,
training
the
relevant
PICs
which
speaker
cases
a
succeeding
signal
phonemes
emulate
rare
highly
well
possible
models
this
whether
signal
speaker's
above,
phoneme
a
reference
lexicon
speaker,
hand.
a
phonemes
prepausally
contains
lengthened
vowels
utterances
returns
training
in
are
syllabic
modeled
language
600
will
a
software
sequence
standard
the
stress
entirely
stressed
(phonetic
comparison
elements),
expected
each
39,000
based
that
which
speakers
represents
rates
a
recorded.
were
rate
estimates
average
duration
constraint
parameters
conform
acoustic
pair
the
reference
which
performed,
shared
reference
models
fi'om
build
words
was
texts
representing
phrases
the
training
goal
sentences
phoneme.
hardware,
A
using
detailed
reference
description
test
segmentation.
assess
models
order
initial
is
provide
benchmark
models
software
speaker's
order
reference
system
the
primarily
trained
acquisition
using
Prior
speakers,
the
the
been
each
obtained
models
acquisition
Modifications
processing,
dependent
score
speaker-
current
build
English
strategy
were
training
modified
procedure
word.
the
candidate
presented
score
version
English
Section
24
automatic
probability
a
(each
report[2].
which
earlier
system
detail
degrees
programming
stress),
described
3
extend
consonants.
mistakes.
22%
sentence
soon.
hypotheses
available
subject
SNOR
rapid
emulation
the
hardware
are
pronunciations.
pruning
pronunciations
used
reflect
errors
differences,
eliminate
reference
proportion
performance
paths.
versions
Another
explicit
important
well.
component
reasonably
small
signal
a
slandard
s

Univers
Ebooks
Livres audio
Presse
Podcasts
BD
Documents

DRAGON SYSTEMS RESOURCE MANAGEMENT BENCHMARK RESULTS FEBRUARY 1991

YouScribe

Le catalogue

Le service

Les conditions