NOTE: The following materials are presented for timely
dissemination of academic and technical work. Copyright and all other rights
therein are reserved by authors and/or other copyright holders. Persoanl
use of the following materials is permitted and, however, people using
the materials or information are expected to adhere to the terms and
constraints invoked by the related copyright.
Speaker Identification Using Time-Delay HMEs
ABSTRACT
In this paper, we extend the Hierarchical Mixtures of Experts (HME) to
temporal processing and explore it for a substantial problem, that of
text-dependent speaker identification. For a specific multiway classification,
we propose a generalized Bernoulli density instead of the multinomial logit
density to avoid the instability during training. Time-delay technique is
applied for spatio-temporal processing in the HME and a combining scheme is
presented for combining multiple time-delay HMEs in order to complete
multi-scale analysis for the temporal data. Using the time-delay HME along
with the EM algorithm as well as the combination of multiple time-delay HMEs,
the speaker identification system has a good performance and yields
significantly fast training. We have also addressed some issues about the
time-delay techniques in the HME.
Click
ijns96.pdf
for full text