Help Privacy Policy Disclaimer
  Advanced SearchBrowse




Conference Paper

Using Conditional Random Fields to Predict Pitch Accent in Conversational Speech

There are no MPG-Authors in the publication available
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available

Gregory, M., & Altun, Y. (2004). Using Conditional Random Fields to Predict Pitch Accent in Conversational Speech. In D. Scott, W. Daelemans, & M. Walker (Eds.), 42nd Annual Meeting of the Association for Computational Linguistics (ACL 2004) (pp. 677-684). East Stroudsburg, PA, USA: Association for Computational Linguistics.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-D8AD-F
The detection of prosodic characteristics is an important
aspect of both speech synthesis and speech
recognition. Correct placement of pitch accents aids
in more natural sounding speech, while automatic
detection of accents can contribute to better wordlevel
recognition and better textual understanding.
In this paper we investigate probabilistic, contextual,
and phonological factors that influence pitch
accent placement in natural, conversational speech
in a sequence labeling setting. We introduce Conditional
Random Fields (CRFs) to pitch accent prediction
task in order to incorporate these factors efficiently
in a sequence model. We demonstrate the
usefulness and the incremental effect of these factors
in a sequence model by performing experiments
on hand labeled data from the Switchboard Corpus.
Our model outperforms the baseline and previous
models of pitch accent prediction on the Switchboard