Shape matters: Machine classification and listeners’ perceptual discrimination of American English intonational tunes

Jennifer Cole, Sam Tilsen, Jeremy Steffman

Research output: Contribution to journalConference articlepeer-review

6 Scopus citations

Abstract

In Autosegmental-Metrical models of intonational phonology, pitch accents, phrase accents and boundary tones may combine freely to create a predicted set of phonologically distinct phrase-final “nuclear” tunes. In this study we ask if an 8-way distinction in nuclear tune shape in American English, predicted from combinations of 2 (monotonal) pitch accents, 2 phrase accents and 2 boundary tones, is manifest in speech production and in speech perception. F0 trajectories from an imitative speech production experiment were analyzed using (i) neural net classification, and (ii) human listeners’ perceptual discrimination of the model utterances. Pairwise classification accuracy of the imitative productions is highest for tune pairs that differ in holistic shape (high-rising vs. rise-fall), and poorest for tunes with the same shape that differ in (higher vs. lower) final f0. Perception results show a similar pattern, with poor pairwise discrimination for tunes that differ primarily, but by a small degree, in final f0. Together the results suggest a hierarchy of distinctiveness among nuclear tunes, with a robust distinction based on holistic tune shape, which only partly aligns with distinctions in tonal specification, and a weak/poorly differentiated distinction between tunes with the same holistic shape but small differences in final f0.

Original languageEnglish (US)
Pages (from-to)297-301
Number of pages5
JournalProceedings of the International Conference on Speech Prosody
Volume2022-May
DOIs
StatePublished - 2022
Event11th International Conference on Speech Prosody, Speech Prosody 2022 - Lisbon, Portugal
Duration: May 23 2022May 26 2022

Funding

We thank Lisa Cox, Chun Chan and Stefanie Shattuck-Hufnagel for technical and conceptual contributions. This project was supported by NSF BCS-1944773.

Keywords

  • deep learning
  • intonation perception
  • intonation production
  • neural net classification
  • nuclear tunes

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Shape matters: Machine classification and listeners’ perceptual discrimination of American English intonational tunes'. Together they form a unique fingerprint.

Cite this