Corpora, Databases, and Internet Resources: Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora

Jennifer Cole*, Mark Hasegawa-Johnson, Dan Loehr, Linda Van Guilder, Henning Reetz, Stefan A. Frisch

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This article introduces a wide range of approaches to using large bodies of data for linguistic research. Corpus analysis for phonological research involves the investigation of the phonetic, phonological, and lexical properties of speech for the purpose of understanding the patterns of variation in the phonetic expression of words, and the distributional patterns of sound elements in relation to the linguistic context. A speech corpus provides a basis for investigating variability in phonetic form and also provides a rich resource for studying the relationship between phonological form and other levels of linguistic structure. Linguistic metadata provides information about the speakers, such as sex, age, ethnicity, and region of residence. Metadata may also provide information about speaker recruitment and recording procedures. Forced alignment is done using algorithms from automatic speech recognition (ASR), and is most successful when each phone associated with the word in its dictionary form is actually fully pronounced. One of the easiest methods of manipulating natural speech is the splicing technique, where parts of a speech signal are cut out, repeated, or cross-spliced with another piece of the signal. The gating technique is another form of natural speech signal manipulation often applied in psycholinguistic experiments, where parts of a speech signal are cut off, and incrementally more of the signal is presented to a listener. Another speech signal manipulation is the mixing of two signals.

Original languageEnglish (US)
Title of host publicationThe Oxford Handbook of Laboratory Phonology
PublisherOxford University Press
ISBN (Electronic)9780191744068
ISBN (Print)9780199575039
DOIs
StatePublished - Sep 18 2012

Keywords

  • Automatic speech recognition
  • Corpus analysis
  • Gating technique
  • Lexical properties
  • Linguistic metadata
  • Phonology
  • Speech signal manipulation
  • Usage frequency

ASJC Scopus subject areas

  • Arts and Humanities(all)
  • Social Sciences(all)

Fingerprint Dive into the research topics of 'Corpora, Databases, and Internet Resources: Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora'. Together they form a unique fingerprint.

Cite this