VoiceAssist

Guiding users to high-quality voice recordings

Prem Seetharaman, Gautham Mysore, Bryan A Pardo, Paris Smaragdis, Celso Gomes

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Voice recording is a challenging task with many pitfalls due to sub-par recording environments, mistakes in recording setup, microphone quality, etc. Newcomers to voice recording often have difficulty recording their voice, leading to recordings with low sound quality. Many amateur recordings of poor quality have two key problems: too much reverberation (echo), and too much background noise (e.g. fans, electronics, street noise). We present VoiceAssist, a system that helps inexperienced users produce high quality recordings by providing real-time visual feedback on audio quality. We integrate modern audio quality measures into an interactive human-machine feedback loop, so that the audio quality can be maximized at capture-time. We demonstrate the utility of this feedback for improving the recording quality with a user study. When presented with visual feedback about recording quality, users produced recordings that were strongly preferred by third-party listeners, when compared to recordings made without this feedback.

Original languageEnglish (US)
Title of host publicationCHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450359702
DOIs
StatePublished - May 2 2019
Event2019 CHI Conference on Human Factors in Computing Systems, CHI 2019 - Glasgow, United Kingdom
Duration: May 4 2019May 9 2019

Publication series

NameConference on Human Factors in Computing Systems - Proceedings

Conference

Conference2019 CHI Conference on Human Factors in Computing Systems, CHI 2019
CountryUnited Kingdom
CityGlasgow
Period5/4/195/9/19

Fingerprint

Feedback
Reverberation
Microphones
Acoustic noise
Fans
Electronic equipment
Acoustic waves

Keywords

  • Active capture
  • Audio quality
  • Creativity support tools
  • Feedback
  • Interfaces
  • Narration
  • Speech
  • Voice recording

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Graphics and Computer-Aided Design
  • Software

Cite this

Seetharaman, P., Mysore, G., Pardo, B. A., Smaragdis, P., & Gomes, C. (2019). VoiceAssist: Guiding users to high-quality voice recordings. In CHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Conference on Human Factors in Computing Systems - Proceedings). Association for Computing Machinery. https://doi.org/10.1145/3290605.3300539
Seetharaman, Prem ; Mysore, Gautham ; Pardo, Bryan A ; Smaragdis, Paris ; Gomes, Celso. / VoiceAssist : Guiding users to high-quality voice recordings. CHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, 2019. (Conference on Human Factors in Computing Systems - Proceedings).
@inproceedings{a73cd37049254a5ba0bfd1894f9a4ed7,
title = "VoiceAssist: Guiding users to high-quality voice recordings",
abstract = "Voice recording is a challenging task with many pitfalls due to sub-par recording environments, mistakes in recording setup, microphone quality, etc. Newcomers to voice recording often have difficulty recording their voice, leading to recordings with low sound quality. Many amateur recordings of poor quality have two key problems: too much reverberation (echo), and too much background noise (e.g. fans, electronics, street noise). We present VoiceAssist, a system that helps inexperienced users produce high quality recordings by providing real-time visual feedback on audio quality. We integrate modern audio quality measures into an interactive human-machine feedback loop, so that the audio quality can be maximized at capture-time. We demonstrate the utility of this feedback for improving the recording quality with a user study. When presented with visual feedback about recording quality, users produced recordings that were strongly preferred by third-party listeners, when compared to recordings made without this feedback.",
keywords = "Active capture, Audio quality, Creativity support tools, Feedback, Interfaces, Narration, Speech, Voice recording",
author = "Prem Seetharaman and Gautham Mysore and Pardo, {Bryan A} and Paris Smaragdis and Celso Gomes",
year = "2019",
month = "5",
day = "2",
doi = "10.1145/3290605.3300539",
language = "English (US)",
series = "Conference on Human Factors in Computing Systems - Proceedings",
publisher = "Association for Computing Machinery",
booktitle = "CHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems",

}

Seetharaman, P, Mysore, G, Pardo, BA, Smaragdis, P & Gomes, C 2019, VoiceAssist: Guiding users to high-quality voice recordings. in CHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Conference on Human Factors in Computing Systems - Proceedings, Association for Computing Machinery, 2019 CHI Conference on Human Factors in Computing Systems, CHI 2019, Glasgow, United Kingdom, 5/4/19. https://doi.org/10.1145/3290605.3300539

VoiceAssist : Guiding users to high-quality voice recordings. / Seetharaman, Prem; Mysore, Gautham; Pardo, Bryan A; Smaragdis, Paris; Gomes, Celso.

CHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, 2019. (Conference on Human Factors in Computing Systems - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - VoiceAssist

T2 - Guiding users to high-quality voice recordings

AU - Seetharaman, Prem

AU - Mysore, Gautham

AU - Pardo, Bryan A

AU - Smaragdis, Paris

AU - Gomes, Celso

PY - 2019/5/2

Y1 - 2019/5/2

N2 - Voice recording is a challenging task with many pitfalls due to sub-par recording environments, mistakes in recording setup, microphone quality, etc. Newcomers to voice recording often have difficulty recording their voice, leading to recordings with low sound quality. Many amateur recordings of poor quality have two key problems: too much reverberation (echo), and too much background noise (e.g. fans, electronics, street noise). We present VoiceAssist, a system that helps inexperienced users produce high quality recordings by providing real-time visual feedback on audio quality. We integrate modern audio quality measures into an interactive human-machine feedback loop, so that the audio quality can be maximized at capture-time. We demonstrate the utility of this feedback for improving the recording quality with a user study. When presented with visual feedback about recording quality, users produced recordings that were strongly preferred by third-party listeners, when compared to recordings made without this feedback.

AB - Voice recording is a challenging task with many pitfalls due to sub-par recording environments, mistakes in recording setup, microphone quality, etc. Newcomers to voice recording often have difficulty recording their voice, leading to recordings with low sound quality. Many amateur recordings of poor quality have two key problems: too much reverberation (echo), and too much background noise (e.g. fans, electronics, street noise). We present VoiceAssist, a system that helps inexperienced users produce high quality recordings by providing real-time visual feedback on audio quality. We integrate modern audio quality measures into an interactive human-machine feedback loop, so that the audio quality can be maximized at capture-time. We demonstrate the utility of this feedback for improving the recording quality with a user study. When presented with visual feedback about recording quality, users produced recordings that were strongly preferred by third-party listeners, when compared to recordings made without this feedback.

KW - Active capture

KW - Audio quality

KW - Creativity support tools

KW - Feedback

KW - Interfaces

KW - Narration

KW - Speech

KW - Voice recording

UR - http://www.scopus.com/inward/record.url?scp=85067602101&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85067602101&partnerID=8YFLogxK

U2 - 10.1145/3290605.3300539

DO - 10.1145/3290605.3300539

M3 - Conference contribution

T3 - Conference on Human Factors in Computing Systems - Proceedings

BT - CHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

PB - Association for Computing Machinery

ER -

Seetharaman P, Mysore G, Pardo BA, Smaragdis P, Gomes C. VoiceAssist: Guiding users to high-quality voice recordings. In CHI 2019 - Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery. 2019. (Conference on Human Factors in Computing Systems - Proceedings). https://doi.org/10.1145/3290605.3300539