Towards a model of perceived quality of blind audio source separation

Brendan Fox*, Bryan A Pardo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

Existing perceptual models of audio quality, such as PEAQ, perform poorly when applied to blind audio source separation (BASS). We propose to create a perceptual model designed specifically for BASS algorithms. To create this model, we have designed a study to capture subjective human assessments of signal distortions resulting from BASS. In this study, humans rate the similarity between pairs of sounds. The first sound in each pair is a reference sound. The second sound is a distorted version of the reference, extracted from a multi-source mixture by a current BASS approach. We then correlate human similarity assessments with machine-measurable parameters. This paper describes preliminary results from a pilot study of three participants. Results indicate a strong correlation between human similarity assessments and the relative fraction of frames for which at last one frequency band in the distorted signal contains a significant noise component (RDF).

Original languageEnglish (US)
Title of host publicationProceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007
PublisherIEEE Computer Society
Pages1898-1901
Number of pages4
ISBN (Print)1424410177, 9781424410170
DOIs
StatePublished - 2007
EventIEEE International Conference onMultimedia and Expo, ICME 2007 - Beijing, China
Duration: Jul 2 2007Jul 5 2007

Publication series

NameProceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007

Other

OtherIEEE International Conference onMultimedia and Expo, ICME 2007
CountryChina
CityBeijing
Period7/2/077/5/07

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Software

Fingerprint Dive into the research topics of 'Towards a model of perceived quality of blind audio source separation'. Together they form a unique fingerprint.

  • Cite this

    Fox, B., & Pardo, B. A. (2007). Towards a model of perceived quality of blind audio source separation. In Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007 (pp. 1898-1901). [4285046] (Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007). IEEE Computer Society. https://doi.org/10.1109/icme.2007.4285046