Degenerate unmixing estimation technique using the constant Q transform

Zafar Rafii*, Bryan A Pardo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

The Degenerate Unmixing Estimation Technique (DUET) is a Blind Source Separation (BSS) algorithm for stereo audio. DUET depends on an amplitude-phase 2d histogram built from the differences between the two channels, where peaks in the histogram indicate sources in the mixture. If peaks overlap, separation becomes unfeasible. This is often the case for music mixtures. We propose to improve peak separation by building histograms from time-frequency representations based on the Constant Q Transform (CQT) instead of the Fourier Transform (FT). The CQT has a logarithmic frequency resolution matching the geometrically spaced notes of the Western music scale. We also adaptively resize histogram bins and use Wiener filtering to improve peak resolving and source reconstruction. Results on mixtures of harmonic musical instruments show improvement in separation, especially at low frequencies and for closely spaced sources.

Original languageEnglish (US)
Title of host publication2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
Pages217-220
Number of pages4
DOIs
StatePublished - Aug 18 2011
Event36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Prague, Czech Republic
Duration: May 22 2011May 27 2011

Other

Other36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
Country/TerritoryCzech Republic
CityPrague
Period5/22/115/27/11

Keywords

  • Blind Source Separation
  • Constant Q Transform
  • Degenerate Unmixing Estimation Technique

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Degenerate unmixing estimation technique using the constant Q transform'. Together they form a unique fingerprint.

Cite this