MUSIC SEPARATION ENHANCEMENT WITH GENERATIVE MODELING

Noah Schaffer, Boaz Cogan, Ethan Manilow, Max Morrison, Prem Seetharaman, Bryan Pardo

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Despite phenomenal progress in recent years, state-of-the-art music separation systems produce source estimates with significant perceptual shortcomings, such as adding extraneous noise or removing harmonics. We propose a post-processing model (the Make it Sound Good (MSG) post-processor) to enhance the output of music source separation systems. We apply our post-processing model to state-of-the-art waveform-based and spectrogram-based music source separators, including a separator unseen by MSG during training. Our analysis of the errors produced by source separators shows that waveform models tend to introduce more high-frequency noise, while spectrogram models tend to lose transients and high frequency content. We introduce objective measures to quantify both kinds of errors and show MSG improves the source reconstruction of both kinds of errors. Crowdsourced subjective evaluations demonstrate that human listeners prefer source estimates of bass and drums that have been post-processed by MSG.

Original languageEnglish (US)
Title of host publicationProceedings of the 23rd International Society for Music Information Retrieval Conference, ISMIR 2022
EditorsPreeti Rao, Hema Murthy, Ajay Srinivasamurthy, Rachel Bittner, Rafael Caro Repetto, Masataka Goto, Xavier Serra, Marius Miron
PublisherInternational Society for Music Information Retrieval
Pages772-780
Number of pages9
ISBN (Electronic)9781732729926
StatePublished - 2022
Event23rd International Society for Music Information Retrieval Conference, ISMIR 2022 - Hybrid, Bengaluru, India
Duration: Dec 4 2022Dec 8 2022

Publication series

NameProceedings of the 23rd International Society for Music Information Retrieval Conference, ISMIR 2022

Conference

Conference23rd International Society for Music Information Retrieval Conference, ISMIR 2022
Country/TerritoryIndia
CityHybrid, Bengaluru
Period12/4/2212/8/22

ASJC Scopus subject areas

  • Music
  • Information Systems
  • Artificial Intelligence
  • Human-Computer Interaction
  • Signal Processing

Fingerprint

Dive into the research topics of 'MUSIC SEPARATION ENHANCEMENT WITH GENERATIVE MODELING'. Together they form a unique fingerprint.

Cite this