LEVERAGING HIERARCHICAL STRUCTURES FOR FEW-SHOT MUSICAL INSTRUMENT RECOGNITION

Hugo Flores Garcia, Aldo Aguilar, Ethan Manilow, Bryan Pardo

Research output: Chapter in Book/Report/Conference proceedingConference contribution

27 Scopus citations

Abstract

Deep learning work on musical instrument recognition has generally focused on instrument classes for which we have abundant data. In this work, we exploit hierarchical relationships between instruments in a few-shot learning setup to enable classification of a wider set of musical instruments, given a few examples at inference. We apply a hierarchical loss function to the training of prototypical networks, combined with a method to aggregate prototypes hierarchically, mirroring the structure of a predefined musical instrument hierarchy. These extensions require no changes to the network architecture and new levels can be easily added or removed. Compared to a non-hierarchical few-shot baseline, our method leads to a significant increase in classification accuracy and significant decrease in mistake severity on instrument classes unseen in training.

Original languageEnglish (US)
Title of host publicationProceedings of the 22nd International Conference on Music Information Retrieval, ISMIR 2021
PublisherInternational Society for Music Information Retrieval
Pages220-228
Number of pages9
ISBN (Electronic)9781732729902
StatePublished - 2021
Event22nd International Conference on Music Information Retrieval, ISMIR 2021 - Virtual, Online
Duration: Nov 7 2021Nov 12 2021

Publication series

NameProceedings of the 22nd International Conference on Music Information Retrieval, ISMIR 2021

Conference

Conference22nd International Conference on Music Information Retrieval, ISMIR 2021
CityVirtual, Online
Period11/7/2111/12/21

Funding

This work was funded, in part, by USA National Science Foundation Award 1901456.

ASJC Scopus subject areas

  • Music
  • Information Systems

Fingerprint

Dive into the research topics of 'LEVERAGING HIERARCHICAL STRUCTURES FOR FEW-SHOT MUSICAL INSTRUMENT RECOGNITION'. Together they form a unique fingerprint.

Cite this