Abstract
The prevalence of digital cameras and video-capable mobile phones enables the common practice of audiences capturing recordings of live music performances. It is now increasingly common to find some of these personal recordings online, uploaded to popular video hosting websites. Recognizing the desire of music fans to obtain a recording of good audio quality, we offer a time-domain fusion technique for combining these samples to achieve higher signal-to-noise ratio (SNR) than the single best original sample. When no composite output can improve SNR, the best original sample recording is returned. The scenario is modeled as a single blind source with multiple diverse receivers. As every live performance is unique, we assume no prior knowledge of a reference signal, and no knowledge of the original recordings' SNRs. Using statistical characteristics among the samples, we infer relative SNR, rank samples by quality, and determine whether a composite delivers improvement. The technique can be applied in a variety of contexts where multiple receivers have opportunity to capture audio, speech, or other signals.
Original language | English (US) |
---|---|
Title of host publication | Fusion 2011 - 14th International Conference on Information Fusion |
State | Published - Sep 13 2011 |
Event | 14th International Conference on Information Fusion, Fusion 2011 - Chicago, IL, United States Duration: Jul 5 2011 → Jul 8 2011 |
Other
Other | 14th International Conference on Information Fusion, Fusion 2011 |
---|---|
Country/Territory | United States |
City | Chicago, IL |
Period | 7/5/11 → 7/8/11 |
Keywords
- Acoustic noise
- Audio
- Blind signal
- Estimation
ASJC Scopus subject areas
- Information Systems