Acoustic Event Detection Based on Feature-Level Fusion of Audio and Video Modalities. (13th February 2011)
- Record Type:
- Journal Article
- Title:
- Acoustic Event Detection Based on Feature-Level Fusion of Audio and Video Modalities. (13th February 2011)
- Main Title:
- Acoustic Event Detection Based on Feature-Level Fusion of Audio and Video Modalities
- Authors:
- Butko Butko, Taras Taras
Canton-Ferrer Canton-Ferrer, Cristian Cristian
Segura Segura, Carlos Carlos
Giró Giró, Xavier Xavier
Nadeu Nadeu, Climent Climent
Hernando Hernando, Javier Javier
Casas Casas, Josep R. Josep R. - Other Names:
- Hong Hong Sangjin Sangjin Academic Editor.
- Abstract:
- Abstract : Acoustic event detection (AED) aims at determining the identity of sounds and their temporal position in audio signals. When applied to spontaneously generated acoustic events, AED based only on audio information shows a large amount of errors, which are mostly due to temporal overlaps. Actually, temporal overlaps accounted for more than 70% of errors in the real-world interactive seminar recordings used in CLEAR 2007 evaluations. In this paper, we improve the recognition rate of acoustic events using information from both audio and video modalities. First, the acoustic data are processed to obtain both a set of spectrotemporal features and the 3D localization coordinates of the sound source. Second, a number of features are extracted from video recordings by means of object detection, motion analysis, and multicamera person tracking to represent the visual counterpart of several acoustic events. A feature-level fusion strategy is used, and a parallel structure of binary HMM-based detectors is employed in our work. The experimental results show that information from both the microphone array and video cameras is useful to improve the detection rate of isolated as well as spontaneously generated acoustic events.
- Is Part Of:
- EURASIP journal on advances in signal processing. Volume 2011(2011)
- Journal:
- EURASIP journal on advances in signal processing
- Issue:
- Volume 2011(2011)
- Issue Display:
- Volume 2011, Issue 2011 (2011)
- Year:
- 2011
- Volume:
- 2011
- Issue:
- 2011
- Issue Sort Value:
- 2011-2011-2011-0000
- Page Start:
- Page End:
- Publication Date:
- 2011-02-13
- Subjects:
- Signal processing -- Periodicals
Traitement du signal
Signal processing
Periodicals
621.3822 - Journal URLs:
- https://asp-eurasipjournals.springeropen.com/ ↗
http://link.springer.com/ ↗
http://www.hindawi.com/journals/asp/ ↗ - DOI:
- 10.1155/2011/485738 ↗
- Languages:
- English
- ISSNs:
- 1687-6172
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25227.xml