Multiclass audio segmentation in broadcast environments

IT

Lecture given by Pablo Gimeno, University of Zaragoza, Spain

Abstract: Audio segmentation can be defined as the division of an audio signal into smaller fragments according to a predefined set of attributes. This wide definition could include several systems depending on the set of rules considered. In this talk, the focus will be set on multiclass audio segmentation tasks, aiming to obtain a set of labels describing several tipologies in an audio signal such as speech, music and noise. During the presentation, different approaches will be presented evaluating these kind of systems in broadcast domain data.

Biography: Pablo Gimeno is a Speech scientist at ViVoLab research group. He completed his thesis under the supervision of Dr. Alfonso Ortega. His research interests span the areas of speech processing, audio and speech segmentation, speech activity detection and automatic speech recognition.

Tags: ai audio segmentation automatic speech recognition speech processing

Added by: Emmanuelle Billard
Additional owner(s):
- Gregor Dupuy
Updated on: Sept. 26, 2023, 12:49 p.m.
Channel:
- IT
Type: Conférence
Main language: English
Discipline(s):
- Informatique

IT

Multiclass audio segmentation in broadcast environments

Lecture given by Pablo Gimeno, University of Zaragoza, Spain

Infos