Work place: Dept. of Electrical & Computer Engineering, University of Patras, 26500 Patras, Greece
E-mail: fakotaki@upatras.gr
Website:
Research Interests: Human-Computer Interaction, Computer systems and computational processes, Artificial Intelligence, Natural Language Processing, Pattern Recognition, Systems Architecture
Biography
Nikos Fakotakis is Professor at the University of Patras. His research interests include speech and audio signal processing, pattern recognition, natural language processing, dialogue systems, human-computer interaction, artificial intelligence, and bioacoustics.
By Theodoros Theodorou Iosif Mporas Nikos Fakotakis
DOI: https://doi.org/10.5815/ijitcs.2014.11.01, Pub. Date: 8 Oct. 2014
In this report we present an overview of the approaches and techniques that are used in the task of automatic audio segmentation. Audio segmentation aims to find changing points in the audio content of an audio stream. Initially, we present the basic steps in an automatic audio segmentation procedure. Afterwards, the basic categories of segmentation algorithms, and more specific the unsupervised, the data-driven and the mixed algorithms, are presented. For each of the categorizations the segmentation analysis is followed by details about proposed architectural parameters, such us the audio descriptor set, the mathematical functions in unsupervised algorithms and the machine learning algorithms of data-driven modules. Finally a review of proposed architectures in the automatic audio segmentation literature appears, along with details about the experimenting audio environment (heading of database and list of audio events of interest), the basic modules of the procedure (categorization of the algorithm, audio descriptor set, architectural parameters and potential optional modules) along with the maximum achieved accuracy.
[...] Read more.By Iosif Mporas Todor Ganchev Otilia Kocsis Nikos Fakotakis Olaf Jahn Klaus Riede
DOI: https://doi.org/10.5815/ijisa.2013.07.02, Pub. Date: 8 Jun. 2013
We report on the development of an automated acoustic bird recognizer with improved noise robustness, which is part of a long-term project, aiming at the establishment of an automated biodiversity monitoring system at the Hymettus Mountain near Athens, Greece. In particular, a typical audio processing strategy, which has been proved quite successful in various audio recognition applications, was amended with a simple and effective mechanism for integration of temporal contextual information in the decision-making process. In the present implementation, we consider integration of temporal contextual information by joint post-processing of the recognition results for a number of preceding and subsequent audio frames. In order to evaluate the usefulness of the proposed scheme on the task of acoustic bird recognition, we experimented with six widely used classifiers and a set of real-field audio recordings for two bird species which are present at the Hymettus Mountain. The highest achieved recognition accuracy obtained on the real-field data was approximately 93%, while experiments with additive noise showed significant robustness in low signal-to-noise ratio setups. In all cases, the integration of temporal contextual information was found to improve the overall accuracy of the recognizer.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals