Feature generation has been proposed recently to generate feature sets automatically, as opposed to human-designed feature sets. This technique has shown promising results in many areas of supervised classification, in particular in the audio domain. However, feature generation is usually performed blindly, with genetic algorithms. As a result search performance is poor, thereby limiting its practical use. We propose a method to increase the search performance of feature generation systems. We focus on analytical features, i.e. features determined by their syntax. Our method consists in first extracting statistical properties of the feature space called spin patterns, by analogy with statistical physics. We show that spin patterns carry information about the topology of the feature space. We exploit these spin patterns to guide a simulated annealing algorithm specifically designed for feature generation. We evaluate our approach on three audio classification problems, and show that it increases performance by an order of magnitude. More generally this work is a first step in using tools from statistical physics for the supervised classification of complex audio signals.

G. Barbieri, M. Degli Esposti, F. Pachet, P. Roy (2010). Is There a Relation Between the Syntax and the Fitness of an Audio Feature?. UTRECHT : IGITUR.

Is There a Relation Between the Syntax and the Fitness of an Audio Feature?

BARBIERI, GABRIELE;DEGLI ESPOSTI, MIRKO;
2010

Abstract

Feature generation has been proposed recently to generate feature sets automatically, as opposed to human-designed feature sets. This technique has shown promising results in many areas of supervised classification, in particular in the audio domain. However, feature generation is usually performed blindly, with genetic algorithms. As a result search performance is poor, thereby limiting its practical use. We propose a method to increase the search performance of feature generation systems. We focus on analytical features, i.e. features determined by their syntax. Our method consists in first extracting statistical properties of the feature space called spin patterns, by analogy with statistical physics. We show that spin patterns carry information about the topology of the feature space. We exploit these spin patterns to guide a simulated annealing algorithm specifically designed for feature generation. We evaluate our approach on three audio classification problems, and show that it increases performance by an order of magnitude. More generally this work is a first step in using tools from statistical physics for the supervised classification of complex audio signals.
2010
ISMIR 2010 : Proceedings of the 11th International Society for Music Information
321
326
G. Barbieri, M. Degli Esposti, F. Pachet, P. Roy (2010). Is There a Relation Between the Syntax and the Fitness of an Audio Feature?. UTRECHT : IGITUR.
G. Barbieri; M. Degli Esposti; F. Pachet; P. Roy
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/91573
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact