The goal of this paper is to describe an optimization approach for selecting a reduced number of samples of the linear prediction residual. This can be extremely useful in pattern recognition tools. Sample determination is a combinatorial problem. Our approach addresses the combinatorial problem with simulated annealing based optimization. We show that better results than that obtained by a standard approximation approach, namely the multi-pulse algorithm, are obtained with our approach. Multi-pulse selects pulse locations by a sequential, suboptimal, algorithm and computes the pulses amplitudes according to an optimization criteria. Our approach finds the optimal residual samples by means of an optimization algorithm approach without amplitudes optimization. The compressed residual is fed to an all-pole model of speech obtaining better results than standard Multipulse modeling. We believe that this algorithm could be used as an alternative to other algorithms for medium-rate coding of speech in low complexity embedded devices. We also discuss performance and complexity issues of the described algorithm.
A novel approach for supporting approximate representation of linear prediction residuals in pattern recognition tools
Cuzzocrea, Alfredo;Mumolo, Enzo
2017-01-01
Abstract
The goal of this paper is to describe an optimization approach for selecting a reduced number of samples of the linear prediction residual. This can be extremely useful in pattern recognition tools. Sample determination is a combinatorial problem. Our approach addresses the combinatorial problem with simulated annealing based optimization. We show that better results than that obtained by a standard approximation approach, namely the multi-pulse algorithm, are obtained with our approach. Multi-pulse selects pulse locations by a sequential, suboptimal, algorithm and computes the pulses amplitudes according to an optimization criteria. Our approach finds the optimal residual samples by means of an optimization algorithm approach without amplitudes optimization. The compressed residual is fed to an all-pole model of speech obtaining better results than standard Multipulse modeling. We believe that this algorithm could be used as an alternative to other algorithms for medium-rate coding of speech in low complexity embedded devices. We also discuss performance and complexity issues of the described algorithm.File | Dimensione | Formato | |
---|---|---|---|
ICCSA2017_1.pdf
accesso aperto
Tipologia:
Bozza finale post-referaggio (post-print)
Licenza:
Copyright Editore
Dimensione
450.92 kB
Formato
Adobe PDF
|
450.92 kB | Adobe PDF | Visualizza/Apri |
publisher version.pdf
Accesso chiuso
Tipologia:
Documento in Versione Editoriale
Licenza:
Copyright Editore
Dimensione
824.75 kB
Formato
Adobe PDF
|
824.75 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.