Multi-resolution pre-processing for pattern recognition in images and audio signals
Abstract
With the rapid growth of technology and the proliferation of data in this “digital
age”, most current data (imaging and audio) applications require higher data resolution, higher data transmission rates, and better compression techniques to meet
the ever increasing demands placed on them. In the case of data compression, an
ideal system must produce good quality of data with high compression ratios while
minimising the computational overhead. In other words, bandwidth cost money,
therefore, the transmission and storage of information become costly. However, if
we can use less data, both transmission and storage become cheaper.
The Haar Wavelet Transform (HWT) has been chosen, here, to fulfil the data
compression requirement of this research. The HWT is a mathematical tool that is
ideally suited for hierarchically decomposing image information. For example, it
is the preferred method by the JPEG format to produce encoded images at higher
compression ratios. In addition, it can also be used to remove irrelevant/redundant
image data to facilitate the recording, transmission, and searching of data effectively.
HWT based image compression is computationally efficient and is symmetric in
nature (both the forward and the inverse transforms have the same complexity)
allowing for the building of fast compression and decompression routines.
The aim of this research is to determine optimum compression rates for data using Machine Learning methods; typically, Artificial Neural Network (ANN). The
MNIST dataset images and the TI46 database audio are subject to a HWT transformation and a signature of the information is constructed by removing irrelevant/redundant data by means of a cut-off function. Multiple signatures for images and audio are built for differing cut-offs are then input into a neural network.
Their performances are then compared to find the optimum HWT based compression. Which, it has made the input vectors of the network much more compact and
thus easier to establish relationships between them, leading to a rapid and effective
classification of the signal.
Collections
The following license files are associated with this item: