Show simple item record

dc.contributor.advisorDaly, Mark
dc.contributor.advisorFlynn, Ronan
dc.contributor.authorMansor, Noha
dc.date.accessioned2022-06-30T12:38:18Z
dc.date.available2022-06-30T12:38:18Z
dc.date.copyright2020
dc.date.issued2020-10-05
dc.identifier.citationMansor, N. (2020). Multi-resolution pre-processing for pattern recognition in images and audio signals. Thesis (Masters - Msc (Research) in Software Engineering. Software Research Institute. Department of Electronics & Informatics AIT.en_US
dc.identifier.urihttp://research.thea.ie/handle/20.500.12065/4007
dc.description.abstractWith the rapid growth of technology and the proliferation of data in this “digital age”, most current data (imaging and audio) applications require higher data resolution, higher data transmission rates, and better compression techniques to meet the ever increasing demands placed on them. In the case of data compression, an ideal system must produce good quality of data with high compression ratios while minimising the computational overhead. In other words, bandwidth cost money, therefore, the transmission and storage of information become costly. However, if we can use less data, both transmission and storage become cheaper. The Haar Wavelet Transform (HWT) has been chosen, here, to fulfil the data compression requirement of this research. The HWT is a mathematical tool that is ideally suited for hierarchically decomposing image information. For example, it is the preferred method by the JPEG format to produce encoded images at higher compression ratios. In addition, it can also be used to remove irrelevant/redundant image data to facilitate the recording, transmission, and searching of data effectively. HWT based image compression is computationally efficient and is symmetric in nature (both the forward and the inverse transforms have the same complexity) allowing for the building of fast compression and decompression routines. The aim of this research is to determine optimum compression rates for data using Machine Learning methods; typically, Artificial Neural Network (ANN). The MNIST dataset images and the TI46 database audio are subject to a HWT transformation and a signature of the information is constructed by removing irrelevant/redundant data by means of a cut-off function. Multiple signatures for images and audio are built for differing cut-offs are then input into a neural network. Their performances are then compared to find the optimum HWT based compression. Which, it has made the input vectors of the network much more compact and thus easier to establish relationships between them, leading to a rapid and effective classification of the signal.en_US
dc.formatPDFen_US
dc.language.isoengen_US
dc.publisherAthlone Institute of Technologyen_US
dc.rightsAttribution-Non-Commercial-Share-Alike-4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/*
dc.subjectPattern recognitionen_US
dc.subjectAudio signalsen_US
dc.subjectImageen_US
dc.titleMulti-resolution pre-processing for pattern recognition in images and audio signalsen_US
dc.typeinfo:eu-repo/semantics/masterThesisen_US
dc.contributor.affiliationAthlone Institute of Technologyen_US
dc.rights.accessrightsinfo:eu-repo/semantics/openAccessen_US
dc.subject.departmentFaculty of Engineering & Informatics AITen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-Non-Commercial-Share-Alike-4.0 International
Except where otherwise noted, this item's license is described as Attribution-Non-Commercial-Share-Alike-4.0 International