AN INTELLIGENT SPARSE VISUAL FEATURE DESCRIPTORS FOR CONTENT-BASED IMAGE RETRIEVAL

Mehmood, Zahid

Please use this identifier to cite or link to this item: http://localhost:80/xmlui/handle/123456789/2324

Full metadata record

DC Field	Value	Language
dc.contributor.author	Mehmood, Zahid	-
dc.date.accessioned	2018-04-16T07:04:41Z	-
dc.date.accessioned	2020-04-09T16:30:28Z	-
dc.date.available	2020-04-09T16:30:28Z	-
dc.date.issued	2017	-
dc.identifier.uri	http://142.54.178.187:9060/xmlui/handle/123456789/2324	-
dc.description.abstract	Content-based image retrieval (CBIR) techniques are used to retrieve similar images from image repositories by utilizing the visual contents of the images. From last few years, bagof- visual-words (BoVW) model is most commonly used for image retrieval and got promising results in terms of accuracy and effectiveness. However, BoVW model still has some problems, such as an image is represented as an orderless global histogram of visual words that neglects the spatial layout of the image. Spatial information is an important component that provides discriminating details for accurate retrieval of images. In this thesis, three novel approaches for image representations are presented by the selection of appropriate semantic regions of an image by constructing histograms of visual words. The standard image databases are used to determine the efficiency of proposed approaches. Following approaches are presented in this dissertation: A novel image representation is presented using the characteristics of local and global information in the form of histograms of visual words. The global information is obtained by constructing the histogram of visual words over the whole image, while the histogram of visual words for local information is constructed over the local rectangular region of the image. The local histogram represents the spatial information of salient objects. In order to verify the performance of the proposed approach, a number of experiments are conducted on the standard image databases (Corel-A, Caltech-256, and Ground truth). The results show that the proposed image representation significantly enhance the effectiveness of image retrieval. Based on the semantic similarity in an image, another image representation is proposed by constructing the histograms of visual words by splitting an image into two rectangular regions that add the spatial information to the inverted index of the BoVW based image representation. By utilizing this phenomenon of image representation, different visual words for upper and lower rectangular regions of an image are obtained for better image retrieval performance. For the verification of proposed approach, extensive experiments are conducted vii on Corel-A, and Ground truth image databases, proof the robustness of the proposed approach. In order to overcome the problems of overfitting on large dictionary sizes, lack of spatial information, and to reduce the computational cost, a new image representation based on the weighted average of triangular histograms (WATH) is also introduced. The image is divided into four triangular regions in order to incorporate the spatial information to the inverted index of the BoVW based image representation, and a histogram of visual words are computed from each triangular region. An appropriate weight is assigned to each histogram in order to eliminate the aforementioned problems. The assigned weight reduces; the size of the dictionary by reducing the non-salient visual words, and the computational cost. The proposed approach also provide the consistent performance on large dictionary sizes. The quantitative and qualitative analysis conducted on two image databases (Corel-A and Corel- 1500) shows the robustness of the proposed approach among the recent image retrieval approaches. Keywords: Content-based image retrieval (CBIR); Bag-of-visual-words (BoVW); Local and global histograms; Rectangular spatial histograms; Weighted triangular histograms.	en_US
dc.description.sponsorship	Higher Education Commission, Pakistan	en_US
dc.language.iso	en	en_US
dc.publisher	UNIVERSITY OF ENGINEERING AND TECHNOLOGY, TAXILA, PAKISTAN	en_US
dc.subject	Applied Sciences	en_US
dc.title	AN INTELLIGENT SPARSE VISUAL FEATURE DESCRIPTORS FOR CONTENT-BASED IMAGE RETRIEVAL	en_US
dc.type	Thesis	en_US
Appears in Collections:	Thesis

Files in This Item:

File	Description	Size	Format
8400.htm		128 B	HTML	View/Open

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets