DSpace logo

Please use this identifier to cite or link to this item: http://142.54.178.187:9060/xmlui/handle/123456789/2750
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTahir, Muhammad-
dc.date.accessioned2017-12-15T07:17:14Z-
dc.date.accessioned2020-04-09T16:36:09Z-
dc.date.available2020-04-09T16:36:09Z-
dc.date.issued2016-
dc.identifier.urihttp://142.54.178.187:9060/xmlui/handle/123456789/2750-
dc.description.abstractBiological sequences consist of A C G and T in a DNA structure and contain vital information of living organisms. The development of computing technologies, especially NGS technologies have increased genomic data at a rapid rate. The increase in genomic data presents significant research challenges in bioinformatics, such as sequence alignment, short-reads error correction, phylogenetic inference, etc. Next-generation high-throughput sequencing technologies have opened new and thought-provoking research opportunities. In particular, Next-generation sequencers produce a massive amount of short-reads data in a single run. However, these large amounts of short-reads data produced are highly susceptible to errors, as compared to shotgun sequencing. Therefore, there is a peremptory demand to design fast and more accurate statistical and computational tools to analyze these data. This research presents a novel and robust algorithm called HaShRECA for genome sequence short reads error correction. The developed algorithm is based on a probabilistic model that analyzes the potential errors in reads and utilizes the Hadoop-MapReduce framework to speed up the computation processes. Experimental results show that HaShRECA is more accurate, as well as time and space efficient as compared to previous algorithms.en_US
dc.description.sponsorshipHigher Education Commission, Pakistanen_US
dc.language.isoenen_US
dc.publisherIqra University Islamabad Campus, Pakistanen_US
dc.subjectApplied Sciencesen_US
dc.titleRobust Algorithm for Genome Sequence Short Read Error Correction using Hadoop-MapReduceen_US
dc.typeThesisen_US
Appears in Collections:Thesis

Files in This Item:
File Description SizeFormat 
1973.htm128 BHTMLView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.