Please use this identifier to cite or link to this item: http://hdl.handle.net/2080/4786
Title: DNA Sequence Similarity Using Discrete Wavelet Transform
Authors: Banerjee, Nikita
Swain, Pratyush
Behera, Ayush Kumar
Sa, Pankaj Kumar
Keywords: Alignment-free method
Kmers
Chaos game representation
Discrete wavelet transform
Distance measures
Issue Date: Nov-2024
Citation: International Conference on Big Data Analytics in Bioinformatics(DABCon-2024), Narula Institute of Technology, Kolkata, India, 21 - 23 November 2024
Abstract: The development of next-generation sequencing technologies has resulted in exponential growth in sequence data. Global or local, pairwise or multiple sequence alignment was the foundation of early methods for sequence analysis, which was computationally intensive and time-consuming. In addition, gaps, mismatches, and insertions/deletions might provide unsatisfactory results during the alignment process, particularly for highly divergent sequences. We have used alignment-free sequence analysis to overcome this disadvantage and identify their phylogenetic relationship by using discrete wavelet transformation on the chaos game representation. First, DNA sequences were converted into 2D images to create a feature matrix. The DWT method is then applied to the resulting matrix to extract essential features and scale down their dimensionality; the resulting feature vectors are used for similarity analysis using different distance matrices. Further, we have executed our model on two benchmark datasets, Cichlid fish and Yersinia strains, from the AFproject to evaluate performance, and we have achieved top rank with an accuracy of 95% in Cichlid fish and 100% in Yersinia strains.
Description: Copyright belongs to proceeding publisher
URI: http://hdl.handle.net/2080/4786
Appears in Collections:Conference Papers

Files in This Item:
File Description SizeFormat 
2024_DABCon_NBanerjee_DNA.pdf944.66 kBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.