Please use this identifier to cite or link to this item:
http://hdl.handle.net/2080/4786
Title: | DNA Sequence Similarity Using Discrete Wavelet Transform |
Authors: | Banerjee, Nikita Swain, Pratyush Behera, Ayush Kumar Sa, Pankaj Kumar |
Keywords: | Alignment-free method Kmers Chaos game representation Discrete wavelet transform Distance measures |
Issue Date: | Nov-2024 |
Citation: | International Conference on Big Data Analytics in Bioinformatics(DABCon-2024), Narula Institute of Technology, Kolkata, India, 21 - 23 November 2024 |
Abstract: | The development of next-generation sequencing technologies has resulted in exponential growth in sequence data. Global or local, pairwise or multiple sequence alignment was the foundation of early methods for sequence analysis, which was computationally intensive and time-consuming. In addition, gaps, mismatches, and insertions/deletions might provide unsatisfactory results during the alignment process, particularly for highly divergent sequences. We have used alignment-free sequence analysis to overcome this disadvantage and identify their phylogenetic relationship by using discrete wavelet transformation on the chaos game representation. First, DNA sequences were converted into 2D images to create a feature matrix. The DWT method is then applied to the resulting matrix to extract essential features and scale down their dimensionality; the resulting feature vectors are used for similarity analysis using different distance matrices. Further, we have executed our model on two benchmark datasets, Cichlid fish and Yersinia strains, from the AFproject to evaluate performance, and we have achieved top rank with an accuracy of 95% in Cichlid fish and 100% in Yersinia strains. |
Description: | Copyright belongs to proceeding publisher |
URI: | http://hdl.handle.net/2080/4786 |
Appears in Collections: | Conference Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2024_DABCon_NBanerjee_DNA.pdf | 944.66 kB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.