Please use this identifier to cite or link to this item: http://hdl.handle.net/2080/4767
Title: Comparative Analysis of Feature Representation in DNA Sequence Similarity Using Alignment-Free Approaches
Authors: Banerjee, Nikita
Bhat, Udipi Sriram
Sa, Pankaj Kumar
Keywords: Bioinformatics
DNA sequence similarity
k-mer analysis
Feature Representataion
Similarity measure
phylogenetic trees
Issue Date: Nov-2024
Citation: 6th International Conference on Communication and Intelligent Systems (ICCIS 2024) MANIT Bhopal, 08-09 November 2024
Abstract: Numerous methodologies have been proposed for DNA sequence similarity analysis, both Alignment-based and Alignment-free analysis. Though Alignment based methods give accurate results, in the last few decades, AF methods have proven to come close to being as effective as AB methods while overcoming a lot of limitations. Our research revealed which methods provide greater performance independently and which ones with different combination of possible methods. The feature representation has to be information lossless. Different representation methods are highlighted and a second step of feature generation has been applied if representation does not provide required features for sequence comparison. We find that the methods involving k-mer analysis achieve higher accuracy than other approaches. k-mer analysis combined with matrix reduction yielded the highest accuracy, maintaining the lowest RF score among the different techniques
Description: Copyright belongs to the proceeding publisher
URI: http://hdl.handle.net/2080/4767
Appears in Collections:Conference Papers

Files in This Item:
File Description SizeFormat 
2024_ICCIS_NBanerjee_Comparative.pdf873.49 kBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.