Please use this identifier to cite or link to this item:
http://hdl.handle.net/2080/4517
Title: | DNA Sequence Similarity for Identifying Mastitis-Causing Bacteria in Cattle |
Authors: | Banerjee, Nikita Bakshi, Sambit Sa, Pankaj Kumar |
Keywords: | CBOW architecture DNA sequence similarity Distance measures Jaccard Distance One-hot encoding. |
Issue Date: | Mar-2024 |
Citation: | International Conference on Computational Intelligence in Pattern Recognition(CIPR), Baripada, India, 15-16 March 2024 |
Abstract: | Mastitis, an inflammation of the mammary gland in cattle, poses a significant threat to dairy industry productivity and animal welfare. Identifying the specific bacterial strains responsible for causing mastitis is crucial for effective prevention and treatment strategies. In this research, we have used a popular deep-learning technique to elucidate the underlying pathogenic strains associated with mastitis in cattle. The first step involved the extraction of k-mers from DNA sequences, followed by a one-hot encoding approach applied to represent each k-mer as a binary array, considering a predefined alphabet. SVD has been used to reduce the dimension of the features on that Continuous Bag of Words (CBOW) model, which was applied to the generated context pairs to learn meaningful embeddings for each k-mer. Node embeddings are obtained for each k-mer present in the model’s vocabulary; at last, Jaccard similarity was applied to the output node embeddings to ascertain the pathogenic potential of different bacterial strains in causing mastitis. The model enabled the identification of bacteria more prone to causing mastitis in cattle, and these findings could offer valuable insights into understanding the etiology of mastitis and contribute to the development of targeted interventions and therapeutics |
Description: | Copyright belongs to proceeding publisher |
URI: | http://hdl.handle.net/2080/4517 |
Appears in Collections: | Conference Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2024_CIPR_ABanerjee_DNA.pdf | 440.77 kB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.