Please use this identifier to cite or link to this item: http://hdl.handle.net/2080/4517
Title: DNA Sequence Similarity for Identifying Mastitis-Causing Bacteria in Cattle
Authors: Banerjee, Nikita
Bakshi, Sambit
Sa, Pankaj Kumar
Keywords: CBOW architecture
DNA sequence similarity
Distance measures
Jaccard Distance
One-hot encoding.
Issue Date: Mar-2024
Citation: International Conference on Computational Intelligence in Pattern Recognition(CIPR), Baripada, India, 15-16 March 2024
Abstract: Mastitis, an inflammation of the mammary gland in cattle, poses a significant threat to dairy industry productivity and animal welfare. Identifying the specific bacterial strains responsible for causing mastitis is crucial for effective prevention and treatment strategies. In this research, we have used a popular deep-learning technique to elucidate the underlying pathogenic strains associated with mastitis in cattle. The first step involved the extraction of k-mers from DNA sequences, followed by a one-hot encoding approach applied to represent each k-mer as a binary array, considering a predefined alphabet. SVD has been used to reduce the dimension of the features on that Continuous Bag of Words (CBOW) model, which was applied to the generated context pairs to learn meaningful embeddings for each k-mer. Node embeddings are obtained for each k-mer present in the model’s vocabulary; at last, Jaccard similarity was applied to the output node embeddings to ascertain the pathogenic potential of different bacterial strains in causing mastitis. The model enabled the identification of bacteria more prone to causing mastitis in cattle, and these findings could offer valuable insights into understanding the etiology of mastitis and contribute to the development of targeted interventions and therapeutics
Description: Copyright belongs to proceeding publisher
URI: http://hdl.handle.net/2080/4517
Appears in Collections:Conference Papers

Files in This Item:
File Description SizeFormat 
2024_CIPR_ABanerjee_DNA.pdf440.77 kBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.