Publication Date
Spring 2026
Degree Type
Master's Project
Degree Name
Master of Science in Bioinformatics (MSBI)
Department
Computer Science
First Advisor
Dr. Philip Heller
Second Advisor
Dr. Laura Miller Conrad
Third Advisor
Dr. Wendy Lee
Keywords
Assembly, Colistin adjuvants, ArnA, CATwalk, SPAdes, BLAST
Abstract
Research into colistin adjuvants has identified interactions between a host of proteins and proteins found within certain strains of bacteria. BLAST, a widely adopted local sequence alignment tool, was used to create a database containing the genomes of relevant bacterial strains, which would then be used to query against the previously identified proteins. Many of these strains do not have publicly available assemblies, which makes database construction difficult. CATwalk, a new naïve fragment extender, is first described in this paper and used to extend fragments of interest. BLAST results first identified short fragments with sufficient similarity, which can then be extended with CATwalk for further alignment. CATwalk was ultimately unsuccessful in accurately extending the sequence of interest. SPAdes, a well-known greedy assembly algorithm, was used afterwards to create rough assemblies of relevant bacterial genomes. Compelling similarities between ArnA and an unknown protein in K. pneumoniae 0479 were then uncovered. This study provides a new avenue for ongoing research into counteracting colistin resistance in pathogenic bacteria.
Recommended Citation
Anderson, Gavin, "Searching for Cause of Unexplained Protein Interaction with Multiple Assembly Methods" (2026). Master's Projects. 1745.
https://scholarworks.sjsu.edu/etd_projects/1745