AJOU Open Repository: SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information

BROWSE

Cited 0 times in Scipus Cited Count

SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information

DC Field	Value	Language
dc.contributor.author	Malik, A	-
dc.contributor.author	Subramaniyam, S	-
dc.contributor.author	Kim, CB	-
dc.contributor.author	Manavalan, B	-
dc.date.accessioned	2023-02-13T06:22:54Z	-
dc.date.available	2023-02-13T06:22:54Z	-
dc.date.issued	2021	-
dc.identifier.uri	http://repository.ajou.ac.kr/handle/201003/24441	-
dc.description.abstract	Sortase enzymes are cysteine transpeptidases that embellish the surface of Gram-positive bacteria with various proteins thereby allowing these microorganisms to interact with their neighboring environment. It is known that several of their substrates can cause pathological implications, so researchers have focused on the development of sortase inhibitors. Currently, six different classes of sortases (A-F) are recognized. However, with the extensive application of bacterial genome sequencing projects, the number of potential sortases in the public databases has exploded, presenting considerable challenges in annotating these sequences. It is very laborious and time-consuming to characterize these sortase classes experimentally. Therefore, this study developed the first machine-learning-based two-layer predictor called SortPred, where the first layer predicts the sortase from the given sequence and the second layer predicts their class from the predicted sortase. To develop SortPred, we constructed an original benchmarking dataset and investigated 31 feature descriptors, primarily on five feature encoding algorithms. Afterward, each of these descriptors were trained using a random forest classifier and their robustness was evaluated with an independent dataset. Finally, we selected the final model independently for both layers depending on the performance consistency between cross-validation and independent evaluation. SortPred is expected to be an effective tool for identifying bacterial sortases, which in turn may aid in designing sortase inhibitors and exploring their functions. The SortPred webserver and a standalone version are freely accessible at: https://procarb.org/sortpred.	-
dc.language.iso	en	-
dc.title	SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information	-
dc.type	Article	-
dc.identifier.pmid	34976319	-
dc.identifier.url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8703055	-
dc.subject.keyword	Bioinformatics	-
dc.subject.keyword	Cysteine transpeptidase	-
dc.subject.keyword	Hybrid features	-
dc.subject.keyword	Machine learning	-
dc.subject.keyword	Random forest	-
dc.subject.keyword	Sortase	-
dc.contributor.affiliatedAuthor	Manavalan, B	-
dc.type.local	Journal Papers	-
dc.identifier.doi	10.1016/j.csbj.2021.12.014	-
dc.citation.title	Computational and structural biotechnology journal	-
dc.citation.volume	20	-
dc.citation.date	2021	-
dc.citation.startPage	165	-
dc.citation.endPage	174	-
dc.identifier.bibliographicCitation	Computational and structural biotechnology journal, 20. : 165-174, 2021	-
dc.identifier.eissn	2001-0370	-
dc.relation.journalid	J020010370	-

Appears in Collections:: Journal Papers > School of Medicine / Graduate School of Medicine > Physiology

Files in This Item:: 34976319.pdf Download

Show simple item record

qrcode

트윗하기

License

Ajou University Medical Information & Media Center 164 Worldcup-ro Yeongtong-gu Suwon 16499 Korea / TEL : 031-219-5312
Copyright (c) Ajou University Medical Information & Media Center All Rights Reserved.
AJOU Open Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.

BROWSE

Browse