Cited 0 times in Scipus Cited Count

DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest

DC Field Value Language
dc.contributor.authorManavalan, B-
dc.contributor.authorShin, TH-
dc.contributor.authorLee, G-
dc.date.accessioned2019-11-13T00:17:25Z-
dc.date.available2019-11-13T00:17:25Z-
dc.date.issued2018-
dc.identifier.urihttp://repository.ajou.ac.kr/handle/201003/16670-
dc.description.abstractDNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html.-
dc.language.isoen-
dc.titleDHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest-
dc.typeArticle-
dc.identifier.pmid29416743-
dc.identifier.urlhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5788611/-
dc.subject.keywordDNase I hypersensitive site-
dc.subject.keywordfeature selection-
dc.subject.keywordmachine learning-
dc.subject.keywordrandom forest-
dc.subject.keywordsupport vector machine-
dc.contributor.affiliatedAuthorBalachandran, Manavalan-
dc.contributor.affiliatedAuthor이, 광-
dc.type.localJournal Papers-
dc.identifier.doi10.18632/oncotarget.23099-
dc.citation.titleOncotarget-
dc.citation.volume9-
dc.citation.number2-
dc.citation.date2018-
dc.citation.startPage1944-
dc.citation.endPage1956-
dc.identifier.bibliographicCitationOncotarget, 9(2). : 1944-1956, 2018-
dc.identifier.eissn1949-2553-
dc.relation.journalidJ019492553-
Appears in Collections:
Journal Papers > School of Medicine / Graduate School of Medicine > Physiology
Files in This Item:
29416743.pdfDownload

qrcode

해당 아이템을 이메일로 공유하기 원하시면 인증을 거치시기 바랍니다.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse