dc.contributor.advisor | Chen, Xue-wen | |
dc.contributor.author | Liu, Mei | |
dc.date.accessioned | 2009-08-31T02:24:47Z | |
dc.date.available | 2009-08-31T02:24:47Z | |
dc.date.issued | 2009-06-30 | |
dc.date.submitted | 2009 | |
dc.identifier.other | http://dissertations.umi.com/ku:10463 | |
dc.identifier.uri | http://hdl.handle.net/1808/5449 | |
dc.description.abstract | To fully understand the underlying mechanisms of living cells, it is essential to delineate the intricate interactions between the cell proteins at a genome scale. Insights into the protein functions will enrich our understanding in human diseases and contribute to future drug developments. My dissertation focuses on the development and optimization of machine learning algorithms to study protein-protein interactions and protein function annotations through discovery of domain-domain interactions. First of all, I developed a novel domain-based random decision forest framework (RDFF) that explored all possible domain module pairs in mediating protein interactions. RDFF achieved higher sensitivity (79.78%) and specificity (64.38%) in interaction predictions of S. cerevisiae proteins compared to the popular Maximum Likelihood Estimation (MLE) approach. RDFF can also infer interactions for both single-domain pairs and domain module pairs. Secondly, I proposed cross-species interacting domain patterns (CSIDOP) approach that not only increased fidelity of existing functional annotations, but also proposed novel annotations for unknown proteins. CSIDOP accurately determined functions for 95.42% of proteins in H. sapiens using 2,972 GO `molecular function' terms. In contrast, most existing methods can only achieve accuracies of 50% to 75% using much smaller number of categories. Additionally, we were able to assign novel annotations to 181 unknown H. sapiens proteins. Finally, I implemented a web-based system, called PINFUN, which enables users to make online protein-protein interaction and protein function predictions based on a large-scale collection of known and putative domain interactions. | |
dc.format.extent | 143 pages | |
dc.language.iso | EN | |
dc.publisher | University of Kansas | |
dc.rights | This item is protected by copyright and unless otherwise specified the copyright of this thesis/dissertation is held by the author. | |
dc.subject | Computer science | |
dc.subject | Biology | |
dc.subject | Bioinformatics | |
dc.subject | Data mining | |
dc.subject | Domain interaction network | |
dc.subject | Machine learning | |
dc.subject | Protein function | |
dc.subject | Protein interaction network | |
dc.title | Discovering Domain-Domain Interactions toward Genome-Wide Protein Interaction and Function Predictions | |
dc.type | Dissertation | |
dc.contributor.cmtemember | Agah, Arvin | |
dc.contributor.cmtemember | Grzymala-Busse, Jerzy | |
dc.contributor.cmtemember | Huan, Jun | |
dc.contributor.cmtemember | Ward, Robert E | |
dc.thesis.degreeDiscipline | Electrical Engineering & Computer Science | |
dc.thesis.degreeLevel | Ph.D. | |
kusw.oastatus | na | |
dc.identifier.orcid | https://orcid.org/0000-0002-8036-2110 | |
kusw.oapolicy | This item does not meet KU Open Access policy criteria. | |
kusw.bibid | 6857491 | |
dc.rights.accessrights | openAccess | |