Show simple item record

dc.contributor.advisorChen, Xue-wen
dc.contributor.authorLiu, Mei
dc.date.accessioned2009-08-31T02:24:47Z
dc.date.available2009-08-31T02:24:47Z
dc.date.issued2009-06-30
dc.date.submitted2009
dc.identifier.otherhttp://dissertations.umi.com/ku:10463
dc.identifier.urihttp://hdl.handle.net/1808/5449
dc.description.abstractTo fully understand the underlying mechanisms of living cells, it is essential to delineate the intricate interactions between the cell proteins at a genome scale. Insights into the protein functions will enrich our understanding in human diseases and contribute to future drug developments. My dissertation focuses on the development and optimization of machine learning algorithms to study protein-protein interactions and protein function annotations through discovery of domain-domain interactions. First of all, I developed a novel domain-based random decision forest framework (RDFF) that explored all possible domain module pairs in mediating protein interactions. RDFF achieved higher sensitivity (79.78%) and specificity (64.38%) in interaction predictions of S. cerevisiae proteins compared to the popular Maximum Likelihood Estimation (MLE) approach. RDFF can also infer interactions for both single-domain pairs and domain module pairs. Secondly, I proposed cross-species interacting domain patterns (CSIDOP) approach that not only increased fidelity of existing functional annotations, but also proposed novel annotations for unknown proteins. CSIDOP accurately determined functions for 95.42% of proteins in H. sapiens using 2,972 GO `molecular function' terms. In contrast, most existing methods can only achieve accuracies of 50% to 75% using much smaller number of categories. Additionally, we were able to assign novel annotations to 181 unknown H. sapiens proteins. Finally, I implemented a web-based system, called PINFUN, which enables users to make online protein-protein interaction and protein function predictions based on a large-scale collection of known and putative domain interactions.
dc.format.extent143 pages
dc.language.isoEN
dc.publisherUniversity of Kansas
dc.rightsThis item is protected by copyright and unless otherwise specified the copyright of this thesis/dissertation is held by the author.
dc.subjectComputer science
dc.subjectBiology
dc.subjectBioinformatics
dc.subjectData mining
dc.subjectDomain interaction network
dc.subjectMachine learning
dc.subjectProtein function
dc.subjectProtein interaction network
dc.titleDiscovering Domain-Domain Interactions toward Genome-Wide Protein Interaction and Function Predictions
dc.typeDissertation
dc.contributor.cmtememberAgah, Arvin
dc.contributor.cmtememberGrzymala-Busse, Jerzy
dc.contributor.cmtememberHuan, Jun
dc.contributor.cmtememberWard, Robert E
dc.thesis.degreeDisciplineElectrical Engineering & Computer Science
dc.thesis.degreeLevelPh.D.
kusw.oastatusna
dc.identifier.orcidhttps://orcid.org/0000-0002-8036-2110
kusw.oapolicyThis item does not meet KU Open Access policy criteria.
kusw.bibid6857491
dc.rights.accessrightsopenAccess


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record