New Methodology for Measuring Semantic Functional Similarity Based on Bidirectional Integration

Jeong, Jong Cheol

dc.contributor.advisor	Chen, Xue-wen
dc.contributor.advisor	Luo, Bo
dc.contributor.author	Jeong, Jong Cheol
dc.date.accessioned	2013-07-14T15:47:07Z
dc.date.available	2013-07-14T15:47:07Z
dc.date.issued	2013-05-31
dc.date.submitted	2013
dc.identifier.other	http://dissertations.umi.com/ku:12808
dc.identifier.uri	http://hdl.handle.net/1808/11467
dc.description.abstract	1.2 billion users in facebook, 17 million articles in Wikipedia, and 190 million tweets per day have demanded significant increase of information processing through Internet in recent years. Similarly life sciences and bioinformatics also have faced issues of processing Big data due to the explosion of publicly available genomic information resulted from the Human Genome Project (HGP) and the increasing usage of high throughput technology. HGP was completed in 2003 and resulted in identifying 20,000-25,000 genes in human DNA and determining the sequences of three billion human base pairs. The information requires huge amount of data storage and becomes difficult to process using on-hand database management tools or traditional data processing applications. This thesis introduces new method, Biological and Statistical Mean (BSM) score to calculate functional similarity between gene products (GPs) that can help to extract biologically relevant and statistically robust information from large-scale biomedical, genomic and proteomic data sources. BSM score is defined by 16 different scoring matrices derived from principles of multi-view learning in machine learning algorithm and five different databases including Gene Ontology, UniProt, SCOP, CATH, and KUPS. The proposed method also shows how diverse databases and principles in machine learning theory can be integrated into a simple scoring function, and how the simple concept can give significant impact on the studies in biomedical and human life sciences. The comprehensive evaluations and performance comparisons with other conventional methods show that BSM score clearly outperforms other methods in terms of sensitivity of clustering similarity functional groups and coverage of identifying related genes. As a part of potential applications handling large amount of diverse data sources in medical domain, this thesis introduces similarity-based drug target identification and disease networks using BSM scores. Application of BSM score is freely available through http://www.ittc.ku.edu/chenlab/goal/
dc.format.extent	273 pages
dc.language.iso	en
dc.publisher	University of Kansas
dc.rights	This item is protected by copyright and unless otherwise specified the copyright of this thesis/dissertation is held by the author.
dc.subject	Bioinformatics
dc.subject	Computer science
dc.subject	Information science
dc.subject	Disease relation
dc.subject	Drug target identification
dc.subject	Functional similarity
dc.subject	Gene ontology
dc.subject	Machine learning
dc.subject	Semantic distance
dc.title	New Methodology for Measuring Semantic Functional Similarity Based on Bidirectional Integration
dc.type	Dissertation
dc.contributor.cmtemember	Luo, Bo
dc.contributor.cmtemember	Chen, Xue-wen
dc.contributor.cmtemember	Agah, Arvin
dc.contributor.cmtemember	Grzymala-Busse, Jerzy
dc.contributor.cmtemember	Huan, Jun
dc.contributor.cmtemember	Im, Wonpil
dc.thesis.degreeDiscipline	Bioengineering
dc.thesis.degreeLevel	Ph.D.
kusw.oastatus	na
dc.identifier.orcid	https://orcid.org/0000-0002-5024-2927
kusw.oapolicy	This item does not meet KU Open Access policy criteria.
kusw.bibid	8086022
dc.rights.accessrights	openAccess

Files in this item

Name:: JEONG_ku_0099D_12808_DATA_1.pdf
Size:: 19.51Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

The University of Kansas prohibits discrimination on the basis of race, color, ethnicity, religion, sex, national origin, age, ancestry, disability, status as a veteran, sexual orientation, marital status, parental status, gender identity, gender expression and genetic information in the University’s programs and activities. The following person has been designated to handle inquiries regarding the non-discrimination policies: Director of the Office of Institutional Opportunity and Access, IOA@ku.edu, 1246 W. Campus Road, Room 153A, Lawrence, KS, 66045, (785)864-6414, 711 TTY.