HealthTrust: Assessing the Trustworthiness of Healthcare Information on the Internet

Park, Meeyoung

dc.contributor.advisor	Luo, Bo
dc.contributor.advisor	Chen, Xue-wen
dc.contributor.author	Park, Meeyoung
dc.date.accessioned	2014-06-18T03:40:21Z
dc.date.available	2014-06-18T03:40:21Z
dc.date.issued	2013-12-31
dc.date.submitted	2013
dc.identifier.other	http://dissertations.umi.com/ku:13091
dc.identifier.uri	http://hdl.handle.net/1808/14202
dc.description.abstract	As well recognized, healthcare information is growing exponentially and is made more available to public. Frequent users such as medical professionals and patients are highly dependent on the web sources to get the appropriate information promptly. However, the trustworthiness of the information on the web is always questionable due to the fast and augmentative properties of the Internet. Most search engines provide relevant pages to given keywords, but the results might contain some unreliable or biased information. Consequently, a significant challenge associated with the information explosion is to ensure effective use of information. One way to improve the search results is by accurately identifying more trustworthy data. Surprisingly, although trustworthiness of sources is essential for a great number of daily users, not much work has been done for healthcare information sources by far. In this dissertation, I am proposing a new system named HealthTrust, which automatically assesses the trustworthiness of healthcare information over the Internet. In the first phase, an unsupervised clustering using graph topology, on our collection of data is employed. The goal is to identify a relatively larger and reliable set of trusted websites as a seed set without much human efforts. After that, a new ranking algorithm for structure-based assessment is adopted. The basic hypothesis is that trustworthy pages are more likely to link to trustworthy pages. In this way, the original set of positive and negative seeds will propagate over the Web graph. With the credibility-based discriminators, the global scoring is biased towards trusted websites and away from untrusted websites. Next, in the second phase, the content consistency between general healthcare-related webpages and trusted sites is evaluated using information retrieval techniques to evaluate the content-semantics of the webpage with respect to the medical topics. In addition, graph modeling is employed to generate contents-based ranking for each page based on the sentences in the seed pages. Finally, in order to integrate the two components, an iterative approach that integrates the credibility assessments from structure-based and content-based methods to give a final verdict - a HealthTrust score for each webpage is exploited. I demonstrated the first attempt to integrate structure-based and content-based approaches to automatically evaluate the credibility of online healthcare information through HealthTrust and make fundamental contributions to both information retrieval and healthcare informatics communities.
dc.format.extent	120 pages
dc.language.iso	en
dc.publisher	University of Kansas
dc.rights	This item is protected by copyright and unless otherwise specified the copyright of this thesis/dissertation is held by the author.
dc.subject	Computer science
dc.subject	Healthcare informatics
dc.subject	Hidden Markov model
dc.subject	Machine learning
dc.subject	Natural language processing
dc.subject	Topic modeling
dc.subject	Trustworthiness
dc.title	HealthTrust: Assessing the Trustworthiness of Healthcare Information on the Internet
dc.type	Dissertation
dc.contributor.cmtemember	Agah, Arvin
dc.contributor.cmtemember	Huan, Luke
dc.contributor.cmtemember	Kulkarni, Prasad
dc.contributor.cmtemember	Wang, Michael
dc.thesis.degreeDiscipline	Electrical Engineering & Computer Science
dc.thesis.degreeLevel	Ph.D.
kusw.bibid	8086457
dc.rights.accessrights	openAccess

Files in this item

Name:: Park_ku_0099D_13091_DATA_1.pdf
Size:: 1.789Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

The University of Kansas prohibits discrimination on the basis of race, color, ethnicity, religion, sex, national origin, age, ancestry, disability, status as a veteran, sexual orientation, marital status, parental status, gender identity, gender expression and genetic information in the University’s programs and activities. The following person has been designated to handle inquiries regarding the non-discrimination policies: Director of the Office of Institutional Opportunity and Access, IOA@ku.edu, 1246 W. Campus Road, Room 153A, Lawrence, KS, 66045, (785)864-6414, 711 TTY.