Document similarity based on concept tree distance

Lakkaraju, Praveen

dc.contributor.author	Lakkaraju, Praveen
dc.date.accessioned	2021-10-08T19:14:28Z
dc.date.available	2021-10-08T19:14:28Z
dc.date.issued	2007-05-31
dc.identifier.uri	http://hdl.handle.net/1808/32041
dc.description	Thesis (M.S.)--University of Kansas, Electrical Engineering and Computer Science, 2007.	en_US
dc.description.abstract	The Web is fast moving from an era of search engines to an era of discovery engines. Discovery engines help you find things that you never knew existed or did not know how to ask for. One of the ways this can be done is by automatically computing and displaying objects that are similar to the object in which the user is currently expressing interest. In this paper, we present a new approach to compute interdocument similarity that is based on a tree-matching algorithm. We represent each document as a concept tree using the concept associations obtained from a classifier. We make use of a tree-matching algorithm called the tree edit distance to compute similarities between these concept trees. Experiments on a subset of documents from the CiteSeer collection showed that our algorithm performed better than the document similarity based on the traditional vector space model.	en_US
dc.publisher	University of Kansas	en_US
dc.rights	This item is protected by copyright and unless otherwise specified the copyright of this thesis/dissertation is held by the author.	en_US
dc.subject	Applied sciences	en_US
dc.title	Document similarity based on concept tree distance	en_US
dc.type	Thesis	en_US
dc.thesis.degreeDiscipline	Electrical Engineering and Computer Science
dc.thesis.degreeLevel	M.S.
kusw.bibid	5349267
dc.rights.accessrights	openAccess	en_US

Files in this item

Name:: Lakkaraju_Praveen_2007_5349267.pdf
Size:: 363.3Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

The University of Kansas prohibits discrimination on the basis of race, color, ethnicity, religion, sex, national origin, age, ancestry, disability, status as a veteran, sexual orientation, marital status, parental status, gender identity, gender expression and genetic information in the University’s programs and activities. The following person has been designated to handle inquiries regarding the non-discrimination policies: Director of the Office of Institutional Opportunity and Access, IOA@ku.edu, 1246 W. Campus Road, Room 153A, Lawrence, KS, 66045, (785)864-6414, 711 TTY.