Topic specific spider for taxonomic documents
Issue Date
2007-05-31Author
Kotcherlakota, Krishna Priya
Publisher
University of Kansas
Type
Thesis
Degree Level
M.S.
Discipline
Electrical Engineering and Computer Science
Rights
This item is protected by copyright and unless otherwise specified the copyright of this thesis/dissertation is held by the author.
Metadata
Show full item recordAbstract
The overall goal of the project being developed at KU as a part of SEEK is to provide an interface tool for a taxonomist through which he/she can obtain taxonomic revisions depending for the species provided to the system. This thesis, in particular, aims to develop a subsystem that collects taxonomic documents available on the World Wide Web using a combination of spidering and document classification techniques. To increase the number of documents collected, two query expansion techniques have been considered and evaluated. We found that generic query expansion performed better than the taxonomically expanded queries. Also, we show that filtering helps improve the performance of the system by reducing the number of non-taxonomic documents that are presented to the end-user.
Description
Thesis (M.S.)--University of Kansas, Electrical Engineering and Computer Science, 2007.
Collections
- Theses [3901]
Items in KU ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
We want to hear from you! Please share your stories about how Open Access to this item benefits YOU.