Show simple item record

dc.contributor.authorSmalter Hall, Aaron
dc.contributor.authorHuan, Luke
dc.contributor.authorJia, Yi
dc.contributor.authorLushington, Gerald H.
dc.date.accessioned2017-04-03T21:08:25Z
dc.date.available2017-04-03T21:08:25Z
dc.date.issued2010-04
dc.identifier.citationSmalter, Aaron, Jun Huan, Yi Jia, and Gerald Lushington. "GPD: A Graph Pattern Diffusion Kernel for Accurate Graph Classification with Applications in Cheminformatics." IEEE/ACM Transactions on Computational Biology and Bioinformatics 7.2 (2010): 197-207.en_US
dc.identifier.urihttp://hdl.handle.net/1808/23569
dc.description.abstractGraph data mining is an active research area. Graphs are general modeling tools to organize information from heterogeneous sources and have been applied in many scientific, engineering, and business fields. With the fast accumulation of graph data, building highly accurate predictive models for graph data emerges as a new challenge that has not been fully explored in the data mining community. In this paper, we demonstrate a novel technique called graph pattern diffusion (GPD) kernel. Our idea is to leverage existing frequent pattern discovery methods and to explore the application of kernel classifier (e.g., support vector machine) in building highly accurate graph classification. In our method, we first identify all frequent patterns from a graph database. We then map subgraphs to graphs in the graph database and use a process we call “pattern diffusion” to label nodes in the graphs. Finally, we designed a graph alignment algorithm to compute the inner product of two graphs. We have tested our algorithm using a number of chemical structure data. The experimental results demonstrate that our method is significantly better than competing methods such as those kernel functions based on paths, cycles, and subgraphs.en_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.relation.isversionofhttp://ieeexplore.ieee.org/document/5313793/en_US
dc.rights(c) 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.en_US
dc.subjectGraph classificationen_US
dc.subjectGraph alignmenten_US
dc.subjectFrequent subgraph miningen_US
dc.titleGPD: A Graph Pattern Diffusion Kernel for Accurate Graph Classification with Applications in Cheminformaticsen_US
dc.typeArticleen_US
kusw.kuauthorHuan, Luke
kusw.kudepartmentElectrical Engineering and Computer Scienceen_US
kusw.oanotesPer SHERPA/RoMEO 4/3/2017: Author's Pre-print: green tick author can archive pre-print (ie pre-refereeing) Author's Post-print: green tick author can archive post-print (ie final draft post-refereeing) Publisher's Version/PDF: cross author cannot archive publisher's version/PDF General Conditions: Author's pre-print on Author's personal website, employers website or publicly accessible server Author's post-print on Author's server or Institutional server Author's pre-print must be removed upon publication of final version and replaced with either full citation to IEEE work with a Digital Object Identifier or link to article abstract in IEEE Xplore or replaced with Authors post-print Author's pre-print must be accompanied with set-phrase, once submitted to IEEE for publication ("This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible") Author's pre-print must be accompanied with set-phrase, when accepted by IEEE for publication ("(c) 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.") IEEE must be informed as to the electronic address of the pre-print If funding rules apply authors may post Author's post-print version in funder's designated repository Author's Post-print - Publisher copyright and source must be acknowledged with citation (see above set statement) Author's Post-print - Must link to publisher version with DOI Publisher's version/PDF cannot be used Publisher copyright and source must be acknowledgeden_US
dc.identifier.doi10.1109/TCBB.2009.80en_US
kusw.oaversionScholarly/refereed, author accepted manuscripten_US
kusw.oapolicyThis item meets KU Open Access policy criteria.en_US
dc.rights.accessrightsopenAccess


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record