Show simple item record

dc.contributor.authorFeehan, Ryan
dc.contributor.authorFranklin, Meghan W.
dc.contributor.authorSlusky, Joanna S. G.
dc.date.accessioned2021-12-10T18:49:34Z
dc.date.available2021-12-10T18:49:34Z
dc.date.issued2021-06-17
dc.identifier.citationFeehan, R., Franklin, M. W., & Slusky, J. (2021). Machine learning differentiates enzymatic and non-enzymatic metals in proteins. Nature communications, 12(1), 3712. https://doi.org/10.1038/s41467-021-24070-3en_US
dc.identifier.urihttp://hdl.handle.net/1808/32277
dc.description.abstractMetalloenzymes are 40% of all enzymes and can perform all seven classes of enzyme reactions. Because of the physicochemical similarities between the active sites of metalloenzymes and inactive metal binding sites, it is challenging to differentiate between them. Yet distinguishing these two classes is critical for the identification of both native and designed enzymes. Because of similarities between catalytic and non-catalytic metal binding sites, finding physicochemical features that distinguish these two types of metal sites can indicate aspects that are critical to enzyme function. In this work, we develop the largest structural dataset of enzymatic and non-enzymatic metalloprotein sites to date. We then use a decision-tree ensemble machine learning model to classify metals bound to proteins as enzymatic or non-enzymatic with 92.2% precision and 90.1% recall. Our model scores electrostatic and pocket lining features as more important than pocket volume, despite the fact that volume is the most quantitatively different feature between enzyme and non-enzymatic sites. Finally, we find our model has overall better performance in a side-to-side comparison against other methods that differentiate enzymatic from non-enzymatic sequences. We anticipate that our model’s ability to correctly identify which metal sites are responsible for enzymatic activity could enable identification of new enzymatic mechanisms and de novo enzyme design.en_US
dc.publisherNature Researchen_US
dc.rights© The Author(s) 2021. This article is licensed under a Creative Commons Attribution 4.0 International License.en_US
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en_US
dc.subjectBiocatalysisen_US
dc.subjectMachine learningen_US
dc.titleMachine learning differentiates enzymatic and non-enzymatic metals in proteinsen_US
dc.typeArticleen_US
kusw.kuauthorFeehan, Ryan
kusw.kuauthorFranklin, Meghan W.
kusw.kuauthorSlusky, Joanna S. G.
kusw.kudepartmentCenter for Computational Biologyen_US
kusw.kudepartmentMolecular Biosciencesen_US
dc.identifier.doi10.1038/s41467-021-24070-3en_US
dc.identifier.orcidhttps://orcid.org/ 0000-0002-2435-671Xen_US
dc.identifier.orcidhttps://orcid.org/ 0000-0003-0842-6340en_US
kusw.oaversionScholarly/refereed, publisher versionen_US
kusw.oapolicyThis item meets KU Open Access policy criteria.en_US
dc.identifier.pmidPMC8211803en_US
dc.rights.accessrightsopenAccessen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

© The Author(s) 2021. This article is licensed under a Creative Commons Attribution 4.0 International License.
Except where otherwise noted, this item's license is described as: © The Author(s) 2021. This article is licensed under a Creative Commons Attribution 4.0 International License.