dc.contributor.author | Price, Bradley S. | |
dc.contributor.author | Sherwood, Ben | |
dc.date.accessioned | 2019-12-12T21:38:49Z | |
dc.date.available | 2019-12-12T21:38:49Z | |
dc.date.issued | 2018-07-01 | |
dc.identifier.citation | Price, B. S. and Sherwood, B. (2018) A Cluster Elastic Net for Multivariate Regression . Journal of Machine Learning Research, 18, 1-39. | en_US |
dc.identifier.uri | http://hdl.handle.net/1808/29855 | |
dc.description.abstract | We propose a method for simultaneously estimating regression coefficients and clustering response variables in a multivariate regression model, to increase prediction accuracy and give insights into the relationship between response variables. The estimates of the regression coefficients and clusters are found by using a penalized likelihood estimator, which includes a cluster fusion penalty, to shrink the difference in fitted values from responses in the same cluster, and an L1 penalty for simultaneous variable selection and estimation. We propose a two-step algorithm, that iterates between k-means clustering and solving the penalized likelihood function assuming the clusters are known, which has desirable parallel computational properties obtained by using the cluster fusion penalty. If the response variable clusters are known a priori then the algorithm reduces to just solving the penalized likelihood problem. Theoretical results are presented for the penalized least squares case, including asymptotic results allowing for p≫n. We extend our method to the setting where the responses are binomial variables. We propose a coordinate descent algorithm for the normal likelihood and a proximal gradient descent algorithm for the binomial likelihood, which can easily be extended to other generalized linear model (GLM) settings. Simulations and data examples from business operations and genomics are presented to show the merits of both the least squares and binomial methods. | en_US |
dc.publisher | Journal of Machine Learning Research | en_US |
dc.relation.isversionof | http://jmlr.org/papers/v18/17-445.html | en_US |
dc.rights | Copyright 2018 Bradley S. Price and Ben Sherwood. License: CC-BY 4.0, seehttps://creativecommons.org/licenses/by/4.0/. Attribution requirements are providedathttp://jmlr.org/papers/v18/17-445.html. | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Multivariate Regression | en_US |
dc.subject | Clustering | en_US |
dc.subject | Fusion Penalty | en_US |
dc.title | A Cluster Elastic Net for Multivariate Regression | en_US |
dc.type | Article | en_US |
kusw.kuauthor | Sherwood, Ben | |
kusw.kudepartment | Business | en_US |
kusw.oanotes | Per SHERPA/RoMEO 12/06/2019
Author's Pre-print: green tick author can archive pre-print (ie pre-refereeing)
Author's Post-print: green tick author can archive post-print (ie final draft post-refereeing)
Publisher's Version/PDF: green tick author can archive publisher's version/PDF
General Conditions: On open access repositories
Authors retain copyright
Publisher's version/PDF may be used
Publisher's version/PDF will be deposited in ACM's computing repository (CORR) automatically
Must link to publisher version
Published source must be acknowledgedMandated OA: (Awaiting information)
Notes: Publisher last contacted on 17/07/2015
All titles are open access journals | en_US |
kusw.oaversion | Scholarly/refereed, publisher version | en_US |
kusw.oapolicy | This item meets KU Open Access policy criteria. | en_US |
kusw.proid | ID158839836672 | en_US |
dc.rights.accessrights | openAccess | en_US |