Skip redundant pieces

KU ScholarWorks >
School of Business >
School of Business Articles >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1808/1232
View usage statistics

Title: Using Bayesian networks for bankruptcy prediction: Some methodological issues
Authors: Sun, Lili
Shenoy, Prakash P.
Keywords: Bankruptcy prediction
Bayesian networks
Naıve Bayes
Variable selection
Discretization of continuous variables
Issue Date: 16-Jul-2007
Publisher: Elsevier
Extent: 380445 bytes
Type: Article
Citation: Lili, S. and P. P. Shenoy, "Using Bayesian networks for bankruptcy prediction: Some methodological issues," European Journal of Operational Research, Vol. 180, No. 2, 2007, pp. 738--753.
Abstract: This study provides operational guidance for building naıve Bayes Bayesian network (BN) models for bankruptcy prediction. First, we suggest a heuristic method that guides the selection of bankruptcy predictors. Based on the correlations and partial correlations among variables, the method aims at eliminating redundant and less relevant variables. A naıve Bayes model is developed using the proposed heuristic method and is found to perform well based on a 10-fold validation analysis. The developed naıve Bayes model consists of eight first-order variables, six of which are continuous. We also provide guidance on building a cascaded model by selecting second-order variables to compensate for missing values of first order variables. Second, we analyze whether the number of states into which the six continuous variables are discretized has an impact on the model’s performance. Our results show that the model’s performance is the best when the number of states for discretization is either two or three. Starting from four states, the performance starts to deteriorate, probably due to over-fitting. Finally, we experiment whether modeling continuous variables with continuous distributions instead of discretizing them can improve the model’s performance. Our finding suggests that this is not true. One possible reason is that continuous distributions tested by the study do not represent well the underlying distributions of empirical data. Finally, the results of this study could also be applicable to business decision-making contexts other than bankruptcy prediction.
URI: http://hdl.handle.net/1808/1232
ISSN: 0377-2217
Appears in Collections:School of Business Articles

Files in This Item:

File Description SizeFormat
EJOR07.pdf371KbAdobe PDFView/Open

Items in KU ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.