Académique Documents
Professionnel Documents
Culture Documents
www.elsevier.com/locate/eswa
a
College of Business Administration, Sogang University, CPO Box 1142, Seoul 121-742, South Korea
College of Commerce and Economics, Dongguk University, Gyeongju, Gyeongbuk, 780-714, South Korea
Abstract
Bankruptcy prediction has drawn a lot of research interests in previous literature, and recent studies have shown that machine learning
techniques achieved better performance than traditional statistical ones. This paper applies support vector machines (SVMs) to the
bankruptcy prediction problem in an attempt to suggest a new model with better explanatory power and stability. To serve this purpose, we
use a grid-search technique using 5-fold cross-validation to find out the optimal parameter values of kernel function of SVM. In addition, to
evaluate the prediction accuracy of SVM, we compare its performance with those of multiple discriminant analysis (MDA), logistic
regression analysis (Logit), and three-layer fully connected back-propagation neural networks (BPNs). The experiment results show that
SVM outperforms the other methods.
q 2005 Published by Elsevier Ltd.
Keywords: Bankruptcy prediction; Support vector machine; Grid-search; Kernel function; Back-propagation neural networks
1. Introduction
Over the past decade, several financial crises observed in
some emerging markets enjoying financial liberalization
showed that debt financing could result in large and sudden
capital outflows, thereby causing a domestic credit crunch.
This experience made banking authorities such as Bank for
International Settlements (BIS) learn a number of lessons,
among which they all encourage commercial banks to
develop internal models to better quantify financial risks
(Basel Committee on Banking Supervision, 1999). Decision
making problems in the area of credit evaluation have been
considered very important but difficult ones for financial
institutions due to a high level of risk from wrong decisions.
In order to effectively manage the credit risk exposures of
financial institutions, there is a strong need for sophisticated
decision support systems backed by analytical tools to
measure, monitor, manage, and control financial and
* Corresponding author. Tel.: C82 54 770 2317; fax: C82 54 770 2532.
E-mail addresses: jaemin@ccs.sogang.ac.kr (J.H. Min), chanlee@
dongguk.ac.kr (Y.-C. Lee).
0957-4174/$ - see front matter q 2005 Published by Elsevier Ltd.
doi:10.1016/j.eswa.2004.12.008
604
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
2. Theoretical background
2.1. Statistical methods
In the late 1960s, discriminant analysis (DA) was
introduced to create a composite empirical indicator of
financial ratios. Using financial ratios, Beaver (1966)
developed an indicator that best differentiated between
failed and non-failed firms using univariate analysis
techniques. The univariate approach was later improved
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
i Z 1; .; N
(2)
(3)
(4)
605
subject to
(
yi wT fxi C bR 1 K xi ; i Z 1; .; N
xi R 0;
i Z 1; .; N
(5)
where xis are slack variables needed to allow misclassifications in the set of inequalities, and C 2RC is a
tuning hyperparameter, weighting the importance of
classification errors vis-a`-vis the margin width. The
solution of the primal problem is obtained after
constructing the Lagrangian. From the conditions of
optimality, one obtains a quadratic programming (QP)
problem with Lagrange multipliers ais. A multiplier ai
exists for each training data instance. Data instances
corresponding to non-zero ais are called support vectors.
On the other hand, the above primal problem can be
converted into the following dual problem with objective
function (6) and constraints (7). Since the decision variables
are support vector of Lagrange multipliers, it is easier to
interpret the results of this dual problem than those of the
primal one.
1
Max aT Qa K eT a
a 2
subject to
(
0% ai % C;
i Z 1; .; N
yT a Z 0
(6)
(7)
(8)
606
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
3. Research design
3.1. Data collection and preprocessing
The database used in this study was obtained from the
Koreas largest credit guarantee organization that has the
public policy of promoting growth of small and medium-sized
enterprises in the nation. All of the non-bankruptcy cases are
from medium-sized heavy industry firms in 2002 and the
corresponding bankruptcy cases are also from the same
industry. In general, however, the number of bankruptcy cases
is smaller than that of non-bankruptcy cases; hence, we
collected additional bankruptcy cases from the years of 2000,
2001, and 2002 in order to make the experimental data set
consist of even number of bankruptcy and non-bankruptcy
cases. The final sample of 1888 firms includes 944 bankruptcy
and 944 non-bankruptcy cases placed in random order.
The original data are scaled into the range of (K1, 1).
The goal of linear scaling is to independently normalize
each feature component to the specified range. It ensures the
larger value input attributes do not overwhelm smaller value
inputs; hence helps to reduce prediction errors (Hsu, Chang,
& Lin, 2004).
In choosing financial ratios, we apply stepwise logistic
regression analysis. Most previous studies using statistical
methods such as discriminant analysis and logistic
regression have selected independent variables through the
stepwise regression analysis. In this study, several financial
ratios are initially selected by the principal component
analysis and t-test for the graphical analysis. Using the
stepwise logistic regression, we also reduce the number of
financial variables to a manageable set of 11.
In cases of SVM, MDA, and Logit, each data set is split
into two subsets: a training set of 80% (1510) and a holdout
set of 20% (378) of the total data (1888), respectively. The
holdout data is used to test the results, which is not utilized
to develop the model. In case of BPN, each data set is split
into three subsets: a training set of 60% (1132), a validation
set of 20% (378), and a holdout set of 20% (378) of the total
data (1888) respectively, where the validation data is used to
check the results.
3.2. Analysis steps
This study is conducted according to the analysis steps in
Fig. 1.
In Step 1, we reduce the number of multi-dimensional
financial ratios to two factors through the principal
component analysis (PCA)1 and calculate factor scores
of all companies using factor loadings equal to or greater
than 0.5. In Step 2, we compare the training performance
among SVM models graphically using the factor scores.
1
For the principal component analysis, we use statistically significant
financial ratios by t-test.
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
607
Probability of default Z
1
1
Z
1 CeKZ 1 CeKw0Cw1 x1C.Cwn xn
(11)
(10)
608
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
Table 1
The result of principal component analysis
4. Empirical analysis
Variables
Factor1
Factor2
K0.058
0.541
0.529
0.374
0.444
0.342
0.327
0.544
K0.246
0.400
0.008
0.091
0.698
0.713
0.446
0.521
0.761
0.771
K0.313
0.714
0.177
0.048
K0.761
K0.770
K0.727
0.408
0.430
0.053
K0.103
K0.128
K0.103
K0.174
K0.136
K0.346
K0.214
K0.130
K0.220
K0.199
0.604
K0.210
0.304
0.431
0.448
0.541
0.568
0.519
K0.066
K0.051
K0.749
0.889
0.503
0.523
0.866
0.758
0.634
0.141
0.017
0.485
K0.279
0.066
K0.112
K0.027
0.067
K0.157
K0.178
0.266
0.157
0.451
0.557
K0.071
K0.396
K0.593
0.027
0.485
0.062
0.566
The financial ratios whose factor loading is equal to or greater than 0.5.
Fig. 2. The scatter plot of factor scores. (a) The scatter plot of training data. (b) The scatter plot of holdout data.
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
609
Fig. 4. Graphical display of the prediction performance of RBF kernel SVM. (a) Training data. (b) Holdout data.
610
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
Fig. 5. Graphical display of the prediction performance of polynomial kernel SVM. (a) Training data (dZ1). (b) Holdout data (dZ1). (c) Training data (dZ2).
(d) Holdout data (dZ2). (e) Training data (dZ3). (f) Holdout data (dZ3).
Table 2
The selected financial ratios
Variables
Formula
Variable costsOsales
Interest expensesOtotal borrowings and bonds payable
Interest expensesOsales
Sales at break-even pointOsales
Sales at break-even point: Ofixed
costs-non-operating revenue)O
(1K(variable costsOsales))
Stockholders equityOtotal assets
Cash flow from operatingOshort
term loan
Fixed assetsOstockholders equity
SalesOcapital stock
SalesOoperating assets
SalesOfixed assets
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
611
Table 3
The result of grid-search using 5-fold cross-validation
C
2K5
2K3
2K1
21
23
25
27
29
211
213
215
g
23
21
2K1
2K3
2K5
2K7
2K9
2K11
2K13
2K15
47.2185
47.2185
48.1457
62.5828
62.5828
62.5828
62.5828
62.5828
62.5828
62.5828
62.5828
47.2185
62.9801
74.3709
77.5497
77.6159
77.5497
77.5497
77.5497
77.5497
77.5497
77.5497
74.4371
79.1391
81.2583
81.0596
80.9272
79.8675
79.7351
79.7351
79.7351
79.7351
79.7351
71.9205
77.2848
79.6689
81.9868
82.9139
82.9139
82.4503
79.9338
79.3377
79.3377
79.2053
57.2848
74.4371
77.7483
79.2715
81.1921
82.5828
82.8477
82.7152
82.1192
81.2583
79.9338
47.2185
67.8808
75.3642
78.1457
78.8079
80.7285
81.9205
82.6490
83.1126
82.7152
82.3179
47.2185
47.2185
68.1457
75.6954
78.0795
78.4106
79.7351
80.6623
81.8543
82.5828
82.8477
47.2185
47.2185
47.2185
68.0795
75.5629
77.8808
78.0795
79.6026
80.1325
80.4636
81.3245
47.2185
47.2185
47.2185
47.2185
67.8808
75.5629
77.6159
78.0795
79.3377
79.8013
79.6026
47.2185
47.2185
47.2185
47.2185
47.2185
67.7483
75.5629
77.5497
78.0132
79.4702
79.8013
612
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
Table 4
The performance of SVM models
Model (kernel)
Linear
RBF
Polynomiala
211
211
211
N/A
2K7
2K7
Sigmoida
211
2K7
Table 7
The best prediction accuracy of SVM, BPN, MDA, and Logit (hit ratio: %)
d
N/A
N/A
1
2
3
N/A
Holdout
data
Training data
Holdout data
80.2649
88.0132
80.3311
86.6225
88.4768
72.6490
77.2487
83.0688
77.2487
83.8624
82.0106
71.1640
5. Conclusions
Parameter r is set to 1.
Table 5
The performance of BPN
Learning epoch
Hidden nodes
100
8
12
16
24
32
8
12
16
24
32
8
12
16
24
32
200
300
Holdout data
83.1272
82.9505
81.5371
81.0954
83.5689
84.8057
86.7491
82.6855
85.1590
85.2474
84.8940
86.1307
87.0141
85.2474
88.1625
80.6878
80.4233
80.1587
79.8942
81.2169
80.9524
81.2169
81.2169
81.2169
81.7460
82.0106
81.4815
80.6878
82.5397
81.4815
MDA
Logit
BNN
MDA
Logit
88.0132
83.0688
85.2474
82.5397
78.8079
79.1391
79.8676
78.3069
Table 8
McNemar values (p-values) for the pairwise comparison of performance
BPN
Table 6
The performance of MDA and Logit
Model
SVM
Holdout data
78.8079
79.8676
79.1391
78.3069
SVM
BPN
MDA
1.750 (0.186)
*P!0.1, **P!0.05.
b
P-value.
a
McNemar statistic.
MDA
Logit
4.470 (0.035)**
0.356 (0.551)
2.857 (0.091)*
0.103 (0.749)
0.595 (0.440)
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
References
Altman, E. I. (1968). Financial ratios, discriminant analysis and the
prediction of corporate bankruptcy. The Journal of Finance, 23(4),
589609.
Altman, E. I., Marco, G., & Varetto, F. (1994). Corporate distress diagnosis
comparisons using linear discriminant analysis and neural networks.
Journal of Banking and Finance, 18(3), 505529.
Barniv, R., Agarwal, A., & Leach, R. (1997). Predicting the outcome
following bankruptcy filing: A three-state classification using neural
networks. International Journal of Intelligent Systems in Accounting,
Finance and Management, 6(3), 177194.
Basel Committee on Banking Supervision (1999). Credit Risk Modeling:
Current Practices and Applications. Basel Committee Publications.
Beaver, W. (1966). Financial ratios as predictors of failure. Journal of
Accounting Research, 5, 71111.
Bell, T., Ribar, G., Verchio, J. (1990). Neural nets vs. logistic regression: A
comparison of each models ability to predict commercial bank failures.
In Proceedings of the 1990 Deloitte and Touche/University of Kansas
Symposium on Auditing Problems (pp. 2958).
Bryant, S. M. (1997). A case-based reasoning approach to bankruptcy
prediction modeling. International Journal of Intelligent Systems in
Accounting, Finance and Management, 6(3), 195214.
Butta, P. (1994). Mining for financial knowledge with CBR. AI Expert,
9(2), 3441.
Chang, C.-C., Lin, C.-J. (2004). LIBSVM: A library for support vector
machines. Technical Report, Department of Computer Science and
Information Engineering, National Taiwan University. Available at
http://www.csie.ntu.edu.tw/~cjlin/papers/libsvm.pdf.
Chen, M.-C., & Huang, S.-H. (2003). Credit scoring and rejected instances
reassigning through evolutionary computation techniques. Expert
Systems with Applications, 24, 433441.
Cielen, A., & Vanhoof, K. (1999). Bankruptcy prediction using a data
envelopment analysis Manuscript. Diebenpeek: Limburg University.
Coakley, J. R., & Brown, C. E. (2000). Artificial neural networks in
accounting and finance: Modeling issues. International Journal of
Intelligent Systems in Accounting, Finance and Management, 9(2),
119144.
Coates, P., & Fant, L. (1993). Recognizing financial distress patterns using
a neural network tool. Financial Management, 22(3), 142155.
Cristianini, N., & Shawe-Taylor, J. (2000). An introduction to support
vector machines. Cambridge, England: Cambridge University Press.
Curram, S. P., & Mingers, J. (1994). Neural networks, decision tree
induction and discriminant analysis: An empirical comparison. Journal
of Operational Research Society, 45(4), 440450.
Davis, R. H., Edelman, D. B., & Gammerman, A. J. (1992). Machine
learning algorithms for credit-card applications. IMA Journal of
Mathematics Applied in Business and Industry, 4, 4351.
Desai, V. S., Conway, J. N., & Overstreet, G. A. (1997). Credit scoring
models in the credit union environment using neural networks and
genetic algorithms. IMA Journal of Mathematics Applied in Business
and Industry, 8, 324346.
Desai, V. S., Crook, J. N., & Overstreet, G. A. (1996). A comparison of
neural networks and linear scoring models in the credit union
environment. European Journal of Operational Research, 95, 2437.
Diakoulaki, D., Mavrotas, G., & Papayannakis, L. (1992). A multicriteria
approach for evaluating the performance of industrial firms. Omega,
20(4), 467474.
Dimitras, A. I., Zanakis, S. H., & Zopounidis, C. (1996). A survey
of business failure with an emphasis on prediction methods and
industrial applications. European Journal of Operational Research, 90,
487513.
Drucker, H., Wu, D., & Vapnik, V. N. (1999). Support vector machines for
spam categorization. IEEE Transactions Neural Networks, 10(5),
10481054.
613
614
J.H. Min, Y.-C. Lee / Expert Systems with Applications 28 (2005) 603614
Smith, M. (1993). Neural networks for statistical modeling. New York: Van
Nostrand Reinhold.
Srinivasan, V., & Ruparel, B. (1990). CGX: An expert support system
for credit granting. European Journal of Operational Research, 45,
293308.
Srinivasan, V., & Kim, Y. H. (1988). Designing expert financial systems: A
case study of corporate credit management. Financial Management, 5,
3243.
Tam, K. Y., & Kiang, M. Y. (1992). Managerial applications of neural
networks: The case of bank failure predictions. Management Science,
38(7), 926947.
Tay, F. E. H., & Cao, L. (2001). Application of support vector machines in
financial time series forecasting. Omega, 29, 309317.
Vapnik, V. (1998). Statistical learning theory. New York: Springer.
Viaene, S., Derrig, R. A., Baesens, B., & Dedene, G. (2002). A comparison
of state-of-the-art classification techniques for expert automobile
insurance claim fraud detection. The Journal of Risk and Insurance,
69(3), 373421.
Weigend, A. (1994). On overfitting and the effective number of hidden
units. In Proceedings of the 1993 connectionist models summer school
(pp. 335342).
West, D. (2000). Neural network credit scoring models. Computers and
Operations Research, 27, 11311152.
Wilson, R., & Sharda, R. (1994). Bankruptcy prediction using neural
networks. Decision Support Systems, 11, 545557.
Zhang, G. P. (2000). Neural networks for classification: A survey. IEEE
Transactions Systems, Man, and Cybernetics-Part C: Applications and
Reviews, 30(4), 451462.
Zhang, G. P., Hu, M. Y., Patuwo, B. E., & Indro, D. C. (1999). Artificial
neural networks in bankruptcy prediction: General framework and
cross-validation analysis. European Journal of Operational Research,
116, 1632.