Vous êtes sur la page 1sur 7

Predicting Stock Price Performance: A Neural Network Approach

Youngohc Yoon, Ph.D., George Swales, Ph.D.


Computer Information Systems Department Finance and General Business Department

Southwest Missouri State University


Springfield, Missouri 65804-0095

Abstract The Neural Network (") method has


demonstrated its capability of addressing problems with a
great deal of complexity. The neural network method
The prediction of stock price performance is a may be able to enhance an investor's forecasting ability.
difficult and complex problem. Multivariate analytical However, comparatively few applications 181 1121 1131.
techniques using both quantitative and qualitative using the neural network, have been attempted in finance.
variables have repeatedly been used to help form the basis The purpose of this paper is to apply the Neural
of investor stock price expectations and, hence, influence Network 0approach to a dynamic and complex
investment decision making. However, the performance problem in a business environment and to investigate its
of multivariate analytical techniques is often less than ability to predict stock price performance. It also
conclusive and needs to be improved to more accurately illustrates the methalology of applying this approach and
forecast stock price performance. A neural network compares its predictive power with that of multiple
method has demonstrated its capability of addressing discriminant analysis (MDA) methods.
complex problems. A neural network method may be
able to enhance an investor's forecasting ability. The
purpose of this paper is to examine the capability of a
neural neiwork method and compares its predictive power 2. Literative Review
with that of multiple discriminant analysis methods.
Discriminant analysis techniques are used to
classify a set of independent variables into two or more
1. Introduction mutually exclusive categories. It involves finding a linear
combination of independent variables that reflect large
The prediction of stock price performance involves differences in group means. The technique can be used
the interaction of many variables, making prediction very for description as well as prediction.
difficult and complex. Many analysts and investors use Multivariate analytical techniques have repeatedly
financial statement data to assist in projecting future stock been used in fiance. Applications are found in corporate
price trends. Qualitative information, while not as easily finance, banking, and investments. Crediting scoring of
interpreted, may also have an effect on investment value. loan applications to estimate the probability of consumer
Both quantitative and qualitative variables help form the or corporate default and bond rating analysis are
basis of investor stock price expectations and, hence, examples of discriminant analysis applications in the
influence investment decision making. fiance discipline.
The multivariate analytical techniques using both Perhaps the best known, seminal work in the field
quantitative and qualitative variables have repeatedly of multiple discriminant analysis (MDA) methodology
becn used in finance and investments. However, the applied to finance was conducted by Edward I. Altman
performance of multivariate analytical techniques is often [l]. Altman's use of multiple discriminant analytical
less than conclusive and needs to be improved for more techniques focused on predicting corporate bankruptcy.
accurately forecasting stock price performance. After several early attempts, he settled on a discriminant
model that contained five financial ratios that were used

156
0073-1 129/91/0000/0156$01.00 0 1991 IEEE
as independent variables. The model could reasonably As indicated above, using qualitative information to
predict corporate bankruptcy for up to two years in the supplement an investor's forecasting ability in equity
future. Subsequent development and refinement of the markets is beneficial. This type of information is often
model increased the two-year accuracy level and obtained overlooked by investors, perhaps due to its subjective, not
a 70 percent accuracy rate for up to five years in the readily inteqreted. form. While these techniques are
future [2]. A drawback of this linear approach is that it valuable, other methods using non-linear approaches may
classifies some firms as likely to go bankrupt when in further enhance forecasting ability.
actuality, they do not.
Since Alunan's original work, there have been
numemus attempts by researchers to modify and enhance 3. Research Question
his MDA model. Some of the results have been
successful, while others have not. Gentry, Newbold, and
The studies mentioned above have generally
Whitford [lo] found that adding cash-based funds flow
indicated that multiple discriminant analysis, as used in
components to Altman's model provided superior results
the finance discipline, can be a valuable tool to the
in predicting financial failure. They also concluded that
decision maker. It has also been recognized that
cash outflow components were more closely related than
qualitative information can enhance an investor's stock
cash inflow components to corporate failure.
price forecasting ability.
Meyer and fifer's method of predicting corporate
Given the above factors, can non-linear methods
bankruptcy utilized the same financial ratios as were
significantly enhance an MDA model's stock price
found in Altman's model, but added financial data from
predictive power? Specifically, how does multiple
more than one period prior to failure to determine a time
discriminant analysis compare with the neural network
trend. Collins [7] tested both models, using credit union
approach in forecasting stock price performance?
financial data and concluded that this approach adds little,
Additionally. can the use of neural network methods
if anything, to Alunan's model.
enhance the results of a recently published study which
Altman and Spivack compared the Value Line
used multiple discriminant analysis to assess the
relative financial snength system and the zeta bankruptcy
investor's ability to forecast stock price performance.
classification methods of predicting corporate bankruptcy
These questions are addressed in the paragraphs that
[3]. Their findings reveal that although significant
methodological differences do exist, a high correlation follow.
between the methods is evident and that bond systems'
scores correlate well with published bond ratings.
Pinches and Mingo [15] and others utilized the 4. Description of the Data
MDA concept to classify industrial bonds into multiple
categories, using multiple independent variables. Most of As mentioned earlier, qualitative variables can
these attempts have utilized quantitative financial provide an often neglected source of valuable information
variables to construct the model with reasonably good to the investor. Two independent research groups
predictive results. presented the results of the multiple discriminant analysis
Direct applications of the use of the MDA method which used qualitative information found in the
technique to enhance corporate performance are found in fm's annual report to the stockholder 1141 [17]. The
the literature. An example is given by La Flew study conducted by Swales serves as the basis for the
Corporation, who finding financial ruin at their doorstep, application of the neural network approach, which this
worked Altman's model backwards to turn the company paper addresses.
around [4]. The data used in this study was gathered from two
While most of the development of MDA techniques information sources frequently used by investors: The
involves the use of quantitative financial data, other Fortune 500 and Business Week's "Top 1OOO" [18] [191.
approaches using qualitative assessments have been used These soucces provide total return (dividends and stock
in finance. Forecasting how a f m ' s stock will perform price appreciation) and market valuation data,
in equity markets, using qualitative variables found in the respectively, for widely-followed companies. A stock's
fm's annual report to the stockholders, has recently been total return and market valuation are used by investors
attempted. McConnell, Haslem, and Gibson [14] and and financial analysts as performance measures.
Swales [171 have found that qualitative data can provide For this experiment, two separate sets of data were
additional information to forecast stock price gathered. From the Fortune 500 forms, five industries
performance. that offered investors the highest total returns in each year

157
were selected; the samDle consisted of 58 comDanies.
The 10 industries that were reported by Business Week to 5. Neural Network Model
have the highest market valuations provided the data for
the second set; 40 f m s were included in this sample. It
was felt that, if differences across f m s could be found 5.1 Design of the Network
among the top industries, then more pronounced
differences were likely to exist in industries further down The neural network (NN) model is structured in a
the line. four-layered network, as shown in Figure 1: an input
We classified the Fortune set of companies into two layer, two hidden layers, and an output layer. An input
groups according to their total return. Group 1 provided unit has excitatory (positive) or inhibitory (negative)
investors with the highest total returns in their respective connections to a hidden unit in the hidden layer, and a
industries; Group 2 provided the lowest returns in their hidden unit has connections to an output unit in the
industries. We classified the Business Week set of 40 output layer. Therefore, an input unit in this network
companies in each of 10 industries into two groups. structure has indirect connections to an output unit.
Again, Group 1 consisted of those f m s with the highest Input for the network was a list of nine variables:
market valuations for their industries; Group 2 consisted confidence, economic factors outside the fm's control,
of those firms with the lowest market valuations for their growth, strategic gains, new products, anticipated loss,
industries. anticipated gain, long-term optimism and short-term
For each company in the study, the president's optimism. Output was a classification of two patterns: a
letter to stockholders from the annual report for the f m whose stock price performed well and a firm whose
period immediately prior to the group selection year was stock price performed poorly. In a network, each input
studied. A qualitative content analysis technique was parameter is represented in an input unit. Therefore, the
used to classify and tally recurring themes that were network has the nine input units in the input layer and the
identified by similar word or phrases. An examination of two output units in the output layer. The number of
president's letters to stockholders identified nine hidden units necessary to accurately predict the stock
recurring themes commonly addressed in discussion of price performance was determined empirically.
the future. These themes included reference to
confidence, economic factors outside the fm's control,
growth, strategic plans, new products, anticipated losses, Figure 1:
anticipated gains, long-term optimism and short-term Four-Layered Network
optimism.
Each letter was read for content and references to
the themes noted above. The frequency and percentage
of each letter devoted to these themes was then recorded.
When the letter did not contain a specific, direct reference
to one of the themes, a subjective judgment was made by
the researcher as to which, if any, theme should be
credited with the phrase or statement. In those instances
when reference was not made to a theme(s) in the letter,
the data set reflected this finding. The frequency data set
was then used for both MDA techniques and NN methods
to predict stock price performance.
Content analysis techniques have been widely used
in the social sciences 151. Financial researchers have also
Input Layer
applied these techniques to analyze narrative components
of financial reports 161 191 [MI [171. Refer to 1141 1171 x1 x2 x3 ........ x7 x8 x9
for further details of the content analysis technique used
to analyze the president's letter to stockholders.

158
The network for the prediction of stock price performance weights in order to reduce this error. Thus, BPLA is a
uses the following nine input (X - X 9 ) and two output gmhent descent algorithm in which weights in the
,
(Y - Y 2, parameters. network are iteratively modified to minimize the overall
mean square error between desired and actual output
Input Parameters: Output Parameters:
XI: Confidence Y,: well-performing f m s values for all output units over all input patterns. The
amount the weights are to be adjusted for each input
X,: Economic Y,: poorly-perfoming
pattern is determined by the derivative of the error
factors firms function in equation 2 with respect to the weight as
X,: Growth follows.
X,: Strategic plans
X,: New products aE
X,: Anticipated loss
X,: Anticipated gains
A Wji 0~ - aw,, t31

X,: Long-term optimism


X,: Short-term optimism This derivative yields the error signal,

6j = (Dj - A i ) Aj (1 - Aj) [41


5.2 Learning Process
for an output unit, and
Once a network structure was developed, a set of
initial weights was assigned at random. Then, the Back n
Propagation ! m i n g Algorithm (BPLA) [16] was used
with the Fortune set to estimate the weights of the feed
6j = A j (I-Aj) C 6 k Wkj
k=O
forward network. In this algorithm, the input vector with
nine input values was assigned as the activation vector of for hidden units.
the input layer. The activation vector of an input layer
propagates forward to the upper layer as the product of Finally, the connection weight between the jth unit in
weights on the interconnections and the activation values. the Lth layer and the ith unit in the L-1st layer is
A sigmoid function in equation 1 was used to compute modified according to
the activation value of a unit, Aj, on the upper layer.

where Sj is defined above, 4 is the activation values of


the ith unit, and a is the learning rate, which is used to
control the speed of the training process. For further
If the upper layer is not an output layer, its activation details and discussions of BPLA, see Rumelhart [la]. In
vector propagates forward to the higher layer in a network this experiment, the initial training data consisted of 58
in the same manner. The superscript L and L-1 represents cases in the Fortune set. A small learning rate of a = 0.1
an upper and lower layer, respectively. If the upper layer was used. The experiment was conducted on a VAX
is an output layer, an activation value of each output unit 1ln50 using the C programming language.
is compared to the desired one and the e m r is measured
according to
6. Results
E = ?1 C
" (Dj-Aj)2 The performance of the trained network was tested
PI
j=O with the Business Week set which contains 40 cases. In
the fxst experiment, the effect of varying the number of
hidden units in a four-layered network was tested. The
The learning algorithm iteratively modifies the set of result of this experiment is represented in Figure 2. In a

159
four-layered network, performance improved as the Group 2. During the training phase, the MDA model
number of hidden units increased up to a certain point. provided better predictive capability for f m s in the
This supports previous findings on the importance of the lower performance category than for firms in the higher
hidden layer for the different applications [ 111 [201. The performance category. However, during the testing
best performance was achieved by a network with four phase, all models demonstrated better predictive
hidden units on the fnst hidden layer and one hidden unit capability for f m s in the higher performance category.
on the second hidden layer. Increasing the number of
hidden units beyond this point produced no further
improvement, but impaired the network’s performance. Table 1:
Performance of the MDA and Four-Layered Network
on the training and testing data.
Figure 2 Models Training Data Testing Data
Effect of varying the number of Hidden units
in a Four-Layered Network on performance Group Group Mean Group Group Mean
1 2 1 2

1MDA 72% 76% 74% 70% 60% 65%

Four-
Layer 86% %% 91% 90% 65% 77.5%
I

a This study shows that the mean success rate during


E
the testing phase for the four-layered network was 77.5
percent as compared with 65 percent for the MDA
technique. This result shows that a NN method
0 2 4 6 significantly enhanced the MDA model’s stock price
predictive power. Dutta and Shekhar [8] also reported the
The number of Hidden units better performance in rating bonds by the NN approach
than by the regression method. The higher performance
The performance of the NN model was also of the four-layered NN model indicates that this
compared with that of MDA: Table 1 summarizes the non-linear technique with hidden units in the network was
result of this experiment. The report includes the a more appropriate method to use to forecast stock price
performance of MDA and four-layered network on a performance than the multiple discriminant analysis
training data set, as well as a testing data set, to method.
demonstrate the important aspects of this study. The However, the NN approach demonstrates a
value indicates that MDA resulted in a 74 percent mean limitation. In general, MDA is useful for both
posterior membership probability for the training set: 21 descriptions, as well as predictions, since it can explain
of the 29 companies were correctly classified into Group the characteristics of each group and the significance of
1, while 22 of the 29 companies were correctly classified each input parameter. In the NN model, it is a difficult
into Group 2. However, during the testing phase, the task to analyze the characteristics of each group and the
model yielded only an overall 65 percent success rate for importance of input parameters in a NN model due to the
the testing data set: 14 and 12 of the 20 companies were hidden units employed in the network. The hidden unit is
correctly classified into Group 1 and 2, respectively. useful to extract the high-order mapping function
The four-layered network correctly classified 91 between output and input; however, it makes separating
percent of the mean training data and appropriately the contribution of each input parameter to the output
predicted 77.5 percent of the mean testing data: 18 of 20 value very difficult.
companies were correctly classified into Group 1,
whereas 13 of 20 companies were correctly classified into

160
7. Concluding Remarks
Collins, Robert A., "An Empirical Comparison
The study demonstrated that the neural network of Banhptcy Prediction Models,'' Financial
approach is capable of learning a function that maps input
Monagemenf,Summer, 1980, pp. 52-56.
to output and encoding it in the magnitudes of the
weights in the network's connection. The number of
Dum, S. and S. Shekhar. "Bond Rating: A
hidden units employed in the network contributed to its
Non-Conservative Application of Neural
viability. The increase in the number of hidden units
Networks," Proceedings of the IEEE
resulted in higher performance up to a certain point.
International Conference on Neural Networks,
However, additional hidden units beyond the point
1988, Vol 11, pp. 443450.
impaired the model's performance. Comparison of the
NN technique with the MDA approach, indicated that the Frazier, K. b., Ingram, R. W., and Tennyson,
NN approach can significantly improve the predictability
B. M., "A Methodology for the Analysis of
of stock price performance.
Narrative Accounting Disclosures," Journal of
While some limitations of this approach were
noted, it is evident that its use can improve an investor's Accounting Research, Spring, 1984, pp.
decision making capability. Further research into the 3 18-331.
application of neural network techniques. using both
quantitative and qualitative data, is suggested and Gentry, James A., Paul Newbold and David T.
Whitford, " Predicting Bankruptcy: If Cash
encouraged.
Flow's Not the Bottom Line, What Is?,"
Financial Analysts Journal,
REFERENCES September-October, 1985, pp. 47-56.

Altman, Edward I., "Financial Ratios, Gorman, R. P. and T. J. Sejnowski. "Analysis


Discriminant Analysis and the Prediction of of Hidden Units in a Layered Network Trained
Corporate Bankruptcy," The Journal of to Classify Sonar Targets," Neural Networks,
Finance, September, 1968, pp. 589-609. Vol 1, 1988, pp. 75-89.

Altman, Edward I., Robert G. Haldeman and P. Kamijo, K. and Tanigawa. T. "Stock Price
Namyanan, "Zeta Analysis: A New Model to Pattern Recognition: A Recurrent Network
Identify Bankruptcy Risk of Corporations," Approach." Proceedings of the International
Journal of Banking and Finance, June, 1977, Joint Conference on Neural Networks. 1990,
pp. 29-54. Vol 1, pp. 215-222.

Altman, Edward I., and Joseph Spivack. Kimoto, T., Asakawa, K., Yoda, M. and
"Predicting Bankruptcy: The Value LIne Takeoka, M. "Stock Market Prediction System
Relative Financial Strength System vs. the Zeta with Modular neural Networks." Proceeding
Bankruptcy Classification Approach," of the International Joint Conference on
Financial Analysts Journal, Neural Networks, 1990, Vol 1, pp. 1-6.
November-December, 1983, pp. 60-67.
McConnell, Dennis, John A. Haslem and
Ball, Michael, "Z Factor: Rescue by the Virginia R. Gibson, "The President's Letter to
Numbers," Inc., December, 1980. pp. 4548. Stockholders: A New Look." Financial
Analysts Journal, September-October, 1986,
Berlson, B. "Content Analysis in pp. 66-70.
Communication Research," New York: Hafner
Publishing Company, 1971. Pinches. George E., and K. A. Mingo, "A
Multivariate Analysis of Industrial Bond
Bowman, E. H., "Content Analysis of Annual Ratings." Journal of Finance, March, 1977,
Reports for Corporation Strategy and Risk," pp.1-8.
Interfaces,JanuaryFebruary, 19884. pp. 61-71.

161
[161 Rumelhart, D. E., G. E. Hinton, and R. J. [18] "The Fortune 500," Fortune, April 30, 1984;
Williams, "Learning Internal Representations April 29,1985; April 28,1986.
by Error Propagation," In D. E. Rumelhart and
J. L. McClelland [E&), Parallel Distributed [19] "The Top 1OOO: America's Most Valuable
Processing: Exploration in the Microstructure Companies," Business Week,April 18, 1986.
of Cognition. Cambridge: MIT Press, pp.
3 18-362. [20] Yoon, Y., R. W. Brobst, P. R. Bergstresser,
and L. L. Peterson, "A Desktop Neural
[171 Swales, George S., Jr., "Another 'Look at the Network for Dermatology Diagnosis," Journal
President's Letter to Stockholders," Financial of Neural Network Computing,Vol 1, Summer,
Analysts Journal, March-April, 1988, pp. 1989, pp. 43-52.
71-73.

162

Vous aimerez peut-être aussi