
 
screen in figure 5. As shown in figure 5, we use two-
layer networks with 35 and 15 nodes respectively. 
And the network has 6 inputs and 1 output. Through 
the multiple trials of training and test, we select one 
of the models with the minimum error rate. We 
perform the investment simulation with the selected 
prediction model on the screen in figure 7. 
We generate the data as shown in table 1. The 
price prediction model calculates the prediction 
values for the stocks in the simulation data which are 
the criteria for decision making of buying stocks. 
Table 1: Data for model generation and investment 
simulation. 
  Period  # of data 
Training data  2009.4.1 ~ 2009.12.30  24,548 
Validation data  2008.4.1 ~ 2008.5.30  5,440 
Test data  2008.6.2 ~ 2008.8.29  5,235 
Simulation data  2010.2.1 ~ 2010.3.16  2,312 
 
We perform the simulation with changing the 
elements of the trading policy. Table 2 shows the 
values of the elements of the trading policy used in 
the simulation. We have 108 results from the 
simulation and some of them are presented in table 4. 
Table 2: The values of the trading policy elements for 
simulation. 
Elements  Values 
Holding period (days), H  1, 3, 5 
Buying discount rate (%), B  0 
Expected profit ratio (%), E  2, 3, 4, 5, 6, 7 
Loss cut ratio (%), L  -2, -3, -4, -5, -6, -7 
Table 3: Some results of the investment simulation. 
H, B, E, L 
# of 
profits 
# of 
loss 
Total profit 
(Won) 
1, 0, 2, -2  94  66  485,080 
3, 0, 2, -2  116  50  1,158,701 
3, 0, 5, -5  122  40  2,257,588 
5, 0, 5, -5  142  22  3,765,845 
 
We use 0.5 as the cut-off value of the prediction 
value. We have 169 transactions (one transaction 
includes both buying and selling) and assume that 
one million won is used to buy each recommended 
stock. During the simulation period (2010/2/1 ~ 
2010~3/16), KOSPI rises about 2.6% from 1606.44 
to 1648.01. The first line in table 4 means the 
followings: the holding period is one day, the BDR, 
EPR, LCR are 0%, 2%, -2% respectively. Among 
169 transactions we make profits 94 times and have 
loss 66 times. We got profits as 485,080 Korean 
Won. Table 3 shows that the results can be 
considerably different according to the different 
trading policies. As a result, we can say that the user 
select the trading policy outperforming the average 
market profits through the investment simulation. 
4 CONCLUSIONS 
In this paper, we propose the data mining tool which 
provides the three functions: stock data management, 
the stock price prediction model generation using 
machine learning techniques and the investment 
simulation. The prediction model recommends the 
stocks to buy and the investment simulation suggests 
the trading policy. Thus, the proposed tool can 
support the short-term investors’ decision-making. 
Users have only to get daily stock data from 
KRX and update the existing stock database. Once 
the prediction model is built and the proper trading 
policy is established, users can perform the objective 
decision-making based on the data rather than the 
emotional judgements. Users have only to get the 
recommended stocks through the application of the 
today’s data to the prediction model under the 
established trading policy. 
Other machine learning techniques, such as the 
support vector machines (SVM) and the genetic 
algorithms, have studied for the stock price 
prediction. We will expand the data mining tool for 
including such techniques. More technical indicators 
are required for more sophisticated prediction 
models. We will consider the asset allocation 
problem in the investment simulation, which will 
present more definite results and be more helpful. 
ACKNOWLEDGEMENTS 
This research was financially supported by Hansung 
University. 
REFERENCES 
C. F. Tsai and S. P. Wang. 2009. Stock Price Forecasting 
by Hybrid Machine Learning Techniques. Proceedings 
of the International MultiConference of Engineers and 
Computer Scientists. Vol. 1. 755-760. 
J. R. Quinlan. 1993. C4.5: Programs for Machine Learning, 
Morgan Kaufmann Publishers. 
ICAART2013-InternationalConferenceonAgentsandArtificialIntelligence
476