High 3.420 0.002 Significant Contribution (Null Hypothesis Rejected)
Low 0.979 0.438 No Significant Contribution (Null Hypothesis Retained)
The observed differences suggest that while
different price features may have similar impacts on
model accuracy in medium-term predictions, their
complementarity and interactions in larger models
require deeper examination through grouped removal
or grouped F-tests. For features like closing price,
which demonstrate strong standalone predictive
power, their contribution may not remain the most
significant when combined with all other information.
In contrast, high price may exhibit unique advantages
in capturing market peaks and volatility ranges,
leading to more pronounced statistical gains. Overall,
these findings highlight that in practical applications,
feature selection and evaluation should be tailored to
specific model objectives and market dynamics,
ensuring that different price groups are assessed
flexibly for their predictive importance.
5 CONCLUSIONS
This study centres on the linear regression model,
examining the performance of individual feature
groups versus multiple feature combinations in
medium-term (five-day-ahead) predictions and
evaluating the contribution of each feature group
using the grouped F-test. The results indicate that
while the closing price group performed relatively
well in single-variable models, its accuracy did not
significantly surpass that of the combined model,
suggesting that variable redundancy may reduce the
benefits of incorporating multiple features.
Meanwhile, the high price group exhibited statistical
significance in the grouped F-test, indicating its
irreplaceable value in capturing market peaks and
volatility ranges, whereas the information contained
in the closing price, opening price, and lowest price
groups may have been partially covered by other
features. These findings address the two core research
questions: first, the difference between single-group
and multi-group features in short- to medium-term
predictions is limited, with the closing price group
performing comparably to the combined model;
second, the highest price group demonstrating a
distinct contribution to the overall model, as
confirmed by the grouped F-test. It is important to
note that this study is based solely on AAPL stock and
employs linear regression, which may not fully
account for time-series autocorrelation and nonlinear
limitations. Future research could integrate additional
features (e.g., trading volume, financial indicators,
news sentiment) and extend the analysis to multiple
stocks, while also exploring nonlinear models such as
random forests or LSTM to enhance adaptability to
market fluctuations and deepen the study of stock
market prediction.
REFERENCES
Duncan, D. B. (1955). Multiple range and multiple F tests.
Biometrics, 11(1), 1-42.
Emioma, C. C., & Edeki, S. O. (2021). Stock price
prediction using machine learning on least-squares
linear regression basis. In Journal of Physics:
Conference Series (Vol. 1734, No. 1, p. 012058). IOP
Publishing.
Ghania, M. U., Awaisa, M., & Muzammula, M. (2019).
Stock market prediction using machine learning (ML)
algorithms. ADCAIJ: Advances in Distributed
Computing and Artificial Intelligence, 8(4), 97-116.
Montgomery, D. C., Peck, E. A., & Vining, G. G. (2021).
Introduction to linear regression analysis. John Wiley
& Sons.
Nikou, M., Mansourfar, G., & Bagherzadeh, J. (2019).
Stock price prediction using DEEP learning algorithm
and its comparison with machine learning algorithms.
Intelligent Systems in Accounting, Finance and
Management, 26(4), 164-174.
Pahwa, N., Khalfay, N., Soni, V., & Vora, D. (2017). Stock
prediction using machine learning a review paper.