
(Gasper & Mbwambo, 2023). This study found out
that even under the serious fluctuation caused by the
conflict in Ukraine and the coronavirus outbreak, the
ARIMA model could still capture the potential
movement in crude oil prices. In 2024, Guzman
affirmed ARIMA’s ability to predict future corn
prices in Mexico (Guzma, 2024). This research also
emphasized the importance of forecasting future
prices of agricultural products by pointing out that
future corn prices would influence farmers' interests
and sustainable agricultural development. Bagrecha
et al. used the ARIMA model to forecast silver prices
in India. However, the results only explained 26% of
the observed silver price changes. It suggested that
the ARIMA model was too simple for silver price
prediction and that more factors should be taken into
consideration (Bagrecha et al., 2024). In 2025, Ojha
et al. utilized the SARIMA model to predict global
wheat prices (Ojha & Karki, 2025). They pointed out
its importance in helping investors and countries
make reasonable strategic decisions. However, the
study claimed that the prediction is only suitable for
short-term analysis, and many external factors are not
considered. The results may not fit other commodities
and periods.
This paper aims to improve the ability to predict
the future prices of different commodities with the
SARIMA model, hoping to expand from the price
prediction of the commodity to the prediction of the
future development trend of the industry
corresponding to that commodity, and finally explore
the changes in industries and changes in the world
economy. This study first searches the prices of the
commodities in each of the three categories
mentioned above. Based on the different
characteristics of all the commodities in each
category, this study finally chooses aluminum in
metals, wheat in agricultural products, and Brent oil
in energy as the data to be researched. This paper first
applies pre-processing procedures to the data and then
uses the SARIMA model to fit the data and make
forecasts of future data in the next 10 months. The
differences between the future data and the actual
data are compared to evaluate the ability of the
SARIMA model. Conclusions are reached based on
the results and international events during the
research period.
2 DATASETS
2.1 Data Collection and Description
This paper extracts the datasets from Kaggle, and the
original datasets are extracted from Alpha Vantage
API using Python. The dataset contains monthly
historical prices of 10 different commodities from
January 1990 to March 2023. Prices are reported in
USD per unit of measurement for each commodity.
The prices of aluminum, wheat, and Brent crude oil
were selected in the study.
Crude oil is one of the most important energies in
the world. It can represent the energy industry. This
paper selects the price of the Brent crude oil which is
a blended crude stream produced in the North Sea
region. This is because Brent crude oil is one of the
most important crude oil pricing benchmarks in the
world. It is widely used in international oil price
quotation and contract settlement, and it serves as a
marker for pricing a number of other crude streams.
Compared with OPEC and WTI prices, the price of
Brent crude oil is more comprehensive and
transparent, because it is not affected by the US
domestic factors and the political factors of the
Organization of the Petroleum Exporting Countries
(OPEC).
Aluminum is widely used in aerospace,
automotive, construction, packaging, and many other
manufacturing and industrial sectors. It can represent
the manufacturing industry. The price movement of
aluminum directly reflects the economic performance
and industry cycles. Compared to gold and silver,
aluminum is less used as a precious metal and
investment target. Therefore, its price volatility is
usually less influenced by macroeconomic
uncertainty and safe-haven demand, which makes it
more appealing data in this study.
Wheat is one of the world's leading food crops. It
is widely used in food production and is important in
global agricultural trade. It can represent the
agricultural industry. Compared with other
agricultural products, wheat has a wider application.
For example, it can be processed into bread and beer.
2.2 Data Pre-processing
Since the characteristics of the prices of different
commodities are similar, the data pre-processing is
the same. Therefore, this paper will only focus on the
overall procedure in this part.
First of all, this paper generates the time series
plots of the prices of three commodities, the results
are shown in Figures 1, 3, and 5. From the plots, it
should be decided whether the prices of the three
commodities are stationary. This paper then applies
the log transformation to the data to make them
stationary. This paper also uses STL decomposition
to detect the characteristics of the data. The results are
shown in Figures 2, 4, and 6. From the four plots
ICDSE 2025 - The International Conference on Data Science and Engineering
468