Predicting unemployment levels is a critical task
for economic policymakers and researchers.
However, the data used by a number of studies is
already outdated, and they didn’t involve the outliers
caused by the pandemic. Various methods have been
employed by many researchers to predict
unemployment rates, including traditional statistical
models and advanced machine-learning techniques.
Previous researchers have used various methods
to predict unemployment trends in the United States,
including traditional ARIMA models, primary
economic signals, and automatic time series modeling
techniques like Autometrics. Guerard, Thomakos,
and Kyriazi built upon earlier work by applying
Autometrics to improve models for real GDP and
unemployment, accounting for structural breaks and
outliers (Guerard, Thomakos & Kyriazi, 2020). Their
study emphasized the effectiveness of adaptive
learning forecasting and the significance of
incorporating leading indicators. However, the
effects of the COVID-19 period were not included in
their research process, which generated a huge impact
on the global unemployment rate.
Shan Zhong analyzed the U.S. real GDP and
unemployment rate data from 1948 to 2023 using
linear and nonlinear regression and ARIMA models
(Zhong, 2023). The study found that nonlinear
regression more accurately represents the relationship
between these two factors. ARIMA forecasts showed
optimistic future trends with GDP growth and low
unemployment but with wide confidence intervals.
Yurtsever proposed a hybrid model combining
LSTM and GRU deep learning techniques to forecast
unemployment rates in the U.S., U.K., France, and
Italy. Generally, the hybrid model outperformed
standalone LSTM and GRU models, except in Italy,
where GRU performed better. This study highlights
the effectiveness of combining different models to
enhance forecasting performance (Yurtsever, 2023).
Other researchers have also explored hybrid
approaches, such as combining ARIMA with
artificial intelligence methods, which have shown
promising results in reducing prediction errors
(Chakraborty et al., 2021; Ahmad et al., 2021).
Additionally, Xiao et al. revisited earlier forecasting
methodologies to explore relationships between
unemployment rates and leading economic indicators
such as data on weekly jobless claims and the U.S.
Leading Economic Indicator (LEI), demonstrating
that incorporating these variables can enhance
predictive accuracy (Xiao et al., 2022). Montgomery
et al. further emphasized that forecasting accuracy
could be improved by combining multiple time series
methods and carefully accounting for structural
breaks in historical data (Montgomery et al., 1998).
Similarly, Dritsakis and Klazoglou applied the Box-
Jenkins methodology extensively to forecast U.S.
unemployment rates, highlighting its effectiveness
but also acknowledging its limitations when
confronted with structural changes or unprecedented
economic shocks (Dritsakis & Klazoglou, 2018).
These findings collectively reinforce the necessity of
exploring diverse forecasting methodologies to better
capture complex labor market dynamics.
This study seeks to fill this research gap by
comparing three well-known time series forecasting
methods to predict unemployment trends in the
United States: ARIMA, LSTM neural networks, and
intervention approaches. By evaluating these diverse
models using the recent data from 1978 to 2023 and
considering their performance across different
economic conditions, this study seeks to point out
which model is best fitted to predict unemployment
trends and offer new perspectives on how effectively
different forecasting approaches perform in the
current financial landscape.
2 DATA AND METHOD
2.1 Data Collection and Description
The dataset used in this analysis contains the
unemployment rate for the 16-24 age group from
December 1978 to July 2023. The data was cleaned
and preprocessed by removing missing values,
converting the date column to a date format, and
arranging the data in chronological order. The dataset
provides a comprehensive view of the trends and
patterns in youth unemployment over more than 40
years.
2.2 Methods and Principles
This study employs several methodologies to analyze
and predict unemployment rates in the United States.
The primary methods are as follows.
2.2.1 Time Series Visualization
Time series visualization is a crucial step in
understanding the behavior of the data over time. This
involves graphically representing the unemployment
rate over time to identify trends, seasonal patterns,
and significant events. Visual inspection helps in
understanding the overall behavior of the data.
Firstly, this study presents a time series graph
depicting unemployment rates among younger age