set (15%). The performance evaluation indexes
including: 𝑅
, Explained Variance Score (EVS) by
Model, Mean-square Error (MSE), Root Mean-
square Error (RMSE), Mean Absolute Error (MAE).
These indexes help us to evaluate the performance
and the stability of each model. These indexes will
plot in 5 different forms, for a better observation of
the model.
3 RESULTS AND DISCUSSION
For each model, after the training, the researcher plots
each model’s evaluation index sorted by performance
form best to worse. The results are shown in Figure 1
and Figure 2.
From the Figure 1 and Figure 2, Generally, the
Random Forest regressor performs the best, following
models are K-NN and DT has the worst performance.
In all the model “RF B 85a” has the best performance
in all indexes, reaching 0.72 𝑅
, and 10.16 RMSE,
Which representee stability and relatively good
performance of the model. However, most of the
models based on Decision Tree has negative index on
𝑅
and EVS, that indicate decision trees are not
suitable for this task. K-NN based models has a
relatively middle performance, but still not as good as
DT models.
The performance of ‘70d’ models are worse than
85d models shown in Figure 3. Thus, for random
distributed dataset, using lager training dataset can
surlily improve the performance of models. For all
models, the ‘85d’ performance is better than ‘85a’
training plan. Is not a good way to train twice when a
researcher is working on a random distributed data.
The second training may cause the overfitting to
valuation dataset.
Form these plots shown in Figure 4, it can be
observed that as this study used more data and diverse
data to train the model, the 𝑅
and MSE performance
will get better. However, the training method on
specialized datasets are various. For most of the
models, using ‘85d’ or ‘85a’ does not make a
significant performance difference of the model.
Using ‘85a’ may even loss the accuracy due to the
overfitting to the valuation set. But if the data for
predicting has a significant pattern corresponded to
the valuation set, the model’s performance would get
boost because of the valuation set.
Due to the limited data and time, the researcher
cannot develop auto collection programs to collect the
real-time data form Spotify through Api. The data
cleaning is not rough that some of the songs’
popularity is default value, which is 0. This noise can
make a significant effect to the accuracy of the
models. The connection between the popularity and
different factors still need to discuss.
4 CONCLUSIONS
This study tested the performance of three widely
used based models with different training methods.
And visualized the performance of these models. And
discussed the notice when applying these models into
research study.
For this regression task, the Random Forest is the
most effective and reliable over all models. When the
model is facing random or highly diverse data, they
should not valuate the models again. If the data in
prediction task has a relatively similar pattern, the
valuation training would be effective for the
specialized models.
For the further research, the researcher can test
more advanced models and collecting more mount
and updated song tracks data with scripts and api
tools. This task is just one class that make predictions
on various variables without the direct connections
with the answer. However, the conclusion of this
research can apply to many other tasks. The
improvement of accuracy can help the artists to
predict their popularity and make better songs that fit
people’s need. These models can utilize for
merchandizing to make more profit.
REFERENCES
Feng, Z. 2023. Song popularity prediction using machine
learning.
Ge, Y., Wu, J., & Sun, Y. 2020. Popularity prediction of
music based on factor extraction and model blending.
In 2020 2
nd
International Conf. on Economic
Management and Model Engineering (ICEMME) (pp.
1062-1065). IEEE.
Grandview research, 2023, Music Streaming Market Size
& Share Analysis Report, https://www.grandviewrese
arch.com/industry-analysis/music-streaming-market
Kaggle, 114000 Spotify Songshttps://www.kaggle.com/
datasets/priyamchoksi/spotify-dataset-114k-songs
Kim, J. 2021. Music popularity prediction through data
analysis of music’s characteristics. International
Journal of Science, Technology and Society, 9(5), 239-
244.
Lee, J., & Lee, J. S. 2018. Music popularity: Metrics,
characteristics, and audio-based prediction. IEEE
Transactions on Multimedia, 20(11), 3173-3182.
Martín-Gutiérrez, D., Peñaloza, G. H., Belmonte-
Hernández, A., & García, F. Á. 2020. A multimodal
end-to-end deep learning architecture for music
popularity prediction. IEEE Access, 8, 39361-39374.