
Table 3: Segment-Level Evaluation Results.
Segment Method HitRate@10 HitRate@100 HitRate@1000 Recall@10 Recall@100 Recall@1000
1 MostPop 0.1795 0.2074 0.2981 0.0768 0.0981 0.1743
1 RecentPop 0.1821 0.2097 0.3116 0.0793 0.1003 0.1858
1 DecayPop 0.1814 0.2124 0.3109 0.0785 0.1022 0.1838
1 TrendPop 0.0012 0.0065 0.0340 0.0003 0.0033 0.0194
2 MostPop 0.0567 0.0945 0.2177 0.0224 0.0464 0.1306
2 RecentPop 0.0616 0.0995 0.2350 0.0263 0.0495 0.1433
2 DecayPop 0.0599 0.1024 0.2312 0.0252 0.0512 0.1410
2 TrendPop 0.0097 0.0334 0.1422 0.0009 0.0064 0.0351
3 MostPop 0.0323 0.0838 0.2404 0.0114 0.0387 0.1292
3 RecentPop 0.0387 0.0911 0.2611 0.0160 0.0426 0.1424
3 DecayPop 0.0376 0.0936 0.2569 0.0158 0.0439 0.1404
3 TrendPop 0.0027 0.0123 0.0600 0.0006 0.0043 0.0256
4 MostPop 0.0295 0.1290 0.3835 0.0071 0.0380 0.1387
4 RecentPop 0.0376 0.1428 0.4145 0.0120 0.0435 0.1538
4 DecayPop 0.0392 0.1479 0.4069 0.0125 0.0438 0.1512
4 TrendPop 0.0018 0.0083 0.0431 0.0004 0.0034 0.0220
For Segments 2 and 3 (moderately active users),
the advantage of RecentPop and DecayPop over
MostPop becomes more evident. These users bene-
fit from models that can better capture evolving prod-
uct trends while still leveraging accumulated popular-
ity information. Nevertheless, performance is over-
all lower than in Segment 1, suggesting that medium-
activity users represent a more challenging group, as
their preferences are neither fully new nor as consis-
tent as those of highly active users.
Finally, for Segment 4 (highly active users), all
methods perform poorly, with very low HR and re-
call values. This highlights a major limitation of sim-
ple popularity-based models: they fail to serve power
users who likely expect more personalized recom-
mendations. In this group, the differences between
MostPop, RecentPop, and DecayPop are less pro-
nounced, indicating that none of the baselines are suf-
ficient to address the complexity of heavy-user behav-
ior. TrendPop, again, performs poorly across all seg-
ments, confirming its instability and lack of practical
utility.
In summary the segment-level analysis under-
scores that popularity-based methods especially those
incorporating temporal dynamics are effective for less
active users but inadequate for heavy users. This find-
ing suggests that hybrid or personalized approaches
would be necessary to achieve strong performance
across the full spectrum of user types.
4 CONCLUSIONS
This paper evaluated the effectiveness of four
popularity-based recommendation methods, Most-
Pop, RecentPop, DecayPop, and TrendPop, on a
large-scale e-commerce dataset. Our findings demon-
strate that models incorporating temporal dynamics
significantly outperform a simple long-term popu-
larity baseline. Specifically, RecentPop and Decay-
Pop consistently achieved higher Hit Rate and Re-
call scores, confirming that recent transaction data is
a more powerful predictor of future product demand.
DecayPop, by applying an exponential decay func-
tion, offered a slightly more robust balance between
long-term popularity and short-term trends, particu-
larly at larger recommendation list sizes.
In contrast, our proposed TrendPop model, de-
signed to capture emerging trends by analyzing
weekly rank changes, performed poorly across all
metrics. This suggests that the high volatility of
weekly sales data introduces significant noise, mak-
ing simple rank-based trend detection an unreliable
strategy for stable recommendations.
Perhaps the most critical insight comes from our
user segmentation analysis. We found that popularity-
based methods are highly effective for low-activity
and new users, where personalized signals are sparse.
However, their performance drastically diminishes
for highly active ”power users,” who likely expect
more tailored and diverse suggestions. This under-
scores a fundamental limitation of non-personalized
approaches: while they serve as excellent and compu-
tationally efficient baselines for the cold-start prob-
lem, they are insufficient for retaining engaged cus-
tomers. In summary, our work highlights that
recency-aware popularity models are valuable com-
ponents of a recommendation system, but they must
be complemented by personalized strategies to cater
to the full spectrum of user behavior.
Temporal Popularity-Based Recommender Systems for e-Commerce: A Comprehensive Evaluation
23