loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Kavach Dheer ; Peter Corcoran and Josephine Griffith

Affiliation: University of Galway, Ireland

Keyword(s): Large Language Models, Recommender Systems, Next-Item Prediction, Model Benchmarking, Data Leakage Analysis.

Abstract: Large language models (LLMs) are rapidly being integrated into recommender systems. New LLMs are released frequently, offering numerous architectures that share identical parameter sizes within their class, giving practitioners many options to choose from. While existing benchmarks evaluate LLM-powered recommender systems on various tasks, none have examined how same-sized LLMs perform under identical experimental conditions as a recommender system. Additionally, these benchmarks do not verify whether the evaluation datasets were part of the LLMs pre-training data. This research evaluates five open-source 7–8B parameter models (Gemma, Deepseek, Qwen, Llama-3.1, and Mistral) using a fixed A-LLMRec architecture for next-item prediction using the Amazon Luxury-Beauty Dataset. We measure top-1 accuracy (Hit@1) and evaluate dataset leakage through reference-model membership-inference attacks to ensure no model gains advantages from pre-training exposure. Although all models show negligi ble dataset leakage rates $(<0.2\%)$, Hit@1 varies dramatically across 20 percentage points, from 44\% for Gemma to 64\% for Mistral, despite identical parameter counts and evaluation conditions. These findings demonstrate that selecting among the most appropriate LLMs is a crucial design decision in LLM-based recommender systems. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Dheer, K., Corcoran, P. and Griffith, J. (2025). Beyond Parameter Counts: Benchmarking Similar-Sized Large Language Models for Next-Item Recommendation. In Proceedings of the 17th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR; ISBN ; ISSN 2184-3228, SciTePress, pages 364-371. DOI: 10.5220/0013736800004000

@conference{kdir25,
author={Kavach Dheer and Peter Corcoran and Josephine Griffith},
title={Beyond Parameter Counts: Benchmarking Similar-Sized Large Language Models for Next-Item Recommendation},
booktitle={Proceedings of the 17th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR},
year={2025},
pages={364-371},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013736800004000},
isbn={},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
TI - Beyond Parameter Counts: Benchmarking Similar-Sized Large Language Models for Next-Item Recommendation
SN -
IS - 2184-3228
AU - Dheer, K.
AU - Corcoran, P.
AU - Griffith, J.
PY - 2025
SP - 364
EP - 371
DO - 10.5220/0013736800004000
PB - SciTePress