
pages 8488–8505, Toronto, Canada. Association for
Computational Linguistics.
Iso, H., Morishita, T., Higashinaka, R., and Minami, Y.
(2021). Unsupervised opinion summarization with
tree-structured topic guidance. Transactions of the As-
sociation for Computational Linguistics, 9:945–961.
Iso, H., Morishita, T., Higashinaka, R., and Minami, Y.
(2022). Comparative opinion summarization. In Find-
ings of the Association for Computational Linguistics:
ACL 2022, pages 3307–3318, Dublin, Ireland. Asso-
ciation for Computational Linguistics.
Kry
´
sci
´
nski, W., McCann, B., Xiong, C., and Socher, R.
(2020). Evaluating the factual consistency of abstrac-
tive text summarization. In Proceedings of the 2020
Conference on Empirical Methods in Natural Lan-
guage Processing (EMNLP), pages 9332–9346, On-
line. Association for Computational Linguistics.
Li, K., Zhang, S., Ge, T., Wang, Y., Zhang, Z., Wang, A.,
Wang, J., Wang, G., Feng, Y., and Wang, W. (2023).
CONNER: A COmpreheNsive kNowledge Evaluation
fRamework for assessing large language models. In
Findings of the Association for Computational Lin-
guistics: EMNLP 2023, pages 5827–5842, Singapore.
Association for Computational Linguistics.
Lin, C.-Y. (2004). ROUGE: A package for automatic evalu-
ation of summaries. In Text Summarization Branches
Out, pages 74–81, Barcelona, Spain. Association for
Computational Linguistics.
Pecar, S. (2018). Towards opinion summarization of cus-
tomer reviews. In Proceedings of ACL 2018, Student
Research Workshop, pages 1–8, Melbourne, Australia.
Association for Computational Linguistics.
Stiennon, N., Ouyang, L., Wu, J., Ziegler, D. M., Lowe, R.,
Voss, C., Radford, A., Amodei, D., and Christiano,
P. F. (2020). Learning to summarize with human feed-
back. In Advances in Neural Information Processing
Systems, volume 33, pages 3008–3021.
Team, T. L. . C. N. (2025). Trendyol llm 8b t1.
Titov, I. and McDonald, R. (2008). Modeling online re-
views with multi-grain topic models. In Proceedings
of the 17th International Conference on World Wide
Web, pages 111–120. ACM.
APPENDIX A.1 SUMMARY
GENERATION PROMPT
Your primary task is to generate a concise,
persuasive, and objective summary in
Turkish, synthesized from the provided
customer reviews. You are to follow a
structured methodology, beginning with a
sentiment-balanced analysis of the reviews
to extract key product-centric features.
Prioritize the inclusion of positive
attributes while integrating a necessary
amount of constructive negative feedback
to ensure an objective overview; if no
substantive negative feedback is present,
construct the summary solely from positive
points without referencing the absence of
criticism.
A crucial part of this process is the
application of strict content filtering
criteria: you must focus exclusively on the
product’s intrinsic qualities and performance,
and are directed to disregard and exclude all
logistical and service-related comments, such
as those concerning shipping, packaging, or
seller interactions.
Furthermore, to ensure the summary is broadly
applicable, normalize domain-specific
information by generalizing or omitting
variant-specific details like color or exact
sizing for fashion items, and frame any
discussions of price in terms of overall value
or price-to-performance ratio.
Finally, adhere to several stylistic and
formal constraints: adopt a neutral and
objective tone, maintain a third-person
narrative perspective by attributing all
opinions to "users," and ensure the final
output is grammatically correct and adheres to
a strict word count limit.
APPENDIX A.2 EVALUATION
PROMPT
Your function is to perform a comprehensive
validation of the provided Turkish summary,
ensuring its strict adherence to a series of
critical guidelines. You must first verify
its compliance with formal constraints: the
summary must be written in Turkish, remain
within word limit, and consistently use a
third-person point of view. The core of your
assessment will be content-based, confirming
that the summary focuses exclusively on
product-related features, advantages, and
disadvantages, with a special allowance for
discussions of a book’s content if it is the
product in question.
It is imperative to penalize any deviation
from the strict exclusion criteria, which
prohibit any mention of topics such as cargo,
packaging, sellers, or delivery, as well as
gifts, or health suggestions. Furthermore,
you will ensure that any recommendations
are explicitly attributed to "users," that
personal experiences are generalized to
reflect broader customer sentiment, and
that conflicting feedback is handled with
objectivity.
Finally, you must evaluate the summary’s
overall sentiment, applying a significant
penalty if it is excessively negative or lacks
a constructive, positive perspective, as the
goal is to inform, not deter.
Customer Review Summarization in Production with Large Language Models for e-Commerce Platforms
11