
BLEU scores
0
0.1
0.2
0.3
0.4
0.3
0.11
7 · 10
−2
Scores
Proposed Model ChatGPT 3.5 Gemini AI
Figure 7: Comparison of BLEU Scores.
nuanced needs of learners, leading to inconsistencies
and gaps in educational delivery. The proposed ap-
proach leverages an LLM fine-tuned with embedded
value training, addresses these challenges by ensur-
ing more contextually rele- vant and comprehensive
responses. The superior performance of the model, as
demonstrated by high ROUGE and BLEU scores, in-
dicates its proficiency in generating coherent and pre-
cise language, surpassing both GPT-3.5 and Gemini
AI. These results underscore the model’s potential to
improve educational outcomes by providing accurate
and meaningful interactions. Future work will focus
on further refining the model to enhance its adaptabil-
ity and scalability, ensuring it can cater to a diverse
range of educational contexts and needs. This con-
tinued development aims to solidify the role of ad-
vanced chatbots in shaping the future of personalized
academic learning.
REFERENCES
Almazrouei, E., Alobeidli, H., Alshamsi, A., Cappelli, A.,
Cojocaru, R., Debbah, M., Goffinet,
´
E., Hesslow,
D., Launay, J., Malartic, Q., et al. (2023). The fal-
con series of open language models. arXiv preprint
arXiv:2311.16867.
Chen, J., Lu, X., Du, Y., Rejtig, M., Bagley, R., Horn, M.,
and Wilensky, U. (2024). Learning agent-based mod-
eling with llm companions: Experiences of novices
and experts using chatgpt & netlogo chat. In Proceed-
ings of the CHI Conf. on Human Factors in Computing
Systems, pages 1–18. ACM.
Firth, J. A., Torous, J., and Firth, J. (2020). Exploring the
impact of internet use on memory and attention pro-
cesses. International Journal of Environmental Re-
search and Public Health, 17(24).
Florindi, F., Fedele, P., and Dimitri, G. M. (2024). A novel
solution for the development of a sentimental analysis
chatbot integrating chatgpt. Personal and Ubiquitous
Computing, pages 1–14.
Google (2023). Gemini AI. https://gemini.google.com/.
Text-based AI Model.
Mirowski, P., Mathewson, K. W., Pittman, J., and Evans,
R. (2023). Co-writing screenplays and theatre scripts
with language models: Evaluation by industry profes-
sionals. In Proceedings of the 2023 CHI Conference
on Human Factors in Computing Systems. ACM.
Montagna, S., Ferretti, S., Klopfenstein, L. C., Florio, A.,
and Pengo, M. F. (2023). Data decentralisation of
llm-based chatbot systems in chronic disease self-
management. In Proceedings of the 2023 ACM Con-
ference on Information Technology for Social Good,
pages 205–212.
Omoregbe, N. A. I., Ndaman, I. O., Misra, S., Abayomi-
Alli, O. O., Dama
ˇ
sevi
ˇ
cius, R., and Dogra, A. (2020).
Text messaging-based medical diagnosis using natu-
ral language processing and fuzzy logic. Journal of
Healthcare Engineering, 2020:1–14.
Ooi, K.-B., Tan, G. W.-H., Al-Emran, M., Al-Sharafi,
M. A., Capatina, A., Chakraborty, A., Dwivedi, Y. K.,
Huang, T.-L., Kar, A. K., Lee, V.-H., et al. (2023). The
potential of generative artificial intelligence across
disciplines: Perspectives and future directions. Jour-
nal of Computer Information Systems, pages 1–32.
OpenAI (2023). ChatGPT (v3.5). https://chat.openai.com.
Large Language Model.
Shankar, S., Zamfirescu-Pereira, J., Hartmann, B.,
Parameswaran, A. G., and Arawjo, I. (2024). Who
validates the validators? aligning llm-assisted evalu-
ation of llm outputs with human preferences. arXiv
preprint arXiv:2404.12272.
ˇ
Sar
ˇ
cevi
´
c, A., Tomi
ˇ
ci
´
c, I., Merlin, A., and Horvat, M.
(2024). Enhancing programming education with
open-source generative ai chatbots. In 2024 47th
MIPRO ICT and Electronics Convention (MIPRO),
pages 2051–2056.
Yan, L., Sha, L., Zhao, L., Li, Y., Martinez-Maldonado, R.,
Chen, G., Li, X., Jin, Y., and Ga
ˇ
sevi
´
c, D. (2024). Prac-
tical and ethical challenges of large language models
in education: A systematic scoping review. British
Journal of Educational Technology, 55(1):90–112.
LLM Based Embedded Value Trained ChatBot Framework for Personalized Academic Learning
675