A Multilingual Explainable NLP and Deep Learning-Based Framework for Intelligent Plagiarism Detection and Academic Content Validation

Dondeti Rammohanreddy; P. Pradeep; Oviyasri  G. K.; M.  K. Kirubakaran; Allam Balaram; Angel  Jency V.

doi:10.5220/0013865700004919

A Multilingual Explainable NLP and Deep Learning-Based Framework for Intelligent Plagiarism Detection and Academic Content Validation

Dondeti Rammohanreddy, P. Pradeep, Oviyasri G. K., M. K. Kirubakaran, Allam Balaram, Angel Jency V.

2025

Abstract

For Academic research writing, plagiarism checking has moved from simple text matching to context based matching through sophisticated natural language processing (NLP) and deep learning. This research presents a multilingual, explainable, and scalable approach to intelligent plagiarism detection and content validation effort for academic integrity. By combining BERT and XLM-R model with semantic similarity measurement, the system can effectively detect paraphrased, cross-lingual and AI-generated plagiarism. The model, in contrast to available systems, include citation context awareness, real-time response and domain-based thresholds, which accounts for fairness and transparency in an evaluation. Explainable AI components such as attention visualization and token-level attribution provide interpretability for students, teachers, and reviewers. It also has the ability to detect code and figure plagiarism and it is appropriate for science, technology, engineering and mathematics disciplines. Experimental results on benchmark and real world academic datasets show higher accuracy, fewer false positives, and better cross-language and cross-content type performance. This work is a first step towards the ethical, smart, and inclusive validation of academic content.

Download

Paper Citation

in Harvard Style

Rammohanreddy D., Pradeep P., K. O., Kirubakaran M., Balaram A. and V. A. (2025). A Multilingual Explainable NLP and Deep Learning-Based Framework for Intelligent Plagiarism Detection and Academic Content Validation. In Proceedings of the 1st International Conference on Research and Development in Information, Communication, and Computing Technologies - Volume 1: ICRDICCT`25; ISBN 978-989-758-777-1, SciTePress, pages 357-362. DOI: 10.5220/0013865700004919

in Bibtex Style

@conference{icrdicct`2525,
author={Dondeti Rammohanreddy and P. Pradeep and Oviyasri K. and M. Kirubakaran and Allam Balaram and Angel V.},
title={A Multilingual Explainable NLP and Deep Learning-Based Framework for Intelligent Plagiarism Detection and Academic Content Validation},
booktitle={Proceedings of the 1st International Conference on Research and Development in Information, Communication, and Computing Technologies - Volume 1: ICRDICCT`25},
year={2025},
pages={357-362},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013865700004919},
isbn={978-989-758-777-1},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 1st International Conference on Research and Development in Information, Communication, and Computing Technologies - Volume 1: ICRDICCT`25
TI - A Multilingual Explainable NLP and Deep Learning-Based Framework for Intelligent Plagiarism Detection and Academic Content Validation
SN - 978-989-758-777-1
AU - Rammohanreddy D.
AU - Pradeep P.
AU - K. O.
AU - Kirubakaran M.
AU - Balaram A.
AU - V. A.
PY - 2025
SP - 357
EP - 362
DO - 10.5220/0013865700004919
PB - SciTePress