Many people relying on urkund for
plagiarism detection in Hindi content but
urkund has limited coverage and accuracy in
detecting instances of plagiarism. While
Urkund supports multiple languages,
including Hindi, its database for Hindi
sources may be relatively smaller and less
comprehensive compared to its English
database. This limitation can result in missed
matches or reduced accuracy in identifying
plagiarism in Hindi content. Additionally, the
nuances of the Hindi language, including
cultural and contextual references, may pose
challenges for Urkund's algorithms,
potentially leading to false positives or
missed instances of plagiarism.
6 CONCLUSION AND SCOPE OF
WORK
This work proceeded with various algorithms,
procedures and methodologies handled by different
researchers, and discussed these approaches on
various parameters. Despite of wide variety of
techniques available for plagiarism detection, there
are still some research gaps and challenges that need
to be addressed. For example, some methods may
struggle to detect plagiarism when the plagiarized
text has been paraphrased or reworded, while others
may struggle to handle certain types of document
formats or languages. Additionally, there are ethical
and legal issues to consider when implementing
plagiarism detection, such as ensuring that the
privacy of students or other users is respected.
After reviewing all the papers from 2016 to 2023,
it has been found to construct intra-corpus a
productive system can be planned alongside that a
viable AI model can be proposed for the
counterfeiting recognition, A huge local database can
be created in the system. Most of the commercial
software’s uses the local corpus for speedup the
search and reduces the processing of the system.
Plagiarism checking in Hindi documents has not been
working properly, so ML technique can be deployed
to design a tool specifically for Hindi content along
with that most of the work suggested earlier are not
deployed on web, it is required to build a web
interface for better experience.
REFERENCES
U. Garg and V. Goyal, “Maulik: A Plagiarism Detection
Tool for Hindi Documents”, Indian Journal of Science
and Technology, vo. 9, no. 12, pp. 1- 11, 2016
N. Ehsan and A. Shakery, “Candidate document retrieval
for cross- lingual plagiarism detection using two-level
proximity information”, Information Processing &
Management, November 2016
G. P. V. and J. D. Velásquez, “Docode 5: Building a real-
world plagiarism detection system”, Engineering
Applications of Artificial Intelligence, vol. 32, no. 4,
pp. 1703-1712, September 2017
K. Baba, T. Nakatoh and T. Minami, “Plagiarism detection
using document similarity based on distributed
representation”, Procedia Computer Science, vol. 10,
pp. 89798-89822, 2017
A. Abdi, S. M. Shamsuddin and R. M. Aliguliyev, “A
linguistic treatment for automatic external plagiarism
detection”, Knowledge-Based Systems, vol. 19, no. 3,
pp. 1817-1826, 1 November 2017
J. Dwivedi and A. Tiwary, "Plagiarism detection on bigdata
using modified map-reduced based SCAM algorithm,"
2017 International Conference on Innovative
Mechanisms for Industry Applications (ICIMIA),
2017, pp. 608-610
K. Baba, "Fast plagiarism detection based on simple
document similarity," 2017 Twelfth International
Conference on Digital Information Management
(ICDIM), 2017, pp. 54-58
R. Sutoyo et al., "Detecting documents plagiarism using
winnowing algorithm and k-gram method," 2017 IEEE
International Conference on Cybernetics and
Computational Intelligence (CyberneticsCom), 2017,
pp. 67-72
W. Ali, T. ahmed, Z. Rehman, A. U. Rehman and M.
Slaman, "Detection of Plagiarism in Urdu Text
Documents," 2018 14th International Conference on
Emerging Technologies (ICET), 2018, pp. 1-6
R. K. Bachchan and A. K. Timalsina, "Plagiarism Detection
Framework Using Monte Carlo Based Artificial Neural
Network for Nepali Language," 2018 IEEE 3rd
International Conference on Computing,
Communication and Security (ICCCS), 2018, pp. 122-
127
S. Lazemi, H. Ebrahimpour-Komleh and N. Noroozi,
"Persian Plagirisim Detection Using CNN s," 2018 8th
International Conference on Computer and Knowledge
Engineering (ICCKE), 2018, pp. 171-175
D. Soyusiawaty, A. H. S. Jones and P. Widiandana,
"Similarity Detection of Student Assignments Using
Rocchio Method," 2018 12th International Conference
on Telecommunication Systems, Services, and
Applications (TSSA), 2018, pp. 1-4
N. Meuschke, V. Stange, M. Schubotz, M. Kramer and B.
Gipp, "Improving Academic Plagiarism Detection for
STEM Documents by Analyzing Mathematical Content
and Citations," 2019 ACM/IEEE Joint Conference on
Digital Libraries (JCDL), 2019, pp. 120-129