performance of NarcoTrace and brings out its
potential to revolutionize the digital war against drug
trafficking
.
2 LITERATURE SURVEY
2.1 Overview of Current Method of
Detection
Drug trafficking circles have proved to be very elastic
when it comes to applying social media apps like
Telegram, WhatsApp and Instagram as messaging
instruments. All of these offer anonymity, a very
large user base that lets traffickers hide in them and
behave quite undetected. Trafficking scales as well as
techniques used make old detection mechanisms, for
example manual content moderation and keyword
searching ineffective (Alves and Pedroso, 2019),
(Ma, Qian, et al. , 2024). Recent studies have,
therefore, started to search for using machine learning
and AI in this matter. For instance, Alves et al.
employed the use of hypergraph learning method on
drug trafficking networks in the pursuit of gaining
insight in the community structures on which
traffickers operate and converge on social media
platform 3. Similarly, Shah et al. employed the social
network analysis on the encrypted communication
where evidence was realised that at the network level,
it can be a point for the detection of illicit activities 4.
However, they lack to hold the strength to tackle the
comprehensive use of multimodality in the content of
the social media relating to the usage of text, images
and audio.
2.2 Text Analysis in Drug Trafficking
Detection
It remains however one of the most prominent types
of data used for the passing on of information among
traffickers. It majorly relied on shallow text mining
approaches: TF-IDF and keyword matching, in
earlier approaches. These do not understand the
contexts and are not up to nuanced language analysis
based on context understanding (Alves, 2023),
(Wang, 2021). Ma et al have developed sophisticated
NLP models, BERT, to detect the languages related
to drugs on social media. These models are better at
subtlety in linguistic patterns and context than the
traditional method approach (Shah and Sen, 2020).
Even though the NLP models are highly effective,
there is still a huge challenge in using the constantly
evolving slang used in dealing with drug trafficking.
In fact, baselines such as LDA for topic modeling are
merged into bigger frameworks that almost captured
the semantic richness of user conversations (Desai
and Patel, 2019). The NarcoTrace models used the
concept of transfer learning wherein finetuned pre-
trained models can learn through specific data sets to
enhance its ability in adapting new slang and jargon
(Shah and Sen, 2020).
2.3 Image and Video Analysis
Images and videos are another form of visual
information which are commonly used by traffickers
for advertising their products or confirming
transactions. Object detection models based on
YOLO have been promising in identifying drug
paraphernalia within images. Alves et al. underlined
the employment of YOLO V5 to identify secret
shipments of drugs, emphasizing its real-time
detection capabilities with high accuracy (Shah and
Sen, 2020), (Kumar and Gupta, 2020). However,
traffickers usually use obfuscation attacks, like
altering a view of an object or hiding prohibited
objects in seemingly benign events. That is
particularly challenging even for as advanced models
as YOLO. In response to this, NarcoTrace uses
advanced preprocessing techniques to increase its
robustness against such evasion attacks (Kumar and
Gupta, 2020).
2.4 Audio Analysis for Crime Detection
Audio communications are favored by traffickers
since they are seen as untraceable. Nowadays, with
speech recognition models including Wav2vec,
transcript and analysis of audio messages might now
potentially be made possible (Shah, 2023). Honourio
et al. even demoed how audio analysis may aid
forensic investigations by picking crucial phrases
indicating drug trafficking activities.
Despite these developments, there are still issues
with how it deals with mixed accents, noise, and
audio compression. NarcoTrace utilizes noise
reduction and sophisticated models of speech to make
sure its detection capability is insensitive to the
adverse effects of poor audio quality conditions ,
(Shah, 2023).
2.5 Limitations and Gaps
Although some notable progresses have been made,
outstanding gaps still persist in detecting online drug
trafficking via social media. Current systems lack the
processing of several data modalities simultaneously