
offering the ability to learn complex patterns, adapt
to evolving threats, and outperform traditional rule-
based systems that rely on static signatures. While
numerous studies (Dina and Manivannan, 2021; Ab-
delmoumin et al., 2022; Sahani et al., 2023; Yang
and Shami, 2022; Gite et al., 2023; Yu et al., 2024;
Gomez et al., 2023; Zeng et al., 2024; Theofanous
et al., 2024; Kondaiah et al., 2024; Siyyal et al., 2022;
Zang et al., 2024; Ucci et al., 2021; Berbecaru and
Petraglia, 2023) have explored ML-based Intrusion
Detection Systems (IDS), most evaluate performance
in simulation environments, limiting real-world ap-
plicability. Even those that use real testbeds (Siyyal
et al., 2022; Zang et al., 2024; Ucci et al., 2021;
Berbecaru and Petraglia, 2023) often overlook critical
challenges in high-throughput networks—particularly
computational overload and excessive inference la-
tency—leading to delayed detection and reduced ef-
fectiveness under realistic traffic loads.
To address these limitations, we propose a novel
ML-based NIDS architecture designed for low-
latency, high-throughput operation. The system is
divided into two optimized components: (1) traf-
fic capture and preprocessing, and (2) model infer-
ence. For the first, we employ passive optical tap-
ping to capture traffic without disrupting production
networks, combined with the Argus tool for real-time
feature extraction. This ensures timely, lightweight
data preparation with minimal overhead. For infer-
ence, we introduce a parallel micro-batching archi-
tecture that improves detection throughput and scal-
ability, enabling rapid, accurate threat identification
in high-speed environments. By co-optimizing both
stages for minimal decision time, our design enhances
system responsiveness and practical deployability in
operational networks.
This paper proposes a high-throughput ML-based
NIDS architecture designed for real-time operation
in high-speed networks without disrupting production
traffic. The system integrates passive optical tapping
with Argus for efficient, real-time feature extraction,
ensuring timely and relevant input to the detection
model. To maximize scalability and detection per-
formance, we introduce a parallel micro-batching ar-
chitecture for inference, enabling rapid processing of
streaming traffic under demanding conditions. To-
gether, these contributions enhance the practicality,
responsiveness, and accuracy of intrusion detection in
operational network environments.
2 RELATED WORK
In the realm of simulated environments for Ma-
chine Learning-based Intrusion Detection Systems
(IDS), several studies have provided valuable
insights.Traditional approaches employ hybrid
signature-anomaly detection (Dina and Manivannan,
2021), ensemble methods combining PCA with
deep learning (Abdelmoumin et al., 2022), and
diverse ML techniques including SVMs, decision
trees, and neural networks (Sahani et al., 2023;
Yang and Shami, 2022). Recent advances explore
specialized architectures such as graph-based sys-
tems (Yu et al., 2024), unsupervised clustering with
HDBSCAN (Gomez et al., 2023), and ensemble
models integrating deep learning with self-attention
mechanisms (Kondaiah et al., 2024). However,
these simulation-based studies focus primarily on
maximizing detection accuracy while neglecting
critical real-world constraints: inference latency,
traffic capture efficiency, and end-to-end detection
time optimization.
Real testbed implementations reveal additional
limitations. Studies using traditional packet capture
methods (Wireshark, tcpdump) (Siyyal et al., 2022;
Kondaiah et al., 2024) face scalability challenges at
high network speeds, while computationally intensive
approaches like NLP-inspired deep learning (Zang
et al., 2024) introduce prohibitive processing delays.
Existing monitoring solutions integrate multiple tools
(Suricata, Zeek) (Berbecaru and Petraglia, 2023) but
suffer from deployment complexity and lack opti-
mization for high-throughput scenarios. Notably, no
prior work addresses the joint optimization of passive
traffic capture, ML inference efficiency, and detection
time minimization specifically for 10Gbps networks.
Our approach uniquely combines passive optical tap-
ping for zero-latency capture with a two-tier architec-
ture optimizing both network-level filtering and ML-
based detection, explicitly targeting sub-second re-
sponse times in high-speed operational environments.
3 SCALABLE ML-BASED IDS
FOR HIGHT-SPEED
NETWORKS
In this section, we explain the overall design of our
proposed solution and describe it in detail with a pri-
mary goal to achieve real-time detection with high
throughput.
RITECH 2025 - The International Conference on Research and Innovations in Information and Engineering Technology
158