consideration for real-life implementation. Such
evaluation across various performance criteria
addresses gaps and shortcomings in previous studies,
which often focused on individual models or
restricted comparison metrics.
Based on these results, YOLOv11 was chosen as
the most suitable object detection model, as it
outperformed the other architectures. It was then
integrated with a CNN classification model using a
MobileNetV2 backbone, which was selected for its
lightweight architecture and suitability for
deployment in limited computational resource
environment. These models can be used in smart
recycling bins, automated waste-sorting facilities,
and mobile apps. Such implementations could
immediately help Indonesia's waste management
policy by enhancing waste segregation efficiency and
increasing the proportion of properly managed waste
beyond the current rate of around 60%. This approach
would also contribute to broader sustainability goals,
such as SDG 11: Sustainable Cities and
Communities, while aligning with smart city
development initiatives.
5.1 Limitations and Future Work
Although this research offers valuable insights, it is
important to recognize its limitations. The
performance of the models can be affected by the size
and variety of the training datasets (TACO, ICRA-19,
and Domestic Waste Classification). Future work
may include augmenting these datasets with more
diverse waste items and complex environmental
conditions (e.g., varying lighting, occlusions) to
improve model robustness and generalization.
Additional exploration of hyperparameter
optimization for each model or examining recent
architectures could also lead to performance
improvements. Furthermore, effective execution of
such a system would require hardware integration and
creating a comprehensive workflow for real-world
functionality, including the mechanical aspects of
waste sorting activated by the vision system output.
Exploring ensemble methods or model quantization
techniques for deployment on devices with limited
resources could also be beneficial for future studies.
ACKNOWLEDGEMENTS
During the preparation of this work, the authors used
generative AI tools for language refinement. After
using these tools, the authors reviewed and took full
responsibility for the content of the publication.
REFERENCES
Alif, M. A. R. (2024). YOLOv11 for vehicle detection:
advancements, performance, and applications in
intelligent transportation systems.
Dakari Aboyomi, D. and Daniel, C. (2023). A comparative
analysis of modern object detection algorithms: yolo vs.
ssd vs. faster r-cnn. ITEJ 8:96–106.
Dwiatmoko, F., Utami, D., Sivi, N. A., Nahdlatul, U., and
Lampung, U. (2024). Image Classification of Organic
and Non-Organic Waste Using CNN (Convolutional
Neural Network) Algorithm [In Indonesian].
Fang, B., Yu, J., Chen, Z., Osman, A. I., Farghali, M., Ihara,
I., Hamza, E. H., Rooney, D. W., and Yap, P. S. (2023).
Artificial intelligence for waste management in smart
cities: a review.
Fang, J. (2022). SSD-based Lightweight Recyclable
Garbage Target Detection Algorithm. Innovation in
Science and Technology 1(1). doi:
10.56397/ist.2022.08.05.
Fulton, M. S., Hong, J., and Sattar, J. (2020). Trash-
ICRA19: a bounding box labeled dataset of underwater
trash. Available at: https://doi.org/10.13020/x0qn-y082
(accessed June 15, 2025).
Haqqi, M., Rochmah, L., Dwi Safitri, A., Adhi Pratama, R.,
and Tarwoto (2024). Implementation of machine
learning to identify types of waste using cnn algorithm.
Khanam, R. and Hussain, M. (2024). YOLOv11: an
overview of the key architectural enhancements.
Kulkarni, H. N. and Kannamangalam Sundara Raman, N.
(2019). Waste object detection and classification.
Li, J., Chen, J., Sheng, B., Li, P., Yang, P., Feng, D. D., and
Qi, J. (2022). Automatic Detection and Classification
System of Domestic Waste via Multimodel Cascaded
Convolutional Neural Network. IEEE Transactions on
Industrial Informatics 18(1):163–173. doi:
10.1109/TII.2021.3085669.
Lu, H., Li, C., Chen, W., and Jiang, Z. (2020). A single shot
multibox detector based on welding operation method
for biometrics recognition in smart cities. Pattern
Recognition Letters 140:295–302. doi:
10.1016/j.patrec.2020.10.016.
Ma, W., Wang, X., and Yu, J. (2020). A lightweight feature
fusion single shot multibox detector for garbage
detection. IEEE Access 8:188577–188586. doi:
10.1109/ACCESS.2020.3031990.
Ministry of Environment and Forestry of Indonesia (2024).
National Waste Management Information System
(SIPSN). Available at:
https://sipsn.menlhk.go.id/sipsn/ (accessed April 3,
2025) [In Indonesian].
Musaev, A., Anorboev, A., and Youn, J. M. (2025).
Optimized epoch selection ensemble: integrating
custom cnn and fine-tuned mobilenetv2 for malimg
dataset classification. IEEE Access. doi:
10.1109/ACCESS.2025.3547791.
Nie, Z., Duan, W., and Li, X. (2021). Domestic garbage
recognition and detection based on Faster R-CNN. In:
Journal of Physics: Conference Series.