Nofri Ramadan (1), Muis Muhtadi (2), Muhammad Zulfiqar Rafi (3), Rinakit Estu Waluyo (4)
Background: Young children commonly place objects near or into their mouths, creating safety concerns that require constant supervision. Specific Background: Advances in computer vision enable real-time recognition of hands, faces, and objects, allowing automated identification of behaviors that may lead to hazardous mouth-related interactions. Knowledge Gap: Few systems combine object detection and human pose estimation specifically to assess risks related to toddler hand–mouth–object interactions in real-time environments. Aims: This study develops a risk detection system using YOLOv11n to recognize hands, faces, and objects while classifying conditions into safe or risky based on Euclidean distance between hand and mouth keypoints. Results: The system produces 92% accuracy in scenarios without objects and 74% in scenarios with objects, demonstrating its capability to differentiate between safe and risky conditions. Novelty: This research introduces an integrated spatial analysis approach that evaluates real-time proximity among hands, mouth, and objects, rather than detecting these elements independently. Implication: The system provides practical potential for real-time child safety monitoring, offering earlier awareness of mouth-related object risks in various activity settings.
• The system identifies hand–mouth proximity to classify safe and risky situations.• Integrated pose estimation and object detection enable spatial risk assessment in real time.• The model supports early awareness of mouth-related object risks during toddler activities.
Object Detection, Pose Estimation, Computer Vision, YOLO, Child Safety Monitoring
K. Sivakrishna, J. S. Vaishnavi, A. Sreelekha, S. Sindhu, C. Varshitha, and G. Kalakoti, “Object Detection and Hazard Alert System for Child Safety on Robot using YOLO,” in 2025 5th International Conference on Expert Clouds and Applications (ICOECA), IEEE, 2025, pp. 1094–1098.
H. M. Isa, S. A. Aldoseri, A. S. Abduljabbar, and K. A. Alsulaiti, “Accidental ingestion of foreign bodies/harmful materials in children from Bahrain: A retrospective cohort study,” World J. Clin. Pediatr., vol. 12, no. 4, p. 205, 2023.
S. M. Damanik, E. Sitorus, and I. Mertajaya, “Penerapan Upaya Pencegahan dan Penanganan Aspirasi Benda Asing dan Kejang Demam pada Anak di Rumah,” J. Comunità Serv., vol. 3, no. 2, pp. 653–661, 2021.
T. Sultana, M. Ruiz-Casares, R. Iwo, M. Janus, and J. I. Nazif-Muñoz, “Maternal Education and Children Home Alone in 63 Low-and Middle-Income Countries,” Glob. Pediatr. Health, vol. 11, p. 2333794X241258179, 2024.
M. Karišik, “Foreign body aspiration and ingestion in children,” Acta Clin. Croat., vol. 62, no. Supplement 1, pp. 105–112, 2023.
F. A. Khan and A. Dey, “Towards Enhancing Child Safety: A Deep Learning Approach to Detect Child Safe and Unsafe Objects,” in 2024 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE), IEEE, 2024, pp. 123–128.
S. Zhu et al., “Cribnet: Enhancing infant safety in cribs through vision-based hazard detection,” in 2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG), IEEE, 2024, pp. 01–08.
C. Zheng et al., “Deep learning-based human pose estimation: A survey,” ACM Comput. Surv., vol. 56, no. 1, pp. 1–37, 2023.
A. Tripathi, M. K. Gupta, C. Srivastava, P. Dixit, and S. K. Pandey, “Object detection using yolo: A survey,” in 2022 5th International Conference on Contemporary Computing and Informatics (IC3I), IEEE, 2022, pp. 747–752.
P. Jiang, D. Ergu, F. Liu, Y. Cai, and B. Ma, “A Review of Yolo algorithm developments,” Procedia Comput. Sci., vol. 199, pp. 1066–1073, 2022.
D. Maji, S. Nagori, M. Mathew, and D. Poddar, “Yolo-pose: Enhancing yolo for multi person pose estimation using object keypoint similarity loss,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 2637–2646.
R. Khanam and M. Hussain, “YOLOv11: An Overview of the Key Architectural Enhancements.” 2024. [Online]. Available: https://arxiv.org/abs/2410.17725
T. Arunkumar, S. Maheswaran, P. Dineshkumar, K. Geetha, A. Sureshkumar, and S. Praveenkumar, “Design and Implementation of an Astute Infant Monitoring System Based on YOLO v8 Algorithm,” in 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), IEEE, 2024, pp. 1–6.
S. Diop, F. Jouen, J. Bergounioux, and I. Trabelsi, “Fine-Tuned YOLO Model for Monitoring Children Across Medical Scenes Based on a Large-Scale Real-World Dataset for Children Detection,” IEEE Access, 2025.
Z. Yang, B. Tsui, J. Ning, and Z. Wu, “Falling detection of toddlers based on improved YOLOv8 models,” Sensors, vol. 24, no. 19, p. 6451, 2024.
A. M. Mekala et al., “Fused YOLO and Traditional Features for Emotion Recognition from Facial Images of Tamil and Russian Speaking Children: A Cross-Cultural Study,” IEEE Access, 2025.
M. P. Salim, J. Ong, I. S. Edbert, and D. Surhatono, “Object detection for child Learning media,” in 2022 8th International Conference on Science and Technology (ICST), IEEE, 2022, pp. 1–6.
Q. Wu, X. Wang, N. Li, S. Fong, L. Zhang, and J. Yang, “Real-Time Face and Facial Landmark Joint Detection based on End-to-End Deep Network,” IEEE Trans. Instrum. Meas., 2025.
S. Wang, C. Guo, R. Yang, Q. Zhang, and H. Ren, “A lightweight vision-based measurement for hand gesture information acquisition,” IEEE Sens. J., vol. 23, no. 5, pp. 4964–4973, 2022.
J. H. Tan and C. P. Goh, “Enhancing Child Safety: Computer Vision-Based Accident Detection for Infants and Toddlers,” in 2024 3rd International Conference on Digital Transformation and Applications (ICDXA), 2024, pp. 1–5. doi: 10.1109/ICDXA61007.2024.10470712.