Abstract
The COVID-19 outbreak has extenuated the need for a monitoring system that can monitor face mask adherence and social distancing with the use of AI. With the existing video surveillance systems as base, a deep learning model is proposed for mask detection and social distance measurement. State-of-the-art object detection and recognition models such as Mask RCNN, YOLOv4, YOLOv5, and YOLOR were trained for mask detection and evaluated on the existing datasets and on a newly proposed video mask detection dataset the ViDMASK. The obtained results achieved a comparatively high mean average precision of 92.4% for YOLOR. After mask detection, the distance between people's faces is measured for high risk and low risk distance. Furthermore, the new large-scale mask dataset from videos named ViDMASK diversifies the subjects in terms of pose, environment, quality of image, and versatile subject characteristics, producing a challenging dataset. The tested models succeed in detecting the face masks with high performance on the existing dataset, MOXA. However, with the VIDMASK dataset, the performance of most models are less accurate because of the complexity of the dataset and the number of people in each scene. The link to ViDMask dataset and the base codes are available at https://github.com/ViDMask/VidMask-code.git.
Original language | English |
---|---|
Article number | 102235 |
Journal | Displays |
Volume | 73 |
DOIs | |
Publication status | Published - Jul 2022 |
Externally published | Yes |
Keywords
- Faster Mask RCNN with Resnet backbone and FPN
- Mask detection
- Mask Video dataset
- Social distancing
- YOLOR
- YOLOV4
- YOLOV4-tiny
- YOLOV5
ASJC Scopus subject areas
- Human-Computer Interaction
- Hardware and Architecture
- Electrical and Electronic Engineering