TY - GEN
T1 - SWIN and Vision Transformer-Driven Crack Detection in the Al Qattara Oasis, UAE
T2 - 11th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2024
AU - Ali, Luqman
AU - Ambali Parambil, Medha Mohan
AU - Swavaf, Muhammed
AU - Alnajjar, Fady
AU - Aljassmi, Hamad
AU - De Man, Adriaan De
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Heritage sites are central to the cultural identity and historical narrative of a community. The Al Qattara Oasis, located in the United Arab Emirates (UAE), illustrates this function well. This study addresses the pressing need to preserve and maintain built heritage by employing modern technological solutions for infrastructure upkeep. Specifically, it focuses on crack detection, a critical aspect of ensuring the structural integrity of heritage buildings. Utilizing data collected from various structures across the UAE, obtained through handheld cameras, the effectiveness of Vision Transformers (ViTs) and Swin Transformers in identifying cracks within Al Qattara is assessed. The research thoroughly evaluates models, with particular attention to different input patch sizes. Through systematic experimentation and analysis over 100 epochs, ViT models with a patch size 16 exhibit significant promise in crack detection. Notably, the ViT-16 model achieves good performance metrics, including training and validation accuracies of 85% and (82%) respectively, and correctly classifies 791 out of 957 cracks and 772 out of 903 non-cracks. In contrast, Swin Transformers show lower validation accuracies (70-74%) and higher misclassification rates. The outcomes underscore the potential of ViT models in enhancing infrastructure maintenance efforts within heritage sites like Qattara Oasis. By combining advanced technology with a profound respect for historical preservation, this study aims to contribute to the sustainable conservation and protection of cultural heritage, ensuring that future generations can continue to appreciate the enduring legacy of sites such as Qattara Oasis.
AB - Heritage sites are central to the cultural identity and historical narrative of a community. The Al Qattara Oasis, located in the United Arab Emirates (UAE), illustrates this function well. This study addresses the pressing need to preserve and maintain built heritage by employing modern technological solutions for infrastructure upkeep. Specifically, it focuses on crack detection, a critical aspect of ensuring the structural integrity of heritage buildings. Utilizing data collected from various structures across the UAE, obtained through handheld cameras, the effectiveness of Vision Transformers (ViTs) and Swin Transformers in identifying cracks within Al Qattara is assessed. The research thoroughly evaluates models, with particular attention to different input patch sizes. Through systematic experimentation and analysis over 100 epochs, ViT models with a patch size 16 exhibit significant promise in crack detection. Notably, the ViT-16 model achieves good performance metrics, including training and validation accuracies of 85% and (82%) respectively, and correctly classifies 791 out of 957 cracks and 772 out of 903 non-cracks. In contrast, Swin Transformers show lower validation accuracies (70-74%) and higher misclassification rates. The outcomes underscore the potential of ViT models in enhancing infrastructure maintenance efforts within heritage sites like Qattara Oasis. By combining advanced technology with a profound respect for historical preservation, this study aims to contribute to the sustainable conservation and protection of cultural heritage, ensuring that future generations can continue to appreciate the enduring legacy of sites such as Qattara Oasis.
KW - Crack detection
KW - Swin transformer
KW - Vision transformer
UR - http://www.scopus.com/inward/record.url?scp=105003308765&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105003308765&partnerID=8YFLogxK
U2 - 10.1109/BDCAT63179.2024.00038
DO - 10.1109/BDCAT63179.2024.00038
M3 - Conference contribution
AN - SCOPUS:105003308765
T3 - Proceedings - 2024 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2024
SP - 178
EP - 183
BT - Proceedings - 2024 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 16 December 2024 through 19 December 2024
ER -