TY - JOUR
T1 - A video summarization framework based on activity attention modeling using deep features for smart campus surveillance system
AU - Muhammad, Wasim
AU - Ahmed, Imran
AU - Ahmad, Jamil
AU - Nawaz, Muhammad
AU - Alabdulkreem, Eatedal
AU - Ghadi, Yazeed
N1 - Publisher Copyright:
© 2022 Muhammad et al.
PY - 2022
Y1 - 2022
N2 - Like other business domains, digital monitoring has now become an integral part of almost every academic institution. These surveillance systems cover all the routine activities happening on the campus while producing a massive volume of video data. Selection and searching the desired video segment in such a vast video repository is highly time-consuming. Effective video summarization methods are thus needed for fast navigation and retrieval of video content. This paper introduces a keyframe extraction method to summarize academic activities to produce a short representation of the target video while preserving all the essential activities present in the original video. First, we perform fine-grain activity recognition using a realistic Campus Activities Dataset (CAD) by modeling activity attention scores using a deep CNN model. In the second phase, we use the generated attention scores for each activity category to extract significant video frames. Finally, we evaluate the interframe similarity index used to reduce the number of redundant frames and extract only the representative keyframes.
AB - Like other business domains, digital monitoring has now become an integral part of almost every academic institution. These surveillance systems cover all the routine activities happening on the campus while producing a massive volume of video data. Selection and searching the desired video segment in such a vast video repository is highly time-consuming. Effective video summarization methods are thus needed for fast navigation and retrieval of video content. This paper introduces a keyframe extraction method to summarize academic activities to produce a short representation of the target video while preserving all the essential activities present in the original video. First, we perform fine-grain activity recognition using a realistic Campus Activities Dataset (CAD) by modeling activity attention scores using a deep CNN model. In the second phase, we use the generated attention scores for each activity category to extract significant video frames. Finally, we evaluate the interframe similarity index used to reduce the number of redundant frames and extract only the representative keyframes.
KW - Dats science
KW - Deep learning
KW - Emerging technologies
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85130365217&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130365217&partnerID=8YFLogxK
U2 - 10.7717/peerj-cs.911
DO - 10.7717/peerj-cs.911
M3 - Article
AN - SCOPUS:85130365217
SN - 2376-5992
VL - 8
JO - PeerJ Computer Science
JF - PeerJ Computer Science
M1 - e911
ER -