Towards Optimized IoT-based Context-aware Video Content Analysis Framework

Gad Gad, Eyad Gad, Bassem Mokhtar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Despite the success of convolutional neural networks (CNNs) in the area of spatial analysis, and recurrent neural networks (RNNs) on sequence modeling and interpretation tasks, video analysis has only seen limited interest and progress. This is partially due to focusing on the natural humanlike translation from video space to natural language space to the detriment of informativeness. This paper is proposing an automated context-aware video analysis framework that is directed by the constrains of its application. This framework encorporates an encoder-decoder neural network trained on a closed-domain video-to-text dataset. The network architecture and the standardized language model present in the dataset are optimized for speed, to allow the system to be applied on IoT devices, and for informativeness, to extract information easily from the model output to the following stages of the anlaysis. The proposed framework provides a practical method to integrate the power of CNN and RNN combination in a directed way to extract the most from video content. A classroom monitoring system is discussed as an example of the capabilities and limitations of the proposed framework using NVIDIA's Jetson nano board.

Original languageEnglish
Title of host publication7th IEEE World Forum on Internet of Things, WF-IoT 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages46-50
Number of pages5
ISBN (Electronic)9781665444316
DOIs
Publication statusPublished - Jun 14 2021
Externally publishedYes
Event7th IEEE World Forum on Internet of Things, WF-IoT 2021 - New Orleans, United States
Duration: Jun 14 2021Jul 31 2021

Publication series

Name7th IEEE World Forum on Internet of Things, WF-IoT 2021

Conference

Conference7th IEEE World Forum on Internet of Things, WF-IoT 2021
Country/TerritoryUnited States
CityNew Orleans
Period6/14/217/31/21

Keywords

  • Closed-domain Dataset
  • Context-aware Analysis
  • Deep Learning Framework
  • Subject-Verb-Object Description
  • Video Description

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Hardware and Architecture
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Towards Optimized IoT-based Context-aware Video Content Analysis Framework'. Together they form a unique fingerprint.

Cite this