Comparing Emotion Detection Methods in Online Classrooms: YOLO Models, Multimodal LLM, and Human Baseline

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The COVID-19 pandemic has transformed learning environments, challenging educators to understand students' behaviors during the online mode of learning, in particular emotions associated with students' attention during virtual classrooms. As learning transitions between physical and virtual spaces, the ability to interpret student attention and engagement has become complex. In response to this challenge, our research investigates the use of GPT-4o, a multimodal large language model, for identifying student emotions by analyzing images in diverse learning settings. The study involved analyzing online classroom images featuring 149 faces, utilizing three distinct approaches: a computer vision model (YOLO), the multimodal LLM (GPT-4o), and a human-annotated baseline. The analysis systematically categorized facial expressions into eight emotional categories: Happy, Sad, Angry, Neutral, Contempt, Disgust, Fear, and Surprise. The findings indicate that multimodal LLMs can effectively detect student emotions, achieving an average accuracy of 93.8%, which aligns with the human baseline accuracy of 97.0%. In contrast, YOLO models maintained an average accuracy of 81.9%, performing well for basic emotions but struggling with subtle expressions. This research contributes to enhancing educational practices by providing valuable insights regarding the application of multimodal LLMs to assist educators in comprehending student emotions within both physical and digital classroom settings.

Original languageEnglish
Title of host publicationEDUCON 2025 - IEEE Global Engineering Education Conference, Proceedings
PublisherIEEE Computer Society
ISBN (Electronic)9798331539498
DOIs
Publication statusPublished - 2025
Event16th IEEE Global Engineering Education Conference, EDUCON 2025 - London, United Kingdom
Duration: Apr 22 2025Apr 25 2025

Publication series

NameIEEE Global Engineering Education Conference, EDUCON
ISSN (Print)2165-9559
ISSN (Electronic)2165-9567

Conference

Conference16th IEEE Global Engineering Education Conference, EDUCON 2025
Country/TerritoryUnited Kingdom
CityLondon
Period4/22/254/25/25

ASJC Scopus subject areas

  • Education
  • General Engineering
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Comparing Emotion Detection Methods in Online Classrooms: YOLO Models, Multimodal LLM, and Human Baseline'. Together they form a unique fingerprint.

Cite this