Multimodal Large Language Models as a Catalyst for Advancing Learning Analytics in Early Childhood Education

Authors

  • Yuanyuan Yang School of Smart Education, Jiangsu Normal University Author
  • Tianchen Sun Research Center for Scientific Data Hub, Zhejiang Lab Author
  • Yuan Shen College of Education, Zhejiang University of Technology Author
  • Yangbin Xie Research Center for Scientific Data Hub, Zhejiang Lab Author

Abstract

Early childhood education (ECE) involves rich, informal, and multimodal learning processes that are difficult to assess with traditional methods. This paper presents a novel framework that integrates Multimodal Large Language Models (MLLMs) into ECE learning analytics to capture children’s spontaneous expressions—such as drawings, speech, gestures, and social interactions—in a scalable, child-centered manner. The system includes multimodal data collection, MLLM-based feature extraction, automated developmental analytics, and educator-in-the-loop feedback. A real-world case from a rural kindergarten illustrates the framework’s ability to generate interpretable indicators and actionable insights. We discuss opportunities for individualized assessment and developmentally appropriate practices, as well as challenges related to interpretability, privacy, and equity. This work demonstrates the potential of MLLMs to support holistic, play-based learning analytics in early childhood settings.

Downloads

Download data is not yet available.

Downloads

Published

2025-09-05

Conference Proceedings Volume

Section

Conference Proceedings Submissions