10–14 Nov 2025
Office of Grants and Research
Africa/Accra timezone

Is Facial Geometric-Encoding Crucial to Understanding Facial Data in Video Sequences?

Not scheduled
45m
Office of Grants and Research

Office of Grants and Research

Poster Presentation Emerging Technologies, Artificial Intelligence, and Engineering Innovations

Speaker

Emmanuel Kwesi Baah (Kwame Nkrumah University of Science and Technology)

Description

The sensitivity to the subtle facial changes demands a dedicated attention model that models a geometric layout of faces in video sequences to enable the self-attention-based architectures to understand the structural and dynamic aspects of the facial data for engagement monitoring. Convolutional networks lack this. We present a convolution-free approach to video-based facial expression recognition exclusively built on the TimeSformer architecture. Our method, named “FMeshformer”, adapts the standard TimeSformer architecture for generic action recognition by applying the mesh positional encoder after the patch embedding. Our proposed model overcomes the limitation of the linear positional embedding, which fails to capture the nuanced spatial relationship between facial features, by focusing on geometric layouts(mesh-aware) of the faces in facial-expression-based videos to understand the dynamic and structural aspects of engagement using facial data. Despite the innovative design, the FMeshformer achieves state-of-the-art results on facial expression detection using the benchmark DAiSEE dataset with a test accuracy of 71%. Finally, compared to other three-dimensional convolutional networks, our model is faster to train on a new dataset, it achieves a significantly higher test efficiency, and is applied to longer video clips (over 60 seconds long).

Primary author

Emmanuel Kwesi Baah (Kwame Nkrumah University of Science and Technology)

Co-authors

Prof. James Ben Hayfron-Acquah (Kwame Nkrumah University of Science and Technology) Dr Dominic Asamoah (Kwame Nkrumah University of Science and Technology) Kwabena Owusu-Agyemang (Kwame Nkrumah University of Science and Technology)

Presentation materials