Foundation, Multimodal Large Language and Generative Models for Face and Gesture Recognition Call for Papers
*****************************
1st International Workshop on
Foundation, Multimodal Large Language and Generative Models for Face
and Gesture Recognition
Held in the scope of IEEE FG 2025
26 or May 30 (TBD), Clearwater, Florida, USA
https://sites.google.com/view/fmllmgm-fg25
Paper submission : 9 April 2025, 11:59pm PST
*****************************
*** Call for Papers ***
The field of face and gesture recognition has recently experienced a
transformative shift with the rise of foundation models, generative
models, and multimodal large language models (LLMs), which offer
unprecedented capabilities to process and integrate multimodal data
(e.g., text, images, video, and audio) in a unified framework. This
workshop aims to explore the implications and potential uses of these
models specifically for face and gesture recognition tasks. Foundation
models (such as CLIP, GPT, etc.) enable robust feature extraction and
transfer learning. In addition, generative models allow synthetic data
generation, privacy-preserving learning, and advanced data
augmentation techniques. As LLMs increasingly support multimodal
functionalities, they provide a promising avenue to advance the field
beyond traditional techniques, facilitating richer, contextually
aware, and potentially more accurate recognition systems, among other
key aspects in the field such as explainability.
This workshop will foster collaboration among researchers interested
in advancing foundation, multimodal LLMs and generative models for
face and gesture recognition, encouraging interdisciplinary insights,
and new research that leverages these models for tasks such as
real-time emotion recognition, social behavior analysis, and advanced
biometrics.
Topics of interest include, but are not limited to:
+ Adapting foundation models for face and gesture recognition
+ Zero-shot and few-shot learning for gestures and facial analyses using LLMs
+ Contextual reasoning and interpretation of gestures and expressions via LLMs
+ Ethical, privacy, and robustness challenges in LLM-driven biometric systems
+ Biometric system components (beyond recognition) based on foundation models and LLMs
+ Generating synthetic datasets for face, body, and gesture analysis
*** Paper format and submission ***
*** Format ***
Submitted papers may not be accepted or under review
elsewhere. Submissions may be up to 8 pages (+ an unlimited number of
references) in the same format as the main IEEE FG 2025
papers. Accepted papers will be submitted for inclusion into IEEE
Xplore and published as part of the same proceedings as the main FG
conference papers.
*** Submission ***
Paper submissions are accepted through
https://cmt3.research.microsoft.com/FG2025
*** Dates ***
Workshops will be held on May 26th and May 30th, 2025. The final date
of the workshop will decided at a later date.
*** Important Dates ***
Paper submission: April 9, 2025 (11:59pm Pacific)
Notifications to authors: April 25, 2025
Camera-Ready Submission: May 2, 2025
For more information, visit: https://sites.google.com/view/fmllmgm-fg25
--
Prof. Vitomir Struc, PhD
Laboratory for Machine Intelligence
Faculty of Electrical Engineering
University of Ljubljana, Slovenija
URL: https://lmi.fe.uni-lj.si/en/vitomir-struc/
VP Technical Activities, IEEE Biometrics Council
Supervisory Board Member: European Association for Biometrics (EAB)
Vice-chair of IAPR Technical Committee on Biometrics (TC4)
General Co-Chair: CVF/IEEE Winter Conference on Applications in
Computer Vision (WACV 2026)
Program Co-Chair: IEEE/IAPR International Joint Conference on
Biometrics (IJCB 2025), https://ijcb2025.ieee-biometrics.org/