Large Vision-Language Model Learning and Applications (LAVA) Call for Papers

Call for Papers and Challenge Participation

We invite researchers, practitioners, and enthusiasts to contribute to
the Workshop and Grand Challenge on 
Large Vision-Language Model Learning and Applications (LAVA), 
to be held in conjunction with ACM
Multimedia 2025.

 

?? LAVA Workshop Overview

The LAVA Workshop explores innovations and challenges in Large
Vision–Language Models (LVLMs). We welcome contributions across a
broad spectrum of topics, including but not limited to:

    Data preprocessing and prompt engineering for LVLMs
    Training and compression techniques for LVLMs
    Self-supervised, unsupervised, few-shot, and zero-shot learning
    Generative AI and multimodal generation
    Trustworthy and explainable LVLMs
    Security, privacy, and ethical concerns in LVLMs
    Evaluation and benchmarking methodologies
    LVLMs for downstream tasks and applications
    LVLMs in virtual, augmented, and mixed reality
    LVLMs for low-resource scenarios
    Multimodal integration beyond vision and language

Submission Types

    Short papers (non-archived): Up to 4 pages, excluding references
    Long papers (archived in ACM Digital Library): Up to 8 pages, excluding references

All submissions should follow the official ACM MM format.

Workshop Important Dates

    ?? Paper submission deadline: June 15, 2025
    ?? ACM MM fast-track submission: July 11, 2025
    ? Notification of acceptance: July 24, 2025
    ??? Camera-ready deadline: August 1, 2025
    ?? Workshop date: October 27–28, 2025

?? More info: https://lava-workshop.github.io/workshop

 

?? LAVA Grand Challenge 2025

This year's LAVA Challenge focuses on enhancing LVLM capabilities in
interpreting complex visual documents, including: Data Flow Diagrams
(DFDs), Class Diagrams, Gantt Charts, Architectural and Building
Design Drawings

The 2025 challenge emphasizes Japanese government and business
documents in PDF format, each accompanied by multiple-choice
(10-option) questions requiring deep visual-linguistic understanding.

Challenge Important Dates

    ? Registration opens: March 15, 2025
    ?? Public data release: April 17, 2025
    ? Registration closes: May 31, 2025
    ?? Private test data release: We decided to use the test data for public and private leaderboard.
    ?? Final results, report & paper submission deadline: June 30, 2025
    ?? Notification of acceptance: July 24, 2025
    ??? Camera-ready deadline: August 26, 2025
    ?? Challenge presentation date: October 27–31, 2025

?? More info: https://lava-workshop.github.io/grandchallenge

 

We look forward to your contributions and participation in pushing the
frontiers of vision-language learning!