The 6th International Workshop on Human-centric Multimedia Analysis
Rio de Janeiro, Brazil — 10-14 November 2026
View on ACM MM 2026

News

2026/03/25: The website is established
2026/04/13: Important updates regarding the on-site presentation policy and Open Access have been added to the Call for Papers section.

Introduction

Human-centric multimedia analysis is one of the fundamental problems in multimedia understanding. It is a very challenging problem that involves multiple topics such as face recognition, human parsing, human pose estimation, human action detection, human-object interaction, person tracking, person re-identification, and so on. Today, ubiquitous multimedia sensors and large-scale computing infrastructures are producing at a rapid velocity a wide variety of big multi-modality data for human-centric analysis, which provides rich knowledge to tackle these challenges. Researchers have striven to push the limits of human-centric multimedia analysis in various applications, such as intelligent surveillance, retailing, fashion design, and services. Therefore, the purpose of this workshop is to: 1) bring together the state-of-the-art research on human-centric multimedia analysis; 2) call for a coordinated effort to understand the opportunities and challenges emerging in human-centric multimedia analysis; 3) identify key tasks and evaluate the state-of-the-art methods; 4) showcase innovative methodologies and ideas; 5) introduce interesting real-world human-centric multimedia analysis systems or applications; and 6) propose new real-world datasets and discuss future directions. We believe this workshop will offer a timely collection of research updates to benefit researchers and practitioners in the broad multimedia communities. To this end, we solicit original research and survey papers in (but not limited to) the following topics:

  • Face detection and recognition, face anti-spoofing, face landmark detection and parsing.
  • Human detection, pose estimation, human parsing, and pose tracking.
  • Human 3D shape estimation and reconstruction.
  • Human gait recognition, person re-identification and person tracking.
  • Human action recognition and detection.
  • Human activity recognition using non-visual sensors.
  • Human-computer interaction / Human-object interaction.
  • Multimedia event detection.
  • Anomaly event detection.
  • Human crowd analysis.
  • Human-centric multimedia content generation.
  • Human-centric multi-agent system.
  • Human-centric large model.


Organizers

Wu Liu

University of Science and Technology of China

Yutong Gao

School of Information Engineering, Minzu University of China

Xinchen Liu

JD AI Research, Beijing, China

Zhaochun Ren

Leiden University

Hongyuan Zhu

Agency for Science, Technology, and Research (A*STAR), Singapore

Jingkuan Song

School of Computer Science and Technology, Tongji University

Jiebo Luo

University of Rochester

Xiaoyan Gu

University of Chinese Academy of Sciences

If you have any questions, feel free to contact huma26.organizer@gmail.com

More information