Header Image

Keynote Speakers

Chang Wen Chen
Chang Wen Chen (陈长汶)
Chair Professor of Visual Computing, Interim Dean of the Faculty of Computer and Mathematical Sciences, The Hong Kong Polytechnic University, IEEE Fellow
🎤
Abstract:
Biography: Prof. Changwen Chen is currently the Chair Professor of Visual Computing at The Hong Kong Polytechnic University and the Interim Dean of the Faculty of Computer and Mathematical Sciences. He was an Empire Innovation Professor of Computer Science and Engineering at the University at Buffalo, State University of New York (2008–2021). He also served as Dean of the School of Science and Engineering at The Chinese University of Hong Kong, Shenzhen (2017–2020). Previously, he held the Allen Henry Endow Chair Professorship at Florida Institute of Technology (2003–2007) and faculty positions at the University of Missouri-Columbia (1996–2003) and the University of Rochester (1992–1996). Prof. Chen received his B.S. from the University of Science and Technology of China in 1983, his M.S.E.E. from the University of Southern California in 1986, and his Ph.D. from the University of Illinois at Urbana-Champaign in 1992. He has served as Editor-in-Chief for IEEE Transactions on Multimedia (2014–2016) and IEEE Transactions on Circuits and Systems for Video Technology (2006–2009). He is currently the Associate Editor-in-Chief of IEEE Transactions on Biometrics, Behavior, and Identity Science. He has also been an editor for several other major IEEE journals, including the Proceedings of the IEEE, IEEE Journal on Selected Areas in Communications, and IEEE Journal on Emerging and Selected Topics in Circuits and Systems. Prof. Chen and his students have received nine Best Paper Awards. His professional recognitions include the Sigma Xi Excellence in Graduate Research Mentoring Award (2003), the Alexander von Humboldt Research Award (2009), the University at Buffalo Exceptional Scholar – Sustained Achievement Award (2012), the SUNY Chancellor’s Award for Excellence in Scholarship and Creative Activities (2016), and the Distinguished ECE Alumni Award from the University of Illinois at Urbana-Champaign (2019). He is an IEEE Fellow (2004), an SPIE Fellow (2007), and a member of Academia Europaea (2021).
Chia-Wen Lin
Chia-Wen Lin (林嘉文)
Distinguished Professor, Department of Electrical Engineering, National Tsing Hua University, Deputy Director of NTHU AI Research Center, IEEE Fellow
🎤 Physics-Informed Generative Image Restoration: From Data to Models
Abstract: Images captured in real-world environments often suffer from diverse degradation patterns governed by underlying physical processes, such as motion blur, haze, rain, and low illumination, which lead to undesired contrast loss and appearance distortions. With the rapid development of deep generative image models, numerous image restoration methods have been proposed to effectively alleviate these degradations. As most deep restoration approaches rely on supervised learning, their performance strongly depends on the diversity and representativeness of the training data. However, for many real-world degradations—such as rain, haze, and motion blur—collecting paired degraded and clean images is expensive and often impractical, severely limiting the coverage of available training datasets. These data acquisition challenges restrict the effectiveness and generalization ability of existing restoration models. In this talk, we will present how physics-based models can be leveraged both to enrich training datasets with realistic degradation distributions and to be integrated into diffusion-based restoration frameworks. By incorporating physics-informed priors, the performance of generative restoration models can be substantially enhanced. Representative results will be demonstrated on image dehazing and deblurring tasks.
Biography: Prof. Chia-Wen Lin is currently a Distinguished Professor with the Department of Electrical Engineering, National Tsing Hua University (NTHU), Taiwan. He also serves as Deputy Director of NTHU AI Research Center. His research interests include image/video processing, computer vision, and video networking. Dr. Lin is an IEEE Fellow, and has served on IEEE Circuits and Systems Society (CASS) Fellow Evaluation Committee (2021–2023), and CASS BoG members-at-Large (2022–2024). He was Steering Committee Chair of IEEE ICME (2020–2021), IEEE CASS Distinguished Lecturer (2018–2019), APSIPA Distinguished Lecturer (2023–2024), and President of the Chinese Image Processing and Pattern Recognition (IPPR) Association, Taiwan (2019–2020). He is currently Associate Editor-in-Chief for IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), and has served as Associate Editor of IEEE Transactions on Image Processing, IEEE Transactions on Multimedia, IEEE TCSVT, and IEEE Multimedia. He served as TPC Chair of IEEE ICME in 2010, IEEE ICIP in 2019, and PCS in 2022, and the Conference Chair of IEEE VCIP in 2018 and PCS in 2024.
Patrick Le Callet
Patrick Le Callet
Full Professor, Polytech Nantes / Nantes Université, Senior Member of the Institut Universitaire de France, IEEE Fellow
🎤 Human sustainability in AI scene: toward QoAI experience
Abstract: This talk examines how AI is progressively challenging human sustainability. We use Quality of Experience (QoE) as an interpretative lens to highlight new paradigm in AI evaluation. Facing aging society, AI can be seen as technological solutions. In that context, I first trace the evolution from imitation learning to foundation models, and then to world models and embodied AI, where human behavior is increasingly abstracted into data, patterns, and simulated environments. Beyond human sustainability in that context, I also illustrate what QoE sciences can bring for Machine Experience. Second, I present the emergence of a token-based cognitive economy, and question its ressources debt and possible QoE metrics to mitigate it. Then I present the notion of AI psychological debt. AI does not only optimize tasks, it reconfigures the structure of human experience and cognitive participation. Building on this, I propose QoAI (Quality of AI Experience) as an extended framework for AI systems evaluation. QoAI aims to assess not only efficiency and accuracy, but also human agency, cognitive effort, dependency formation, attentional integrity, and long-term cognitive resilience. The QoAI Experience Dashboard operationalizes these dimensions to make visible the hidden experiential costs of AI systems. Ultimately, the question is no longer only how well AI performs, but what kind of human cognition it sustains or erodes over time.
Biography: Prof. Patrick Le Callet is Full Professor at Polytech Nantes / Nantes Université and Senior Member of the Institut Universitaire de France. He leads interdisciplinary research on human perception, media processing, and cognitive computing. His work focuses on sustainable visual communication, quality of experience, and cognitive computing. Former scientific director of the "Ouest Industries Créatives" cluster, he has authored over 500 publications and 16 patents. Active in international editorial boards and standardization bodies (VQEG, IEEE-SA), he is co‑recipient of a Technology & Engineering Emmy Award in 2020 for his contributions to perceptual metrics in video encoding.
Zechao Li
Zechao Li (李泽超)
Professor and Dean, School of Computer Science and Engineering / School of Artificial Intelligence / School of Software, Nanjing University of Science and Technology
🎤 Robust Multimodal Visual Cognition: From Precise Perception Engines to Interactive Reflective Agents
Abstract: Building robust multimodal AI hinges on elevating static low-level perception into dynamic reasoning systems equipped with structured cognition and self-reflection. This report summarizes our series of research along this trajectory. First, to build a precise visual perception engine, we propose CTNet to resolve pixel-level ambiguity in complex scenes. Furthermore, through Singular Value Fine-tuning (SVF) and the VRP-SAM framework, we achieve open-world generalization with minimal parameter cost and visual reference prompts, establishing a powerful "visual specialist" foundation. However, Multimodal Large Language Models (MLLMs) frequently suffer from hallucinations due to the lack of precise descriptions regarding fine-grained visual attributes and object relations. To address this, we introduce the EDC framework, which leverages the aforementioned visual specialists to finely extract target attributes and transform them into high-quality image-text descriptions, significantly enhancing the visual cognition of MLLMs. Finally, targeting the pain point that large models often reason de novo and repeatedly make the same mistakes, we develop ViLoMem, a dual-stream memory framework that separately encodes logical reasoning errors and visual perception traps. This research achieves a technical leap from precise perception engines to the elimination of multimodal cognitive hallucinations, ultimately evolving into interactive reflective agents equipped with semantic memory and continual learning capabilities.
Biography: Prof. Zechao Li is a Professor and the Dean of the School of Computer Science and Engineering / School of Artificial Intelligence / School of Software at Nanjing University of Science and Technology. His research interests primarily focus on multimodal intelligent analysis and computer vision. He has led several prestigious national grants, including the National Science Fund for Distinguished Young Scholars, the National Science and Technology Major Project for New Generation AI, and Key Projects of the NSFC Joint Fund. Selected as a Young Top-Notch Talent of the National "Ten Thousand Talents Program," he has published over 60 papers in top-tier journals and conferences, such as IEEE TPAMI, IJCV, and CCF Rank-A venues. His major accolades include the First Prize of the Jiangsu Provincial Science and Technology Progress Award (2024, 1st contributor), the First Prize of the Natural Science Award from the Chinese Institute of Electronics (2022, 2nd contributor), and the First Prizes of the Jiangsu Provincial Science and Technology Award (2020 as 2nd contributor, and 2017 as 3rd contributor). Additionally, he received the Best Paper Awards at ACM MM Asia in both 2020 and 2024. Professor Li currently serves as an Associate Editor for renowned journals including IEEE TPAMI, IEEE TCSVT, IEEE TMM, and Pattern Recognition (PR), and previously served on the editorial boards of IEEE TNNLS and Information Sciences.

Full Technical Program details will be announced soon

© 2026 MLMC Conference — Technical Program Information.