Multi-Modal Human Modeling, Analysis and Synthesis

Jun Yu, Changwei Luo, Chang Wen Chen (Herausgeber)

Buch | Hardcover

320 Seiten

2025
CRC Press (Verlag)
978-1-032-52764-2 (ISBN)

Artikel merken

Multi-modal Human Modeling, Analysis and Synthesis aims to adopt a structured perspective, building a comprehensive technical framework for multi-modal human modeling, analysis, and synthesis—progressing from local details to holistic perspectives, and from face features to body dynamics.

In today’s world, where intelligent technologies are deeply transforming human-computer interaction and virtual reality, multi-modal human modeling, analysis and synthesis have become central topics in computer vision. As application scenarios grow increasingly complex, new technologies continue to emerge to address these challenges. These techniques demand systematic summarization and practical guidance.

To meet this need, Multi-Modal Human Modeling, Analysis and Synthesis aims to adopt a structured perspective, building a comprehensive technical framework for multi-modal human modeling, analysis and synthesis—progressing from local details to holistic perspectives, and from face features to body dynamics.

This book begins by examining the anatomy structures and characteristics of human faces and bodies, then analyzes how traditional methods and deep learning approaches provide robust optimization solutions for modeling. For example, it explores how to address challenges in face recognition caused by lighting changes, occlusions, face expressions and aging, as well as methods for body localization, reconstruction, recognition and anomaly detection in multi-modal scenarios. It also explains how multi-modal data can drive realistic face and body synthesis. A standout feature is its focus on Huawei’s MindSpore framework, bridging the gap between algorithms and engineering through practical case studies. From building face detection and recognition pipelines with the MindSpore toolkit to accelerating model training via automatic parallel computing, and solving large language model (LLM) training challenges, each step is supported by reproducible code and design logic.

Designed for researchers and engineers in computer vision and AI, this book balances theoretical foundations with industry-ready technical details. Whether you aim to enhance the reliability of biometric recognition, explore creative possibilities in virtual-real interactions or optimize the deployment of deep learning frameworks, this guide serves as an essential link between academic advancements and real-world applications.

Jun Yu is currently an associate professor and laboratory director with the Department of Automation and the Institute of Advanced Technology, University of Science and Technology of China. His research interests are Multimedia Computing and Intelligent Robot. He has published 200+ journal articles and conference papers in TPAMI, IJCV, JMLR, TIP, TMM, etc. He has received 6 Best Paper Awards from premier conferences, including CVPR PBVS, ICCV MFR, ICME, FG, and won 60+ championships from Grand Challenges held in NeurIPS, CVPR, ICCV, MM, ECCV, IJCAI, AAAI. Changwei Luo is an assistant research fellow at Department of Electronic Engineering, Tsinghua University. He is also working with Academy of Military Sciences, Beijing, China. His research interests cover computer vision and human-machine interaction. He has published more than 40 papers. Chang Wen Chen is currently Chair Professor of Visual Computing at The Hong Kong Polytechnic University. He was previously an Empire Innovation Professor of Computer Science and Engineering at the University at Buffalo, State University of New York from 2008 to 2021. He also served as Dean of the School of Science and Engineering at The Chinese University of Hong Kong, Shenzhen from 2017 to 2020. He was Allen Henry Endow Chair Professor at the Florida Institute of Technology from 2003 to 2007. He was on the faculty of Electrical and Computer Engineering at the University of Missouri-Columbia from 1996 to 2003 and on the faculty of Electrical and Computer Engineering at the University of Rochester from 1992 to 1996. He is currently the Associate Editor-in-Chief of IEEE Transactions on Biometrics, Behavior, and Identity Science. He has been an Editor-in-Chief or Editor for several other major IEEE Transactions and Journals, including the IEEE Transactions on Multimedia, IEEE Transactions on Circuits and Systems for Video Technology, Proceedings of IEEE, IEEE Journal of Selected Areas in Communications, and IEEE Journal of Emerging and Selected Topics in Circuits and Systems. He has served as Conference Chair for several major IEEE, ACM and SPIE conferences related to multimedia video communications and signal processing. His research has been supported by NSF, DARPA, Air Force, NASA, Whitaker Foundation, Microsoft, Intel, Kodak, Huawei, and Technicolor. Chen received his BS degree from University of Science and Technology of China in 1983, MSEE degree from University of Southern California in 1986, and Ph.D. degree from University of Illinois at Urbana-Champaign in 1992. He and his students have received nine Best Paper Awards or Best Student Paper Awards. He has also received several research and professional achievement awards, including Sigma Xi Excellence in Graduate Research Mentoring Award in 2003, Alexander von Humboldt Research Award in 2009, the University at Buffalo Exceptional Scholar - Sustained Achievement Award in 2012, the State University of New York System Chancellor’s Award for Excellence in Scholarship and Creative Activities in 2016, and the Distinguished ECE Alumni Award from University of Illinois at Urbana-Champaign in 2019. He is an IEEE Fellow since 2004, an SPIE Fellow since 2007 and a member of Academia Europaea since 2021.

Chapter 1 Introduction

Jun Yu, Changwei Luo, Chang Wen Chen, Fengxin Chen and Wei Xu

Chapter 2 Human Face Modeling

Jun Yu, Changwei Luo and Fengxin Chen

Chapter 3 Human Face Analysis

Jun Yu, Changwei Luo and Fengxin Chen

Chapter 4 Human Face Synthesis

Jun Yu and Fengxin Chen

Chapter 5 Human Body Modeling

Fengxin Chen and Jun Yu

Chapter 6 Human Body Analysis

Fengxin Chen and Jun Yu

Chapter 7 Human Body Synthesis

Fengxin Chen and Jun Yu

Chapter 8 MindSpore: An All-Scenario Deep Learning Computing Framework

Peng He, Jun Yu, Xuefeng Jin, Fan Yu, Cong Wang, Wei Zheng and Yuanyuan Tuo

Erscheinungsdatum	13.09.2025
Zusatzinfo	11 Tables, black and white; 24 Line drawings, black and white; 48 Halftones, black and white; 72 Illustrations, black and white
Verlagsort	London
Sprache	englisch
Maße	156 x 234 mm
Gewicht	770 g
Themenwelt	Mathematik / Informatik ► Informatik ► Software Entwicklung
Themenwelt	Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik
ISBN-10	1-032-52764-1 / 1032527641
ISBN-13	978-1-032-52764-2 / 9781032527642
Zustand	Neuware