Home  ·   Publications  ·   More


Zhongang Cai   蔡中昂


Hi there! I'm a Staff Research Scientist at SenseTime Research, working with Dr. Lei Yang. My research focuses on multimodal foundation models with an emphasis on spatial intelligence. As a side project, I lead DLP3D, an open-source framework for building real-time autonomous 3D characters. I received my Ph.D. from MMLab@NTU, advised by Prof. Ziwei Liu and Prof. Chen Change Loy, where I spent wonderful years exploring virtual humans.

Google Scholar X GitHub HuggingFace YouTube LinkedIn

News

[2025-10] Release of the source code of DLP3D. Try it now at dlp3d.ai !

[2025-10] Digital Life Project 2 (DLP3D) has been accepted to SIGGRAPH Asia 2025 (Real-Time Live!).

[2025-10] SMPLest-X has been accepted to TPAMI 2025.

[2025-09] PoseFuse3D-KI has been accepted to NeurIPS 2025.

[2025-08] Release of technical report: Has GPT-5 Achieved Spatial Intelligence? An Empirical Study.

[2025-06] DPoser-X (Oral) has been accepted to ICCV 2025.

[2025-05] ADHMR has been accepted to ICML 2025.

[2025-02] SOLAMI, Disco4D, and EgoLife have been accepted to CVPR 2025.

[2025-01] MeshAnything has been accepted to ICLR 2025.

[2024-12] I have started a new role as a Staff Research Scientist at SenseTime Research.

[2024-09] Release of HuMMan v1.0: Motion Generation Subset and GTA-Human II Dataset.

[2024-08] Release of HuMMan v1.0: 3D Vision Subset.

[2024-08] GTA-Human has been accepted to TPAMI 2024 after two years of review!

[2024-07] WHAC and Large Motion Model have been accepted to ECCV 2024.

[2024-06] I have defended my Ph.D. thesis Scaling Up Parametric Human Recovery! 🎓

[2024-04] Invited talk (slides) at China3DV 2024.

[2024-02] Digital Life Project, AiOS, and GaussianEditor have been accepted to CVPR 2024.

[2024-02] Invited talk on SMPLer-X (recording) at International Digital Economy Academy (IDEA).

My Three Favorite Works [Full List]



Digital Life Project 2: Open-source Autonomous 3D Characters on the Web

Zhongang Cai, Daxuan Ren, Yang Gao, Yukun Wei, Tongxi Zhou, Zhengyu Lin, Huimuk Jang, Haoyang Zeng, Chen Change Loy, Ziwei Liu, Lei Yang.
SIGGRAPH Asia (Real-Time Live!), 2025

Project Page Code Demo

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

Zhongang Cai*, Wanqi Yin*, Ailing Zeng, Chen Wei, Qingping Sun, Yanjun Wang, Hui En Pang, Haiyi Mei, Mingyuan Zhang, Lei Zhang, Chen Change Loy, Lei Yang, Ziwei Liu.
NeurIPS (Datasets and Benchmarks Track), 2023

Project Page PDF Code Video Demo 新智元 商汤学术 我爱计算机视觉 IDEA论文一刻(B站) CVer(B站)

HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling

Zhongang Cai*, Daxuan Ren*, Ailing Zeng*, Zhengyu Lin*, Tao Yu*, Wenjia Wang*, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu.
European Conference on Computer Vision (ECCV), 2022 (Oral)

Project Page PDF Code Data Video