Zhou, Feng, Li, Chi, Dai, Ju, Zhu, Mengxiao, Zhang, Yongmei, Lai, Yukun ORCID: https://orcid.org/0000-0002-2094-5680 and Rosin, Paul ORCID: https://orcid.org/0000-0002-4965-3884
2025.
Chat-driven 3D human pose and shape editing with Large Language Models.
Presented at: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP),
Hyderabad, India,
6-11 April 2025.
Published in: Rao, B. D., Trancoso, I., Sharma, G. and Mehta, N. B. eds.
Proceedings of the International Conference on Acoustics, Speech and Signal Processing.
IEEE,
pp. 1-5.
10.1109/ICASSP49660.2025.10887921
|
Preview |
PDF
- Accepted Post-Print Version
Download (2MB) | Preview |
Abstract
Generating and creating humanoid 3D models has received increasing attention recently due to its fundamental support for many high-level 3D applications. Although automatic 3D pose and shape reconstruction methods have achieved promising results, there are still some failure cases due to self-occlusions, viewpoint changes, and the complexity of human pose articulations. In this paper, we propose a novel way to leverage Large Language Models (LLMs) to interactively reconstruct human pose and shape based on a Skinned Multi-Person Linear (SMPL) model. We construct a mapping table to fine-tune an LLM, enabling it to understand user inputs better and output the positional information of joint points. Additionally, a simple neural network is adopted to regress the shape cues of the SMPL. We demonstrate a gallery of results of numerous poses and shapes. We validate our method via numerical evaluations, user studies, and comparisons to manually posed characters and previous work.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Date Type: | Published Online |
| Status: | Published |
| Schools: | Schools > Computer Science & Informatics |
| Publisher: | IEEE |
| ISBN: | 979-8-3503-6875-8 |
| ISSN: | 1520-6149 |
| Related URLs: | |
| Date of First Compliant Deposit: | 8 February 2025 |
| Date of Acceptance: | 18 December 2024 |
| Last Modified: | 26 Mar 2025 15:15 |
| URI: | https://orca.cardiff.ac.uk/id/eprint/176050 |
Actions (repository staff only)
![]() |
Edit Item |





Dimensions
Dimensions