Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Chat-driven 3D human pose and shape editing with Large Language Models

Zhou, Feng, Li, Chi, Dai, Ju, Zhu, Mengxiao, Zhang, Yongmei, Lai, Yukun ORCID: https://orcid.org/0000-0002-2094-5680 and Rosin, Paul ORCID: https://orcid.org/0000-0002-4965-3884 2025. Chat-driven 3D human pose and shape editing with Large Language Models. Presented at: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, 6-11 April 2025. Published in: Rao, B. D., Trancoso, I., Sharma, G. and Mehta, N. B. eds. Proceedings of the International Conference on Acoustics, Speech and Signal Processing. IEEE, pp. 1-5. 10.1109/ICASSP49660.2025.10887921

[thumbnail of LLMEdit_ICASSP2025.pdf]
Preview
PDF - Accepted Post-Print Version
Download (2MB) | Preview

Abstract

Generating and creating humanoid 3D models has received increasing attention recently due to its fundamental support for many high-level 3D applications. Although automatic 3D pose and shape reconstruction methods have achieved promising results, there are still some failure cases due to self-occlusions, viewpoint changes, and the complexity of human pose articulations. In this paper, we propose a novel way to leverage Large Language Models (LLMs) to interactively reconstruct human pose and shape based on a Skinned Multi-Person Linear (SMPL) model. We construct a mapping table to fine-tune an LLM, enabling it to understand user inputs better and output the positional information of joint points. Additionally, a simple neural network is adopted to regress the shape cues of the SMPL. We demonstrate a gallery of results of numerous poses and shapes. We validate our method via numerical evaluations, user studies, and comparisons to manually posed characters and previous work.

Item Type: Conference or Workshop Item (Paper)
Date Type: Published Online
Status: Published
Schools: Schools > Computer Science & Informatics
Publisher: IEEE
ISBN: 979-8-3503-6875-8
ISSN: 1520-6149
Related URLs:
Date of First Compliant Deposit: 8 February 2025
Date of Acceptance: 18 December 2024
Last Modified: 26 Mar 2025 15:15
URI: https://orca.cardiff.ac.uk/id/eprint/176050

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics