Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

Abstract

Generating 3D human models directly from text helps reduce the cost and time of character modeling. However, achieving multi-attribute controllable and realistic 3D human avatar generation is still challenging due to feature coupling and the scarcity of realistic 3D human avatar datasets. To address these issues, we propose Text2Avatar, which can generate realistic-style 3D avatars based on the coupled text prompts. Text2Avatar leverages a discrete codebook as an intermediate feature to establish a connection between text and avatars, enabling the disentanglement of features. Furthermore, to alleviate the scarcity of realistic style 3D human avatar data, we utilize a pre-trained unconditional 3D human avatar generation model to obtain a large amount of 3D avatar pseudo data, which allows Text2Avatar to achieve realistic style generation. Experimental results demonstrate that our method can generate realistic 3D avatars from coupled textual data, which is challenging for other existing methods in this field.

Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

Chaoqun Gong^1†

Yuqin Dai^2†

Ronghui Li¹

Achun Bao¹

Jun Li²

Jian Yang²

Yachao Zhang^1*

Xiu Li^1*

¹Shenzhen International Graduate School, Tsinghua University, China

²Nanjing University of Science and Technology, China

Abstract

Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

Chaoqun Gong1†

Yuqin Dai2†

Ronghui Li1

Achun Bao1

Jun Li2

Jian Yang2

Yachao Zhang1*

Xiu Li1*

1Shenzhen International Graduate School, Tsinghua University, China

2Nanjing University of Science and Technology, China

Abstract

Chaoqun Gong^1†

Yuqin Dai^2†

Ronghui Li¹

Achun Bao¹

Jun Li²

Jian Yang²

Yachao Zhang^1*

Xiu Li^1*

¹Shenzhen International Graduate School, Tsinghua University, China

²Nanjing University of Science and Technology, China