TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
File version
Author(s)
Zhang, Jiawei
Bai, Xiao
Zheng, Jin
Ning, Xin
Zhou, Jun
Gu, Lin
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Leonardis, A
Ricci, E
Roth, S
Russakovsky, O
Sattler, T
Varol, G
Date
Size
File type(s)
Location
Italy, Milan
License
Abstract
Radiance fields have demonstrated impressive performance in synthesizing lifelike 3D talking heads. However, due to the difficulty in fitting steep appearance changes, the prevailing paradigm that presents facial motions by directly modifying point appearance may lead to distortions in dynamic regions. To tackle this challenge, we introduce TalkingGaussian, a deformation-based radiance fields framework for high-fidelity talking head synthesis. Leveraging the point-based Gaussian Splatting, facial motions can be represented in our method by applying smooth and continuous deformations to persistent Gaussian primitives, without requiring to learn the difficult appearance change like previous methods. Due to this simplification, precise facial motions can be synthesized while keeping a highly intact facial feature. Under such a deformation paradigm, we further identify a face-mouth motion inconsistency that would affect the learning of detailed speaking motions. To address this conflict, we decompose the model into two branches separately for the face and inside mouth areas, therefore simplifying the learning tasks to help reconstruct more accurate motion and structure of the mouth region. Extensive experiments demonstrate that our method renders high-quality lip-synchronized talking head videos, with better facial fidelity and higher efficiency compared with previous methods. Code is available at: https://github.com/Fictionarry/TalkingGaussian.
Journal Title
Conference Title
Computer Vision – ECCV 2024: 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part X
Book Title
Edition
Volume
15068
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation
Li, J; Zhang, J; Bai, X; Zheng, J; Ning, X; Zhou, J; Gu, L, TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting, Computer Vision – ECCV 2024: 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part X, 2025, 15068, pp. 127-145