SynGauss: Real-Time 3D Gaussian Splatting for Audio-Driven Talking Head Synthesis

Name: SynGauss: Real-Time 3D Gaussian Splatting for Audio-Driven Talking Head Synthesis
Creator: Zhou, Zhanyi
License: 暂无描述

IEEE2026-04-17 收录

下载链接：

https://ieee-dataport.org/documents/syngauss-real-time-3d-gaussian-splatting-audio-driven-talking-head-synthesis

下载链接

链接失效反馈

官方服务：

资源简介：

e used a mixed dataset\cite{ye2023geneface}in our experiments, where part of the data was referenced from the publicly available dataset provided by GaussianTalking\cite{li2025talkinggaussian}, and additional data was collected by ourselves. Specifically, we selected four high-definition talking video clips from the publicly available dataset, including two male portraits, Macron and Obama and one female portrait, May. These video clips are centered on the subject, with an average length of 6500 frames and a frame rate of 25 FPS. Among them, the videos for May and Macron were cropped and resized to $512\times512$ resolution, while the video for Obama was resized to $450\times450$ resolution to ensure consistency and compatibility with the model's input requirements.In addition, we collected two high-definition video clips, featuring one male portrait, Kanghui and one female portrait, Lizimeng. These video clips are recorded at 25 FPS with a duration of 5 minutes and were cropped and resized to $512\times512$ resolution. By introducing our self-collected dataset, we increased the diversity of the data in the experiments, covering different genders and speaking styles. This combination of publicly available and self-collected datasets not only expands the scale of the experimental data but also improves the comprehensiveness and adaptability of the model evaluation.

提供机构：

Zhou, Zhanyi

5,000+

优质数据集

54 个

任务类型

进入经典数据集