From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Project page: ~evonne_ng/projects/audio2photoreal/
Code and data:
Arxiv: coming soon!
Abstract:
We present a framework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction. Given speech audio, we output multiple possibilities of gestural motion for an individual, including face, body, and hands. The key behind our method is in combining the benefits of sample diversity from vector quantization with the high-frequency details obtained through diffusion to generate more dynamic, expressive motion. We visualize the generated motion using highly photorealistic avatars that can express crucial nuances in gestures (e.g. sneers and smirks). To facilitate this line of research, we introduce a first-of-its-kind multi-view conversational dataset that allows for photorealistic reconstruction. Experiments show our model generates appropriate and diverse gestures, outperforming both diffusion- and VQ-only methods. Furthermore, our perceptual evaluation highlights the importance of photorealism (vs. meshes) in accurately assessing subtle motion details in conversational gestures. Code and dataset will be publicly released.
Key parts:
00:15 project overview
00:40 dataset
00:47 method overview
00:55 face motion model
01:10 guide pose predictor
01:26 pose motion model
01:45 avatar renderer
02:31 results: guide poses, diffusion outputs, avatar
03:16 results: muti-sample results
04:15 results: ours vs. LDA vs. Random
04:53 results: ours vs. SHOW vs KNN
05:43 results: generalization to “Friends“ audio
06:10 results: motion editing
1 view
6
1
2 weeks ago 00:00:00 3
COUP DE TONNERRE DIPLOMATIQUE À LONDRES, MACRON PRÉPARE SA SURVIE : LE BORDEL ! | LA MATINALE GPTV
3 weeks ago 00:25:30 1
EU/UK Russia obsession, driving towards the abyss
3 weeks ago 00:10:24 1
Rust | Early Access | Gameplay, Part 7 - WHY SO SERIOUS?!
3 weeks ago 00:03:29 15
Pink Floyd - Cluster One cover
4 weeks ago 00:14:16 1
SCP 682 | Indie Horror Game | - PUFF, THE MAGIC DRAGON ON STEROIDS!
4 weeks ago 00:15:40 1
Scribble | Indie Game | - SAD HORROR!
4 weeks ago 00:18:36 1
It’s Dark Gameplay | Indie Horror Game | - GROWLING DEMONIC SMOKE COW!
4 weeks ago 00:38:33 1
The Groundskeeper | Indie Horror Game | - SLENDER’S GRANDPA?!
4 weeks ago 00:09:47 1
Pelicump | Indie Game | - AGENT 00CROW !?
4 weeks ago 00:14:18 1
Rust | Early Access | Gameplay, Part 6 - PRISONER!
4 weeks ago 00:14:45 1
Within Deep Sorrows, Playthrough /w facecam, Part 2 (final?) - WAKE UP!
4 weeks ago 00:24:00 1
Within Deep Sorrows, Playthrough /w facecam, Part 1 - JUMPSCARE K.O!
4 weeks ago 00:05:57 1
Santa Claws | Indie Christmas Game | Gameplay - SNOWBALL FIGHT!
4 weeks ago 00:00:51 1
MERRY CHRISTMAS /w Zalzar and Sigge!
4 weeks ago 00:16:03 1
Just Cause 2 Multiplayer Madness Montage | Gameplay | - SWAG N’ YOLO!
4 weeks ago 00:09:50 1
Mr Red’s adventure in The Missing Balls | Christmas Game | Gameplay /w facecam - HUGE RED BALLS!
4 weeks ago 00:16:23 1
The Walking Dead | Season 2, Episode 1 | Gameplay Playthrough /w facecam, Part 4 - EPISODE FINALE!
4 weeks ago 00:30:40 1
The Walking Dead | Season 2, Episode 1 | Gameplay Playthrough, Part 3 - DEEP WOUNDS!
4 weeks ago 00:22:48 1
The Walking Dead | Season 2, Episode 1 | Gameplay Playthrough /w facecam, Part 2 - SAM
4 weeks ago 00:21:19 1
The Walking Dead | Season 2, Episode 1 | Gameplay Playthrough /w facecam, Part 1 - CLEMENTINE
4 weeks ago 00:08:58 1
Rust | Early Access | Gameplay, Part 5 - RAIDERS!
4 weeks ago 00:15:16 1
Rust | Early Access | Gameplay, Part 4 - HUNTING AND EXPLORATION!
4 weeks ago 00:09:44 1
Rust | Early Access | Gameplay, Part 3 - HOME SWEET HOME!