SyncTweedies: A General Generative Framework Based on Synchronized Diffusions

architecture

Abstract

We introduce a general framework for generating diverse visual content, including ambiguous images, panorama images, mesh textures, and Gaussian splat textures, by synchronizing multiple diffusion processes. We present exhaustive investigation into all possible scenarios for synchronizing multiple diffusion processes through a canonical space and analyze their characteristics across applications. In doing so, we reveal a previously unexplored case: averaging the outputs of Tweedie's formula while conducting denoising in multiple instance spaces. This case also provides the best quality with the widest applicability to downstream tasks. We name this case SyncTweedies. In our experiments generating visual content aforementioned, we demonstrate the superior quality of generation by SyncTweedies compared to other synchronization methods, optimization-based and iterative-update-based methods.


3D Mesh Texturing

🎬 3D Mesh


"A dumpster"

"A clutch bag"

"A lemon"

"A hand carved wood turtle"


🎬 Qualitative Results

"A nascar"

"A hamburger"

"An hourglass"

"A jeep"


🎨 Luma AI 3D Mesh Re-Texturing

"A turtle"

âž¡

"A golden statue
of a turtle"

"A car"

âž¡

"A luxurious
red sports car"

"A lantern"

âž¡

"A chinese style lantern"

"A nascar"

âž¡

"A car with graffiti"

"An elephant"

âž¡

"An african elephant"

"An axe"

âž¡

"A wooden axe"


3D Gaussian Splat Texturing

🎬 Qualitative Results


"A majestic red chair"

"A photo of cucumbers"

"A photo of a yellow excavator covered in snow"

"A photo of a white cruise ship at sea"

"A leather chair"

"A photo of corns"

"A white drum kit"

"A photo of a pirate ship at sea"


Ambiguous Images

🎬 Qualitative Results

Clockwise 90° Rotation

Color Inversion

Patch Permutation


Panorama Generation

🎬 Qualitative Results

"A photo of a mountain range at twilight"

"A photo of a beautiful ocean with coral reef"

"A photo of a lake under the northern lights"


Depth-to-360-Panorama Generation

🎬 Qualitative Results

"A house at night"

"An old looking library"

"A room that has been painted gold"


💡 Comparison with Other Methods

🚀 3D Mesh Texturing

Case1

Case2

(SyncTweedies)

Case3

Case4

Case5

Paint3D

Paint-it

TEXTure

Text2Tex

"Baseball glove"

Case1

Case2

(SyncTweedies)

Case3

Case4

Case5

Paint3D

Paint-it

TEXTure

Text2Tex

"Minivan"

Case1

Case2

(SyncTweedies)

Case3

Case4

Case5

Paint3D

Paint-it

TEXTure

Text2Tex

"iPod"


🚀 3D Gaussian Splat Texturing

Case1

Case2

(SyncTweedies)

Case3

Case4

Case5

SDS

MVDream-SDS

IN2N

"[S*] a wooden carving of a microphone"

Case1

Case2

(SyncTweedies)

Case3

Case4

Case5

SDS

MVDream-SDS

IN2N

"[S*] a drum kit made of ruby"

Case1

Case2

(SyncTweedies)

Case3

Case4

Case5

SDS

MVDream-SDS

IN2N

"[S*] a photo of a military ship at sea"