Source
View 1
View 2
+
Static 3DGS
=
Dynamic 3DGS
Source
Target
(1) Structured Multiview Motion Inversion. We start by using the Inversion method to extract Motion Embeddings from the source video, while using multiple Anchor Embeddings to ensure information sharing between angles:
(2) View-aware Semantic Motion Transfer. We then apply the learned motion embeddings on multiple rendered views of the target object, resulting in a inconsistent, sparse set of 2D videos.
(3) 4D Consolidation. Our final step is a consolidation process, transforming the generated …supervisions into a 4D representation. We utilized a control points mechanism, while incorporating novel rotation constraint, resulting in a a smooth, temporally coherent dynamic 3D model.
placeholder