Weihao Yuan | 原玮浩

I received the Ph.D. degree from Robotics Institute, Hong Kong University of Science and Technology, where I worked on Robot Reinforcement Learning and 3D Vision under the supervision of Prof. Michael Yu Wang and Prof. Qifeng Chen. I was a visiting scholar at the Robotics, Perception and Learning Lab, KTH, working with Prof. Danica Kragic. Before that, I obtained the Bachelor degree from Zhejiang University, majoring in Automation and minoring in Finance.

My research interests include Embodied AI (VLA / WAM / RL / Humanoid Motion) and 3D Vision (Reconstruction / Generation / World Model). Glad to be exploring the real Physical AI in this era.

I am actively looking for prospective Ph.D/Master students and RAs/Interns at Nanjing University. Please drop me an email if you are interested.

News

2025-06-26
Two paper accepted to ICCV 2025:
● LHM: Animatable Human Reconstruction from a Single Image in One Second
● Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration

2025-04-20
One paper accepted to SIGGRAPH 2025:
● LAM: Large Avatar Model for One-shot Animatable Gaussian Head

2025-02-27
Two paper accepted to CVPR 2025:
● Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture
● AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction

2025-01-23
One paper accepted to ICLR 2025:
● LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning

2024-10-10
Two papers accepted to NeurIPS 2025:
● GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation Oral
● MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling

2024-07-30
One paper accepted to SIGGRAPH Asia 2025 & TOG:
● MVImgNet2.0: A Larger-scale Dataset of Multi-view Images

2024-07-10
One paper accepted to IEEE Robotics and Automation Letters (RA-L):
● OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation

2024-07-05
Three papers accepted to ECCV 2024:
● Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
● An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-2-Image Models
● High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding

2024-02-27
Three papers accepted to CVPR 2024:
● IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images Highlight
● RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D Highlight
● GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors Oral