I am a Research Scientist at NVIDIA Research, Deep Imagination Group. I received my Ph.D. from Simon Fraser University, supervised by Prof. Yasutaka Furukawa, and my M.S. under the supervision of Prof. Ping Tan.

My research focuses on world models and generative 3D/4D perception for Physical AI.

Selected Publications

  • Cosmos World Foundation Model Platform for Physical AI
    arXiv 2025
    NVIDIA
  • MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single to Sparse-view 3D Object Reconstruction
    ECCV 2024
    Shitao Tang*, Jiacheng Chen*, Dilin Wang*, Chengzhou Tang, Fuyang Zhang, Yuchen Fan, Vikas Chandra, Rakesh Ranjan, Yasutaka Furukawa
  • MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion
    NeurIPS 2023 Spotlight
    Shitao Tang*, Fuyang Zhang*, Jiacheng Chen, Peng Wang, Yasutaka Furukawa
  • NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization
    CVPR 2023
    Shitao Tang, Sicong Tang, Andrea Tagliasacchi, Ping Tan, Yasutaka Furukawa
  • QuadTree Attention for Vision Transformers
    ICLR 2022
    Shitao Tang*, Jiahui Zhang*, Siyu Zhu, Ping Tan

Full publication list