Publications

2024

  1. Preprint
    World-consistent Video Diffusion with Explicit 3D Modeling
    Qihang Zhang, Shuangfei Zhai, Miguel Angel Bautista Martin, Kevin Miao, Alexander Toshev, Josh Susskind, and Jiatao Gu
    Preprint (Preprint) , 2024
  2. Preprint
    Dart: Denoising autoregressive transformer for scalable text-to-image generation
    Jiatao Gu, Yuyang Wang, Yizhe Zhang, Qihang Zhang, Dinghuai Zhang, Navdeep Jaitly, Josh Susskind, and Shuangfei Zhai
    Preprint (Preprint) , 2024
  3. Preprint
    3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
    Qihang Zhang, Yinghao Xu, Chaoyang Wang, Hsin-Ying Lee, Gordon Wetzstein, Bolei Zhou, and Ceyuan Yang
    Preprint (Preprint) , 2024
  4. Preprint
    Urban Scene Diffusion through Semantic Occupancy Map
    Junge Zhang, Qihang Zhang, Li Zhang, Ramana Rao Kompella, Gaowen Liu, and Bolei Zhou
    Preprint (Preprint) , 2024
  5. CVPR
    SceneWiz3D: Towards Text-guided 3D Scene Composition
    Qihang Zhang, Chaoyang Wang, Aliaksandr Siarohin, Peiye Zhuang, Yinghao Xu, Ceyuan Yang, Dahua Lin, Bolei Zhou, Sergey Tulyakov, and Hsin-Ying Lee
    Computer Vision and Pattern Recognition (CVPR) , 2024
  6. CVPR
    BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
    Qihang Zhang, Yinghao Xu, Yujun Shen, Bo Dai, Bolei Zhou, and Ceyuan Yang
    Computer Vision and Pattern Recognition (CVPR) , 2024

2023

  1. NeurIPS
    Learning Modulated Transformation in GANs
    Ceyuan Yang, Qihang Zhang, Yinghao Xu, Jiapeng Zhu, Yujun Shen, and Bo Dai
    Neural Information Processing Systems (NeurIPS) , 2023
  2. ICCV
    Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
    Jihao Liu, Tai Wang, Boxiao Liu, Qihang Zhang, Yu Liu, and Hongsheng Li
    International Conference on Computer Vision (ICCV) , 2023
  3. ICLR
    Towards Smooth Video Composition
    Qihang Zhang, Ceyuan Yang, Yujun Shen, Yinghao Xu, and Bolei Zhou
    International Conference on Learning Representations (ICLR) , 2023

2022

  1. CORL
    Generative Category-Level Shape and Pose Estimation with Semantic Primitives
    Guanglin Li, Yifeng Li, Zhichao Ye, Qihang Zhang, Tao Kong, Zhaopeng Cui, and Guofeng Zhang
    Conference on Robotics Learning (CORL) , 2022
  2. ECCV
    Learn to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining
    Qihang Zhang, Zhenghao Peng, and Bolei Zhou
    European Conference on Computer Vision (ECCV) , 2022
  3. TPAMI
    MetaDrive: Composing Diverse Driving Scenarios for Generalizable Learning
    Quanyi Li*, Zhenghao Peng*, Lan Feng, Qihang Zhang, Zhenghai Xue, and Bolei Zhou
    In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) , 2022

2021

  1. IEEE TIP
    F^3A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks
    Xintian Wu, Qihang Zhang, Yiming Wu, Huanyu Wang, Songyuan Li, Lingyun Sun, and Xi Li
    IEEE Transactions on Image Processing (IEEE TIP) , 2021
  2. CVPR workshop
    Improving the Generalization of End-to-End Driving through Procedural Generation
    Quanyi Li*, Zhenghao Peng*, Qihang Zhang, Chunxiao Liu, and Bolei Zhou
    In CVPR embodied AI workshop (CVPR workshop) , 2021