I am currently a second-year PhD student in the 3DVLab at the Hong Kong University of Science and Technology (HKUST), under the supervision of Prof. Ping Tan. Additionally, I am an intern at Horizon Robotics Technology R&D Co., Ltd., where I work closely with Wei Yin. Previously, I earned my M.S. degree from the College of Computer Science at Nankai University in 2024, under the supervision of Prof. Ming-Ming Cheng and Prof. Jun Xu. I received my B.E. degree from the Dalian University of Technology (DLUT) in 2021. My research interests include computer vision and computer graphics. I am also a member of AnySyn3D, a research interest group focusing on various topics related to 3D.

πŸ”₯ News

  • 2025.06: Β πŸŽ‰πŸŽ‰ Two papers are accepted by ICCV 2025.
  • 2024.11: Β πŸŽ‰πŸŽ‰ One paper is accepted by 3DV 2025.
  • 2023.02: Β πŸŽ‰πŸŽ‰ One paper is accepted by CVPR 2023 (Highlight).
  • 2023.01: Β πŸŽ‰πŸŽ‰ One paper is accepted by TIP.
  • 2022.07: Β πŸŽ‰πŸŽ‰ One paper is accepted by ECCV 2022 (Oral).

πŸ“ Publications

sym

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

arxiv, 2024

Xiaotao Hu, Wei Yin, Mingkai Jia, Junyuan Deng, Xiaoyang Guo, Qian Zhang, Xiaoxiao Long, Ping Tan

Paper | Code | Project page

sym

MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization

arxiv, 2025

Mingkai Jia, Wei Yin, Xiaotao Hu, Jiaxin Guo, Xiaoyang Guo, Qian Zhang, Xiao-Xiao Long, Ping Tan

Paper

sym

Epona: Autoregressive DiffusionWorld Model for Autonomous Driving

International Conference on Computer Vision (ICCV), 2025

Kaiwen Zhang, Zhenyu Tang, Xiaotao Hu, Xingang Pan, Xiaoyang Guo, Yuan Liu, Jingwei Huang, Li Yuan, Qian Zhang, Xiao-Xiao Long, Xun Cao, Wei Yin

Paper

sym

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

International Conference on Computer Vision (ICCV), 2025

Junyuan Deng, Wei Yin, Xiaoyang Guo, Qian Zhang, Xiaotao Hu, Weiqiang Ren, Xiaoxiao Long, Ping Tan

Paper

sym

Ctrl-Room: controllable text-to-3D room meshes generation with layout constraints

International Conference on 3D Vision (3DV), 2025

Chuan Fang, Yuan Dong, Kunming Luo, Xiaotao Hu, Rakesh Shrestha, Ping Tan

Paper

sym

Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution

Winter Conference on Applications of Computer Vision (WACV), 2024

Zhewei Huang, Ailin Huang, Xiaotao Hu, Chen Hu, Jun Xu, Shuchang Zhou

Paper | Code | BibTex

sym

A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

Computer Vision and Pattern Recognition (CVPR Highlight), 2023

Xiaotao Hu, Zhewei Huang, Ailin Huang, Jun Xu, Shuchang Zhou

Paper | Code | BibTex

sym

Semi-cycled generative adversarial networks for real-world face super-resolution

IEEE Transactions on Image Processing (TIP), 2023

Hao Hou, Jun Xu, Yingkun Hou, Xiaotao Hu, Benzheng Wei, Dinggang Shen

Paper | Code | BibTex

sym

Restore globally, refine locally: A mask-guided scheme to accelerate super-resolution networks

European Conference on Computer Vision (ECCV Oral), 2022

Xiaotao Hu, Jun Xu, Shuhang Gu, Ming-Ming Cheng, Li Liu

Paper | Code | BibTex

πŸŽ– Honors and Awards

  • 2022.09: Β The Third Prize Scholarship, Nankai University.
  • 2019.11: Β Silver Medal, The 44th ACM International Collegiate Programming Contest (ICPC), Nanchang.
  • 2019.09: Β The First Prize Scholarship, Dalian University of Technology.

πŸ“– Educations

  • 2024.09 - now, PhD student in Electronic & Computer Engineering, Hong Kong University of Science and Technology (HKUST), Hong Kong.
  • 2021.09 - 2024.06, M.S. in Computer Science, Nankai University (NKU), Tianjin, China.
  • 2017.09 - 2021.06, B.S. in Software Engineering (ranking 4%), Dalian University of Technology (DLUT), Dalian, China.

πŸ’¬ Invited Talks

not yet

πŸ’» Internships