I am currently a second-year PhD student in the 3DVLab at the Hong Kong University of Science and Technology (HKUST), under the supervision of Prof. Ping Tan. Additionally, I am an intern at Horizon Robotics Technology R&D Co., Ltd., where I work closely with Wei Yin. Previously, I earned my M.S. degree from the College of Computer Science at Nankai University in 2024, under the supervision of Prof. Ming-Ming Cheng and Prof. Jun Xu. I received my B.E. degree from the Dalian University of Technology (DLUT) in 2021. My research interests include computer vision and computer graphics. I am also a member of AnySyn3D, a research interest group focusing on various topics related to 3D.

🔥 News

📝 Publications

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

arxiv, 2024

Xiaotao Hu, Wei Yin, Mingkai Jia, Junyuan Deng, Xiaoyang Guo, Qian Zhang, Xiaoxiao Long, Ping Tan

MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization

arxiv, 2025

Mingkai Jia, Wei Yin, Xiaotao Hu, Jiaxin Guo, Xiaoyang Guo, Qian Zhang, Xiao-Xiao Long, Ping Tan

Epona: Autoregressive DiffusionWorld Model for Autonomous Driving

International Conference on Computer Vision (ICCV), 2025

Kaiwen Zhang, Zhenyu Tang, Xiaotao Hu, Xingang Pan, Xiaoyang Guo, Yuan Liu, Jingwei Huang, Li Yuan, Qian Zhang, Xiao-Xiao Long, Xun Cao, Wei Yin

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

International Conference on Computer Vision (ICCV), 2025

Junyuan Deng, Wei Yin, Xiaoyang Guo, Qian Zhang, Xiaotao Hu, Weiqiang Ren, Xiaoxiao Long, Ping Tan

Ctrl-Room: controllable text-to-3D room meshes generation with layout constraints

International Conference on 3D Vision (3DV), 2025

Chuan Fang, Yuan Dong, Kunming Luo, Xiaotao Hu, Rakesh Shrestha, Ping Tan

Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution

Winter Conference on Applications of Computer Vision (WACV), 2024

Zhewei Huang, Ailin Huang, Xiaotao Hu, Chen Hu, Jun Xu, Shuchang Zhou

A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

Computer Vision and Pattern Recognition (CVPR Highlight), 2023

Xiaotao Hu, Zhewei Huang, Ailin Huang, Jun Xu, Shuchang Zhou

Semi-cycled generative adversarial networks for real-world face super-resolution

IEEE Transactions on Image Processing (TIP), 2023

Hao Hou, Jun Xu, Yingkun Hou, Xiaotao Hu, Benzheng Wei, Dinggang Shen

Restore globally, refine locally: A mask-guided scheme to accelerate super-resolution networks

European Conference on Computer Vision (ECCV Oral), 2022

Xiaotao Hu, Jun Xu, Shuhang Gu, Ming-Ming Cheng, Li Liu

2022.09: The Third Prize Scholarship, Nankai University.
2019.11: Silver Medal, The 44th ACM International Collegiate Programming Contest (ICPC), Nanchang.
2019.09: The First Prize Scholarship, Dalian University of Technology.

2024.09 - now, PhD student in Electronic & Computer Engineering, Hong Kong University of Science and Technology (HKUST), Hong Kong.
2021.09 - 2024.06, M.S. in Computer Science, Nankai University (NKU), Tianjin, China.
2017.09 - 2021.06, B.S. in Software Engineering (ranking 4%), Dalian University of Technology (DLUT), Dalian, China.

not yet