I am a fifth-year (2020.09-) Ph.D. student at the State Key Lab of CAD&CG, Zhejiang University, advised by Prof. Deng Cai and Prof. Xiaofei He. During my internship at Shanghai AI Laboratory, I closely collaborated with Dr. Tong He, Prof. Yu Qiao, and Prof. Wanli Ouyang.

My research interests include 3D vision, robotics, and AIGC. Currently, I focus on advancing spatial intelligence through the development of a 3D foundation model. To achieve this goal, I explore four key directions: (1) Data, cheaply generating 4D data using video generation models; (2) Backbone, handling diverse input through multi-modal learning; (3) Pre-training, scaling up backbones with representation learning; and (4) Downstream, validating efficacy by robot manipulation and 3D scene understanding.

📝 Publications

Video Generation

Arxiv 2024
sym

Depth Any Video with Scalable Synthetic Data

Honghui Yang*, Di Huang*, Wei Yin, Chunhua Shen, Haifeng Liu, Xiaofei He, Binbin Lin, Wanli Ouyang, Tong He

Arxiv, 2024 (Under Review)

arXiv

Robotics

Arxiv 2024
sym

SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

Haoyi Zhu, Honghui Yang, Yating Wang, Jiange Yang, Limin Wang, Tong He

Arxiv, 2024 (Under Review)

arXiv

NeurIPS 2024
sym

EMVP: Embracing Visual Foundation Model for Visual Place Recognition with Centroid-Free Probing

Qibo Qiu, Shun Zhang, Haiming Gao, Honghui Yang, Haochao Ying, Wenxiao Wang, Xiaofei He

NeurIPS, 2024

arXiv

Representation Learning

CVPR 2024
sym

UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, Qibo Qiu, Binbin Lin, Xiaofei He, Wanli Ouyang

CVPR, 2024

arXiv

Arxiv 2023
sym

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Haoyi Zhu*, Honghui Yang*, Xiaoyang Wu*, Di Huang*, Sha Zhang, Xianglong He, Hengshuang Zhao, Chunhua Shen, Yu Qiao, Tong He, Wanli Ouyang

Arxiv, 2023 (TPAMI Under Review)

arXiv

ICCV 2023
sym

Ponder: Point Cloud Pre-training via Neural Rendering

Di Huang, Sida Peng, Tong He, Honghui Yang, Xiaowei Zhou, Wanli Ouyang

ICCV, 2023

arXiv

CVPR 2023
sym

GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds

Honghui Yang*, Tong He*, Jiaheng Liu, Hua Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wanli Ouyang

CVPR, 2023

arXiv

Multi-modal Learning

CVPR 2023
sym

PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer

Honghui Yang, Wenxiao Wang, Minghao Chen, Binbin Lin, Tong He, Hua Chen, Xiaofei He, Wanli Ouyang

CVPR, 2023

arXiv

ECCV 2022
sym

Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph

Honghui Yang, Zili Liu, Xiaopei Wu, Wenxiao Wang, Wei Qian, Xiaofei He, Deng Cai

ECCV, 2022 (Oral)

arXiv

CVPR 2022
sym

Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion

Xiaopei Wu, Liang Peng, Honghui Yang, Liang Xie, Chenxi Huang, Chengqi Deng, Haifeng Liu, Deng Cai

CVPR, 2022 (Oral)

arXiv

💻 Internships

🎖 Honors and Awards

  • Transfar Scholarship, 2023
  • Lu Zengyong Scholarship, 2022
  • Outstanding Postgraduate Student Award, 2021 - 2023
  • Zhejiang Provincial Collegiate Programming Contest, Gold Medal, 2019
  • The ICPC Asia-East Continent Final, Bronze Medal, 2018
  • National Scholarship, 2017