Fangneng Zhan

Biography

I am an Assistant Professor at The Hong Kong University of Science and Technology, leading the World Mind Lab (心象实验室). Previously, I was a researcher at MIT working with Prof. Paul Liang, and also collaborating with Prof. Yilun Du at Harvard University. Before that, I was a postdoctoral researcher at Max Planck Institute for Informatics and received my Ph.D. in Computer Science & Engineering from Nanyang Technological University, Singapore. I obtained bachelor's degree in Communication Engineering from University of Electronic Science and Technology of China.

I study 3D World Modeling — teaching AI to understand and model the 3D world geometry, visual appearance, future dynamics, etc, with ultimate goal of enabling AI to perceive and interact with the physical world. My research covers 3D vision and generative models, with applications in neural rendering, 3D reconstruction, and embodied robotics.

I will be recruiting PhD students in the Spring/Fall 2027 application cycle. If you’re interested, please send me an email (fnzhan@ust.hk).
Preference will be given to applicants with embodied AI expertise or prior internship experience in our group before pursuing a PhD.
Due to the high volume of emails, only shortlisted candidates will be contacted, but please be assured I'll review all the applications very carefully!

Collaborated Students

∙ Caden Wong (Undergrad at MIT).
∙ Ishita Goluguri (Undergrad & Master at MIT)
∙ Christina Wong (Undergrad at Harvard University)
∙ Leslie Gu (PhD at Harvard University)
∙ Yifan Liu (Undergrad at Tsinghua University)
∙ Yue Ma (PhD at HKUST)
∙ Muyu Xu (PhD at NTU)
∙ Felix Moeller (Master at ETH Zurich)
∙ Tianling Xu (Undergrad at SUSTech)
∙ Zhengyang Lin (Undergrad & Master at NUS)

∙ Weihan Xu (Master at Duke University → PhD at UW Seattle)
∙ Kan Jen Cheng (Undergrad at UC Berkeley → PhD at University of Maryland)
∙ Benhao Huang (Undergrad at SJTU → Master at CMU)
∙ Freeman Cheng (Undergrad at UToronto → PhD at UC Merced)
∙ Yuelei Li (Undergrad at UCSD → Master at Caltech)
∙ Yu Chi (Master at TUM → PhD at TUM)

News

∙ [06, 2026] We are organizing a special issue Generative Models for Computer Vision in Pattern Recognition.
∙ [11, 2025] Invited talk at DeCoDE Lab, MIT: Learning to Represent and Render the 3D World.
∙ [10, 2025] Job talk at AIS, HKUST: Learning to Create and Control the 3D World.
∙ [07, 2025] Job talk at ETH Zurich: Learning to Represent and Render the 3D World.
∙ [03, 2025] Job talk at Imperial College London: Learning to Represent and Render the 3D World.
∙ [02, 2025] Invited talk at Peking University: Learning to Represent and Render the 3D World.
∙ [04, 2024] Invited talk at Visual Computing Group, Harvard University: Learning to Represent and Render the 3D World.
∙ [04, 2024] Invited talk at Jiajun Wu's Group, Stanford University: Towards Autonomous Scene Representation & Rendering.
∙ [04, 2024] Two papers are accepted to SIGGRAPH 2024 and TOG, respectively.
∙ [03, 2024] Invited talk at SIGS, Tsinghua University: Autonomous Rendering Intelligence.
∙ [02, 2024] Invited talk at IDS, The University of Hong Kong: Autonomous Rendering Intelligence.
∙ [12, 2023] We are organizing two workshops at CVPR 2024, including Neural Rendering Intelligence and 2nd Generative Models for Computer Vision.
∙ [09, 2023] One paper is accepted to IJCV, one is accepted to NeurIPS 2023.
∙ [08, 2023] One paper about Generative AI is accepted to TPAMI 2023.
∙ [07, 2023] Two papers are accepted to ICCV 2023.

Research Overview

Selected Publications

GEM-4D: Geometry-Enhanced Video World Models for Robot Manipulation

Kaichen Zhou, Yuzhen Chen, Fangneng Zhan, Hang Hua, Grace Chen, Xinhai Chang, Ao Qu, Yilun Du, Zhuang Liu, Paul Pu Liang, Mengyu Wang
Preprint, 2026
Paper | Project

EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation

Yue Ma, Xu Ye, Qinghe Wang^†, Yucheng Wang, Hongyu Liu, Yinhan Zhang, Xinyu Wang, Yuanpeng Che, Shanhui Mo, Paul Liang, Fangneng Zhan^†, Qifeng Chen
SIGGRAPH, 2026
Paper | Project

Abstract 3D Perception for Spatial Intelligence in Vision-Language Models

Yifan Liu, Fangneng Zhan, Kaichen Zhou, Yilun Du, Paul Pu Liang, Hanspeter Pfister
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026
Paper | Project

Flow Equivariant World Modeling for Partially Observed Dynamic Environments

Hansen Lillemark, Benhao Huang, Fangneng Zhan, Yilun Du, T. Anderson Keller
International Conference on Machine Learning (ICML), 2026
Paper | Project

PAGE-4D: Disentangled Pose and Geometry Estimation for 4D Perception

Kaichen Zhou, Yuhan Wang, Grace Chen, Xinhai Chang, Gaspard Beaudouin, Fangneng Zhan, Paul Pu Liang, Mengyu Wang
International Conference on Learning Representations (ICLR), 2026
Paper | Project

Advances in Feed-Forward 3D Reconstruction and View Synthesis

Jiahui Zhang*, Yuelei Li*, Anpei Chen, Muyu Xu, Kunhao Liu, Jianyuan Wang, Xiaoxiao Long, Hanxue Liang, Zexiang Xu, Hao Su, Christian Theobalt, Christian Rupprecht, Andrea Vedaldi, Kaichen Zhou, Hanspeter Pfister, Paul Pu Liang, Shijian Lu, Fangneng Zhan
Computer Graphics Forum (CGF), 2026
Paper | Project

Visual Acoustic Fields

Yuelei Li, Hyunjin Kim, Fangneng Zhan, Ri-Zhao Qiu, Mazeyu Ji, Xiaojun Shan, Xueyan Zou, Paul Liang, Hanspeter Pfister, Xiaolong Wang
Preprint, 2025
Paper | Project

3DPR: Single Image 3D Portrait Relighting with Generative Priors

Pramod Rao, Xilong Zhou, Abhimitra Meka, Gereon Fox, Mallikarjun B R, Fangneng Zhan, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Thabo Beeler, Mohamed Elgharib, Marc Habermann, Christian Theobalt
SIGGRAPH ASIA, 2025
Paper | Project | Code

Evolutive Rendering Models

Fangneng Zhan*, Hanxue Liang*, Michael Niemeyer, Yifan Wang, Michael Oechsle, Adam Kortylewski, Cengiz Oztireli, Gordon Wetzstein, Christian Theobalt
Preprint, 2024
Paper | Project
A framework for the autonomous evolution of principal elements in rendering models.

Lite2Relight: 3D-aware Single Image Portrait Relighting

Pramod Rao, Gereon Fox, Abhimitra Meka, Mallikarjun B R, Tim Weyrich, Fangneng Zhan, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt
SIGGRAPH, 2024
Paper | Project | Code

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

Kunhao Liu, Fangneng Zhan, Muyu Xu, Christian Theobalt, Ling Shao, Shijian Lu
SIGGRAPH Asia, 2024 (Technical Communications)
Paper | Project | Code

General Neural Gauge Fields

Fangneng Zhan, Lingjie Liu, Adam Kortylewski, Christian Theobalt
International Conference on Learning Representations (ICLR), 2023
Paper | Project | Code
A general paradigm and framework for learning gauge transformations in neural fields.

TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis

Heming Zhu, Fangneng Zhan, Christian Theobalt, Marc Habermann
ACM Transactions on Graphics (TOG), 2024
Paper | Project | Code

Multimodal Image Synthesis and Editing: The Generative AI
Era

Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Shijian Lu, Lingjie Liu, Adam Kortylewski, Christian Theobalt, Eric Xing
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 (Top50 Popular Paper)
Paper | Project | Code

DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields

Yu Chi, Fangneng Zhan, Sibo Wu, Christian Theobalt, Adam Kortylewski
The European Conference on Computer Vision (ECCV), 2024
Paper

3D Open-vocabulary Segmentation with Foundation Models

Kunhao Liu, Fangneng Zhan, Jiahui Zhang, Muyu Xu, Yingchen Yu, Abdulmotaleb El Saddik, Christian Theobalt, Eric Xing, Shijian Lu
Advances in Neural Information Processing Systems (NeurIPS), 2023
Paper | Code

WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields

Muyu Xu, Fangneng Zhan, Jiahui Zhang, Yingchen Yu, Xiaoqin Zhang, Christian Theobalt, Ling Shao, Shijian Lu
IEEE International Conference on Computer Vision (ICCV), 2023
Paper | Project

A Deeper Analysis of Volumetric Relightiable Fields

Pramod Rao, Mallikarjun B R, Gereon Fox, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Fangneng Zhan, Ayush Tewari, Christian Theobalt, Mohamed Elgharib
International Journal of Computer Vision (IJCV), 2023 (Invited Paper)
Paper | Project

StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields

Kunhao Liu, Fangneng Zhan, Yiwen Chen, Jiahui Zhang, Yingchen Yu, Abdulmotaleb El Saddik, Shijian Lu, Eric Xing
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)，2023
Paper | Project | Code

VMRF: View Matching Neural Radiance Fields

Jiahui Zhang, Fangneng Zhan, Rongliang Wu, Yingchen Yu, Wenqing Zhang, Song Bai, Xiaoqin Zhang, Shijian Lu
ACM International Conference on Multimedia (ACM MM), 2022
Paper

Auto-regressive Image Synthesis with Integrated Quantization

Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Changgong Zhang, Shijian Lu
European Conference on Computer Vision (ECCV), 2022 (Oral Presentation)
Paper | Code

Bi-level Feature Alignment for Versatile Image Translation and Manipulation

Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Aoran Xiao, Shijian Lu, Chunyan Miao
European Conference on Computer Vision (ECCV), 2022
Paper | Code

Marginal Correspondence for Conditional Image Generation

Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Shijian Lu, Changgong Zhang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 (Oral Presentation)
Paper | Code

Modulated Contrast for Versatile Image Synthesis

Fangneng Zhan, Jiahui Zhang, Yingchen Yu, Rongliang Wu, Shijian Lu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
Paper | Code

GMLight: Lighting Estimation via Geometric Distribution Approximation

Changgong Zhang, Fangneng Zhan, Yingchen Yu, Rongliang Wu, Wenbo Hu, Shijian Lu, Feiying Ma, Xuansong Xie, Ling Shao
IEEE Transactions on Image Processing (TIP), 2022
Paper | Code | Dataset

Sparse Needlets for Lighting Estimation with Spherical Transport Loss

Fangneng Zhan, Changgong Zhang, Wenbo Hu, Shijian Lu, Feiying Ma, Xuansong Xie, Ling Shao
IEEE International Conference on Computer Vision (ICCV), 2021
Paper | Code | Dataset

EMLight: Lighting Estimation via Spherical Distribution Approximation

Fangneng Zhan, Changgong Zhang, Yingchen Yu, Yuan Chang, Shijian Lu, Feiying Ma, Xuansong Xie
AAAI Conference on Artificial Intelligence (AAAI), 2021
Paper | Code | Dataset

WaveFill: A Wavelet-based Generation Network for Image Inpainting

Yingchen Yu, Fangneng Zhan, Shijian Lu, Jianxiong Pan, Feiying Ma, Xuansong Xie, Chunyan Miao
IEEE International Conference on Computer Vision (ICCV), 2021 (Oral Presentation)
Paper | Code

Unbalanced Feature Transport for Exemplar-based Image Translation

Fangneng Zhan, Yingchen Yu, Kaiwen Cui, Gongjie Zhang, Shijian Lu, Jianxiong Pan, Changgong Zhang, Feiying Ma, Xuansong Xie, Chunyan Miao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Paper | Project | Code

Projects & Datasets

Virtual Object Relighting (VOR) Dataset for Lighting Estimation Evaluation

We create an evaluation dataset consisting of 3D scenes to conduct virtual object insertion & rendering in Blender. The lighting estimaiton performance is evaluated by using the predicted illumination map as the environment light in Blender.
Dataset | Paper | Code

Real time Scene Text Detection and Recognition System

We design a real time system for the end-to-end detection and recognition of scene texts in videos. The texts in one frame is first localized with a detection model, then a recognition model is employed to recognize the croped scene text images.
Code | Video

Academic Service

Talks:
∙ [11, 2025] DeCoDE Lab, MIT: Learning to Represent and Render the 3D World.
∙ [10, 2025] AIS, HKUST(Job Talk): Learning to Create and Control the 3D World.
∙ [07, 2025] ETH Zurich(Job Talk): Learning to Represent and Render the 3D World.
∙ [03, 2025] Imperial College London(Job Talk): Learning to Represent and Render the 3D World.
∙ [02, 2025] Peking University: Learning to Represent and Render the 3D World.
∙ [04, 2024] Visual Computing Group, Harvard University: Learning to Represent and Render the 3D World.
∙ [04, 2024] Jiajun Wu's Group, Stanford University: Towards Autonomous Scene Representation & Rendering.
∙ [03, 2024] SIGS, Tsinghua University: Autonomous Rendering Intelligence.
∙ [02, 2024] IDS, The University of Hong Kong: Autonomous Rendering Intelligence.
∙ [03, 2023] The AI Talks: On the Gauge Transformation of Neural Fields.
Workshops & Tutorials:
∙ Organizer, CVPR 2025 Workshop: 3nd Generative Models for Computer Vision.
∙ Organizer, CVPR 2024 Workshop: Neural Rendering Intelligence.
∙ Organizer, CVPR 2024 Workshop: 2nd Generative Models for Computer Vision.
∙ Organizer, CVPR 2023 Workshop: Generative Models for Computer Vision.

Fangneng Zhan 占方能