Biography
Currently, I am a research fellow working and collaborating with Prof. Hanspeter Pfister & Prof. Paul Liang at Harvard University & MIT. Previously, I was a postdoctoral researcher at Max Planck Institute for Informatics with Prof. Christian Theobalt. I received my Ph.D. in Computer Science & Engineering from Nanyang Technological University, Singapore, and bachelor's degree in Communication Engineering from University of Electronic Science and Technology of China.
Coinciding with the dictum of “What I cannot create, I do not understand” by Richard Feynman, my research goals are developing & understanding intelligence emerged from the visual generation & rendering process (i.e., Generative AI). With Neural Rendering and Generative Models as general learning machine, the detailed research directions include:
∙ Rendering-induced intelligence, e.g., Evolutive Rendering, Neural Gauge Fields.
∙ Generative 3D intelligence, e.g., UNITE, IQ-VAE, SF-GAN.
∙ 3D for Robotics and Science, e.g., 3D-OVS
Call for papers for CVPR 2024 Workshops: Neural Rendering Intelligence and 2nd Generative Models for Computer Vision.
News
∙ [04, 2024] Invited talk at Visual Computing Group, Harvard University: Learning to Represent and Render the 3D World.
∙ [04, 2024] Invited talk at Jiajun Wu's Group, Stanford University: Towards Autonomous Scene Representation & Rendering.
∙ [04, 2024] Two papers are accepted to SIGGRAPH 2024 and TOG, respectively.
∙ [03, 2024] Invited talk at SIGS, Tsinghua University: Autonomous Rendering Intelligence.
∙ [02, 2024] Invited talk at IDS, The University of Hong Kong: Autonomous Rendering Intelligence.
∙ [12, 2023] We are organizing two workshops at CVPR 2024, including Neural Rendering Intelligence and 2nd Generative Models for Computer Vision.
∙ [09, 2023] One paper is accepted to IJCV, one is accepted to NeurIPS 2023.
∙ [08, 2023] One paper about Generative AI is accepted to TPAMI 2023.
∙ [07, 2023] Two papers are accepted to ICCV 2023.
Research Overview
Publications
Evolutive Rendering Models
Fangneng Zhan*, Hanxue Liang*, Michael Niemeyer, Yifan Wang, Michael Oechsle, Adam Kortylewski, Cengiz Oztireli, Gordon Wetzstein, Christian Theobalt
Preprint, 2024
Paper | Project
A framework for the autonomous evolution of principal elements in rendering models.
StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Kunhao Liu, Fangneng Zhan, Muyu Xu, Christian Theobalt, Ling Shao, Shijian Lu
Preprint, 2024
Paper | Project | Code
General Neural Gauge Fields
Fangneng Zhan, Lingjie Liu, Adam Kortylewski, Christian Theobalt
International Conference on Learning Representations (ICLR), 2023
Paper | Project | Code
A general paradigm and framework for learning gauge transformations in neural fields.
TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis
Heming Zhu, Fangneng Zhan, Christian Theobalt, Marc Habermann
ACM Transactions on Graphics (TOG), 2024
Paper | Project
Multimodal Image Synthesis and Editing: The Generative AI
Era
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Shijian Lu, Lingjie Liu, Adam Kortylewski, Christian Theobalt, Eric Xing
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 (Top50 Popular Paper)
Paper | Project | Code
DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields
Yu Chi, Fangneng Zhan, Sibo Wu, Christian Theobalt, Adam Kortylewski
The European Conference on Computer Vision (ECCV), 2024
Paper
3D Open-vocabulary Segmentation with Foundation Models
Kunhao Liu, Fangneng Zhan, Jiahui Zhang, Muyu Xu, Yingchen Yu, Abdulmotaleb El Saddik, Christian Theobalt, Eric Xing, Shijian Lu
Advances in Neural Information Processing Systems (NeurIPS), 2023
Paper | Code
WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields
Muyu Xu, Fangneng Zhan, Jiahui Zhang, Yingchen Yu, Xiaoqin Zhang, Christian Theobalt, Ling Shao, Shijian Lu
IEEE International Conference on Computer Vision (ICCV), 2023
Paper | Project
A Deeper Analysis of Volumetric Relightiable Fields
Pramod Rao, Mallikarjun B R, Gereon Fox, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Fangneng Zhan, Ayush Tewari, Christian Theobalt, Mohamed Elgharib
International Journal of Computer Vision (IJCV), 2023 (Invited Paper)
Paper | Project
StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields
Kunhao Liu, Fangneng Zhan, Yiwen Chen, Jiahui Zhang, Yingchen Yu, Abdulmotaleb El Saddik, Shijian Lu, Eric Xing
IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2023
Paper | Project | Code
VMRF: View Matching Neural Radiance Fields
Jiahui Zhang, Fangneng Zhan, Rongliang Wu, Yingchen Yu, Wenqing Zhang, Song Bai, Xiaoqin Zhang, Shijian Lu
ACM International Conference on Multimedia (ACM MM), 2022
Paper
Auto-regressive Image Synthesis with Integrated Quantization
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Changgong Zhang, Shijian Lu
European Conference on Computer Vision (ECCV), 2022 (Oral Presentation)
Paper | Code
Bi-level Feature Alignment for Versatile Image Translation and Manipulation
Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Aoran Xiao, Shijian Lu, Chunyan Miao
European Conference on Computer Vision (ECCV), 2022
Paper | Code
Marginal Correspondence for Conditional Image Generation
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Shijian Lu, Changgong Zhang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 (Oral Presentation)
Paper | Code | Supple
Modulated Contrast for Versatile Image Synthesis
Fangneng Zhan, Jiahui Zhang, Yingchen Yu, Rongliang Wu, Shijian Lu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
Paper | Code | Supple
GMLight: Lighting Estimation via Geometric Distribution Approximation
Changgong Zhang, Fangneng Zhan, Yingchen Yu, Rongliang Wu, Wenbo Hu, Shijian Lu, Feiying Ma, Xuansong Xie, Ling Shao
IEEE Transactions on Image Processing (TIP), 2022
Paper | Code | Dataset | Supple
Sparse Needlets for Lighting Estimation with Spherical Transport Loss
Fangneng Zhan, Changgong Zhang, Wenbo Hu, Shijian Lu, Feiying Ma, Xuansong Xie, Ling Shao
IEEE International Conference on Computer Vision (ICCV), 2021
Paper | Code | Dataset | Supple
EMLight: Lighting Estimation via Spherical Distribution Approximation
Fangneng Zhan, Changgong Zhang, Yingchen Yu, Yuan Chang, Shijian Lu, Feiying Ma, Xuansong Xie
AAAI Conference on Artificial Intelligence (AAAI), 2021
Paper | Code | Dataset | Supple
WaveFill: A Wavelet-based Generation Network for Image Inpainting
Yingchen Yu, Fangneng Zhan, Shijian Lu, Jianxiong Pan, Feiying Ma, Xuansong Xie, Chunyan Miao
IEEE International Conference on Computer Vision (ICCV), 2021 (Oral Presentation)
Paper | Code
Unbalanced Feature Transport for Exemplar-based Image Translation
Fangneng Zhan, Yingchen Yu, Kaiwen Cui, Gongjie Zhang, Shijian Lu, Jianxiong Pan, Changgong Zhang, Feiying Ma, Xuansong Xie, Chunyan Miao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Paper | Project | Code | Supple
Diverse Image Inpainting with Bidirectional and Autoregressive Transformers
Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jianxiong Pan, Kaiwen Cui, Shijian Lu, Feiying Ma, Xuansong Xie, Chunyan Miao
ACM International Conference on Multimedia (ACM MM), 2021 (Oral Presentation)
Paper | Code
ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification
Fangneng Zhan, Shijian Lu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Paper | Code | Supple
Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes
Fangneng Zhan, Shijian Lu, Chuhui Xue
European conference on computer vision (ECCV), 2018
Paper | Code | Dataset | Supple
Projects & Datasets
Virtual Object Relighting (VOR) Dataset for Lighting Estimation Evaluation
We create an evaluation dataset consisting of 3D scenes to conduct virtual object insertion & rendering in Blender. The lighting estimaiton performance is evaluated by using the predicted illumination map as the environment light in Blender.
Dataset | Paper | Code
Real time Scene Text Detection and Recognition System
We design a real time system for the end-to-end detection and recognition of scene texts in videos. The texts in one frame is first localized with a detection model, then a recognition model is employed to recognize the croped scene text images.
Code | Video
Academic Service
Talks:
∙ [04, 2024] Visual Computing Group, Harvard University: Learning to Represent and Render the 3D World.
∙ [04, 2024] Jiajun Wu's Group, Stanford University: Towards Autonomous Scene Representation & Rendering.
∙ [03, 2024] SIGS, Tsinghua University: Autonomous Rendering Intelligence.
∙ [02, 2024] IDS, The University of Hong Kong: Autonomous Rendering Intelligence.
∙ [03, 2023] The AI Talks: On the Gauge Transformation of Neural Fields.
Workshops & Tutorials:
∙ Organizer, CVPR 2024 Workshop: Neural Rendering Intelligence.
∙ Organizer, CVPR 2024 Workshop: 2nd Generative Models for Computer Vision.
∙ Organizer, CVPR 2023 Workshop: Generative Models for Computer Vision.
Program Committee Member:
∙ AAAI 2022, 2021, 2020
Conference & Journal Reviewer:
∙ ICML2022, ICLR 2022, NeurIPS 2021, 2020, CVPR 2022, 2021, 2020, 2019, ICCV 2021, 2019, ECCV 2020, BMVC 2021, 2020, ACCV 2020, WACV 2022, 2021
∙ TPAMI, TIP, TMM, Pattern Recognition, Neurocomputing