I am a postdoctoral researcher at Max Planck Institute for Informatics with Prof. Christian Theobalt . I received my Ph.D. of Computer Science & Engineering from Nanyang Technological University, Singapore, supervised by Prof. Shijian Lu. I also worked as RA at NTU S-Lab & MICL and research intern at Alibaba DAMO Academy. I obtained my bachelor degree in Communication Engineering from University of Electronic Science and Technology of China. My research interests are Generative Models and Neural Rendering (e.g., NeRF, 3D GAN, Scene Representation, Generative Pre-training).
I am looking for master thesis students and also open to collaborations on neural rendering and generative models. Please contact firstname.lastname@example.org if you are interested.
∙ [07, 2022] Our survey paper Multimodal Image Synthesis and Editing is updated.
∙ [07, 2022] Two papers are accepted to ECCV 2022 (1 Oral).
∙ [06, 2022] Two papers are accepted to ACM MM 2022.
∙ [03, 2022] Three papers are accepted to CVPR 2022 (1 Oral).
∙ [01, 2022] One paper is accepted to TIP.
∙ [12, 2021] We release VOR Dataset for lighting estimation evaluation.
∙ [12, 2021] Two papers are accepted to AAAI 2022.
∙ [07, 2021] Two papers are accepted to ICCV 2021 (1 Oral).
∙ [07, 2021] Two papers are accepted to ACM MM 2021 (1 Oral).
∙ [04, 2021] I defend my Ph.D. thesis: Image Synthesis in Visual Machine Learning.
∙ [03, 2021] One paper is accepted to CVPR 2021.
∙ [12, 2020] One papers is accepted to AAAI 2021.
Multimodal Image Synthesis and Editing: A Survey
Auto-regressive Image Synthesis with Integrated Quantization
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Changgong Zhang, Shijian Lu
European Conference on Computer Vision (ECCV), 2022 (Oral Presentation)
Bi-level Feature Alignment for Versatile Image Translation and Manipulation
VMRF: View Matching Neural Radiance Fields
Jiahui Zhang, Fangneng Zhan, Rongliang Wu, Yingchen Yu, Wenqing Zhang, Song Bai, Xiaoqin Zhang, Shijian Lu
ACM International Conference on Multimedia (ACM MM), 2022
Towards Counterfactual Image Manipulation via CLIP
Marginal Contrastive Correspondence for Guided Image Generation
Modulated Contrast for Versatile Image Synthesis
Unbalanced Feature Transport for Exemplar-based Image Translation
Fangneng Zhan, Yingchen Yu, Kaiwen Cui, Gongjie Zhang, Shijian Lu, Jianxiong Pan, Changgong Zhang, Feiying Ma, Xuansong Xie, Chunyan Miao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Paper | Code | Supple
WaveFill: A Wavelet-based Generation Network for Image Inpainting
Sparse Needlets for Lighting Estimation with Spherical Transport Loss
EMLight: Lighting Estimation via Spherical Distribution Approximation
GMLight: Lighting Estimation via Geometric Distribution Approximation
Diverse Image Inpainting with Bidirectional and Autoregressive Transformers
Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jianxiong Pan, Kaiwen Cui, Shijian Lu, Feiying Ma, Xuansong Xie, Chunyan Miao
ACM International Conference on Multimedia (ACM MM), 2021 (Oral Presentation)
Paper | Code
ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification
Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes
Projects & Datasets
Virtual Object Relighting (VOR) Dataset for Lighting Estimation Evaluation
We create an evaluation dataset consisting of 3D scenes to conduct virtual object insertion & rendering in Blender. The lighting estimaiton performance is evaluated by using the predicted illumination map as the environment light in Blender.
Dataset | Paper | Code
Real time Scene Text Detection and Recognition System
We design a real time system for the end-to-end detection and recognition of scene texts in videos. The texts in one frame is first localized with a detection model, then a recognition model is employed to recognize the croped scene text images.
Code | Video
Program Committee Member:
∙ AAAI 2022, 2021, 2020
Conference & Journal Reviewer:
∙ ICML2022, ICLR 2022, NeurIPS 2021, 2020, CVPR 2022, 2021, 2020, 2019, ICCV 2021, 2019, ECCV 2020, BMVC 2021, 2020, ACCV 2020, WACV 2022, 2021
∙ TPAMI, TIP, TMM, Pattern Recognition, Neurocomputing