IllumiNeRF
3D Relighting Without Inverse Rendering

NeurIPS 2024

Xiaoming Zhao^1,3, Pratul P. Srinivasan², Dor Verbin², Keunhong Park¹, Ricardo Martin-Brualla¹, Philipp Henzler¹

¹Google Research, ²Google DeepMind, ³University of Illinois Urbana-Champaign

Main Page Stanford-ORB Gallery TensoIR Gallery CAT3D Gallery

Full Stanford-ORB Benchmark Full TensoIR Benchmark

Abstract

Existing methods for relightable view synthesis --- using a set of images of an object under unknown lighting to recover a 3D representation that can be rendered from novel viewpoints under a target illumination --- are based on inverse rendering, and attempt to disentangle the object geometry, materials, and lighting that explain the input images. Furthermore, this typically involves optimization through differentiable Monte Carlo rendering, which is brittle and computationally-expensive. In this work, we propose a simpler approach: we first relight each input image using an image diffusion model conditioned on lighting and then reconstruct a Neural Radiance Field (NeRF) with these relit images, from which we render novel views under the target lighting. We demonstrate that this strategy is surprisingly competitive and achieves state-of-the-art results on multiple relighting benchmarks.

How It Works

Given a set of images and camera poses in (a), we run NeRF to extract the 3D geometry as in (b);
Based on this geometry and a target light shown in (c), we create radiance cues for each given input view as in (d);
Next, we independently relight each input image using a Relighting Diffusion Model illustrated in (e) and sample S possible solutions for each given image displayed in (f);
Finally, we distill the relit set of images into a 3D representation through a Latent NeRF optimization as in (g) and (h).

3D Consistent Relighting

On the top: we show renderings from our final latent NeRF;
On the bottom: we show a diffusion sample from the nearest training view corresponding to each rendered frame on the top.

Related Works

Check out the following concurrent works which also introduce a (single-image) relighting diffusion model.

DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Neural Gaffer: Relighting Any Object via Diffusion (also capable of 3D relighting)
A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis (also capable of 3D relighting)

BibTeX

@inproceedings{zhao2024illuminerf,
    author    = {Xiaoming Zhao and Pratul P. Srinivasan and Dor Verbin and Keunhong Park and Ricardo Martin Brualla and Philipp Henzler},
    title     = {{IllumiNeRF: 3D Relighting Without Inverse Rendering}},
    booktitle = {NeurIPS},
    year      = {2024},
}

IllumiNeRF3D Relighting Without Inverse Rendering