MELON: Reconstructing 3D objects from images with unknown poses
Back to Home
news

MELON: Reconstructing 3D objects from images with unknown poses

February 27, 20263 views2 min read
Researchers have introduced MELON, a novel technique for 3D reconstruction that can determine object-centric camera poses without requiring approximate pose initializations, complex GAN training schemes, or pre-training on labeled data. This advancement promises to simplify and enhance the process of 3D scene reconstruction, especially in scenarios where camera poses are unknown. MELON, developed by a team including Axel Levy, Matan Sela, Gordon Wetzstein, Florian Schroff, and Hartwig Adam, leverages a relatively simple approach to achieve robust results. The technique has been tested on synthetic images from the NeRF-Synthetic dataset, where it demonstrated quick convergence and competitive performance, achieving a PSNR of 27.5 dB after 50,000 training steps. One of the standout features of MELON is its ability to work with extremely noisy, unposed images. Even when images contain significant white Gaussian noise (σ=1.0), MELON successfully determines camera poses and generates novel views of the objects. This robustness was unexpected, given that similar capabilities have been demonstrated in other methods like RawNeRF, but with known camera poses. The researchers note that while they have so far demonstrated MELON on synthetic images, they are actively adapting the technique for real-world applications. The full details of the method are available in the paper titled "MELON: Monocular 3D Object Reconstruction from Unposed Images" and on the dedicated MELON website. This innovation marks a significant step forward in the field of 3D reconstruction, offering a streamlined and flexible solution that could benefit a wide range of applications, from computer graphics to robotics and beyond.

Related Articles