Lightweight Predictive 3D Gaussian Splats

LPGS models the scenes with the parent(yellow) and child(green) splats where important child splats can be promoted to parent via Adaptive Tree Manipulation (ATM).

Abstract

Recent approaches representing 3D objects and scenes using Gaussian splats show increased rendering speed across a variety of platforms and devices. While ren- dering such representations is indeed extremely efficient, storing and transmitting them is often prohibitively expensive. To represent large-scale scenes, one often needs to store millions of 3D Gaussian, which can occupy up to gigabytes of stor- age. This creates a significant practical barrier, preventing widespread adoption on resource-constrained devices. In this work, we propose a new representation that dramatically reduces the hard drive footprint while featuring similar or improved quality when compared to the standard 3D Gaussian splats. This representation leverages the inherent feature sharing among splats in the close proximity using a hierarchical tree structure, with which only the parent splats need to be stored. We present a method for constructing tree structures from naturally unstructured point clouds. Additionally, we propose the adaptive tree manipulation to prune the redundant trees in the space, while spawn new ones from the significant children splats during the optimization process. On the benchmark datasets, we achieve 20x storage reduction in hard-drive footprint with improved fidelity compared to the vanilla 3DGS and 2x-5x reduction compared to the exiting compact solu- tions. More importantly, we demonstrate the practical application of our method in real-world rendering on mobile devices and AR glasses

TL;DR: We propose a novel Gaussian Splat representation requiring much less storage, featuring superior rendering quality, and being able to run on mobile devices in real-time.

360 Orbit Camera Videos

Demo on iPhone 14 and Snap Spectacles AR Glass

If you browser is not displaying the videos, try to scroll down and place the videos on the top of your screen.

The flickering shown in Spectacles videos are caused by the recording issues.

Visual Comparisons: Ourdoor Scenes from Mip-NeRF 360 Dataset

Ours: PSNR 27.63 / 39.4 MB
3DGS: PSNR 27.41 / 1331 MB
Ours: PSNR 27.63 / 39.4 MB
CompactGS: PSNR 26.81 / 62.78 MB
Ours: PSNR 27.63 / 39.4 MB
LightGS: PSNR 26.73 / -

Visual Comparisons: Indoor Scenes from Mip-NeRF 360 Dataset

Ours: PSNR 31.84 / 28.95 MB
3DGS: PSNR 30.63 / 350 MB
Ours: PSNR 31.84 / 28.95 MB
CompactGS: PSNR 30.88 / 34.21 MB
Ours: PSNR 31.84 / 28.95 MB
LightGS: PSNR 31.27 / -
Ours: PSNR 29.10 / 30.02 MB
3DGS: PSNR 28.70 / 276.52 MB
Ours: PSNR 29.10 / 30.02 MB
CompactGS: PSNR 28.71 / 34.34 MB
Ours: PSNR 29.10 / 30.02 MB
LightGS: PSNR 28.11 / -

Visual Comparisons: Deep Blending Dataset

Ours: PSNR 29.34 / 35.00 MB
3DGS: PSNR 28.77 / 774 MB
Ours: PSNR 29.34 / 35.00 MB
CompactGS: PSNR 29.26 / 48 MB
Ours: PSNR 29.34 / 35.00 MB
LightGS
Ours: PSNR 30.44 / 35.80 MB
3DGS: PSNR 30.04 / 553 MB
Ours: PSNR 30.44 / 35.80 MB
CompactGS: PSNR 30.32 / 39 MB
Ours: PSNR 30.44 / 35.80 MB
LightGS

Visual Comparisons: Tank&Temples Dataset

Ours: PSNR 25.45 / 35.78 MB
3DGS: PSNR 25.19 / 608.7 MB
Ours: PSNR 25.45 / 35.78 MB
CompactGS: PSNR 25.07 / 41.57 MB
Ours: PSNR 25.45 / 35.78 MB
LightGS: PSNR 24.56 / -
Ours: PSNR 21.97 / 37.02 MB
3DGS: PSNR 21.10 / PSNR 255.82 MB
Ours: PSNR 21.97 / 37.02 MB
CompactGS: PSNR 21.56 / 37.29 MB
Ours: PSNR 21.97 / 37.02 MB
LightGS: PSNR 21.09 / -