Comment by markisus

9 months ago

Yes, the normal case uses 2D input, but it can take hours to create the scene. Using the depth channel allows me to create the scene in 33 milliseconds, from scratch, every frame. You could conceptualize this as a compromise between raw pointcloud rendering and fully precomputed Gaussian splat rendering. With pointclouds, you have a lot visual artifacts due to sparsity (low texture information, seeing "through" objects"). With Gaussian splatting, you can transfer a lot more of the 2D texture information into 3D space and render occlusion and view-dependent effects better.

7 comments

markisus

Retr0id 9 months ago

How do the view-dependent effects get "discovered" from only a single source camera angle?

markisus 9 months ago
Actually there are multiple source cameras. The neural net learns to interpolate the source camera colors based on where the virtual camera is. Under the hood it's hard to say exactly what's going on in the mind of the neural net, but I think it's something like "If I'm closer to camera A, take most of the color from camera A."
- BSVogler 9 months ago
  
  Gaussian splatting does not use neural nets. It runs an optimizer on the Gaussian splattering parameters. I think in your comment you are talking about Neural Radiance Fields (Nerfs).
  
  1 reply →
- ttoinou 9 months ago
  
  So we’re not sure how it works exactly ?
  
  1 reply →

sendfoods 9 months ago

That makes things clearer, thanks!