Comment by eloisius

2 months ago

From a single picture it infers a hidden 3D representation, from which you can produce photorealistic images from slightly different vantage points (novel views).

4 comments

eloisius

avaer 2 months ago

There's nothing "hidden" about the 3d represenation. It's a point cloud (in meters) with colors, and a guess at the the "camera" that produced it.

(I am oversimplifying).

uh_uh 2 months ago

"Hidden" or "latent" in a context like this just means variables that the algo is trying to infer because it doesn't have direct access to them.
eloisius 2 months ago
Hidden in the sense of neural net layers. I mean intermediary representation.
- avaer 2 months ago
  
  Right.
  I just want to emphasize that this is not a NERF where the model magically produces an image from an angle and then you ask "ok but how did you get this?" and it throws up its hands and says "I dunno, I ran some math and I got this image" :D.