This blows my mind. This is probably a naive thought; This technique looks like ...

yarg · on March 21, 2020

They've only showed it working with static content - they'll need to do it with video (multiple synchronised cameras) and in real time for ant robotics application.

blurbleblurble · on March 21, 2020

It'd be interesting to see what happened if they encoded an additional time parameter on each 'view' (input image pixel). Surely someone is already trying to extend this technique that way.

iandanforth · on March 20, 2020

Currently view coordinates relative to the volume are required so you first have to solve the SLAM problem before you can optimize a network representation of a given volume.

BubRoss · on March 21, 2020

It takes 12 hours on a high end GPU to make one frame.

teraflop · on March 21, 2020

No, as appendix A of the paper states, each frame takes about 30 seconds to render.

BubRoss · on March 21, 2020

No, the high dimensional field takes 12 hours and the time to render the field to an image is not going to matter for robotics where computer vision needs to be done in real time.