What is Depth Map?

For every pixel, depth map gives the distance from the camera to the object


Predicting Depth Maps

Method

We input an RGB image and want to predict the depth map. We achieve this by using a fully convolutional network to process RGB input image

Problem: Scale / Depth Ambiguity

If we put a cat away from me, it is the same as putting a cat twice as large as the original cat at

Scale Invariant Loss

The feature of scale invariant loss is that we only care if the ratio between pixels are the same.

Intuitively, if pixel A is twice as far as pixel B in the ground truth, the loss is zero if this 2:1 ratio is maintained in the prediction

Its mathematical expression is as follows:

where