What is Depth Map?
For every pixel, depth map gives the distance from the camera to the object
Predicting Depth Maps
Method
We input an RGB image and want to predict the depth map. We achieve this by using a fully convolutional network to process RGB input image
Problem: Scale / Depth Ambiguity
If we put a cat away from me, it is the same as putting a cat twice as large as the original cat at
Scale Invariant Loss
The feature of scale invariant loss is that we only care if the ratio between pixels are the same.
Intuitively, if pixel A is twice as far as pixel B in the ground truth, the loss is zero if this 2:1 ratio is maintained in the prediction
Its mathematical expression is as follows:
where