carbocation t1_ixyz3g4 wrote on November 27, 2022 at 2:16 PM

Seems like binocular depth estimation should be possible with a binocular device.

naccib t1_iy1009o wrote on November 27, 2022 at 10:44 PM

Monocular depth estimation is very valuable for creating AR experiences in general-use devices such as smartphones. This is, in my opinion, the greatest value for such depth estimation algorithms.

carbocation t1_iy12bhd wrote on November 27, 2022 at 11:01 PM

I agree with you about the value and use-cases for monocular depth estimation. I was just making the point that, in principle, a binocular device could attempt binocular depth estimation. Or perhaps they tried it internally and it was not sufficiently better to be worth the expense.

naccib t1_iy3jayh wrote on November 28, 2022 at 2:06 PM

Oh, binocular depth estimation is definitely a less technically challenging approach. I think the reasons they are pursuing monocular are due to what the other commenter said about cost and stuff.

pm_me_your_pay_slips t1_ixz544s wrote on November 27, 2022 at 3:06 PM

One camera is cheaper than two, though. Cheaper in every sense (compute, memory, network bandwidth, energy consumption, parts cost, etc).

mg31415 t1_iy2pg4i wrote on November 28, 2022 at 7:55 AM

How is one camera is cheaper computationally? If it was stereo they wouldn't need a NN

pm_me_your_pay_slips t1_iy2twej wrote on November 28, 2022 at 9:00 AM

You need to do feature computation and find correspondences. If you’re using a learned feature extractor, that will be twice as expensive as the monocular model. But let’s say you’re using a classical feature extractor. You still need to do feature matching. For dense depth maps, both of these stages can be as expensive, if not more, than a single forward pass through a highly optimized mobile NN architecture.

soulslicer0 t1_iy2fd1x wrote on November 28, 2022 at 5:47 AM

Could be doing depth estimation by fusing two monocular nets like mvsnet