mayiSLYTHERINyourbed

mayiSLYTHERINyourbed t1_iu7im0x wrote on October 29, 2022 at 3:41 AM

Reply to comment by GPUaccelerated in Do companies actually care about their model's training/inference speed? by GPUaccelerated

Our use case was in biometrics, where the test sample would usually range in millions of images which needed to be matched simultaneously. Over here even accumulating 2-3ms over each batch or batch would lead to huge delay.

mayiSLYTHERINyourbed t1_iu3dafc wrote on October 28, 2022 at 6:54 AM

Reply to Do companies actually care about their model's training/inference speed? by GPUaccelerated

On a regular basis. We care down to the ms how fast inference or training is. In my last organisation we had to process like 200k images while inferencing. At this point even a delay of 2ms would cost 6.7 minutes just for getting the feature vectors. Which really matters.