I was thinking of first coming up with algorithm to find outliers and the training LSTM only on the outliers, for that I should assemble some meta-algorithm I guess and train both LSTM and trimming network at the same time.
I was wondering if something like this exists in literature already?
Great stuff, I see the 3D ViT examples are only for several(dozen) frames, not hundreds of thousands.
In my experiments so far I tried with LSTM network to classify these but the amount of input features is too massive for realistic training and I was only experiment with already videos order of magnitude smaller than what I want.
Vae94 OP t1_iy8fy1f wrote
Reply to comment by eeng_ in [D] Are problems with massive amount of input features feasible? by Vae94
Yes. Thanks for sanity check!
I was thinking of first coming up with algorithm to find outliers and the training LSTM only on the outliers, for that I should assemble some meta-algorithm I guess and train both LSTM and trimming network at the same time.
I was wondering if something like this exists in literature already?