dluther93 t1_it982ce wrote on October 21, 2022 at 9:22 PM

I've done this before for multi-modal classification tasks.
Train CNN end-to-end, take the layer before last for a dense vector of embeddings.
use that dense vector as a feature set alongside my tabular data in an XGBoost or Catboost model. Boom

Easy to do on a local machine, cumbersome to try and reliably deploy this model though.

Bonsanto t1_ita8okg wrote on October 22, 2022 at 2:08 AM

Do you have any example/implementation at hand?

dluther93 t1_itbglnd wrote on October 22, 2022 at 10:57 AM

Nothing I’m able to pass off publicly unfortunately. Just build a cnn, then concat the outputs into your original dataset :)

abstract000 t1_itc0l85 wrote on October 22, 2022 at 2:14 PM

You really had a significant improvement? I tried this and it performed poorly, but maybe it was just the dataset. BTW did you test that on a Kaggle competition?

dluther93 t1_itc1ldm wrote on October 22, 2022 at 2:22 PM

It was significant to us. Our base case is the xgboost model with tabular data only. We were looking at ways to augment our tabular performance, not improve imaging performance. It was a method of feature engineering for the problem.

abstract000 t1_itc3cw1 wrote on October 22, 2022 at 2:35 PM

OK I will try this next time I work on tabular data.