dluther93

dluther93 t1_it982ce wrote

I've done this before for multi-modal classification tasks.
Train CNN end-to-end, take the layer before last for a dense vector of embeddings.
use that dense vector as a feature set alongside my tabular data in an XGBoost or Catboost model. Boom

Easy to do on a local machine, cumbersome to try and reliably deploy this model though.

3