Viewing a single comment thread. View all comments

pagein t1_ix2wkue wrote

If you want to cluster sentences, take a look in LABSE. This model was specially designed for embedding extraction. https://ai.googleblog.com/2020/08/language-agnostic-bert-sentence.html?m=1

2

Devinco001 OP t1_ix710w3 wrote

This looks really interesting, thanks. Is it open source?

1

pagein t1_ix71gqd wrote

There are several pretrained implementations:

  • Pytorch implemenatation using HuggingFace Transformers Library under Apache 2.0 license
  • Original Tensorflow model on Tensorflow Hub under the same Apache 2.0 license.
2