Viewing a single comment thread. View all comments

fasttosmile t1_j6jk30j wrote

Everyone has been moving on from kaldi so it's a little weird to bring that up now.

If you're interested in a modern formats for speech data look into lhotse.

2

uhules t1_j6juq7x wrote

Lhotse is basically part of the "Kaldi 2.0 ecosystem" (K2/Lhotse/Icefall/Sherpa), you'll probably see people referring to the whole lot as Kaldi as well.

2

fasttosmile t1_j6jzvyw wrote

That does not make sense. You don't need kaldi to use the new libraries. And lhotse can be used totally independently of k2 or icefall.

−1

Maleficent_Cod_1055 t1_j6jkz4b wrote

Tbh if you're still doing anything like word alignment or phone alignment the first thing people bring up is still Kaldi. Will check out Lhotse!

1