Submitted by jiamengial t3_10p66zc in MachineLearning
fasttosmile t1_j6jk30j wrote
Reply to comment by jiamengial in [D] What's stopping you from working on speech and voice? by jiamengial
Everyone has been moving on from kaldi so it's a little weird to bring that up now.
If you're interested in a modern formats for speech data look into lhotse.
uhules t1_j6juq7x wrote
Lhotse is basically part of the "Kaldi 2.0 ecosystem" (K2/Lhotse/Icefall/Sherpa), you'll probably see people referring to the whole lot as Kaldi as well.
fasttosmile t1_j6jzvyw wrote
That does not make sense. You don't need kaldi to use the new libraries. And lhotse can be used totally independently of k2 or icefall.
Maleficent_Cod_1055 t1_j6jkz4b wrote
Tbh if you're still doing anything like word alignment or phone alignment the first thing people bring up is still Kaldi. Will check out Lhotse!
Viewing a single comment thread. View all comments