Submitted by madmax_br5 t3_10mbct5 in MachineLearning
visarga t1_j67pv49 wrote
Reply to comment by HateRedditCantQuitit in [D] Moving away from Unicode for more equal token representation across global languages? by madmax_br5
It's also the fact that content in English dwarfs content in other languages, and languages more similar to English also benefit, but not languages that have different scripts and fewer cognates.
Viewing a single comment thread. View all comments