Submitted by orgtre t3_xujlqk in dataisbeautiful
i875p t1_iqxk5mt wrote
Reply to comment by orgtre in The returns to learning the most common words, by language [OC] by orgtre
Just an observation: the lists seem to indicate that the Chinese corpus is largely based on recent government documents/reports and legal codes that are published in book form. I would guess even if one understands the meaning of every word on the 1-grams list, one would still find reading a relatively accessible classical Chinese novel (like the Romance of the 3 Kingdoms) a bit difficult.
Viewing a single comment thread. View all comments