Submitted by OfficialWireGrind t3_10rbedi in dataisbeautiful
OfficialWireGrind OP t1_j6uoxb7 wrote
Data Sources: For syllable counting, the CMU Pronunciation Dictionary was used along with a few supplementary data points. Most common word data was obtained by analyzing Wikipedia database dumps.
Tools: Python Matplotlib
jaltsukoltsu t1_j6vsgck wrote
Are the words lemmatized or are "play" and "played" counted as two separate words?
Viewing a single comment thread. View all comments