Viewing a single comment thread. View all comments

OfficialWireGrind OP t1_j6uoxb7 wrote

Data Sources: For syllable counting, the CMU Pronunciation Dictionary was used along with a few supplementary data points. Most common word data was obtained by analyzing Wikipedia database dumps.

Tools: Python Matplotlib

6

jaltsukoltsu t1_j6vsgck wrote

Are the words lemmatized or are "play" and "played" counted as two separate words?

2