Frequency list
From Wikipedia, the free encyclopedia
In computational linguistics, a frequency list is a sorted list of words (word types) together with their frequency, where frequency here usually means the number of occurrences in a given corpus. A short example could be:
the | 3789654 |
he | 2098762 |
[...] | |
king | 57897 |
boy | 56975 |
[...] | |
outragious | 76 |
[...] | |
stringyfy | 5 |
[...] | |
transducionalify | 1 |
It seems that Zipf's law holds for frequency lists drawn from longer texts of any natural language. Frequency lists are a necessary prerequisite for building of an electronic dictionary, which is by itself a prerequisite for a wide range of applications in computational linguistics.