English Letter Frequency Counts:
Mayzner Revisited
or
ETAOIN SRHLDCU
> ...My distillation of the Google books data gives us 97,565 distinct words, which were mentioned 743,842,922,321 times (37 million times more than in Mayzner's 20,000-mention collection). Each distinct word is called a "type" and each mention is called a "token." To no surprise, the most common word is "the"....
-- Peter Norvig