Skip to main content

Table 6 Ten most common codewords in the corpus (after transposition), in terms of number of tokens (absolute frequency)

From: Heaps’ law and vocabulary richness in the history of classical music harmony

Codeword Chord   Frequency
100010010000 CEG I 257,252
100001000100 CFA IV 145,967
100010000100 CEA vi 119,734
001000010001 DGB V 105,361
000000000000    99,761
100010000000 CE I 86,179
100000000000 C I 78,009
001001000100 DFA ii 75,462
001001010001 DFGB V 70,802
000000010000 G V 58,966