Skip to main content

Table 6 Ten most common codewords in the corpus (after transposition), in terms of number of tokens (absolute frequency)

From: Heaps’ law and vocabulary richness in the history of classical music harmony

Codeword

Chord

 

Frequency

100010010000

CEG

I

257,252

100001000100

CFA

IV

145,967

100010000100

CEA

vi

119,734

001000010001

DGB

V

105,361

000000000000

  

99,761

100010000000

CE

I

86,179

100000000000

C

I

78,009

001001000100

DFA

ii

75,462

001001010001

DFGB

V

70,802

000000010000

G

V

58,966