Skip to main content

Table 4 Overview of the Federalist Papers Data Set

From: Enriching feature engineering for short text samples by language time series analysis

 

Number of sentences

Total number of tokens

Sample lengths (tokens)

Average

Standard deviation

Hamilton

3567

126,059

35.3

22.6

Madison

1195

43,449

36.4

23.9

Jay

225

9378

41.7

21.4

Overall

4987

178,886

35.9

22.9