Skip to main content

Table 4 Overview of the Federalist Papers Data Set

From: Enriching feature engineering for short text samples by language time series analysis

  Number of sentences Total number of tokens Sample lengths (tokens)
Average Standard deviation
Hamilton 3567 126,059 35.3 22.6
Madison 1195 43,449 36.4 23.9
Jay 225 9378 41.7 21.4
Overall 4987 178,886 35.9 22.9