EPJ Data Science

Table 4 Overview of the Federalist Papers Data Set

From: Enriching feature engineering for short text samples by language time series analysis

	Number of sentences	Total number of tokens	Sample lengths (tokens)
	Number of sentences	Total number of tokens	Average	Standard deviation
Hamilton	3567	126,059	35.3	22.6
Madison	1195	43,449	36.4	23.9
Jay	225	9378	41.7	21.4
Overall	4987	178,886	35.9	22.9

Back to article page