Figure 2From: Enriching feature engineering for short text samples by language time series analysisDistribution of the sample text lengths in the Federalist Papers Data Set for papers with known authors. Average = 35.9 tokens, median = 32 tokens, 0.75 quantile = 47 tokens, 0.95 quantile = 78 tokensBack to article page