Figure 1From: Enriching feature engineering for short text samples by language time series analysisDistribution of the sample text lengths in the Spooky Books Data Set. Average = 30.4 tokens, median = 26 tokens, 0.75 quantile = 38 tokens, 0.95 quantile = 65 tokensBack to article page