From: Enriching feature engineering for short text samples by language time series analysis
 | Number of sentences | Total number of tokens | Sample lengths (tokens) | |
---|---|---|---|---|
Average | Standard deviation | |||
Hamilton | 3567 | 126,059 | 35.3 | 22.6 |
Madison | 1195 | 43,449 | 36.4 | 23.9 |
Jay | 225 | 9378 | 41.7 | 21.4 |
Overall | 4987 | 178,886 | 35.9 | 22.9 |