From: Enriching feature engineering for short text samples by language time series analysis
Requires data set dependent key-value mappings | Functional language sequence has fixed length | Words need to be stemmed | |
---|---|---|---|
Token Length Sequence | – | – | – |
Token Frequency Sequence | ✓ | – | ✓ |
Token Rank Sequence | ✓ | – | ✓ |
Token Length Distribution | – | ✓ | – |
Token Rank Distribution | ✓ | ✓ | ✓ |