FigureĀ 2
From: Word usage mirrors community structure in the online social network Twitter

Proportion of users whose community is correctly predicted. The proportion of users whose topological community association is correctly predicted by analysing a random sample of words, as a function of the number of words sampled. Results are presented for both the modularity maximisation partition (users from only English-speaking communities are shown as red pluses, users from any community as blue circles), and the Map Equation partition (English-speaking communities are shown as black crosses, all communities are as blue squares). For each data point, 5,000 users were tested. Standard error of each point isĀ .