Figure 2From: Word usage mirrors community structure in the online social network TwitterProportion of users whose community is correctly predicted. The proportion of users whose topological community association is correctly predicted by analysing a random sample of words, as a function of the number of words sampled. Results are presented for both the modularity maximisation partition (users from only English-speaking communities are shown as red pluses, users from any community as blue circles), and the Map Equation partition (English-speaking communities are shown as black crosses, all communities are as blue squares). For each data point, 5,000 users were tested. Standard error of each point is <1%.Back to article page