Method for predicting the community of a user. An illustration of the method for predicting which community a user is embedded in. Words are assigned scores (shown as bars above the words in the figure) based on how significantly different their usage is when compared with the global usage (see main text for more details). These scores are generated for the amalgamated text of the users of each community (top left panel), and for the text of the user being tested (top right panel). The scores are compared between a user and all communities and the best match is chosen as the predicted community (bottom panel).