Gender homophily in online dyadic and triadic relationships
© Laniado et al. 2016
Received: 18 November 2015
Accepted: 28 April 2016
Published: 12 May 2016
Gender homophily, or the preference for interaction with individuals of the same gender, has been observed in many contexts, especially during childhood and adolescence. In this study we investigate such phenomenon by analyzing the interactions of the ∼10 million users of Tuenti, a Spanish social networking service popular among teenagers. In dyadic relationships we find evidence of higher gender homophily for women. We also observe a preference of users with more friends to connect to the opposite gender. A particularly marked gender difference emerges in signing up for the social networking service and adding the first friends, and in the interactions by means of wall messages. In these contexts we find evidence of a strong homophily for women, and little or no homophily for men. By examining the gender composition of triangle motifs, we observe a marked tendency of users to group into gender homogeneous clusters, with a particularly high number of male-only triangles. We show that age plays an important role in this context, with a tendency to higher homophily for young teenagers in both dyadic and triadic relationships. Our findings have implications for addressing gender gap issues, understanding adolescent online behavior and technology adoption, and modeling social networks.
Keywordsgender homophily social networks triangle motifs local clustering coefficient age patterns
Homophily is the tendency of individuals to interact preferentially with similar others. Homophilous behavior in general can be found for many different characteristics and attributes, such as race, age, religion, education, occupation, and gender . The extent of its influence on human behavior is following roughly the aforementioned order according to McPherson et al. . As “a basic organizing principle”, it has been widely studied both for human and animal groups . Gender is one of the most important human attributes which plays an important role during the entire life span , and gender homophily has been largely documented in literature, especially for childhood and adolescence .
The emergence of social networking sites (SNSs) allows to observe the extent of homophily from a computational social science perspective, bypassing possible biases induced by surveys or small and limited samples while getting evidence from digital traces of millions of users. On the one hand, this can lead to complement findings from traditional survey-based studies with evidence from a large portion of the population. On the other hand, it can help to unveil behavioral patterns which are induced by online platforms. As we spend more and more time on social networking services, and they are becoming fundamental pathways for information flow in our society , understanding the impact of homophily in this context is particularly important for comprehending phenomena such as the dynamics of technology adoption, social contagion and segregation [7–9].
As we will see in the following review, most of the existing literature on this issue relies on surveys or observations of limited samples of individuals, while large scale studies based on data from social networking services still leave many aspects unexplored. With this study we aim to fill this gap, extending the work presented in  to offer a detailed picture of gender homophily in Tuenti, a Spanish social networking site especially popular among teenagers.
1.1 Gender differences in social network sites
It is a common believe that men are more frequently early adopters of new technologies. However, in the case of many social media websites and services women are in the vanguard. Thus, women outnumbered men by a considerable amount for most social networking sites [11, 12], with Pinterest having the largest gender inequality  and LinkedIn being one of the few exceptions.1 With technology entering the mass market, women lean in and overtake males not only in spending time on social networking platforms, but also in owning gadgets or playing casual social games . Madden et al.  showed that girls between 14 and 17 are more active on SNSs: they are more likely to use SNSs than boys. This gender difference continues and stays valid for the overall SNS users . Asking teenagers about how they actually use SNSs, Espinoza and Juvonen  found that girls not only spend more time on SNSs than boys, but also that this usage is “more central” to their social lives and that social network sites interfere more in their lives. Hargittai and Hsieh  showed gender differences in the level of engagement with social practices on SNSs. While, according to their survey, women engage in more strong-tie activities than men, e.g. interacting with existing friends, women pursue fewer weak-tie activities than men, e.g. developing new relationships. As shown by boyd , “Older boys are twice as likely to use the sites to flirt and slightly more likely to use the sites to meet new people than girls of their age. Older girls are far more likely to use these sites to communicate with friends they see in person than younger people or boys of their age.”
Among the first works focused on gender differences in online friendship preferences based on data collected from social networking sites were Lewis et al.  for Facebook and Thelwall  for MySpace. Another study  analyzed online social interactions in the setting of a massive multi-player online game. The authors found that males reciprocate friendship requests from females faster and that females have more communication partners. The linguistic style of messages has also been shown to be influenced by gender in Twitter , Facebook  and Wikipedia .
As most of the studies rely on analysis of US-based users , often mixing the gender-dimension with racial aspects, some of these findings can be less relevant in non-US contexts. Gender influence on access to information and communication technologies often varies according to local and cultural practices [26–28].
In this work we use a complete dump of a large Spanish social networking service to present an extensive analysis of gender preferences emerging online. Spain is among the most “social media addicted” countries in the European Union  with almost 75% of the Spaniards using Internet as an instrument for communication and interaction with others. Focusing on gender differences in Spanish adolescent lifestyles, Hernando et al.  found that Spanish females are more and longer connected (via cell phone and the Internet) than males and that females hang out more than males with friends online. Despite the above mentioned findings, there is still a lack of understanding of gender roles in online social communications, for the US and even more for non-US contexts. Furthermore, to the best of our knowledge no extensive study based on evidence from large scale data has inspected the impact of gender preferences on joining a social networking site.
1.2 Gender homophily in the offline world and on social network sites
In early studies of face-to-face interactions Shrum et al.  analyzed gender and racial homophily in a sample of friends from an American school (junior, middle and high school). Their findings indicate that racial homophily increases and gender homophily decreases with school grade. In the Netherlands, Baerveldt et al.  looked at gender and ethnic homophily for a sample of adolescents between 16 and 18 years from 20 urban high schools. They found a high tendency to gender homophily (more pronounced for girls) and ethnic homophily in all studied ethnic groups. To explain this slightly higher female homophily, Aukett et al.  showed that the degree of emotionality and intimacy in same-gender friendship is higher for women than men. Women do also tend to place a higher value on these friendships than men do. Maccoby  argued that the decrease of gender homophily with age lies in the interest in the opposite gender. According to Rose and Rudolph’s review , girls seem to have a greater preference for extended dyadic interactions and pro-social behavior, while boys interact more in peer groups with a high network density and clear dominance hierarchy.
We refer to [35, 36] for reviews of gender homophily studies. Hereby, Stehlé et al.  present a detailed picture, as they show an increasing gender homophily with age for strong ties, defined as pairs of children who interact more than a defined threshold, while for weak ties, they find for gender homophily a negative correlation for girls and age and a positive correlation for boys and age.
As already seen in research on gender homophily presented in the previous paragraphs, some studies find few gender differences [37–43], while others [36, 44–49] show important differences in the quantity and quality of male and female friendship patterns both in the offline and online world. One reason for these contradicting results might be the different components of friendship, as well as differences in age and culture . Most of the studies are based on surveys or self-reported data with rather small samples, which might reflect patterns of specific milieus. Studies with large data sets allow to discover general patterns, extending the case study perspective towards a global overview of gender homophily. In this direction, a remarkable effect of gender homophily was found for interactions in online games  and in Wikipedia , a community with a strong gender gap. Large scale analyses of social networking sites instead reported no homophily for messages posted on one another’s profile in MySpace , and only a neglectable effect of homophily on the Facebook friendship graph . However, these studies only consider gender, age and degree separately, so they leave the question open of whether different results for gender homophily would be found for users of different age groups or with different number of connections.
1.3 Gender homophily in triadic relationships
The study of dyads and triads is crucial to understand social structure, already reflected in guiding thoughts of classic sociology such as Simmel’s question “What is society?” . A dyad represents the smallest possible social group, a pair of individuals, being the core of any “intersubjective relationship”. A triad is a group of three people, forming the building block of social order and society . Hence, the detailed analysis of the dyadic and triadic structure of a SNS allows to draw a picture of (gendered) group structure and cohesion.
While the concept of homophily in previous literature is mostly focused on preferences in dyadic relationships, few studies have inspected the effects of gender homophily on triadic relationships and on larger groups. Among these, Goodreau et al. , in a study based on self-reported friendship relationships in several U.S. schools, found a higher probability of triadic closure in children friendship when at least one girl was involved, and a similar pattern was reported for teenagers . Kossinets and Watts  found no influence of gender on triadic closure in a university’s email exchanges, while Huang et al.  analyzing the factors which influence triadic closure in microblogging, observed little influence of gender, with a slightly higher probability of closing a triangle when the third user is a woman. Similarly, Szell and Thurner  found a higher clustering coefficient for female users in trade networks in online games. Kovanen et al.  investigated temporal triangle motifs in mobile phone calls and their composition according to age and gender reporting a prevalence of all-females motifs. David-Barrett et al. , by analyzing profile pictures in which more than one person appear, found that women favor dyadic relations, while men favor larger, all-male cliques. If we exclude studies on microblogging, representing a special scenario given its usage for news consuming and its asymmetric connections which are less likely to represent real friendship ties , no extensive study has focused on gender homophily in triads in social networking services. Furthermore, age has been mostly neglected by the literature in this context so far. The only exception are a few studies that have analyzed the interplay between gender homophily and user age in triads in the field of mobile communications [58, 60].
1.4 Research questions
The preceding discussion has reviewed the phenomenon of gender homophily in the offline and online world, evidencing how this varies with age, and is especially relevant during childhood and adolescence. It has also shown that most of the existing literature on this matter is based on surveys or observations of reduced samples of individuals, while large scale studies on social network sites data still leave many relevant aspects, such as age patterns, mostly unexplored. In this study we aim to deepen our understanding of gender homophily and its impact on crucial aspects including the way in which users join a social networking service, the establishment of preferential relationships and grouping patterns.
RQ1: How does gender homophily affect joining a new social environment?
RQ2: How does gender homophily affect the establishment of connections and the interactions?
RQ3: How does gender homophily affect the strongest interactions of a user?
RQ4: How does gender homophily affect the creation of groups?
In the rest of the paper, we will tackle these questions by presenting a detailed analysis of data from the Spanish social networking platform Tuenti, with a special focus on young users and how homophily varies according to user age.
2 Dataset and methods
2.1 The social network site Tuenti
This study is based on a complete anonymized snapshot of the Spanish social networking service Tuenti,2 extracted on December 11, 2010. At the time of data collection, Tuenti (the name comes from “tu [id]enti[dad]”, Spanish for “your identity”) was one of the largest Spanish social networking platforms and was sometimes referred to as the “Spanish Facebook”. It provided many features common to other popular social networking platforms: it allowed users to set up a profile, connect with friends, share web links and media items and write on each other’s walls. In particular, the terms of agreement specified by Tuenti did not allow kids younger than 14 to join the service and obliged users to specify a place of residence located in Spain. From 2006, the founding year, until November 2011, one year after our data was collected, Tuenti was an invitation-only social networking service.
2.2 Demographic composition
Number of users in the Tuenti dataset broken down by gender
≥1 reciprocal interactions
2.3 Friendship and interaction networks
The Tuenti dataset contains a complete lists of all online friendship connections and for every user also the order by which the user was adding her/his friends. Since Tuenti was an invitation-only social network service by the time the dataset was collected, we assume that the first friend of a user is the one who successfully invited her/him to join the service. Although this assumption is not necessarily 100% correct (e.g., a user might have removed the first friend, or the first friend might have quit the service before the data were collected), we believe that such exceptions happen only in a very reduced number of cases and do not affect our results.
Furthermore, the dataset contains all interactions in the form of the number of messages posted by a user on another user’s page (wall) during a period of three months between September 11 and December 11 2010.
The friendship network is based on all friendship connections between users. This network is undirected since friendship connections in Tuenti are reciprocal.
The interaction network is a sub-network of the friendship network in which we only keep links between two users if they have sent to each other at least one wall message during the three months period of observation. We note that the interaction network is as well undirected, as we only take into account reciprocal interactions.
Number of connections in the friendship network and in the network of reciprocal interactions, broken down by gender
Hereinafter, when analyzing gender homophily by age we focus especially on younger users, i.e. until their twenties. We only show results for users younger than 50 and omit older users for whom data, in general, is very sparse and not very representative.
When showing results averaged by user, we furthermore omit users having few connections to avoid the possible biases (towards very large or very small fractions) these users could introduce in the results. For the friendship network, we omit users having less than 10 friends, while for the interaction network we omit users who had reciprocal interactions with less than two friends during our three-months observation period.
2.4 Null model for assessing gender homophily
The numbers of men and women in the networks are not exactly equal, and more importantly, their degree distributions are not equal. Women have more connections, especially in the interaction network, and this leads to a higher number of dyadic and triadic relationships involving women. As the networks are unbalanced it is difficult to assess the impact of homophily just by observing in absolute terms the results obtained.
To compensate for this inequality, we assess how the results we observe differ from the results one should expect given the user composition of the networks. To do so, first we produce randomized equivalents of our networks by re-shuffling users’ attributes, i.e. age and gender. To maintain the same gender and age proportions, and the same degree distribution for each gender and age, we randomly re-shuffle the attributes of all users having the same degree (keeping the attributes of the same user together). Therefore, the resulting networks have the same identical link structure as the original network, and have the same number of connections involving men and women, as well as the same number of connections for users of each age. The demographic composition of the network, i.e. the proportion of men and women for each age, is also respected. In the following we will refer to such networks as shuffled networks. It should be noted that our method is based on shuffling user attributes, and not reshuffling connections as frequently done in the analysis of complex networks [28, 65]. Comparing the results observed in the real networks with the average of the results obtained in 10 of these shuffled networks allows us to assess to what extent gender preferences are affecting user behavior.
3 Results and discussion
In this section we answer the research questions defined above by analyzing the friendship and interaction networks in the case of Tuenti. In all the figures, we use red and blue (or pink and cyan) to depict women and men, respectively. We use continuous lines to show the results observed in the networks, and dashed lines to show the results expected according to the null model.
3.1 RQ1: gender homophily in building the online social environment
Gender has been observed to play a crucial role in defining people’s decisions about adopting and using new technologies. Thus, men are reported to be more driven by instrumental factors (i.e. perceived usefulness) while women to be more motivated by process and social factors .
In this section we study the influence of gender homophily on building users’ online social environment, by examining differences in how men and women start their online social experience and how they organize their personal social network. We compare the order in which they are making friends of the same or opposite gender and inspect how age influences their gender preferences.
3.1.1 The first friend
We observe a similar trend for the second friend of a female user in the case that the first friend was already a woman. However if, on the contrary, the first friend of a woman was a man, the probability of being the second friend as well a man rises to 42%. For male users the dependency on the gender(s) of the first two friends is even stronger: the second friend has in almost 6 of 10 cases the same gender as the first friend.
3.1.2 Friendship order
3.1.3 Age patterns
Although we consider the current age of a user with respect to an action (signing up) that might have happened in the past, the age difference with respect to the moment in which users registered can not exceed 4 years (the dataset was collected in 2010, and Tuenti was created in 2006) and is mainly between 1 and 2 years (Tuenti’s popularity “boom” started in 2008). Therefore, we believe this issue does only slightly affect the results.
The results show that women organize their online social environment differently from men especially in the initial steps, as they are more likely to add other women as their initial friends and to try a new service and enter a new social environment following an invitation by another woman. In particular, among women between 14 and 16, three out of four joined the SNS accepting an invitation from a female friend. The lower homophily observed for male users may reflect the difference in purposes to sign up for social networking services as it was reported in previous literature. Contrarily to women who use SNSs mainly for relationship maintenance , men have been reported to use them to a higher extent for meeting new people and finding potential dates [4, 18]. The dataset does not contain information about rejected invitations, so we do not know to which extent such strong preferences are due to women being more active in inviting other women, or more prone to accept invitations received from other women than from men. In any case, the results show evidence that gender matters at the moment of joining a new SNS. In particular, our analysis suggests that for women, and especially teenagers, the perceived presence of other women is very important in the first stages into a new virtual environment. This finding is particularly relevant for fostering technology adoption among women, and can help understand and address the gender gap issues suffered by some online communities, such as for example Wikipedia .
3.2 RQ2: gender homophily in establishing connections and interacting with friends
While in the previous section we have focused on the first steps of a user in joining the social networking platform and adding the first friends, we now take a wider view on gender preferences in adding friends and interacting with them in the social networking service. We present some general statistics about homophilous behavior and also look in more detail at how this behavior varies according to the degree and the age of the users.
3.2.1 General statistics
Basic friendship statistics by gender, together with 25% and 75% quantiles
avg # male
avg # female
avg % same gender
For female users, reciprocal interactions with other women are prevalent: they talk on average to 20 other women and 12 men. For men we find that they contact women just a little more often than men. However, this is due to the higher activity of female users, as shown by the stronger preference for interacting with women observed in the shuffled networks.
3.2.2 Gender homophily by degree
3.2.3 Age patterns
Our finding of a generally higher homophily for women is consistent with offline studies where men were reported to have 65% and women 70% of same gender friends . Interestingly, the percentages of same gender friends we found in the friendship network are lower than the ones reported for offline studies. This attenuation of the evidence of homophilous behavior might be due to the ease of adding a “friend” in a social networking service compared to considering someone as a friend in real life, and therefore to a presence of casual relationships in which gender is less relevant. The higher homophily of users having smaller circle of friends, and therefore being possibly more selective in adding friends in the SNS, may be interpreted as an element to support this hypothesis. A similar effect of degree, with gender homophily being prevalent especially for users having few friends, and an opposite tendency to heterophily for users having many friends, was also reported for Facebook based on surveys . An exception to this rule are in our results male users with very small circles of friends, who on the contrary exhibit a slight preference to connect and interact with women.
In line with previous literature [5, 18] the results show evidence of higher homophily among young teenagers, decreasing with age. The inverse pattern observed for male users in the interaction network, with homophily increasing until 22, seems to indicate a different behavior for the two genders, with an increase of the interest for the opposite gender having the strongest effect for female users around the age of 17-18.
Gender homophily observed in the Tuenti friendship network is in contrast with the neglectable homophily reported for the Facebook social graph . This might be due to several reasons, including the higher average degree in Facebook (which according to our findings is associated with lower homophily as discussed above) and the different average age, with Tuenti having an over-representation of teenagers.
3.3 RQ3: gender homophily in strongest online interactions
Following [70–72], where authors introduced simple proxies such as communication reciprocity [70, 73] or interaction frequency  to quantify different dimensions of tie strengths , we examine whether strong interaction ties are likely to exhibit greater homophily.
For each user we say the most messaged friend to be the one to whom that user has sent the highest number of wall messages, and from whom she/he has received at least one message. In this way, we select the friend to which each user has devoted most of her/his attention, among the ones who have reciprocated such attention at least once. Therefore, although the interaction network is undirected, this relationship is directed and not necessarily reciprocal. To insure that each user has only one most messaged friend we introduce the following procedure for ties resolution: first we look at the number of messages received by the user from the candidate friends, and choose the friend with the largest number. In cases when there are still more than one candidate, i.e. interaction values are tied again, we pick randomly one of the friends having the maximum values of both messages received from and sent to the user.
The presented approach has the advantage of focusing on a user’s actions to quantify her/his preference, without influence of the higher or lower activity levels of her/his friends. In this way we avoid the effects of the possible tendency of some friends to post more or less wall messages. However, as a drawback, this measure is based on an asymmetric definition of tie strength. To check the impact of this asymmetry, we introduce for comparison an alternative symmetric measure, consisting in the minimum of the numbers of messages exchanged between two users in the two possible directions (and then, in case of tied values, the maximum value of the two as secondary criterion). We found that in 93.4% of the cases this balanced metric leads to select the same friend as in the asymmetric case, and the results obtained do not differ noticeably from the ones shown in what follows.
The most messaged friend has the same gender in the 44.2% of the cases for male users and in the 77.1% of the cases for female users. For women the percentage is higher than the one observed in the interaction network (67.3%), while for men it is lower (47.5%). In the next subsection we inspect how the age of a user influences such preferences.
3.3.1 Age patterns
Overall, users of both genders are more likely to have their strongest interaction with a woman (in 67% of the cases). Therefore, in this context we observe strong gender homophily for women, but not for men. This result is partly different from what was observed in studies of offline behavior  and for mobile phone networks , where the strongest social ties correspond to different-gender romantic relationships. This pattern, which characterizes especially teenagers, can be interpreted in light of the higher importance of stronger-tie activities for girls, as reported in [16, 17] and .
3.4 RQ4: gender homophily in triadic relationships
In the literature gender homophily has been mostly investigated at the level of dyadic, i.e. one-to-one, relationships. In other words the primer interest was to study how people make and communicate with their friends regardless the social group they are in. In this section we deepen the analysis of gender homophily and go beyond dyadic relationships by inspecting how gender affects group creation and community structure. To this end we focus on triadic relationships, the building blocks of any cohesive group structure.
A triadic relationship (or transitive relationship, or simply triangle) is a group of three users all connected to each other. A high presence of triangles (or a high clustering coefficient) is one of the key elements that distinguish social networks from other kinds of networks, such as biological or technological ones . Therefore, it is particularly relevant to assess how gender affects the formation of transitive relationships.
In the following we study the gender compositions of triangle motifs in the friendship and interaction networks at the global level, and then check the impact of gender on the formation of transitive relationships in the ego network of each user.
3.4.1 Global count of triangle motifs
There are four possibilities for the gender composition of the triangles: 3 women, 3 men, 1 man and 2 women, or 2 men and 1 woman. In case of a perfectly gender balanced network, one could expect, using the binomial distribution, to have exactly 12.5% man-only triangles, 12.5% woman-only triangles, and 37.5% of the triangles in each of the two mixed triangle possibilities. However, as already observed the networks are not gender-balanced, and a remarkable difference in the number of connections involving men and women exists. This is true especially in the interactions network where female users are much more active, which leads to a higher overall number of triangles involving women.
Therefore, to assess how gender influences the formation of transitive relationships we observe to what extent the results deviate from the ones expected according to our null model, by comparing the proportion of triangles observed in the real networks with the average proportion obtained over 10 of the networks in which we have reshuffled user attributes. The results, reported in the following, are all highly significant: the standard deviation of the values observed for the reshuffled networks is smaller than 0.0005.
Proportion of triangle motifs with different gender composition in the friendship and interaction networks
Type of triangle
1 female, 2 males
2 females, 1 male
3.64 × 1010
1.24 × 108
When analyzing the interaction network, i.e. the connections which mutually exchanged messages, we find a striking difference between men and women, as can be observed in the two rightmost columns in Table 4. The number of female only triangles is about 3 times larger than the number of male only triangles. This difference seems high, however reshuffling shows that again we would actually have to expect an even larger disproportionality between male only triangles and female only triangles, given that women are much more active in sending (and receiving) messages. In this case the proportion of male only triangles exceeds by 60% the expected value, while the proportion of female only triangles is only 28.5% higher than expected. This indicates that male users are in general less active in the SNS, but when they interact they tend to create more gender homogeneous groups.
While the results presented so far are based on the total number of triangles of different composition in the network, and might be affected by users having higher degrees, in the following we focus on individual users, inspecting the presence of triadic relationships in their ego-network, and looking at how this varies according to age and gender.
3.4.2 Clustering coefficient by user age
We now check how the tendency to create tightly knit groups, and specifically gender heterogeneous or gender homogeneous ones, changes with user age. To do this we rely on the notion of local clustering coefficient, which is defined as the proportion between the number of triangles in which a node is involved, and the total number of triangles in which it could be involved given its degree .
Beyond looking at the local clustering coefficient of a user in the overall network (i.e., based on triangles of any gender composition, normalized by all the connections of a user) we also define a gender-restricted clustering coefficient, when we do the same considering only friends that have the same gender as the selected user. This is the local clustering coefficient of the users in the two gender-homogeneous networks obtained by removing respectively all male and all female users. As a result, we count gender-homogeneous triangles involving the user, normalized by the number of connections with users of the same gender. So, while the local clustering coefficient of a user in the overall network indicates the tendency of a user to form transitive relationships in general, the gender-restricted one measures the tendency to form gender homogeneous groups.
We observe that in both networks clustering decreases with age for young users, then it starts to increase again. This general trend is not aligned with the one expected according to the null model indicating a marked age pattern in the data. The strongest deviations are found for young teenagers, whose tendency to form dense groups is much larger than expected, and for users over 20, who on the contrary exhibit sparse relationships.
In comparison with values in the overall network and in the null model, gender-restricted clustering coefficient is especially high in both networks for teenagers. For female users it is very high below an age of 16 and then decreases rapidly with age, while for male users it decreases more slowly, remaining higher than expected until about an age of 23. Furthermore, for female users older than 20 we observe in both networks an opposite tendency to less gender-homogeneous groups, which we find for male users only to a minor extent over the age of 23 in the interaction network.
The above results show that users do not only tend to connect preferentially with others of the same gender, but they also tend to group more by gender, and to create gender-homogeneous groups of friends. As demonstrated in , gender segregation is a widespread characteristic of offline social behavior. Our findings show that, in this sense, online social behavior tends to reproduce this offline phenomenon, and that this happens more markedly for male users. In fact, although we find a higher number of triangles involving female users, in apparent agreement with the prevalence of all-female triangle motifs reported for phone calls by Kovanen et al. , when comparing with the null model we observe a higher deviation from the expected values for male users. The decrease in users’ clustering with age indicates that young teenagers tend to have more cohesive groups of friends, and that they diversify their connections as they grow up. The fact that this trend stops at the age of 21-22 seems to tell that around this age users have already diversified their friends, and created connections out of their main groups of friends. The inversion of the tendency for older users might be attributed to the lower presence of older users in our data, as has been showed in Figure 1. While we may assume that for users 14 to 25 years old, most of their friends have a Tuenti account, for older users only a part of their friends are in the SNS. Therefore, the higher clustering for older users can be interpreted as only specific groups among their real-life friends are present online and many diversified connections are missing in our data. The higher tendency of young teenagers to form gender-homogeneous groups, more prolonged in time for male users, confirms findings reported for offline behavior [34, 79, 80].
Recent studies on digital inequalities treat gender in very different ways. Some only concentrate on the influence of gender on human behavior , others such as Zillien  consider gender only as one of many variables in the emergence of digital inequalities, and yet others like boyd  completely ignore the gender dimension. This lack of consistence in considering gender and its influence on digital inequalities indicates that there are still many open questions that need to be addressed. In this study we have presented an extensive analysis of a large social network site to shed light on the phenomenon of gender homophily and to explore how it varies with respect to different kinds of online activities and interactions according to age.
Our analysis of the Tuenti social networking service offers a detailed picture of online behavior for a large portion of the Spanish population: the dataset includes about the 70% of Spanish teenagers. The results are therefore robust for this age group while for adult users, due to the sparsity of the data and a prevalence of inter-generational relationships, some conclusions need further confirmations from qualitative studies or from more representative datasets.
Overall, our results show evidence of gender homophily in dyadic relationships for both genders, being higher for women, and decreasing with age for young teenagers. This was mostly expected according to previous literature on offline and online behavior [3, 31–33, 35]. However, the extent of homophilous behavior is surprisingly high in some settings, such as women’s strong preference for signing up for the social network site on invitation of another woman, adding other women as their initial friends and having the strongest interactions with a woman. The high feminine homophily observed in this context suggests a crucial importance of gender for women in the starting phase of their experience in a new virtual environment. These findings may be particularly relevant for understanding dynamics of technology adoption and contagion in social media, and for facing the gender gap issues that are persistently hard to overcome in some online communities . Our combined analysis of age and gender patterns in particular suggests that the role of active women in involving their female fellows may be a fundamental condition for creating a “network effect” especially among female teenagers. As we only have access to information about all accepted friendship requests and accepted invitations to join the service, we cannot answer the question to which extent female users are in the first steps reluctant to accept invitations from men.
Our results contrast with the neglectable overall effect of homophily reported for the Facebook social graph . Beyond the possible effect of cultural specificities of the Spanish context, the stronger effect of gender homophily in Tuenti might be explained in light of the younger age of its users, or by their lower average degree. In fact, in agreement with survey-based studies focused on Facebook users  we have observed a stronger tendency to homophily for users having lower degrees, and on the opposite a tendency to heterophily, i.e. a preference for the other gender, for users having many connections. Therefore, in absence of more detailed results about Facebook or other similar social networking sites, it is difficult to assess to which extent our different results are due to cultural differences. Further studies on large samples of users from other countries would help to shed light on this aspect.
Contrary to findings reported for mobile phone calls  and microblogging  we did not observe a higher tendency of female users to form gender homogeneous triangles, while we observed a stronger deviation from the expected values in a randomized model for the number of male-only triangles. This result, in apparent contradiction with the higher homophily observed for women in dyadic relationships, is consistent with studies of offline behavior  and of online behavior with respect to profile pictures in SNSs  reporting a higher tendency of males to form gender homogeneous groups. Also in line with offline studies , the tendency to form knit groups of same gender users is specially high for young teenagers, and decreases with age, with a sharper pattern for female users. Our findings show evidence that the interplay of age and gender with local clustering is an important element to understand grouping phenomena and the growth of social networks. While in this study we have highlighted triads as the basic building blocks of larger groups, and studied a snapshot of the social network, analysis of the gender composition of larger cliques and cohesive clusters of users, as well as studies of temporal patterns in tie formation and triadic closure might shed further light on the importance of gender homophily with respect to grouping behavior and network evolution dynamics.
report by Rapleaf: http://readwrite.com/2008/07/29/social_networks_women_outnumber_men
Yana Volkovich was supported by the People Programme (Marie Curie Actions, from the FP7/2007-2013) under grant agreement no. 600388 managed by REA and ACCIÓ.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
- Lazarsfeld PF, Merton RK et al. (1954) Friendship as a social process: a substantive and methodological analysis. In: Freedom and control in modern society, vol 18, pp 18-66 Google Scholar
- McPherson M, Smith-Lovin L, Cook JM (2001) Birds of a feather: homophily in social networks. Annu Rev Sociol 27:415-444 View ArticleGoogle Scholar
- David-Barrett T, Rotkirch A, Carney J, Behncke Izquierdo I, Krems JA, Townley D et al. (2015) Women favor dyadic relationships, but men prefer clubs: cross-cultural evidence from social networking. PLoS ONE 10(3):e0118329 View ArticleGoogle Scholar
- Mehta CM, Strough J (2009) Sex segregation in friendships and normative contexts across the life span. Dev Rev 29(3):201-220 View ArticleGoogle Scholar
- Maccoby EE (2002) Gender and group process: a developmental perspective. Curr Dir Psychol Sci 11(2):54-58 View ArticleGoogle Scholar
- Castells M (2011) The rise of the network society: the information age: economy, society, and culture, vol 1. Wiley, New York Google Scholar
- Currarini S, Jackson MO, Pin P (2009) An economic model of friendship: homophily, minorities, and segregation. Econometrica 77(4):1003-1045 MathSciNetView ArticleMATHGoogle Scholar
- Moody J (2001) Race, school integration, and friendship segregation in America. Am J Sociol 107(3):679-716 MathSciNetView ArticleGoogle Scholar
- Dow PA, Adamic LA, Friggeri A (2013) The anatomy of large Facebook cascades. In: ICWSM Google Scholar
- Volkovich Y, Laniado D, Kappler K, Kaltenbrunner A (2014) Gender patterns in a large online social network. In: The 6th international conference on social informatics (SocInfo’14). Springer, Berlin Google Scholar
- Duggan M, Brenner J (2013) The demographics of social media users – 2012. Pew Research Center Google Scholar
- Madden M, Lenhart A, Cortesi S, Gasser U, Duggan M, Smith A et al (2013) Teens, Social Media, and Privacy. Pew Internet Research Google Scholar
- Ottoni R, Pesce JP, Las Casas D, Franciscani G Jr, Meira W Jr, Kumaraguru P et al. (2013) Ladies first: analyzing gender roles and behaviors in Pinterest. In: Proc. ICWSM Google Scholar
- Cook SG (2012) Women lead in adopting new technologies. Women Higher Educ 21(2):24-25 View ArticleGoogle Scholar
- Madden M, Lenhart A, Duggan M, Cortesi S, Gasser U (2013) Teens and technology 2013. Pew Internet & American Life Project, Washington Google Scholar
- Espinoza G, Juvonen J (2011) The pervasiveness, connectedness, and intrusiveness of social network site use among young adolescents. Cyberpsychol Behav Soc Netw 14(12):705-709 View ArticleGoogle Scholar
- Hargittai E, Hsieh YP (2010) Predictors and consequences of differentiated practices on social network sites. Inf Commun Soc 13(4):515-536 View ArticleGoogle Scholar
- Boyd D (2007) Why youth (heart) social network sites: the role of networked publics in teenage social life. In: MacArthur foundation series on digital learning – youth, identity, and digital media volume, pp 119-142 Google Scholar
- Lewis K, Kaufman J, Gonzalez M, Wimmer A, Christakis N (2008) Tastes, ties, and time: a new social network dataset using facebook.com. Soc Netw 30(4):330-342 View ArticleGoogle Scholar
- Thelwall M (2008) Social networks, gender, and friending: an analysis of MySpace member profiles. J Am Soc Inf Sci Technol 59(8):1321-1330 View ArticleGoogle Scholar
- Szell M, Thurner S (2013) How women organize social networks different from men. Sci Rep 3:1214 View ArticleGoogle Scholar
- Kivran-Swaine F, Brody S, Naaman M (2013) Effects of gender and tie strength on Twitter interactions. First Monday 18:9 View ArticleGoogle Scholar
- Schwartz HA, Eichstaedt JC, Kern ML, Dziurzynski L, Ramones SM, Agrawal M et al. (2013) Personality, gender, and age in the language of social media: the open-vocabulary approach. PLoS ONE 8(9):e73791 View ArticleGoogle Scholar
- Iosub D, Laniado D, Castillo C, Fuster Morell M, Kaltenbrunner A (2014) Emotions under discussion: gender, status and communication in online collaboration. PLoS ONE 9(8):e104880. doi:10.1371/journal.pone.0104880 View ArticleGoogle Scholar
- Ahn J (2011) Teenagers and social network sites: do off-line inequalities predict their online social networks? First Monday 17:1 Google Scholar
- Drabowicz T (2014) Gender and digital usage inequality among adolescents: a comparative study of 39 countries. Comput Educ 74:98-111 View ArticleGoogle Scholar
- Magno G, Weber I (2014) International gender differences and gaps in online social networks. In: Social informatics. Springer, Berlin, pp 121-138 Google Scholar
- Goodreau SM, Kitts JA, Morris M (2009) Birds of a feather, or friend of a friend? Using exponential random graph models to investigate adolescent social networks. Demography 46(1):103-125 View ArticleGoogle Scholar
- E-Communications Household Survey Summary 2010. Public Opinion Analysis, European Commission Google Scholar
- Hernando Á, Oliva A, Ángel Pertegal M (2013) Diferencias de género en los estilos de vida de los adolescentes. Psicosoc Interv 22(1):15-23 View ArticleGoogle Scholar
- Shrum W, Cheek NH Jr, MacD S (1988) Friendship in school: gender and racial homophily. Sociol Educ 61(4):227-239 View ArticleGoogle Scholar
- Baerveldt C, Van Duijn MA, Vermeij L, Van Hemert DA (2004) Ethnic boundaries and personal choice. Assessing the influence of individual inclinations to choose intra-ethnic relationships on pupils’ networks. Soc Netw 26(1):55-74 View ArticleGoogle Scholar
- Aukett R, Ritchie J, Mill K (1988) Gender differences in friendship patterns. Sex Roles 19(1-2):57-66 View ArticleGoogle Scholar
- Rose AJ, Rudolph KD (2006) A review of sex differences in peer relationship processes: potential trade-offs for the emotional and behavioral development of girls and boys. Psychol Bull 132(1):98-131 View ArticleGoogle Scholar
- Stehlé J, Charbonnier F, Picard T, Cattuto C, Barrat A (2013) Gender homophily from spatial behavior in a primary school: a sociometric study. Soc Netw 35(4):604-613 View ArticleGoogle Scholar
- Vigil JM (2007) Asymmetries in the friendship preferences and social styles of men and women. Hum Nat 18(2):143-161 View ArticleGoogle Scholar
- Apicella CL, Marlowe FW, Fowler JH, Christakis NA (2012) Social networks and cooperation in hunter-gatherers. Nature 481(7382):497-501 View ArticleGoogle Scholar
- Bruckner E, Knaup K (1993) Women’s and men’s friendships in comparative perspective. Eur Sociol Rev 9(3):249-266 Google Scholar
- Sheets VL, Lugar R (2005) Friendship and gender in Russia and the United States. Sex Roles 52(1-2):131-140 View ArticleGoogle Scholar
- Roberts SG, Dunbar RI, Pollet TV, Kuppens T (2009) Exploring variation in active network size: constraints and ego characteristics. Soc Netw 31(2):138-146 View ArticleGoogle Scholar
- Burleson BR (1997) A different voice on different cultures: illusion and reality in the study of sex differences in personal relationships. Pers Relatsh 4(3):229-241 View ArticleGoogle Scholar
- Oxley NL, Dzindolet MT, Miller JL (2002) Sex differences in communication with close friends: testing Tannen’s claims. Psychol Rep 91(2):537-544 View ArticleGoogle Scholar
- Rotkirch A, Lyons M, David-Barrett T, Jokela M (2014) Gratitude for help among adult friends and siblings. Evol Psychol 12(4):673-686 View ArticleGoogle Scholar
- Tiger L (1974) Sex-specific friendship. In: The compact: selected dimensions of friendship: St John’s Memorial University of New Foundland, pp 42-48 Google Scholar
- Baron-Cohen S, Wheelwright S (2003) The friendship questionnaire: an investigation of adults with Asperger syndrome or high-functioning autism, and normal sex differences. J Autism Dev Disord 33(5):509-517 View ArticleGoogle Scholar
- Benenson JF, Quinn A, Stella S (2012) Boys affiliate more than girls with a familiar same-sex peer. J Exp Child Psychol 113(4):587-593 View ArticleGoogle Scholar
- Belle D (1989) Children’s social networks and social supports, vol 136. Wiley, New York Google Scholar
- Caldwell MA, Peplau LA (1982) Sex differences in same-sex friendship. Sex Roles 8(7):721-732 View ArticleGoogle Scholar
- Duck S, Wright PH (1993) Reexamining gender differences in same-gender friendships: a close look at two kinds of data. Sex Roles 28(11-12):709-727 View ArticleGoogle Scholar
- Laniado D, Kaltenbrunner A, Castillo C, Morell MF (2012) Emotions and dialogue in a peer-production community: the case of Wikipedia. In: Proc. WikiSym Google Scholar
- Thelwall M (2009) Homophily in MySpace. J Am Soc Inf Sci Technol 60(2):219-231 View ArticleGoogle Scholar
- Ugander J, Karrer B, Backstrom L, Marlow C (2011) The anatomy of the Facebook social graph. arXiv preprint arXiv:1111.4503
- Simmel G (1910) How is society possible? Am J Sociol 16(3):372-391 View ArticleGoogle Scholar
- Bedorf T (2003) Dimensionen des Dritten. W. Fink, München Google Scholar
- Kirke DM (2009) Gender clustering in friendship networks: some sociological implications. Methodol Innov 4(1):23-36 Google Scholar
- Kossinets G, Watts DJ (2006) Empirical analysis of an evolving social network. Science 311(5757):88-90 MathSciNetView ArticleMATHGoogle Scholar
- Huang H, Tang J, Wu S, Liu L et al. (2014) Mining triadic closure patterns in social networks. In: Proceedings of the companion publication of the 23rd international conference on world wide web companion. International World Wide Web Conferences Steering Committee, pp 499-504 Google Scholar
- Kovanen L, Kaski K, Kertész J, Saramäki J (2013) Temporal motifs reveal homophily, gender-specific patterns, and group talk in call sequences. Proc Natl Acad Sci 110(45):18070-18075 View ArticleGoogle Scholar
- Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on world wide web. ACM, New York, pp 591-600 Google Scholar
- Kovanen L, Karsai M, Kaski K, Kertész J, Saramäki J (2011) Temporal motifs in time-dependent networks. J Stat Mech Theory Exp 2011(11):P11005 View ArticleGoogle Scholar
- Telefónica F (2012) La Sociedad de la Información en España 2011. Fundación Telefónica. Available from: http://www.fundaciontelefonica.com/arte_cultura/publicaciones-listado/pagina-item-publicaciones/?itempubli=126
- Mujeres y hombres en España (2013). Instituto Nacional de Estadistica Google Scholar
- Volkovich Y, Scellato S, Laniado D, Mascolo C, Kaltenbrunner A (2012) The length of bridge ties: structural and geographic properties of online social interactions. In: The international AAAI conference on weblogs and social media (ICWSM’12) Google Scholar
- Kaltenbrunner A, Scellato S, Volkovich Y, Laniado D, Currie D, Jutemar EJ et al. (2012) Far from the eyes, close on the web: impact of geographic distance on online social interactions. In: Proceedings of ACM SIGCOMM workshop on online social networks (WOSN ’12). ACM, New York Google Scholar
- Albert R, Jeong H, Internet BAL (1999) Diameter of the world-wide web. Nature 401(6749):130-131 View ArticleGoogle Scholar
- Venkatesh V, Morris MG (2000) Why don’t men ever stop to ask for directions? Gender, social influence, and their role in technology acceptance and usage behavior. MIS Q 24(1):115-139 View ArticleGoogle Scholar
- Muscanell NL, Guadagno RE (2012) Make new friends or keep the old: gender and personality differences in social networking use. Comput Hum Behav 28(1):107-112 View ArticleGoogle Scholar
- Hill BM, Shaw A (2013) The Wikipedia gender gap revisited: characterizing survey response bias with propensity score estimation. PLoS ONE 8(6):e65782 View ArticleGoogle Scholar
- Reeder HM (2003) The effect of gender role orientation on same-and cross-sex friendship formation. Sex Roles 49(3-4):143-152 View ArticleGoogle Scholar
- Friedkin N (1980) A test of structural features of Granovetter’s strength of weak ties theory. Soc Netw 2(4):411-422 View ArticleGoogle Scholar
- Gilbert E, Karahalios K (2009) Predicting tie strength with social media. In: Proc. CHI Google Scholar
- Gilbert E, Karahalios K, Sandvig C (2008) The network in the garden: an empirical analysis of social media in rural life. In: Proc. CHI Google Scholar
- Krackhardt D, Kilduff M (1999) Whether close or far: social distance effects on perceived balance in friendship networks. J Pers Soc Psychol 76(5):770-782 View ArticleGoogle Scholar
- Granovetter MS (1973) The strength of weak ties. Am J Sociol 78(6):1360-1380 View ArticleGoogle Scholar
- Palchykov V, Kaski K, Kertész J, Barabási AL, Dunbar RI (2012) Sex differences in intimate relationships. Sci Rep 2:370 View ArticleGoogle Scholar
- Newman ME, Park J (2003) Why social networks are different from other types of networks. Phys Rev E 68(3):036122 View ArticleGoogle Scholar
- Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393(6684):440-442 View ArticleGoogle Scholar
- Soffer SN, Vazquez A (2005) Network clustering coefficient without degree-correlation biases. Phys Rev E 71(5):057101 View ArticleGoogle Scholar
- Benenson JF (1993) Greater preference among females than males for dyadic interaction in early childhood. Child Dev 64(2):544-555 View ArticleGoogle Scholar
- Parker JG, Seal J (1996) Forming, losing, renewing, and replacing friendships: applying temporal parameters to the assessment of children’s friendship experiences. Child Dev 67(5):2248-2268 View ArticleGoogle Scholar
- Zillien N (2008) Digitale Ungleichheit. Springer, Berlin Google Scholar
- Boyd D (2014) It’s complicated: the social lives of networked teens. Yale University Press, New Haven Google Scholar