2.1 Reciprocity and career impact
Figure 1 shows the frequency of both the number of directed citations (i.e., the number of times \(c_{ij}\) an author i has cited another author j) and the number of reciprocated citations (i.e., \(c_{ij}^{\leftrightarrow }= \min (c_{ij},c_{ji})\)) across all pairs of authors active in the APS dataset from 1950 to 2010. As it can be seen, both distributions show a markedly heavy-tailed behaviour. In particular, there are more than fifty thousand pairs of authors with 10 reciprocated citations, and more than two thousand pairs with 50 or more reciprocated citations. Overall, more than 21% of the citations in the dataset are reciprocated (roughly 8.5 millions out of 40.4 millions).
The sheer scale of the aforementioned phenomenon suggests that reciprocated citations might play a key role in shaping an author’s scientific impact. This, in turn, poses the question whether authors with a history of systematic reciprocity might, on average, outperform their peers. In this section we investigate citation patterns at the level of individual authors in order to answer such a question. We do so by proposing a null network model ensemble (see Fig. 2 and Methods) to estimate the average baseline level of reciprocity \(\langle \rho _{\mathrm{null}}^{(i)} \rangle \) one should expect in the author citation networks under partially random interactions, which we then use to measure excess reciprocity.
The rationale of the definition in (1) is to discount density-related effects. Indeed, simply measuring reciprocity as the fraction of reciprocated weight \(\rho _{0}^{(i)}\) typically leads to seemingly high (low) values in dense (sparse) networks. The measure in Eq. (1) takes care of such potential spurious effects by discounting the average reciprocity observed in a null model ensemble, so that positive (negative) values of \(\rho ^{(i)}\) indicate authors whose citations have been received through an over-representation (under-representation) of reciprocated relationships, whereas values \(\rho ^{(i)} \simeq 0\) indicate levels of reciprocity compatible with the null assumption encoded in the null network model being used. In conclusion, excess reciprocity indirectly quantifies how much the academic impact of an author, as measured by her number of citations and h-index (which are both preserved by the null model, see Methods), relies on the citations from authors she cited as well.
We investigated the relationship between excess reciprocity and impact by following the career paths of authors with a traceable publication history of at least 20 years in the APS dataset. We first employed a variety of methodologies to predict a scientist’s future impact (in terms of citations) based on her previous history of excess reciprocity. In all cases we found very weak to no evidence of any predictive power (see Additional file 1, Supplementary Note 1), which strongly suggests that citation strategies based on the mere exchange of citations do not contribute to attracting higher numbers of citations in the future.
We then applied the k-means clustering algorithm [30] to categorize authors in terms of career impact. Following [12], we performed this analysis considering the career trajectories of all authors with 10 or more papers published over the course of at least 20 years (with at least one paper published every 5 years) who published their first paper either in 1950–1970 or in 1970–1990. We chose to group authors whose careers started over two decades in order to assemble two large enough samples. The downside of this is that we pool together authors whose careers started and developed during rather different historical periods in terms of scientific publication standards and practices. However, we verified that the results presented in the following do not change qualitatively when pooling authors based on shorter time spans.
We used these two samples to perform a k-means clustering analysis based on the cumulative number of citations received over time. Since several authors did not receive citations early in their career, we performed our analysis starting from the 4th career year. In Fig. 3 we present the results for 1970–1990 (see Additional file 1, Supplementary Note 2 for the results obtained for 1950–1970), which were obtained on a pool of 5070 scientists. We identified 4 distinct groups with very different levels of career impact, ranging from a small minority of authors (1.2% of the sample) who managed to attract several thousand citations over the time period considered, to the relative majority of authors (67.4% of the sample) who only enjoyed moderate to low impact (see Fig. 3 caption for more details).
We find the above groups to be associated with markedly different behaviours. Namely, we find long-term career impact to be associated with progressively lower levels of excess reciprocity. Indeed, the two most impactful groups are associated with the lowest long-run excess reciprocity, with the small cluster of elite scientists (group 1 in Fig. 3) displaying an average excess reciprocity around 0.1 towards year 20 of their career. Conversely, the two least impactful groups are associated with consistently higher levels of long-term excess reciprocity, higher than 0.2 in the case of the single least impactful group. We further corroborated the progressive development of differences between the four groups by running two-sided Kolmogorov-Smirnov tests between the distributions of excess reciprocity in each group at career years 4 and 20. The results are reported in Additional file 1, Supplementary Note 3, and show that at year 4 the null hypothesis of excess reciprocities being drawn from the same distributions can be rejected only when comparing the least impactful group of authors (group 4) with the other ones. In contrast, at year 20 the null hypothesis can be rejected for all pairs of groups.
In addition, we checked whether authors belonging to a certain group tend to publish more frequently in some APS journals rather than others. The results of this analysis are presented in Additional file 1, Supplementary Note 3, and show that authors in the most impactful groups (group 1 and 2) have a higher publication rate in Physical Review Letters (PRL), which is somewhat unsurprising since PRL is by far the most impactful venue among those considered here. Yet, a more nuanced picture emerges when looking at the remaining journals, as the most impactful clusters do not necessarily account for the relative majority of publications in the most impactful journals and vice versa. Moreover, while the ranking and behaviours in terms of excess reciprocity are similar across the two time periods we consider, it is interesting to notice that publication rates of the different groups across journals are rather different (see Additional file 1, Supplementary Note 2).
2.2 Shifts in citation patterns
In the previous section we analyzed the relationship between excess reciprocity and long-term impact from a cross-sectional point of view by “collapsing” together the career trajectories of several authors whose actual careers developed asynchronously over the span of a few decades. We now seek to further unpack this relationship by investigating temporal snapshots of the APS citation network, testing how an author’s propensity to reciprocate citations correlated with her impact during different historical periods.
We do so by performing analyses at the decade level. For each decade from the 1950s to the 1990s, we consider all authors whose career started before the end of such decade and did not end before the first year of that decade. We then pool all the papers published by such authors before the end of the decade, and their corresponding citations, to build the author citation network for the decade of interest.
Figure 4(a) shows, for three different decades, the average excess reciprocity of authors as a function of their accrued citations (see Additional file 1, Supplementary Note 7 for all six decades). As it can be seen, over time we observe the emergence of a clear negative correlation between an author’s impact and her excess reciprocity. In the 1950s the entire APS scientific community was essentially compatible with the null model, with average excess reciprocity lower than 0.05 for all groups of authors. This changes considerably from the 1970s onwards, and it becomes quite pronounced in the 2000s, with a very apparent negative relationship between an author’s impact and her tendency to reciprocate.
One might intuitively expect high impact authors to display, as a group, the lowest tendency to reciprocate. Indeed, in network terms, highly impactful academics simply do not have enough outgoing links to reciprocate their incoming links, i.e., they cannot provide enough citations to match the high number of citations they receive. While this is certainly true, as shown consistently for all decades in Fig. 4(a), there are subtler aspects to this observation.
First, let us recall that the definition in Eq. (1) measures the excess of reciprocity with respect to an expected baseline, which in our case is computed from a null model which preserves the heterogeneity (in terms of number of publications) and level of impact (both in terms of accrued citations and h-index) of each author. In this respect, the above result shows that high impact authors simply do not reciprocate much more than one could reasonably expect. Yet, a deeper analysis of the citations received by high impact authors reveals more substantial differences with respect to our null model. Indeed, while our null model naturally incorporates the low levels of excess reciprocity of high impact authors, it does not prescribe who the recipients of citations from them should be.
To investigate who the recipients are, we examine the level of interconnectedness among the leading authors with the highest citation counts in each decade by measuring the rich-club coefficient [31, 32] in the author citation networks (see Methods). The rich-club coefficient quantifies the tendency to preferentially establish relationships within a group with respect to the expected tendency based on a null hypothesis. In the present case, we measure the rich club coefficient as \(\phi (c) = \phi _{0}(c) / \langle \phi _{\mathrm{null}}(c) \rangle \), where \(\phi _{0}(c)\) is the fraction of the total number of citations flowing between authors that have received at least c citations in the empirical network (i.e., authors with an incoming weight equal to or larger than c), and \(\langle \phi _{\mathrm{null}}(c) \rangle \) is the corresponding quantity computed over our null network model ensemble.
We observe an increasingly pronounced rich-club effect among leading academics, with the effect being up to twice as strong with respect to the null model for authors with an incoming weight around 104 in the 2000s (Fig. 4(b)). Conversely, in earlier decades we find the effect to be much less strong, although still present (see also Additional file 1, Supplementary Note 7). This result indicates that, although the overall tendency of high impact authors to reciprocate is close to the one predicted by our null model, they overwhelmingly tend to cite their peers. The presence of such an interconnected rich core of successful scientists suggests that homophily with respect to career excellence has increasingly become one of the driving forces behind the attribution of citations.
2.3 Reciprocity, coauthorship and self-citations
We now shift our attention to the evolution of reciprocity at the aggregate level of the entire APS community. We straightforwardly generalize Eq. (1) to define a measure of network-wide excess reciprocity as \(\rho = (\rho _{0} - \langle \rho _{\mathrm{null}} \rangle ) / (1 - \langle \rho _{\mathrm{null}} \rangle )\), where \(\rho _{0}\) denotes the overall fraction of reciprocated weight in the empirical networks, whereas \(\langle \rho _{\mathrm{null}} \rangle \) denotes the corresponding average quantity in the null network ensemble. We track such quantity over time by considering annual networks constructed by including all papers published by active authors up to the year under analysis. We consider an author to be active whenever the year under consideration is between the first and last of her career.
During the entire period of study we systematically observe positive values of network-wide excess reciprocity, indicating a stronger propensity of the APS community to reciprocate citations than the one expected in our null model ensemble. Furthermore, we find reciprocity to increase steadily (and roughly linearly) up to the early 1990s, after which it stabilizes around 0.15 (Fig. 5(a) and Additional file 1, Supplementary Note 8).
A closer look reveals that, over the entire period of observation, a substantial proportion of the overall reciprocity \(\rho _{0}\) is accounted for by citations between coauthors. Such proportion grows from about 40% in the 1950s to about 50% in the 1990s. This is in contrast with the expected proportion computed in the null model, which instead shows a steady decline over time (Fig. 5(b)).
In order to better understand the impact of citations from coauthors on a scientist’s career, we pool all authors over the entire period of observation and compare the tendency to reciprocate between coauthors and non-coauthors. Namely, we define the reciprocity between a pair of authors i and j as the number of reciprocated citations between them, divided by the total number of citations received by both authors. In Fig. 5(c) we show such quantity as a function of the distance between research interests, quantified by the Jaccard similarity index between the sets of papers cited by a pair of authors over their career [33]. Higher Jaccard indices indicate higher proportions of common references used by both authors, which we interpret as a proxy for a substantial overlap of research interests. As one would intuitively expect, we observe an overall positive correlation between research interests and the tendency to reciprocate citations. However, on average we find this relationship to be stronger in the case of coauthors, regardless of the specific level of proximity between research interests.
Let us conclude this section with a short digression devoted to the investigation of self-citations through the lens of our null model. Figure 5(d) compares the observed fraction of self-citations with the corresponding expected proportion in our null model ensemble over time. As it can be seen, the empirical rate of self-citation has remained around a fairly stable level of around 20% (which decreases to roughly 18% when including authors no longer active in the time frame under consideration). Yet, the null model predicts a sharp downward trend, which, as in the case of reciprocity between coauthors, highlights a growing gap between empirical citation patterns and those expected under our null hypothesis. Interestingly, the aforementioned rate of self-citation is much larger than those observed in citation datasets from other disciplines (e.g., Law, Political Science, Mathematics), which in most cases are between 5% and 12% [34].
2.4 Robustness checks
We tested the robustness of our main findings in a number of ways, in order to rule out spurious effects due to possible confounding factors. First, following [35] we modified our null model in order to account for modularity-related effects in the author network, i.e., that authors belonging to the same scientific sub-communities can be naturally expected to cite each other at an above average rate. To this end, we used two popular community detection algorithms (InfoMap [36] and the modularity-based algorithm published in [37]) in order to extract the community structure of the author network at different granularity levels (see Additional file 1, Supplementary Note 4), and constrained our null model to partially preserve it (see Methods).
Second, we proceeded to discount self-citation as a potential confounding factor in our analyses. Indeed, pairs of past coauthors both citing their own work naturally give rise to reciprocated citations. We therefore ran our analyses after removing all self-citations in the paper network (which, as shown in the bottom right panel of Fig. 5, roughly amount to 20% of all citations in the dataset). The main results are shown in Fig. 6, and, as it can seen, are very much in line with those obtained from the full dataset. Indeed, we still observe a clear negative relationship between excess reciprocity and impact (left panel), whose strength becomes clearer in more recent decades (middle panel), in analogy with the results reported in the right panel of Fig. 3 and the left panel of Fig. 4. Also, in the right panel we see a pattern in the evolution of network-wide excess reciprocity similar to the one observed in the full dataset (top-left panel of Fig. 5), albeit with systematically lower annual values. Based on these results, we can safely conclude that self-citations do produce some extra excess reciprocity, but do not represent its main driver.
Third, we controlled for the effect of productivity in a number of ways. We did so by repeating our analysis after clustering the authors based on the number of citations received per paper (rather than on absolute citation counts), and after removing authors with low productivity and impact (i.e., those with a total of less than 10 citations over their first 20 career years, see Methods). In the same spirit, we also controlled for the presence of large collaborations by repeating our analyses after further restricting our sample to papers published by three or less authors.
Fourth, we restricted our analyses to US-based authors only in order control for potential geographical biases, and to assess the robustness of our results with respect to the name disambiguation procedure we used to identify authors (see Additional file 1, Supplementary Note 4).
In all the above cases we still detected the same negative relationship between excess reciprocity and impact based on the clusters of authors identified with k-means. As a further robustness check, we tested such relationship when separating authors based on different clustering criteria, i.e., we also grouped authors based on quartiles and with the Expectation-Maximization clustering algorithm [38]. In both cases, we still detected the same negative relationship (see Additional file 1, Supplementary Note 4).
Lastly, we tested such relationship from the opposite perspective, i.e., by grouping authors based on excess reciprocity and then measuring the impact of different groups. We resorted to matched pair analysis, and divided the authors whose careers started in 1970-1990 into “high reciprocators” (treatment) and “low reciprocators” (control) groups based on their excess reciprocity pattern over the first 10 career years, and performed a t-test on the average number of citations attracted by authors in the two groups over the following 10 career years after pairing them based on productivity (i.e., on the number of papers published in the first 10 years). Consistently with our results based on clustering, we found the treatment group to attract substantially less citations per author (272.2) than the control group (331.6), with \(p < 0.001\) (see Additional file 1, Supplementary Note 6).