Skip to main content


Table 5 List of features used in the prediction of a directed link between generic users u and  v , along with their description

From: Link creation and information spreading over social and communication ties in an interest-based online social network

Feature Description Rank
Location Binary attribute, whether u and v belong to the same city 14
Gender Binary attribute, whether u and v belong to the same gender 15
Age Absolute difference of ages 12
Library Cosine similarity between library vectors 5
Groups Cosine similarity between group membership vectors 7
Group size Size of the smallest group the two users have in common 6
Vocabulary Cosine similarity between sets of tags used 16
Contact list Cosine similarity of the vectors of social contacts 2
Outdegree Sum of the out degrees ( k out (u)+ k out (v)) 11
Preferential attachment Product of the out degrees ( k out (u) k out (v)) 13
Common neighbors Number of common neighbors, directed case (CN(u,v)=| Γ out (u) Γ in (v)|) 4
Triangle overlap C N ( u , v ) Γ out ( u ) 1
Reciprocation Binary attribute, whether the inverse link (v,u) is already present 9
Resource allocation z ( Γ out ( u ) Γ in ( u ) ) ( 1 k out ( z ) ) [37] 3
Local path Linear combination of common neighbors and common distance-2 neighbors (CN+ϵC N 2 ) [37] 10
Weighted flow wf(u,v)=CN(u,v)+ x C N ( u , v ) min ( w ( u , x ) , w ( x , v ) ) C N ( u , v ) 8
  1. Γ in / out (u) denotes the set of u’s in/out neighbors, k out (u)=| Γ out (u)|, and w(x,y) is the weight of the tie between x and y. The rank reported is the result of the Chi Squared attribute selection method applied to our test set; the bold font of the rank indicates that the corresponding feature has been selected for the restricted feature set.