Skip to main content

Table 1 Summary of the datasets used in this paper

From: Multilayer networks for text analysis with multiple data types

  Wikipedia Dataset in Manuscript Wikipedia Dataset in SI E-mail Dataset in SI Citation dataset in SI
Nodes:     
Documents 120 316 4894 2542
Word Types 11,545 16,344 66,088 7677
Metadata Tags Physics, Maths, Biology Statistics, Maths, Electrical Engineering 0 52 Categories
Edges:     
Hyperlinks 309 1530 18,005 4590
Word Tokens 155,093 321,147 761,179 116,889
Tag Labels 120 316 0 2542