Skip to main content

Table 1 Summary of the datasets used in this paper

From: Multilayer networks for text analysis with multiple data types

 

Wikipedia Dataset in Manuscript

Wikipedia Dataset in SI

E-mail Dataset in SI

Citation dataset in SI

Nodes:

    

Documents

120

316

4894

2542

Word Types

11,545

16,344

66,088

7677

Metadata Tags

Physics, Maths, Biology

Statistics, Maths, Electrical Engineering

0

52 Categories

Edges:

    

Hyperlinks

309

1530

18,005

4590

Word Tokens

155,093

321,147

761,179

116,889

Tag Labels

120

316

0

2542