Skip to main content

Table 5 Most relevant words for each class in dataset SENS2

From: Analysis and classification of privacy-sensitive content in social media posts

Sensitive

Non-sensitive

Overall rank

Word

Overall count

Relative frequency

Overall rank

Word

Overall count

Relative frequency

22

home

33.40 ± 5.74

88.05 ± 5.62

8

love

45.00 ± 7.77

59.86 ± 5.92

26

tomorrow

31.40 ± 4.58

80.36 ± 6.81

10

one

44.50 ± 7.46

56.72 ± 8.67

29

tonight

30.20 ± 5.49

75.78 ± 5.95

19

need

33.70 ± 3.71

56.52 ± 4.91

27

week

30.90 ± 3.00

75.17 ± 5.40

6

like

55.50 ± 7.79

55.10 ± 5.52

9

back

44.70 ± 7.01

74.95 ± 7.04

13

new

40.50 ± 5.32

52.94 ± 6.83

5

work

56.80 ± 6.03

74.46 ± 4.95

20

make

33.70 ± 7.45

52.60 ± 8.10

15

night

37.30 ± 6.86

71.56 ± 7.04

16

think

36.30 ± 5.87

52.57 ± 8.28

1

go

97.40 ± 9.75

67.61 ± 5.45

25

cant

31.60 ± 6.33

48.57 ± 6.53

12

today

42.30 ± 4.79

66.66 ± 5.14

7

time

51.40 ± 8.41

46.59 ± 7.41

0

propnam

123.10 ± 11.53

65.36 ± 4.62

14

good

40.00 ± 7.82

46.02 ± 7.43

4

im

58.20 ± 6.53

62.80 ± 6.88

28

happi

30.30 ± 5.68

45.41 ± 10.64

3

day

72.80 ± 10.27

62.52 ± 6.28

11

want

44.40 ± 4.53

45.02 ± 9.49

17

feel

34.00 ± 6.65

62.38 ± 5.86

24

come

32.40 ± 5.44

42.50 ± 6.18

18

see

33.90 ± 6.30

60.74 ± 5.49

23

know

33.40 ± 7.44

41.55 ± 10.81

2

get

80.50 ± 6.60

58.63 ± 2.28

21

got

33.60 ± 8.10

41.46 ± 8.29

21

got

33.60 ± 8.10

58.54 ± 8.29

2

get

80.50 ± 6.60

41.37 ± 2.28

23

know

33.40 ± 7.44

58.45 ± 10.81

18

see

33.90 ± 6.30

39.26 ± 5.49

24

come

32.40 ± 5.44

57.50 ± 6.18

17

feel

34.00 ± 6.65

37.63 ± 5.86

11

want

44.40 ± 4.53

54.98 ± 9.49

3

day

72.80 ± 10.27

37.48 ± 6.28

28

happi

30.30 ± 5.68

54.60 ± 10.64

4

im

58.20 ± 6.53

37.20 ± 6.88