Skip to main content

Table 7 Most relevant words for each class in dataset OMC

From: Analysis and classification of privacy-sensitive content in social media posts

Sensitive

Non-sensitive

Overall rank

Word

Overall count

Relative frequency

Overall rank

Word

Overall count

Relative frequency

1

im

79.70 ± 7.07

71.51 ± 3.59

2

dont

75.30 ± 8.76

54.77 ± 7.02

29

year

30.10 ± 2.77

70.39 ± 12.45

17

your

39.20 ± 5.75

54.17 ± 7.17

20

much

34.30 ± 6.52

68.82 ± 6.39

25

way

31.40 ± 5.25

53.71 ± 7.79

26

friend

31.20 ± 7.15

67.46 ± 8.38

18

good

38.20 ± 4.92

53.52 ± 9.68

14

realli

43.50 ± 6.38

63.46 ± 7.29

27

that

30.50 ± 6.26

52.89 ± 6.82

23

work

33.10 ± 5.34

61.98 ± 13.41

24

tri

31.90 ± 6.81

50.95 ± 8.36

21

even

34.20 ± 4.59

61.95 ± 10.70

8

peopl

55.60 ± 5.93

49.88 ± 5.09

16

life

42.00 ± 6.94

61.80 ± 5.75

10

think

48.50 ± 4.09

48.99 ± 9.29

7

go

56.80 ± 6.61

59.78 ± 5.72

28

person

30.20 ± 6.32

48.53 ± 11.36

15

would

42.20 ± 7.96

59.32 ± 4.06

22

need

33.60 ± 4.81

47.45 ± 8.00

5

know

60.00 ± 4.74

58.69 ± 4.51

9

thing

55.40 ± 7.31

47.25 ± 5.91

4

feel

63.20 ± 7.15

57.82 ± 5.32

12

make

46.80 ± 5.73

46.65 ± 10.65

11

want

48.00 ± 8.10

57.72 ± 8.62

13

one

45.00 ± 9.49

44.14 ± 6.84

0

like

91.70 ± 8.15

57.22 ± 4.25

6

time

57.10 ± 6.87

43.91 ± 7.14

19

love

37.50 ± 6.02

56.39 ± 7.99

3

get

74.10 ± 7.32

43.85 ± 7.06

3

get

74.10 ± 7.32

56.16 ± 7.06

19

love

37.50 ± 6.02

43.61 ± 7.99

6

time

57.10 ± 6.87

56.09 ± 7.14

0

like

91.70 ± 8.15

42.78 ± 4.25

13

one

45.00 ± 9.49

55.86 ± 6.84

11

want

48.00 ± 8.10

42.28 ± 8.62

12

make

46.80 ± 5.73

53.35 ± 10.65

4

feel

63.20 ± 7.15

42.18 ± 5.32

9

thing

55.40 ± 7.31

52.75 ± 5.91

5

know

60.00 ± 4.74

41.31 ± 4.51