Skip to main content
Figure 4 | EPJ Data Science

Figure 4

From: Uncovering the size of the illegal corporate service provider industry in the Netherlands: a network approach

Figure 4

Number of directors flagged and validated in our approach. (A) The nearest neighbor approach (red) identifies 3830 directors as potential CSPs, while the logistic regression (blue) identifies 6690 (3691 new ones). (B) Amongst the directors flagged by the nearest neighbors approach, 886 correspond to licensed CSPs, 330 to illegal CSPs (TP), 2056 to non-CSPs (FP), and we were not able to determine the status of 558 of them (Unk). Amongst the new directors flagged by the logistic regression approach, 12 correspond to licensed CSPs, 61 to illegal CSPs (TP), 3241 to non-CSPs (FP), and we were not able to determine the status of 389 of them (Unk). The estimates of false positives, true positives and unknowns were obtained using the Bayes rule with a uniform prior and a binomial likelihood. The median of the posterior distribution is displayed. (C) Confusion matrix with the overlap between the directors flagged by both algorithms

Back to article page