Skip to main content

Table 1 Datasets used: the aim is to get datasets which resemble those used in contemporary cost sensitive prediction tasks and have corresponding external datasets

From: Quantifying decision making for data science: from data acquisition to modeling

Dataset

% Minority

Instances

Ext. Data Instances

Time stamps

Costs

Pendigits

8.3

13,821

simulated

simulated

simulated

Medicare

12.9

611,785

853,360

simulated

simulated

Open city data

33.2

250,000

77

actual

actual