Skip to main content

Table A2 Summary statistics for our unlabeled dataset

From: CORAL: COde RepresentAtion learning with weakly-supervised transformers for analyzing data analysis

Number of Notebooks

118,762

Mean Number of Cells per Notebook

19.12

Mean Number of Lines of Code per Cell

3.81

Mean Number of Functions Used per Cell

2.08