GROTOAP: GROund Truth for Open Access Publications is a test set for training and performance evaluation of page segmentation and zone classification algorithms. Contains input articles in a digital form and corresponding ground truth files. The test set is based on articles obtained from DOAJ database published under CC-BY license. The whole test set is available under the same license.
For details, please see http://dx.doi.org/10.1145/2232817.2232901 (OA preprint: https://depot.ceon.pl/handle/123456789/1956).
URL of resource:
Domain:
Computer Science
Resource Type:
Sub-categories:
Software or tool development phase:
Software Access:
Parent organization:
CeON, ICM, University of Warsaw