GROTOAP: Ground Truth for Open Access Publications

GROTOAP: GROund Truth for Open Access Publications is a test set for training and performance evaluation of page segmentation and zone classification algorithms. Contains input articles in a digital form and corresponding ground truth files. The test set is based on articles obtained from DOAJ database published under CC-BY license. The whole test set is available under the same license.

For details, please see (OA preprint:

Computer Science
Resource Type: 
Software or tool development phase: 
Software Access: 
Parent organization: 
CeON, ICM, University of Warsaw