Low resolution scanned text dataset for optical character recognition

From MaRDI portal



DOI10.5281/zenodo.3945525Zenodo3945525MaRDI QIDQ6717338

Dataset published at Zenodo repository.

Author name not available (Why is that?)

Publication date: 15 July 2020

Copyright license: No records found.



A collection of scanned pages of English text designed for testing low resolution OCR systems. There are 11 different pieces of text, each of which contains 5 pages of text. Each of these 55 pages is typeset in 18 different fonts and then scanned at 300 dpi, producing a total of 990 pages of scanned text. Downsampled 60 dpi and 75 dpi versions are included.






This page was built for dataset: Low resolution scanned text dataset for optical character recognition