Project Details


WSIS Prizes Contest 2019 Nominee

Evaluation and post-correction of OCR of digitised historical newspapers


Description

The goal of this research is to bring the digitized text closer to the original newspaper articles by applying post-correction. Post-correction involves improving digitized text quality by manipulating the textual output of the OCR process directly. The idea is that better quality data boosts eHumantities research. Although the quality of the KB newspaper data would definitely benefit from improving the OCR process itself (improved image recognition), post-correction will still be necessary, because the quality of historical newspapers is suboptimal for OCR (for example, due to poor paper and print quality).

Project website

https://www.esciencecenter.nl/project/deep-learning-ocr-post-correction


Images

Action lines related to this project
  • AL C3. Access to information and knowledge 2019
  • AL C7. E-science
Sustainable development goals related to this project
  • Goal 4: Quality education

Coverage
  • Netherlands

Status

Ongoing

Start date

2018

End date

Not set


WSIS values promotion

The idea is that better quality data boosts eHumantities research.


Entity name

The Netherlands eScience Center

Entity country—type

Netherlands Academia

Entity website

https://www.esciencecenter.nl

Partners

National Library of the Netherlands