OCR-D: Robust methods for layout analysis

Project object

The project aims to improve the quality and robustness of methods for the layout analysis of historical documents in OCR-D in order to ensure their practical suitability for mass digitisation. To this end, existing approaches will be optimised and expanded, and promising new procedures will be integrated. The main focus of the work lies in the further development of complementary methods for layout analysis based on artificial intelligence. The developments are accompanied by an evaluation based on scientifically established metrics. Last but not least, it is important to ensure that all methods are provided with appropriate interfaces and integrated into the overall OCR-D framework. This makes it possible, on the one hand, to flexibly combine the procedures to achieve the best possible results and, on the other hand, to guarantee their adaptability and future-proofing with regard to new developments.

Project term

2023 - 2025

Project participants

Third-party-funding

Deutsche Forschungsgemeinschaft (DFG)

Contact

Clemens Neudecker
Generaldirektion
Tel.: +49 30 266 434 081
clemens.neudecker@sbb.spk-berlin.de

Read more about the project

The project is part of OCR-D: DFG-funded Initiative for Optical Character Recognition Development