Project Details
OCCAM - OCR, ClassificAtion & Machine Translation
Project Period: 1. 10. 2019 - 30. 9. 2021
Project Type: grant
Agency: European Comission EU
Program: Connecting Europe Facility (CEF)
OCR, Classification, Machine Translation
OCCAM (OCR, ClassificAtion & Machine Translation) responds to action line "Integration projects" on the integration (and extension) of CEF (Connecting Europe Facility) Automated Translation into multilingual digital cross-border services. The Action proposes the integration of image classification, Translation Memories (TMs), Optical Character Recognition (OCR), and Machine Translation (MT) to support the automated translation of scanned documents - a document type that currently cannot be processed by the CEF eTranslation service.
OCCAM will develop two use cases: (i) the Business Registers Interconnection System (BRIS) use case and (ii) the Digital Humanities use case.
Homoliak Ivan, Ing., Ph.D. (UITS FIT VUT)
Michal Bohumil, Ing. (CVT FIT VUT)
Mrazíková Libuše, Mgr. (Děkanát FIT VUT)
Musil Petr, Ing., Ph.D. (UPGM FIT VUT)
Najman Pavel, Ing., Ph.D. (UPGM FIT VUT)
Otáhalová Sylva (UPGM FIT VUT)
Pirová Zuzana, Ing. (Děkanát FIT VUT)
2021
- KIŠŠ Martin, BENEŠ Karel and HRADIŠ Michal. AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions. In: Lladós J., Lopresti D., Uchida S. (eds) Document Analysis and Recognition - ICDAR 2021. Lecture Notes in Computer Science, vol. 12824. Lausanne: Springer Nature Switzerland AG, 2021, pp. 463-477. ISBN 978-3-030-86336-4. Detail