Result Details

Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases

OTRUSINA, L.; SMRŽ, P. Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases. Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017). Kolkata: 2017. p. 523-532.

Type

conference paper

Language

English

Authors

Otrusina Lubomír, Ing., DCGM (FIT)
Smrž Pavel, doc. RNDr., Ph.D., DCGM (FIT)

Abstract

This paper deals with semantic enrichment of textual resources by means of automatically generated named entity recognizers-linkers and advanced indexing and searching mechanisms that can be integrated into various information retrieval and information extraction systems. It introduces a new system transforming Wikipedia and other available sources into task-specific knowledge bases and employs contextual information to build state-of-the-art entity disambiguation components. Although some components are language-dependent (for example, that responsible for the morphology analysis or the semantic role identification), they can be easily replaced by existing tools providing specific functions. As a case study, we demonstrate an instantiation of the system for the task of semantic annotation of Czech bibliographic databases in the context of the CPK project. We particularly stress the role of problem-specific knowledge sources that can be easily integrated into our system and play a key role in the success of the tool in real applications.

URL

http://www.aclweb.org/anthology/W/W17/W17-7563

Published

2017

Pages

523–532

Proceedings

Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017)

Conference

14th International Conference on Natural Language Processing

Place

Kolkata

BibTeX

@inproceedings{BUT168464,
  author="Lubomír {Otrusina} and Pavel {Smrž}",
  title="Semantic Enrichment Across Language: A Case Study of Czech Bibliographic Databases",
  booktitle="Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017)",
  year="2017",
  pages="523--532",
  address="Kolkata",
  url="http://www.aclweb.org/anthology/W/W17/W17-7563"
}

Departments

Ústav počítačové grafiky a multimédií (DCGM)