Publication Details
Automatic Acquisition of Semantics-Extraction Patterns
parallel corpus, lexico-semantic patterns
This paper examines the use of parallel and comparable corpora for automatic acquisition of semantics-extraction patterns. It presents a new method of the pattern extraction which takes advantage of parallel texts to "port" text mining solutions from a source language to a target language. It is shown that the technique can help in situations when the extraction procedure is to be applied in a language (languages) with a limited set of available resources, e.g. domain-specific thesauri. The primary motivation of our work lies in a particular multilingual e-learning system. For testing purposes, other applications of the given approach were implemented. They include pattern extraction from general texts (tested on wordnet relations), acquisition of domain-specific patterns from large parallel corpus of legal EU documents, and mining of subjectivity expressions for multilingual opinion extraction system.
@INPROCEEDINGS{FITPUB8120, author = "Pavel Smr\v{z}", title = "Automatic Acquisition of Semantics-Extraction Patterns", pages = "1--4", booktitle = "Proceedings of the 5th International Conference on Language Resources and Evaluation", year = 2006, location = "Paris, FR", publisher = "European Language Resources Association", ISBN = "2-9517408-2-4", language = "english", url = "https://www.fit.vut.cz/research/publication/8120" }