Product Details
Corpora Processing Software
Created: 2015
Czech title
Programy pro zpracování korpusů
Type
software
License
required - free
Authors
Doležal Jan, Ing. (FIT BUT)
Dytrych Jaroslav, Ing., Ph.D. (DCGM FIT BUT)
Karásek Miroslav, Ing. (FIT BUT)
Kouřil Jan, Ing. (DCGM FIT BUT)
Otrusina Lubomír, Ing. (DCGM FIT BUT)
Smrž Pavel, doc. RNDr., Ph.D. (DCGM FIT BUT)
Dytrych Jaroslav, Ing., Ph.D. (DCGM FIT BUT)
Karásek Miroslav, Ing. (FIT BUT)
Kouřil Jan, Ing. (DCGM FIT BUT)
Otrusina Lubomír, Ing. (DCGM FIT BUT)
Smrž Pavel, doc. RNDr., Ph.D. (DCGM FIT BUT)
Keywords
corpora, processing, indexing
Description
Set of programs for processing large text corpora. The programs transform data from the HTML format to a vertical text, its annotation at different levels and indexing in MG4J and Elastic.
Licence
Distributed under The Apache License Version 2.0 http://www.apache.org/licenses/LICENSE-2.0.txt
Projects
Research groups
Knowledge Technology Research Group (VZ KNOT)