Publication Details

Interactive Mining on Hierarchical Data

CHMELAŘ Petr and STRYKA Lukáš. Interactive Mining on Hierarchical Data. In: Proceedings of the 13th Conference STUDENT EEICT 2007 Volume 4. Brno: Brno University of Technology, 2007, pp. 410-414. ISBN 978-80-214-3410-3.
Czech title
Interaktivní dolování dat nad hierarchickými daty
Type
conference paper
Language
english
Authors
Chmelař Petr, Ing. (DIFS FIT BUT)
Stryka Lukáš, Ing. (DIFS FIT BUT)
Keywords

interactive, intuitive, on-line, data mining, OLAP, data warehouse, association, characterization, classification, nonnaïve Bayessian classification, uml notation based presentation

Abstract

In this paper, we propose a framework for interactive, iterative, and intuitive mining of multilevel association, characterization and classification rules on data organized in multi-level conceptual hierarchies. This framework is called OLAM SE (Self Explaining On-Line Analytical Mining) and it is proposed as an extension of OLAP or as an alternative to Han's OLAM. OLAM processes data stored in data cubes structure of which is based on a given conceptual hierarchy. OLAM SE determines minimum support value from user defined cover value of data with usage of entropy coding principle. It also automatically determines the maximum threshold to avoid explaining knowledge that is obvious and so potentially uninteresting. Major part of data is thus described by frequent patterns. The presentation of results is inspired by UML diagram notation. It contains a graph nodes of which are frequent data sets represented as packages including sub packages - data classes or items. Edges represent relations or patterns between packages. This representation could be applicable for characterization and nonnaïve Bayesian classification process as well. Patterns can be interactively explored by the user, who gets a detailed view of attractive ones. She can intuitively drive the more detailed knowledge obtaining process.

Published
2007
Pages
410-414
Proceedings
Proceedings of the 13th Conference STUDENT EEICT 2007 Volume 4
Conference
Student EEICT 2007, Brno, CZ
ISBN
978-80-214-3410-3
Publisher
Brno University of Technology
Place
Brno, CZ
BibTeX
@INPROCEEDINGS{FITPUB8319,
   author = "Petr Chmela\v{r} and Luk\'{a}\v{s} Stryka",
   title = "Interactive Mining on Hierarchical Data",
   pages = "410--414",
   booktitle = "Proceedings of the 13th Conference STUDENT EEICT 2007 Volume 4",
   year = 2007,
   location = "Brno, CZ",
   publisher = "Brno University of Technology",
   ISBN = "978-80-214-3410-3",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/8319"
}
Back to top