Publication Details
Brno University of Technology at MediaEval 2011 Genre Tagging Task
Řezníček Ivo, Ing. (DCGM FIT BUT)
Behúň Kamil, Ing. (FIT BUT)
genre recognition, bag of words, SIFT, local features, SVM, classification, classifier fusion
This paper briefly describes our approach to the video genre tagging task which was a part of MediaEval 2011. We focused mainly on visual and audio information, and we exploited metadata and automatic speech transcripts
only in a very basic way. Our approach relied on classification and on classifier fusion to combine
different sources of information. We did not use any additional training data except the very small
exemplary set provided by MediaEval (only 246 videos). The best performance was achieved by metadata alone.
Combination with the other sources of information did not improve results in the submitted runs. This was achieved
later by choosing more suitable weights in fusion. Excluding the metadata,
audio and video gave better results than speech transcripts. Using classifiers for 345 semantic classes
from TRECVID 2011 semantic indexing (SIN) task to project the data worked better than classifying directly from video and audio features.
@INPROCEEDINGS{FITPUB9851, author = "Michal Hradi\v{s} and Ivo \v{R}ezn\'{i}\v{c}ek and Kamil Beh\'{u}\v{n}", title = "Brno University of Technology at MediaEval 2011 Genre Tagging Task", pages = "1--2", booktitle = "Working Notes Proceedings of the MediaEval 2011 Workshop", journal = "CEUR Workshop Proceedings", number = 9, year = 2011, location = "Pisa, Italy, IT", publisher = "CEUR-WS.org", ISSN = "1613-0073", language = "english", url = "https://www.fit.vut.cz/research/publication/9851" }