Detail publikace
Content-based Copy Detection
Hradiš Michal, Ing., Ph.D. (UPGM FIT VUT)
Otrusina Lubomír, Ing. (UPGM FIT VUT)
Řezníček Ivo, Ing. (UPGM FIT VUT)
This paper describes our approach to semantic indexing and content-based copy detection which was used for TRECVID 2010 evaluation.
Semantic indexing
1. The runs differ in the types of features used. All runs use several bag-of-word representations fed to separate linear SVMs and the SVMs were fused by logistic regression. Visual and audio features were used as well as metadata. We added contextual features extracted from the video from which a shot originated.
- F_A_brno.run1 (run1) - Only visual information. Dense sampling and Harris-Laplace detector with SIFT and RGB-SIFT descriptors
- F_A_brno.run1 (run2) - The same as in run1 with added features from audio and metadata.
- F_A_brno.run3 (run3) - The same as in run2 with added contextual features extracted from the whole video.
2. Audio and metadata significantly improves results. Even grater improvement was achieved by using the contextual features.
Content-based Copy Detection
1. One run submitted in two versions (the difference is only in relevance threshold setting)
- brnoccd: SIFT and SURF combination, bag-of-words (visual codebook: 100k size, 4 nearest neighbors used in soft-assignment), inverted file index, geometry (homography) based image similarity metric
2. What if any significant differences (in terms of what measures) did you find among the runs?
- only one setting used - no differences
3. Based on the results, can you estimate the relative contribution of each component of your system/approach to its effectiveness?
- slow search in reference dataset due to pure indexing effectiveness
4. Overall, what did you learn about runs/approaches and the research question(s) that motivated them?
- change the way of describing the video content - frame based (or key-frame based) approach is not sufficient
@INPROCEEDINGS{FITPUB9841, author = "V\'{i}t\v{e}zslav Beran and Michal Hradi\v{s} and Lubom\'{i}r Otrusina and Ivo \v{R}ezn\'{i}\v{c}ek", title = "Content-based Copy Detection", pages = "1--10", booktitle = "2011 TREC Video Retrieval Evaluation Notebook Papers", year = 2011, location = "Gaithersburg, MD, US", publisher = "National Institute of Standards and Technology", language = "english", url = "https://www.fit.vut.cz/research/publication/9841" }