Publication Details
Cheap Rendering vs. Costly Annotation: Rendered Omnidirectional Dataset of Vehicles
Juránek Roman, Ing., Ph.D. (DCGM FIT BUT)
Herout Adam, prof. Ing., Ph.D. (DCGM FIT BUT)
Realistic Rendering, Dataset of Vehicles, Omnidirectional Views, Computer Vision, Object Detection
Detection of vehicles in traffic surveillance needs good and large training datasets in order to achieve competitive detection rates. We are showing an approach to automatic synthesis of custom datasets, simulating various major influences: viewpoint, camera parameters, sunlight, surrounding environment, etc. Our goal is to create a competitive vehicle detector which "has not seen a real car before." We are using Blender as the modeling and rendering engine. A suitable scene graph accompanied by a set of scripts was created, that allows simple configuration of the synthesized dataset. The generator is also capable of storing rich set of metadata that are used as annotations of the synthesized images. We synthesized several experimental datasets, evaluated their statistical properties, as compared to real-life datasets. Most importantly, we trained a detector on the synthetic data. Its detection performance is comparable to a detector trained on state-of-the-art real-life dataset. Synthesis of a dataset of 10,000 images takes only several hours, which is much more efficient, compared to manual annotation, let aside the possibility of human error in annotation.
@INPROCEEDINGS{FITPUB10646, author = "Peter \v{S}los\'{a}r and Roman Jur\'{a}nek and Adam Herout", title = "Cheap Rendering vs. Costly Annotation: Rendered Omnidirectional Dataset of Vehicles", pages = "105--112", booktitle = "Proceedings of Spring Conference on Computer Graphics", year = 2014, location = "Smolenice, SK", publisher = "Comenius University in Bratislava", ISBN = "978-80-223-3601-7", doi = "10.1145/2643188.2643191", language = "english", url = "https://www.fit.vut.cz/research/publication/10646" }