Publication Details

A Game for Crowdsourcing Adversarial Examples for False Information Detection

ČEGIŇ Ján. A Game for Crowdsourcing Adversarial Examples for False Information Detection. In: CEUR Workshop Proceedings. Vídeň: CEUR-WS.org, 2022, pp. 13-25. ISSN 1613-0073. Available from: https://ceur-ws.org/Vol-3275/paper2.pdf

Czech title

Hra pro Crowdsourcing: Příklady odporu při odhalování falešných informací

Type

conference paper

Language

english

Authors

Čegiň Ján, Ing. (DCGM FIT BUT)

URL

https://ceur-ws.org/Vol-3275/paper2.pdf

Keywords

adversarial data generation, false information detection, game with a purpose, human interaction task, machine learning

Abstract

False information detection models are susceptible to adversarial attacks. Such susceptibility is a critical weakness of detection models. Automated creation of adversarial samples can ultimately help to augment training sets and create more robust detection models. However, automatically generated adversarial samples often do not preserve the meaning contained in the original text, leading to information loss. There is a need for adversarial sample generators that can preserve the original meaning. To explore the properties such generators should have and to inform their future design, we conducted a study to collect adversarial samples from human agents using a Game with a purpose (GWAP). Players goal is to modify a given tweet until a detection model is tricked thus creating an adversarial sample. We qualitatively analysed the collected adversarial samples and identified desired properties/strategies that an adversarial meaning-preserving generator should exhibit. These strategies are validated on detection models based on a transformer and LSTM models to confirm their applicability on different models. Based on these findings, we propose a novel generator approach that will exhibit the desired properties in order to generate high-quality information-preserving adversarial samples.

Published

2022

Pages

13-25

Journal

CEUR Workshop Proceedings, vol. 2022, no. 2022, ISSN 1613-0073

Proceedings

CEUR Workshop Proceedings

Conference

The 31st International Joint Conference on Artificial Intelligence, Vienna, AT

Publisher

CEUR-WS.org

Place

Vídeň, AT

EID Scopus

2-s2.0-85143356933

BibTeX

@INPROCEEDINGS{FITPUB12734,
   author = "J\'{a}n \v{C}egi\v{n}",
   title = "A Game for Crowdsourcing Adversarial Examples for False Information Detection",
   pages = "13--25",
   booktitle = "CEUR Workshop Proceedings",
   journal = "CEUR Workshop Proceedings",
   volume = 2022,
   number = 2022,
   year = 2022,
   location = "V\'{i}de\v{n}, AT",
   publisher = "CEUR-WS.org",
   ISSN = "1613-0073",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12734"
}