Publication Details
Deep Learning from Web-Scale Corpora for Better Dictionary Interfaces
Word2Vec, neural networks, ClueWeb, UKWaC, tip-of-the-tongue phenomenon
This paper explores advanced learning mechanisms - neural networks trained by the Word2Vec method - for predicting word associations. We discuss how the approach can be built into dictionary interfaces to help tip-of-the-tongue searches. We also describe our contribution to the CogALex 2014 shared task. We argue that the reverse response-stimulus word associations chosen for the shared task are only mildly related to the motivation idea of the lexical access support system. The methods employed in our contribution are briefly introduced. We present results of experiments with various parameter settings and show what improvement can be expected if more than one answer is allowed. The paper concludes with a proposal for a new collective effort to assemble real tip-of-the-tongue situation records for future, more-realistic evaluations.
@INPROCEEDINGS{FITPUB10709, author = "Lubom\'{i}r Otrusina and Pavel Smr\v{z}", title = "Deep Learning from Web-Scale Corpora for Better Dictionary Interfaces", pages = "22--30", booktitle = "Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex)", year = 2014, location = "Dublin, IE", publisher = "Association for Computational Linguistics", ISBN = "978-1-63439-217-4", language = "english", url = "https://www.fit.vut.cz/research/publication/10709" }