Skip to content

Latest commit

 

History

History
27 lines (19 loc) · 3.17 KB

README.md

File metadata and controls

27 lines (19 loc) · 3.17 KB

Open-domain Chatbot Augmented with Commonsense Knowledge

Master Thesis work published on the Conference "Lithuanian MSc Research in Informatics and ICT".

Abstract

Building an open-domain dialog system is a challenging task in current research. In order to successfully maintain a conversation with human, a dialog system must develop many qualities: being engaging, empathetic, show a unique personality and having general knowledge about the world. Prior research has shown that it is possible to develop such chat-bot system that combines these features, but this work explores this problem further. Most state-of-the-art dialogue systems are guided by unstructured knowledge such as Wikipedia articles, but there is a lack of research on how structured knowledge bases can be used for open-domain dialogue generation. This work proposes usage of structured knowledge base ConceptNet for knowledge-grounded dialogue generation. Novel knowledge extraction algorithm is developed which is then used to incorporate knowledge into existing dialogue datasets. Current state-of-the-art model BlenderBot is finetuned on newly created datasets and it is shown that knowledge augmentation of the dataset improved BlenderBot in terms of various automated metrics and according to human evaluation.

Small technical description

Baseline model, BlenderBot 1, was fine-tuned on a knowledge-augmented datasets. Each original dataset (BST, ConvAI2, WoW, ED) was preprocessed by knowledge extraction algorithm. Developed algorithm extracts knowledge triples (assertions) from ConceptNet and adds the most relevant ones to the inputted utterance. Relevance is described by cosine similarity between the utterance sentence embedding and the knowledge triple embedding (treated as a small sentence). Extracted knowledge were appended to dataset messages, each ConceptNet relation was treated as a special token. The latest version of the algorithm also extracts knowledge from the whole context of the dialogue and not only the last utterance.

Automated metrics

image

Human evaluation

There was an attempt to evaluate the developed model in a fashion similar to ACUTE-EVAL. Although there were not enough resources to perform a full-scale crowdsourced survey, a small amount (~30) of friends and relatives were able to take a survey. One can still take the survey if interested.

image