Toward Self-learning End-to-End Task-Oriented Dialog Systems

Zhang, Xiaoying; Peng, Baolin; Gao, Jianfeng; Meng, Helen

Computer Science > Computation and Language

arXiv:2201.06849v2 (cs)

[Submitted on 18 Jan 2022 (v1), last revised 28 Dec 2022 (this version, v2)]

Title:Toward Self-learning End-to-End Task-Oriented Dialog Systems

Authors:Xiaoying Zhang, Baolin Peng, Jianfeng Gao, Helen Meng

View PDF

Abstract:End-to-end task bots are typically learned over a static and usually limited-size corpus. However, when deployed in dynamic, changing, and open environments to interact with users, task bots tend to fail when confronted with data that deviate from the training corpus, i.e., out-of-distribution samples. In this paper, we study the problem of automatically adapting task bots to changing environments by learning from human-bot interactions with minimum or zero human annotations. We propose SL-AGENT, a novel self-learning framework for building end-to-end task bots. SL-AGENT consists of a dialog model and a pre-trained reward model to predict the quality of an agent response. It enables task bots to automatically adapt to changing environments by learning from the unlabeled human-bot dialog logs accumulated after deployment via reinforcement learning with the incorporated reward model. Experimental results on four well-studied dialog tasks show the effectiveness of SL-AGENT to automatically adapt to changing environments, using both automatic and human evaluations. We will release code and data for further research.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2201.06849 [cs.CL]
	(or arXiv:2201.06849v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2201.06849

Submission history

From: Xiaoying Zhang [view email]
[v1] Tue, 18 Jan 2022 09:56:35 UTC (1,296 KB)
[v2] Wed, 28 Dec 2022 12:59:30 UTC (5,937 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiaoying Zhang
Baolin Peng
Jianfeng Gao
Helen Meng

export BibTeX citation

Computer Science > Computation and Language

Title:Toward Self-learning End-to-End Task-Oriented Dialog Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Toward Self-learning End-to-End Task-Oriented Dialog Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators