Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Kojima, Noriyuki; Suhr, Alane; Artzi, Yoav

Computer Science > Computation and Language

arXiv:2108.04812 (cs)

[Submitted on 10 Aug 2021]

Title:Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Authors:Noriyuki Kojima, Alane Suhr, Yoav Artzi

View PDF

Abstract:We study continual learning for natural language instruction generation, by observing human users' instruction execution. We focus on a collaborative scenario, where the system both acts and delegates tasks to human users using natural language. We compare user execution of generated instructions to the original system intent as an indication to the system's success communicating its intent. We show how to use this signal to improve the system's ability to generate instructions via contextual bandit learning. In interaction with real users, our system demonstrates dramatic improvements in its ability to generate language over time.

Comments:	To appear in TACL 2021. The arXiv version is a pre-MIT Press publication version
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2108.04812 [cs.CL]
	(or arXiv:2108.04812v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2108.04812

Submission history

From: Noriyuki Kojima [view email]
[v1] Tue, 10 Aug 2021 17:53:44 UTC (1,530 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-08

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Noriyuki Kojima
Alane Suhr
Yoav Artzi

export BibTeX citation

Computer Science > Computation and Language

Title:Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators