Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking

Bebensee, Björn; Lee, Haejun

Computer Science > Computation and Language

arXiv:2306.09340 (cs)

[Submitted on 15 Jun 2023]

Title:Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking

Authors:Björn Bebensee, Haejun Lee

View PDF

Abstract:In schema-guided dialogue state tracking models estimate the current state of a conversation using natural language descriptions of the service schema for generalization to unseen services. Prior generative approaches which decode slot values sequentially do not generalize well to variations in schema, while discriminative approaches separately encode history and schema and fail to account for inter-slot and intent-slot dependencies. We introduce SPLAT, a novel architecture which achieves better generalization and efficiency than prior approaches by constraining outputs to a limited prediction space. At the same time, our model allows for rich attention among descriptions and history while keeping computation costs constrained by incorporating linear-time attention. We demonstrate the effectiveness of our model on the Schema-Guided Dialogue (SGD) and MultiWOZ datasets. Our approach significantly improves upon existing models achieving 85.3 JGA on the SGD dataset. Further, we show increased robustness on the SGD-X benchmark: our model outperforms the more than 30$\times$ larger D3ST-XXL model by 5.0 points.

Comments:	Accepted to ACL 2023
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2306.09340 [cs.CL]
	(or arXiv:2306.09340v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.09340

Submission history

From: Björn Bebensee [view email]
[v1] Thu, 15 Jun 2023 17:59:31 UTC (7,393 KB)

Computer Science > Computation and Language

Title:Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators