Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems

Ohashi, Atsumoto; Higashinaka, Ryuichiro

Computer Science > Computation and Language

arXiv:2502.00747 (cs)

[Submitted on 2 Feb 2025]

Title:Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems

Authors:Atsumoto Ohashi, Ryuichiro Higashinaka

View PDF HTML (experimental)

Abstract:Post-processing networks (PPNs) are components that modify the outputs of arbitrary modules in task-oriented dialogue systems and are optimized using reinforcement learning (RL) to improve the overall task completion capability of the system. However, previous PPN-based approaches have been limited to handling only a subset of modules within a system, which poses a significant limitation in improving the system performance. In this study, we propose a joint optimization method for post-processing the outputs of all modules using universal post-processing networks (UniPPNs), which are language-model-based networks that can modify the outputs of arbitrary modules in a system as a sequence-transformation task. Moreover, our RL algorithm, which employs a module-level Markov decision process, enables fine-grained value and advantage estimation for each module, thereby stabilizing joint learning for post-processing the outputs of all modules. Through both simulation-based and human evaluation experiments using the MultiWOZ dataset, we demonstrated that UniPPN outperforms conventional PPNs in the task completion capability of task-oriented dialogue systems.

Comments:	Accepted by AAAI 2025 Main Technical Track
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.00747 [cs.CL]
	(or arXiv:2502.00747v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.00747

Submission history

From: Atsumoto Ohashi [view email]
[v1] Sun, 2 Feb 2025 10:46:37 UTC (319 KB)

Computer Science > Computation and Language

Title:Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators