CoDi: Conversational Distillation for Grounded Question Answering

Huber, Patrick; Einolghozati, Arash; Conway, Rylan; Narang, Kanika; Smith, Matt; Nayyar, Waqar; Sagar, Adithya; Aly, Ahmed; Shrivastava, Akshat

Computer Science > Computation and Language

arXiv:2408.11219 (cs)

[Submitted on 20 Aug 2024]

Title:CoDi: Conversational Distillation for Grounded Question Answering

Authors:Patrick Huber, Arash Einolghozati, Rylan Conway, Kanika Narang, Matt Smith, Waqar Nayyar, Adithya Sagar, Ahmed Aly, Akshat Shrivastava

View PDF HTML (experimental)

Abstract:Distilling conversational skills into Small Language Models (SLMs) with approximately 1 billion parameters presents significant challenges. Firstly, SLMs have limited capacity in their model parameters to learn extensive knowledge compared to larger models. Secondly, high-quality conversational datasets are often scarce, small, and domain-specific. Addressing these challenges, we introduce a novel data distillation framework named CoDi (short for Conversational Distillation, pronounced "Cody"), allowing us to synthesize large-scale, assistant-style datasets in a steerable and diverse manner. Specifically, while our framework is task agnostic at its core, we explore and evaluate the potential of CoDi on the task of conversational grounded reasoning for question answering. This is a typical on-device scenario for specialist SLMs, allowing for open-domain model responses, without requiring the model to "memorize" world knowledge in its limited weights. Our evaluations show that SLMs trained with CoDi-synthesized data achieve performance comparable to models trained on human-annotated data in standard metrics. Additionally, when using our framework to generate larger datasets from web data, our models surpass larger, instruction-tuned models in zero-shot conversational grounded reasoning tasks.

Comments:	13 pages
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.11219 [cs.CL]
	(or arXiv:2408.11219v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.11219

Submission history

From: Patrick Huber [view email]
[v1] Tue, 20 Aug 2024 22:35:47 UTC (9,958 KB)

Computer Science > Computation and Language

Title:CoDi: Conversational Distillation for Grounded Question Answering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CoDi: Conversational Distillation for Grounded Question Answering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators