Explanations from Large Language Models Make Small Reasoners Better

Li, Shiyang; Chen, Jianshu; Shen, Yelong; Chen, Zhiyu; Zhang, Xinlu; Li, Zekun; Wang, Hong; Qian, Jing; Peng, Baolin; Mao, Yi; Chen, Wenhu; Yan, Xifeng

Computer Science > Computation and Language

arXiv:2210.06726 (cs)

[Submitted on 13 Oct 2022]

Title:Explanations from Large Language Models Make Small Reasoners Better

Authors:Shiyang Li, Jianshu Chen, Yelong Shen, Zhiyu Chen, Xinlu Zhang, Zekun Li, Hong Wang, Jing Qian, Baolin Peng, Yi Mao, Wenhu Chen, Xifeng Yan

View PDF

Abstract:Integrating free-text explanations to in-context learning of large language models (LLM) is shown to elicit strong reasoning capabilities along with reasonable explanations. In this paper, we consider the problem of leveraging the explanations generated by LLM to improve the training of small reasoners, which are more favorable in real-production deployment due to their low cost. We systematically explore three explanation generation approaches from LLM and utilize a multi-task learning framework to facilitate small models to acquire strong reasoning power together with explanation generation capabilities. Experiments on multiple reasoning tasks show that our method can consistently and significantly outperform finetuning baselines across different settings, and even perform better than finetuning/prompting a 60x larger GPT-3 (175B) model by up to 9.5% in accuracy. As a side benefit, human evaluation further shows that our method can generate high-quality explanations to justify its predictions, moving towards the goal of explainable AI.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.06726 [cs.CL]
	(or arXiv:2210.06726v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.06726

Submission history

From: Shiyang Li [view email]
[v1] Thu, 13 Oct 2022 04:50:02 UTC (11,147 KB)

Computer Science > Computation and Language

Title:Explanations from Large Language Models Make Small Reasoners Better

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Explanations from Large Language Models Make Small Reasoners Better

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators