MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

Lou, Renze; Zhang, Kai; Xie, Jian; Sun, Yuxuan; Ahn, Janice; Xu, Hanzi; Su, Yu; Yin, Wenpeng

Computer Science > Computation and Language

arXiv:2312.02436 (cs)

[Submitted on 5 Dec 2023 (v1), last revised 15 Mar 2024 (this version, v3)]

Title:MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

Authors:Renze Lou, Kai Zhang, Jian Xie, Yuxuan Sun, Janice Ahn, Hanzi Xu, Yu Su, Wenpeng Yin

View PDF HTML (experimental)

Abstract:In the realm of large language models (LLMs), enhancing instruction-following capability often involves curating expansive training data. This is achieved through two primary schemes: i) Scaling-Inputs: Amplifying (input, output) pairs per task instruction, aiming for better instruction adherence. ii) Scaling Input-Free Tasks: Enlarging tasks, each composed of an (instruction, output) pair (without requiring a separate input anymore). However, LLMs under Scaling-Inputs tend to be overly sensitive to inputs, leading to misinterpretation or non-compliance with instructions. Conversely, Scaling Input-Free Tasks demands a substantial number of tasks but is less effective in instruction following when dealing with instances in Scaling-Inputs. This work introduces MUFFIN, a new scheme of instruction-following dataset curation. Specifically, we automatically Scale Tasks per Input by diversifying these tasks with various input facets. Experimental results across four zero-shot benchmarks, spanning both Scaling-Inputs and Scaling Input-Free Tasks schemes, reveal that LLMs, at various scales, trained on MUFFIN generally demonstrate superior instruction-following capabilities compared to those trained on the two aforementioned schemes.

Comments:	ICLR 2024. Data, model, and code are available at: this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2312.02436 [cs.CL]
	(or arXiv:2312.02436v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.02436

Submission history

From: Renze Lou [view email]
[v1] Tue, 5 Dec 2023 02:32:08 UTC (2,298 KB)
[v2] Mon, 4 Mar 2024 04:12:02 UTC (2,374 KB)
[v3] Fri, 15 Mar 2024 03:11:44 UTC (2,374 KB)

Computer Science > Computation and Language

Title:MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators