Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs

Zhu, Shaojie; Wang, Zhaobin; Zhuo, Chengxiang; Lu, Hui; Hu, Bo; Li, Zang

Computer Science > Artificial Intelligence

arXiv:2312.17535 (cs)

[Submitted on 29 Dec 2023]

Title:Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs

Authors:Shaojie Zhu, Zhaobin Wang, Chengxiang Zhuo, Hui Lu, Bo Hu, Zang Li

View PDF HTML (experimental)

Abstract:CoT (Chain-of-Thought) is a way to solve reasoning problems for LLMs . Recently, many researches appear for improving the CoT capability of LLMs. In this work, we also proposed Olapa-MCoT, which is a LLMs based on llama2-13B PLM for finetuning and alignment learning. During the alignment training, we proposed the SimRRHF algorithm and Incorrect Data Relearning and mainly focused on optimizing the Chinese mathematical reasoning ability of Olapa-MCoT. The experiment achieved significant results, with the accuracy of Chinese mathematical reasoning up to 50%, 36% rise compared to llama2-13B. In addition, the accuracy of English reasoning ability also increased by nearly 4%.

Comments:	10 pages, 1 figures
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2312.17535 [cs.AI]
	(or arXiv:2312.17535v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2312.17535

Submission history

From: Shaojie Zhu [view email]
[v1] Fri, 29 Dec 2023 09:33:35 UTC (155 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2023-12

Change to browse by:

cs
cs.AI
cs.HC

References & Citations

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators