"Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students

Agarwal, Vibhor; Garg, Madhav Krishan; Dharmavaram, Sahiti; Kumar, Dhruv

Computer Science > Computers and Society

arXiv:2402.01687 (cs)

[Submitted on 22 Jan 2024 (v1), last revised 3 Apr 2024 (this version, v2)]

Title:"Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students

Authors:Vibhor Agarwal, Madhav Krishan Garg, Sahiti Dharmavaram, Dhruv Kumar

View PDF HTML (experimental)

Abstract:This study evaluates the effectiveness of various large language models (LLMs) in performing tasks common among undergraduate computer science students. Although a number of research studies in the computing education community have explored the possibility of using LLMs for a variety of tasks, there is a lack of comprehensive research comparing different LLMs and evaluating which LLMs are most effective for different tasks. Our research systematically assesses some of the publicly available LLMs such as Google Bard, ChatGPT(3.5), GitHub Copilot Chat, and Microsoft Copilot across diverse tasks commonly encountered by undergraduate computer science students in India. These tasks include code explanation and documentation, solving class assignments, technical interview preparation, learning new concepts and frameworks, and email writing. Evaluation for these tasks was carried out by pre-final year and final year undergraduate computer science students and provides insights into the models' strengths and limitations. This study aims to guide students as well as instructors in selecting suitable LLMs for any specific task and offers valuable insights on how LLMs can be used constructively by students and instructors.

Comments:	Under review
Subjects:	Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2402.01687 [cs.CY]
	(or arXiv:2402.01687v2 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2402.01687

Submission history

From: Dhruv Kumar [view email]
[v1] Mon, 22 Jan 2024 15:11:36 UTC (528 KB)
[v2] Wed, 3 Apr 2024 14:19:44 UTC (521 KB)

Computer Science > Computers and Society

Title:"Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:"Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators