Enhancing Large Language Models-Based Code Generation by Leveraging Genetic Improvement

Pinna, Giovanni; Ravalico, Damiano; Rovito, Luigi; Manzoni, Luca; DE LORENZO, Andrea

doi:10.1007/978-3-031-56957-9_7

In recent years, the rapid advances in neural networks for Natural Language Processing (NLP) have led to the development of Large Language Models (LLMs), able to substantially improve the state-of-the-art in many NLP tasks, such as question answering and text summarization. Among them, one particularly interesting application is automatic code generation based only on the problem description. However, it has been shown that even the most effective LLMs available often fail to produce correct code. To address this issue, we propose an evolutionary-based approach using Genetic Improvement (GI) to improve the code generated by an LLM using a collection of user-provided test cases. Specifically, we employ Grammatical Evolution (GE) using a grammar that we automatically specialize—starting from a general one—for the output of the LLM. We test 25 different problems and 5 different LLMs, showing that the proposed method is able to improve in a statistically significant way the code generated by LLMs. This is a first step in showing that the combination of LLMs and evolutionary techniques can be a fruitful avenue of research.

Enhancing Large Language Models-Based Code Generation by Leveraging Genetic Improvement

Giovanni Pinna;Damiano Ravalico;Luigi Rovito;Luca Manzoni;Andrea De Lorenzo

2024-01-01

Abstract

Scheda breve

Scheda completa

	Anno
	
				2024
			
	Titolo della collana
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	ISBN
	
				978-3-031-56956-2
978-3-031-56957-9
			
	URL
	
				https://link.springer.com/chapter/10.1007/978-3-031-56957-9_7
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti Convegno (Proceeding)

File in questo prodotto:

File	Dimensione	Formato
versione_editoriale.pdf Accesso chiuso Descrizione: Versione editoriale dell'articolo Tipologia: Documento in Versione Editoriale Licenza: Copyright Editore Dimensione 249.01 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	249.01 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
versione_editoriale-Post_print.pdf Open Access dal 29/03/2025 Tipologia: Bozza finale post-referaggio (post-print) Licenza: Digital Rights Management non definito Dimensione 805.81 kB Formato Adobe PDF Visualizza/Apri	805.81 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/3071899

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Enhancing Large Language Models-Based Code Generation by Leveraging Genetic Improvement

Giovanni Pinna;Damiano Ravalico;Luigi Rovito;Luca Manzoni;Andrea De Lorenzo

2024-01-01

Abstract

Scheda breve

Scheda completa

Pubblicazioni consigliate

Citazioni

social impact

ArTS Archivio della ricerca di Trieste

Enhancing Large Language Models-Based Code Generation by Leveraging Genetic Improvement

Giovanni Pinna;Damiano Ravalico;Luigi Rovito;Luca Manzoni;Andrea De Lorenzo

2024-01-01

Abstract

Scheda breve Scheda completa

Pubblicazioni consigliate

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa