Large Language Model-based Test Case Generation for GP Agents

Genetic programming (GP) is a popular problem-solving and optimization technique. However, generating effective test cases for training and evaluating GP programs requires strong domain knowledge. Furthermore, GP programs often prematurely converge on local optima when given excessively difficult problems early in their training. Curriculum learning (CL) has been effective in addressing similar issues across different reinforcement learning (RL) domains, but it requires the manual generation of progressively difficult test cases as well as their careful scheduling. In this work, we leverage the domain knowledge and the strong generative abilities of large language models (LLMs) to generate effective test cases of increasing difficulties and schedule them according to various curricula. We show that by integrating a curriculum scheduler with LLM-generated test cases we can effectively train a GP agent player with environments-based curricula for a single-player game and opponent-based curricula for a multi-player game. Finally, we discuss the benefits and challenges of implementing this method for other problem domains.

Large Language Model-based Test Case Generation for GP Agents

Jorgensen, Steven;Nadizar, Giorgia;Pietropolli, Gloria;Manzoni, Luca;Medvet, Eric;O'Reilly, Una-May;Hemberg, Erik

2024-01-01

Abstract

Genetic programming (GP) is a popular problem-solving and optimization technique. However, generating effective test cases for training and evaluating GP programs requires strong domain knowledge. Furthermore, GP programs often prematurely converge on local optima when given excessively difficult problems early in their training. Curriculum learning (CL) has been effective in addressing similar issues across different reinforcement learning (RL) domains, but it requires the manual generation of progressively difficult test cases as well as their careful scheduling. In this work, we leverage the domain knowledge and the strong generative abilities of large language models (LLMs) to generate effective test cases of increasing difficulties and schedule them according to various curricula. We show that by integrating a curriculum scheduler with LLM-generated test cases we can effectively train a GP agent player with environments-based curricula for a single-player game and opponent-based curricula for a multi-player game. Finally, we discuss the benefits and challenges of implementing this method for other problem domains.

Scheda breve

Scheda completa

	Anno
	
				2024
			
	ISBN
	
				979-8-4007-0494-9
			
	URL
	
				https://dl.acm.org/doi/10.1145/3638529.3654056
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti Convegno (Proceeding)

File in questo prodotto:

File	Dimensione	Formato
3638529.3654056.pdf Accesso chiuso Tipologia: Documento in Versione Editoriale Licenza: Creative commons Dimensione 711.75 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	711.75 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
2024-GECCO-LLMTestCaseGenerationForGPAgents.pdf Accesso chiuso Descrizione: Versione definitiva ma senza numero di pagine e con un'appendice aggiuntiva in fondo Tipologia: Bozza finale post-referaggio (post-print) Licenza: Copyright Editore Dimensione 1.32 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.32 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/3084242

Citazioni

ND

7

3

social impact