We introduce Limited Rollout Beam Search (LRBS), a beam search strategy for deep reinforcement learning (DRL) based combinatorial optimization improvement heuristics. Utilizing pre-trained models on the Euclidean Traveling Salesperson Problem, LRBS significantly enhances both in-distribution performance and generalization to larger problem instances, achieving optimality gaps that outperform existing improvement heuristics and narrowing the gap with state-of-the-art constructive methods. We also extend our analysis to two pickup and delivery TSP variants to validate our results. Finally, we employ our search strategy for offline and online adaptation of the pre-trained improvement policy, leading to improved search performance and surpassing recent adaptive methods for constructive heuristics.

Scaling Combinatorial Optimization Neural Improvement Heuristics with Online Search and Adaptation / Camerota Verdu, F.J., Castelli, L., Bortolussi, L.. - ELETTRONICO. - 39:25(2025), pp. 27135-27143. (39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 Philadelphia, Pennsylvania February 25–March 4, 2025) [10.1609/aaai.v39i25.34921].

Scaling Combinatorial Optimization Neural Improvement Heuristics with Online Search and Adaptation

Camerota Verdu F. J.
Primo
;
Castelli L.
Secondo
;
Bortolussi L.
Ultimo
2025-01-01

Abstract

We introduce Limited Rollout Beam Search (LRBS), a beam search strategy for deep reinforcement learning (DRL) based combinatorial optimization improvement heuristics. Utilizing pre-trained models on the Euclidean Traveling Salesperson Problem, LRBS significantly enhances both in-distribution performance and generalization to larger problem instances, achieving optimality gaps that outperform existing improvement heuristics and narrowing the gap with state-of-the-art constructive methods. We also extend our analysis to two pickup and delivery TSP variants to validate our results. Finally, we employ our search strategy for offline and online adaptation of the pre-trained improvement policy, leading to improved search performance and surpassing recent adaptive methods for constructive heuristics.
File in questo prodotto:
File Dimensione Formato  
34921-Article Text-38988-1-2-20250410.pdf

Accesso chiuso

Descrizione: Contributo scaricabile gratuitamente alla pagina https://ojs.aaai.org/index.php/AAAI/article/view/34921
Tipologia: Documento in Versione Editoriale
Licenza: Copyright Editore
Dimensione 1.84 MB
Formato Adobe PDF
1.84 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/3113689
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
social impact