Data-driven models can accurately describe and predict the dynamical properties of glass-forming liquids from structural data. Accurate predictions, however, do not guarantee an understanding of the underlying physical phenomena and the key factors that control them. In this paper, we illustrate the merits and limitations of linear regression models of glassy dynamics built on high-dimensional structural descriptors. By analyzing data for a two-dimensional glass model, we show that several descriptors commonly used in glass-transition studies display multicollinearity, which hinders the interpretability of linear models. Ridge regression suppresses some of the shortcomings of multicollinearity, but its solutions are not concise enough to be physically interpretable. Only by using dimensional reduction techniques we do eventually obtain linear models that strike a balance between prediction accuracy and interpretability. Our analysis points to a key role of local packing and composition fluctuations in the glass model under study.

Interpretability of linear regression models of glassy dynamics / Sharma, Anand; Liu, Chen; Ozawa, Misaki; Coslovich, Daniele. - In: PHYSICAL REVIEW MATERIALS. - ISSN 2475-9953. - 10:3(2026), pp. 035602--. [10.1103/q6pd-7trs]

Interpretability of linear regression models of glassy dynamics

Liu, Chen;Coslovich, Daniele
Ultimo
2026-01-01

Abstract

Data-driven models can accurately describe and predict the dynamical properties of glass-forming liquids from structural data. Accurate predictions, however, do not guarantee an understanding of the underlying physical phenomena and the key factors that control them. In this paper, we illustrate the merits and limitations of linear regression models of glassy dynamics built on high-dimensional structural descriptors. By analyzing data for a two-dimensional glass model, we show that several descriptors commonly used in glass-transition studies display multicollinearity, which hinders the interpretability of linear models. Ridge regression suppresses some of the shortcomings of multicollinearity, but its solutions are not concise enough to be physically interpretable. Only by using dimensional reduction techniques we do eventually obtain linear models that strike a balance between prediction accuracy and interpretability. Our analysis points to a key role of local packing and composition fluctuations in the glass model under study.
2026
Pubblicato
File in questo prodotto:
File Dimensione Formato  
q6pd-7trs-1.pdf

accesso aperto

Tipologia: Documento in Versione Editoriale
Licenza: Creative commons
Dimensione 4.61 MB
Formato Adobe PDF
4.61 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/3127558
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact