Community detection has become a fundamental tool for the analysis of complex networks, yet many real-world systems challenge the assumptions underlying traditional approaches. Strong degree heterogeneity, the presence of influential actors, and the availability of node attributes often limit or question the effectiveness of purely topology-based methods, which tend to overlook the mechanisms driving group formation. Addressing these limitations requires frameworks that explicitly account for distinct structural roles and aggregation driven by attribute similarity. Community detection in heterogeneous and attributed networks can be approached from different perspectives. Methods focused on standard definitions of communities emphasise connectivity patterns but struggle in the presence of hubs and high inter-community connectivity, while attribute-based approaches capture similarity but may underweight structural organisation. This thesis adopts an integrated perspective and treats degree heterogeneity and leadership as informative features rather than distortions. By doing so, the proposed framework advances an interpretable view of network organisation that extends beyond individual nodes to explicitly capture community-level structure and hierarchy, highlighting how leadership and connectivity patterns shape the formation, interaction, and positioning of communities within the broader network. This thesis draws on a diverse set of large-scale relational datasets spanning labour flow, housing, and scientific collaboration networks. Empirical analyses are conducted using regional administrative data on worker mobility, spatially explicit housing market data, European research collaboration data derived from the Horizon 2020 and Horizon Europe programmes, and bibliometric co-authorship data on Italian academic scholars constructed from the Italian Ministry of University and Research (MUR) registry and Scopus publication records. Three main contributions are presented. First, empirical analyses demonstrate how structurally prominent actors shape group formation across different applied contexts. Second, a density-based community detection framework is extended to integrate structural information and node attributes, allowing homophily to stabilise community boundaries and redefine leadership in terms of attribute-based community representativeness. Finally, a community-level approach to core-periphery detection is introduced, highlighting the role of peripheral community leaders in maintaining connectivity within the overall system.

Community detection has become a fundamental tool for the analysis of complex networks, yet many real-world systems challenge the assumptions underlying traditional approaches. Strong degree heterogeneity, the presence of influential actors, and the availability of node attributes often limit or question the effectiveness of purely topology-based methods, which tend to overlook the mechanisms driving group formation. Addressing these limitations requires frameworks that explicitly account for distinct structural roles and aggregation driven by attribute similarity. Community detection in heterogeneous and attributed networks can be approached from different perspectives. Methods focused on standard definitions of communities emphasise connectivity patterns but struggle in the presence of hubs and high inter-community connectivity, while attribute-based approaches capture similarity but may underweight structural organisation. This thesis adopts an integrated perspective and treats degree heterogeneity and leadership as informative features rather than distortions. By doing so, the proposed framework advances an interpretable view of network organisation that extends beyond individual nodes to explicitly capture community-level structure and hierarchy, highlighting how leadership and connectivity patterns shape the formation, interaction, and positioning of communities within the broader network. This thesis draws on a diverse set of large-scale relational datasets spanning labour flow, housing, and scientific collaboration networks. Empirical analyses are conducted using regional administrative data on worker mobility, spatially explicit housing market data, European research collaboration data derived from the Horizon 2020 and Horizon Europe programmes, and bibliometric co-authorship data on Italian academic scholars constructed from the Italian Ministry of University and Research (MUR) registry and Scopus publication records. Three main contributions are presented. First, empirical analyses demonstrate how structurally prominent actors shape group formation across different applied contexts. Second, a density-based community detection framework is extended to integrate structural information and node attributes, allowing homophily to stabilise community boundaries and redefine leadership in terms of attribute-based community representativeness. Finally, a community-level approach to core-periphery detection is introduced, highlighting the role of peripheral community leaders in maintaining connectivity within the overall system.

Methods for the analysis of complex group formation mechanisms in attributed networks / Geremia, Sara. - (2026 Mar 25).

Methods for the analysis of complex group formation mechanisms in attributed networks

GEREMIA, SARA
2026-03-25

Abstract

Community detection has become a fundamental tool for the analysis of complex networks, yet many real-world systems challenge the assumptions underlying traditional approaches. Strong degree heterogeneity, the presence of influential actors, and the availability of node attributes often limit or question the effectiveness of purely topology-based methods, which tend to overlook the mechanisms driving group formation. Addressing these limitations requires frameworks that explicitly account for distinct structural roles and aggregation driven by attribute similarity. Community detection in heterogeneous and attributed networks can be approached from different perspectives. Methods focused on standard definitions of communities emphasise connectivity patterns but struggle in the presence of hubs and high inter-community connectivity, while attribute-based approaches capture similarity but may underweight structural organisation. This thesis adopts an integrated perspective and treats degree heterogeneity and leadership as informative features rather than distortions. By doing so, the proposed framework advances an interpretable view of network organisation that extends beyond individual nodes to explicitly capture community-level structure and hierarchy, highlighting how leadership and connectivity patterns shape the formation, interaction, and positioning of communities within the broader network. This thesis draws on a diverse set of large-scale relational datasets spanning labour flow, housing, and scientific collaboration networks. Empirical analyses are conducted using regional administrative data on worker mobility, spatially explicit housing market data, European research collaboration data derived from the Horizon 2020 and Horizon Europe programmes, and bibliometric co-authorship data on Italian academic scholars constructed from the Italian Ministry of University and Research (MUR) registry and Scopus publication records. Three main contributions are presented. First, empirical analyses demonstrate how structurally prominent actors shape group formation across different applied contexts. Second, a density-based community detection framework is extended to integrate structural information and node attributes, allowing homophily to stabilise community boundaries and redefine leadership in terms of attribute-based community representativeness. Finally, a community-level approach to core-periphery detection is introduced, highlighting the role of peripheral community leaders in maintaining connectivity within the overall system.
25-mar-2026
DE STEFANO, DOMENICO
38
2024/2025
Settore SECS-S/05 - Statistica Sociale
Università degli Studi di Trieste
File in questo prodotto:
File Dimensione Formato  
DoctoralThesis.pdf

accesso aperto

Descrizione: Tesi di dottorato
Tipologia: Tesi di dottorato
Dimensione 7.53 MB
Formato Adobe PDF
7.53 MB Adobe PDF Visualizza/Apri
DoctoralThesis_1.pdf

accesso aperto

Descrizione: Tesi di dottorato
Tipologia: Tesi di dottorato
Dimensione 7.53 MB
Formato Adobe PDF
7.53 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/3129602
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact