The analysis of ego networks has attracted a great attention recently and found application in many areas of the social sciences. In particular, the identification of network typologies has become a crucial task and a powerful tool to capture aspects of the social space or personal community in which people are embedded. In this work, we propose a distance-based clustering procedure to identify homogeneous groups of ego networks that are only described by a small number of compositional variables. The proposed approach is motivated by the empirical study of ego networks of contacts extracted from the “Family and Social Subjects” (FSS) Survey conducted by the Italian National Statistical Institute in 2016, which is not specifically oriented to network analysis. We focus on elderly respondents living alone, which can be regarded as a vulnerable category, with the aim to describe their network of contacts. First, mining relational information in FSS data, we derive the ego networks of respondents. Then, we develop a methodology for coping with the presence of heterogeneous data and small amount of information from a network perspective. To this aim, we introduce a dissimilarity measure for mixed-type data, and exploit hierarchical clustering for grouping ego networks according to their composition. In doing so, we intend to make our approach applicable to various surveys.

A clustering procedure for mixed-type data to explore ego network typologies: an application to elderly people living alone in Italy

Pappadà Roberta
2021-01-01

Abstract

The analysis of ego networks has attracted a great attention recently and found application in many areas of the social sciences. In particular, the identification of network typologies has become a crucial task and a powerful tool to capture aspects of the social space or personal community in which people are embedded. In this work, we propose a distance-based clustering procedure to identify homogeneous groups of ego networks that are only described by a small number of compositional variables. The proposed approach is motivated by the empirical study of ego networks of contacts extracted from the “Family and Social Subjects” (FSS) Survey conducted by the Italian National Statistical Institute in 2016, which is not specifically oriented to network analysis. We focus on elderly respondents living alone, which can be regarded as a vulnerable category, with the aim to describe their network of contacts. First, mining relational information in FSS data, we derive the ego networks of respondents. Then, we develop a methodology for coping with the presence of heterogeneous data and small amount of information from a network perspective. To this aim, we introduce a dissimilarity measure for mixed-type data, and exploit hierarchical clustering for grouping ego networks according to their composition. In doing so, we intend to make our approach applicable to various surveys.
File in questo prodotto:
File Dimensione Formato  
Pappadà2021_Article_AClusteringProcedureForMixed-t.pdf

Accesso chiuso

Descrizione: articolo
Tipologia: Documento in Versione Editoriale
Licenza: Copyright Editore
Dimensione 562.82 kB
Formato Adobe PDF
562.82 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
3007291_Pappad…2021_Article_AClusteringProcedureForMixed-t-Post_print.pdf

Open Access dal 01/10/2022

Tipologia: Bozza finale post-referaggio (post-print)
Licenza: Digital Rights Management non definito
Dimensione 1.1 MB
Formato Adobe PDF
1.1 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/3007291
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact