In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i.e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.

Clustering Algorithms for Spatial Big Data

SCHOIER, GABRIELLA;GREGORIO, CATERINA
2017-01-01

Abstract

In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i.e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.
2017
978-3-319-62400-6
978-3-319-62401-3
File in questo prodotto:
File Dimensione Formato  
Computational+Science+and+Its+Applications IV.pdf

Accesso chiuso

Descrizione: libro completo
Tipologia: Documento in Versione Editoriale
Licenza: Digital Rights Management non definito
Dimensione 241.5 MB
Formato Adobe PDF
241.5 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Schoier_Clustering Algorithms for Spatial Big Data.pdf

Accesso chiuso

Descrizione: capitolo
Tipologia: Documento in Versione Editoriale
Licenza: Copyright Editore
Dimensione 6.19 MB
Formato Adobe PDF
6.19 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
2914566_Schoier_Clustering Algorithms for Spatial Big Data-PostPrint.pdf

accesso aperto

Descrizione: PostPrint VQR3
Tipologia: Bozza finale post-referaggio (post-print)
Licenza: Digital Rights Management non definito
Dimensione 5.72 MB
Formato Adobe PDF
5.72 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/2914566
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 5
social impact