The study aims at comparing two methods for tracing the temporal evolution of topics and keywords in corpora of scientific literature: the well-known Latent Dirichelet Allocation and a new knowledge-based system that has been developed in a functional data analysis unsupervised perspective. Object of the study is a corpus of abstracts of articles published by the American Journal of Sociology over a century (1921-2018). Our study advocates that the two methods might not be seen as alternative but rather as integrable means to improve the interpretation of findings.

Knowledge discovery for dynamic textual data: temporal patterns of topics and word clusters in corpora of scientific literature

Matilde Trevisani
;
2019-01-01

Abstract

The study aims at comparing two methods for tracing the temporal evolution of topics and keywords in corpora of scientific literature: the well-known Latent Dirichelet Allocation and a new knowledge-based system that has been developed in a functional data analysis unsupervised perspective. Object of the study is a corpus of abstracts of articles published by the American Journal of Sociology over a century (1921-2018). Our study advocates that the two methods might not be seen as alternative but rather as integrable means to improve the interpretation of findings.
File in questo prodotto:
File Dimensione Formato  
Trevisani_Knowledge discovery for dynamic textual data.pdf

Accesso chiuso

Descrizione: contributo con frontespizio e indice del libro
Tipologia: Documento in Versione Editoriale
Licenza: Copyright Editore
Dimensione 25.15 MB
Formato Adobe PDF
25.15 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/2951150
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact