In the present study, the challenge is whether a distant reading of the history of a discipline can be achieved by analysing the temporal evolution of keywords retrieved from papers in the discipline’s mainstream journals. This calls for a so-called knowledge-based system (KBS), i.e., a computer-based system that supports human learning not only by acquiring and manipulating large volumes of data and information, but also by integrating knowledge from different sources. In this chapter, we introduce a KBS that, starting from a large database of texts retrieved from scientific articles published over a lengthy period by a selection of the discipline’s premier journals, leads to the construction of a well-founded corpus of scientific literature and from this to a possible outline of the discipline’s history. Our work is based on the idea that the temporal course of a word occurrence is a proxy of the word’s life cycle. We then adopt a functional data analysis (FDA) approach under which we first reconstruct words' life cycles. Second, by clustering words with similar life cycles, we detect any prototypical or exemplary temporal patterns representing the latent dynamics of word micro-histories. The major dynamics uncovered at this stage are then submitted to subject matter experts for interpretation and guidance in decision-making, thus making it possible to trace a history of the discipline. Moreover, we propose several kinds of data normalisation which involve different concepts of life cycle similarity and hence a different reading of the history of the discipline under examination.

Functional Data Analysis and Knowledge-Based Systems

Trevisani, Matilde
2018-01-01

Abstract

In the present study, the challenge is whether a distant reading of the history of a discipline can be achieved by analysing the temporal evolution of keywords retrieved from papers in the discipline’s mainstream journals. This calls for a so-called knowledge-based system (KBS), i.e., a computer-based system that supports human learning not only by acquiring and manipulating large volumes of data and information, but also by integrating knowledge from different sources. In this chapter, we introduce a KBS that, starting from a large database of texts retrieved from scientific articles published over a lengthy period by a selection of the discipline’s premier journals, leads to the construction of a well-founded corpus of scientific literature and from this to a possible outline of the discipline’s history. Our work is based on the idea that the temporal course of a word occurrence is a proxy of the word’s life cycle. We then adopt a functional data analysis (FDA) approach under which we first reconstruct words' life cycles. Second, by clustering words with similar life cycles, we detect any prototypical or exemplary temporal patterns representing the latent dynamics of word micro-histories. The major dynamics uncovered at this stage are then submitted to subject matter experts for interpretation and guidance in decision-making, thus making it possible to trace a history of the discipline. Moreover, we propose several kinds of data normalisation which involve different concepts of life cycle similarity and hence a different reading of the history of the discipline under examination.
2018
978-3-319-97063-9
978-3-319-97064-6
File in questo prodotto:
File Dimensione Formato  
Trevisani_Functional Data Analysis and Knowledge-Based Systems.pdf

Accesso chiuso

Descrizione: capitolo con frontespizio e indice
Tipologia: Documento in Versione Editoriale
Licenza: Copyright Editore
Dimensione 2.51 MB
Formato Adobe PDF
2.51 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/2931202
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact