The aim of the present contribution is to merge bibliographic data for members of a bounded scientific community in order to derive a complete unified archive, with top-international and nationally oriented production, as a new basis to carry out network analysis on a unified co-authorship network. A two-step procedure is used to deal with the identification of duplicate records and the author name disambiguation. Specifically, for the second step we strongly drew inspiration from a well-established unsupervised disambiguation method proposed in the literature following a network-based approach and requiring a restricted set of record attributes. Evidences from Italian academic statisticians were provided by merging data from three bibliographic archives. Non-negligible differences were observed in network results in the comparison of disambiguated and not disambiguated data sets, especially in network measures at individual level.
Titolo: | Improving co-authorship network structures by combining multiple data sources: evidence from Italian academic statisticians | |
Autori: | ||
Data di pubblicazione: | 2016 | |
Stato di pubblicazione: | Pubblicato | |
Rivista: | ||
Abstract: | The aim of the present contribution is to merge bibliographic data for members of a bounded scientific community in order to derive a complete unified archive, with top-international and nationally oriented production, as a new basis to carry out network analysis on a unified co-authorship network. A two-step procedure is used to deal with the identification of duplicate records and the author name disambiguation. Specifically, for the second step we strongly drew inspiration from a well-established unsupervised disambiguation method proposed in the literature following a network-based approach and requiring a restricted set of record attributes. Evidences from Italian academic statisticians were provided by merging data from three bibliographic archives. Non-negligible differences were observed in network results in the comparison of disambiguated and not disambiguated data sets, especially in network measures at individual level. | |
Handle: | http://hdl.handle.net/11368/2866726 | |
Digital Object Identifier (DOI): | http://dx.doi.org/10.1007/s11192-016-1872-y | |
URL: | https://link.springer.com/article/10.1007/s11192-016-1872-y | |
Appare nelle tipologie: | 1.1 Articolo in Rivista |
File in questo prodotto:
File | Descrizione | Tipologia | Licenza | |
---|---|---|---|---|
Improving co-authorship.pdf | Articolo principale | Documento in Versione Editoriale | Copyright Editore | Administrator Richiedi una copia |
2866726_Improving co-authorship-PostPrint.pdf | Post Print VQR3 | Bozza finale post-referaggio (post-print) | Digital Rights Management non definito | Open Access Visualizza/Apri |