Learning to Quantify

Esuli, A.; Fabris, A.; Moreo, A.; Sebastiani, F.

doi:10.1007/978-3-031-20467-8

In a number of applications involving classiﬁcation, the ﬁnal goal is not determining which class (or classes) individual unlabelled instances belong to, but estimating the prevalence (or “relative frequency”, or “prior probability”) of each class in the unlabelled data. In recent years it has been pointed out that, in these cases, it would make sense to directly optimise machine learning algorithms for this goal, rather than (somehow indirectly) just optimising the classiﬁer’s ability to label individual instances. The task of training estimators of class prevalence via supervised learning is known as learning to quantify, or, more simply, quantiﬁcation. It is by now well known that performing quantiﬁcation by classifying each unlabelled instance via a standard classiﬁer and then counting the instances that have been assigned to the class (the Classify and Count method) usually leads to biased estimators of class prevalence, i.e., to poor quantiﬁcation accuracy; as a result, methods (and evaluation measures) that address quantiﬁcation as a task in its own right have been developed. This book covers the main applications of quantiﬁcation, the main methods that have been developed for learning to quantify, the measures that have been adopted for evaluating it, and the challenges that still need to be addressed by future research. The book is divided in seven chapters. Chapter 1 sets the stage for the rest of the book by introducing fundamental notions such as class distributions, their estimation, and dataset shift, by arguing for the suboptimality of using classiﬁcation techniques for performing this estimation, and by discussing why learning to quantify has evolved as a task of its own, rather than remaining a by-product of classiﬁcation. Chapter 2 provides the motivation for what is to come by describing the applications that quantiﬁcation has been put at, ranging from improving classiﬁcation accuracy in domain adaptation, to measuring and improving the fairness of classiﬁcation systems with respect to a sensitive attribute, to supporting research and development in the social sciences, in political science, epidemiology, market research, and others. In Chapter 3 we move on to discuss the experimental evaluation of quantiﬁcation systems; we look at evaluation measures for the various types of quantiﬁcation systems (binary, single-label multiclass, multi-label multiclass, ordinal), but also at evaluation protocols for quantiﬁcation, that essentially consist in ways to extract multiple testing samples for use in quantiﬁcation evaluation from a single classification test set. Chapter 4 is possibly the central chapter of the book, and looks at the various supervised learning methods for learning to quantify that have been proposed over the years, be they of an aggregative nature (i.e., methods that require the classification of all individual unlabelled items as an intermediate step) or of a non-aggregative nature (i.e., methods in which no classification of individual items is performed). In Chapter 5 we look at a number of “advanced” (or niche) topics in quantification, including quantification for ordinal data, cross-lingual quantification of textual items, quantification for networked data, and quantification for streaming data. Chapter 6 looks at other aspects of the “quantification landscape” that have not been covered in the previous chapters, and discusses the evolution of quantification research, from its beginnings to the most recent quantification-based “shared tasks”, the landscape of quantification-based, publicly available software libraries, and other tasks in data science that present important similarities with quantification. Chapter 6 also presents the results of experiments, that we have carried out ourselves, in which we compare many of the methods discussed in Chapter 4 on a common testing infrastructure. Chapter 7 concludes the book, pointing to potential future developments in the quantification arena.The book is mostly addressed to researchers in data science that might want to come up to speed with the state of the art in learning to quantify, but it can be useful also to researchers and scientists that operate in other disciplines and that apply techniques from data science to their own application domains. Indeed, it is our experience that many potential users of quantification techniques (who operate in the fields touched upon in Chapter 2, and possibly in others too) do not use them, thus settling for suboptimal “classify and count” techniques, for the simple fact that they are not aware of their existence, and of the existence of quantification as a task of its own; it is also those potential users that we hope will be inspired by this book. We thus hope that the availability of a book that surveys all aspects of the quantification workflow and presents them in a hopefully accessible form, will increase the interest in this subject on the part of researchers and practitioners alike, and will contribute to making quantification better known to potential users of this technology and to researchers interested in advancing the field.

ArTS Archivio della ricerca di Trieste