On the use of cluster analysis for individuating variable influence on spread variation in large datasets