Probabilistic methods of analysis for the time series Moran scatterplot quadrant signature
Author(s)
Rohde, D
Corcoran, J
McGee, TR
Wickes, R
Townsley, M
Year published
2015
Metadata
Show full item recordAbstract
Recently, time series Moran scatterplot quadrant signatures (MSQS) have been introduced as a means of spatio-temporal analysis. Moran scatter plots summarise a set of local Moran statistics that identify how a quantity of interest relates to its neighbours. By reducing these scatter plots to just one of four quadrant locations at fixed time intervals, a time series MSQS is obtained. Clustering on the time series MSQS allows regions that show similar behaviour relative to their neigh- bours to be identified. It has recently been shown that the Levenshtein metric, a distance metric originally used for string comparison, can ...
View more >Recently, time series Moran scatterplot quadrant signatures (MSQS) have been introduced as a means of spatio-temporal analysis. Moran scatter plots summarise a set of local Moran statistics that identify how a quantity of interest relates to its neighbours. By reducing these scatter plots to just one of four quadrant locations at fixed time intervals, a time series MSQS is obtained. Clustering on the time series MSQS allows regions that show similar behaviour relative to their neigh- bours to be identified. It has recently been shown that the Levenshtein metric, a distance metric originally used for string comparison, can be used to construct a kernel that allows standard Ward hierarchical clustering to be applied. The result is that regions with similar spatio-temporal behaviour can be identified. The purpose of this paper is to demonstrate the use of Dirichlet Process mixture models as a fully probabilistic alternative to applying Ward hierarchical clustering with the Levenshtein metric. This approach offers an advance to the existing literature as it proposes a fully generative model that articulates the underlying assumptions and allows prediction of new test points. Additionally, it provides a principled method that avoids using heuristics to select the number of clusters. An efficient Gibbs sampling Markov chain Monte Carlo algorithm is presented, and it is demonstrated how the output of this can be mapped and analysed. Difficulties inher- ent in mapping and plotting high-dimensional mathematical objects are discussed, and practical solutions are proposed.
View less >
View more >Recently, time series Moran scatterplot quadrant signatures (MSQS) have been introduced as a means of spatio-temporal analysis. Moran scatter plots summarise a set of local Moran statistics that identify how a quantity of interest relates to its neighbours. By reducing these scatter plots to just one of four quadrant locations at fixed time intervals, a time series MSQS is obtained. Clustering on the time series MSQS allows regions that show similar behaviour relative to their neigh- bours to be identified. It has recently been shown that the Levenshtein metric, a distance metric originally used for string comparison, can be used to construct a kernel that allows standard Ward hierarchical clustering to be applied. The result is that regions with similar spatio-temporal behaviour can be identified. The purpose of this paper is to demonstrate the use of Dirichlet Process mixture models as a fully probabilistic alternative to applying Ward hierarchical clustering with the Levenshtein metric. This approach offers an advance to the existing literature as it proposes a fully generative model that articulates the underlying assumptions and allows prediction of new test points. Additionally, it provides a principled method that avoids using heuristics to select the number of clusters. An efficient Gibbs sampling Markov chain Monte Carlo algorithm is presented, and it is demonstrated how the output of this can be mapped and analysed. Difficulties inher- ent in mapping and plotting high-dimensional mathematical objects are discussed, and practical solutions are proposed.
View less >
Journal Title
Environmetrics
Subject
Mathematical sciences
Environmental sciences
Criminology not elsewhere classified