RT Journal Article SR Electronic T1 Null Models for Community Detection in Spatially-Embedded, Temporal Networks JF bioRxiv FD Cold Spring Harbor Laboratory SP 007435 DO 10.1101/007435 A1 Marta Sarzynska A1 Elizabeth A. Leicht A1 Gerardo Chowell A1 Mason A. Porter YR 2014 UL http://biorxiv.org/content/early/2014/07/24/007435.abstract AB In the study of networks, it is often insightful to use algorithms to determine mesoscale features such as “community structure”, in which densely connected sets of nodes constitute “communities” that have sparse connections to other communities. The most popular way of detecting communities algorithmically is to optimize the quality function known as modularity. When optimizing modularity, one compares the actual connections in a (static or time-dependent) network to the connections obtained from a random-graph ensemble that acts as a null model. The communities are then the sets of nodes that are connected to each other densely relative to what is expected from the null model. Clearly, the process of community detection depends fundamentally on the choice of null model, so it is important to develop and analyze novel null models that take into account appropriate features of the system under study. In this paper, we investigate the effects of using null models that take incorporate spatial information, and we propose a novel null model based on the radiation model of population spread. We also develop novel synthetic spatial benchmark networks in which the connections between entities are based on distance or flux between nodes, and we compare the performance of both static and time-dependent radiation null models to the standard (“Newman-Girvan”) null model for modularity optimization and a recently-proposed gravity null model. In our comparisons, we use both the above synthetic benchmarks and time-dependent correlation networks that we construct using countrywide dengue fever incidence data for Peru. We also evaluate a recently-proposed correlation null model, which was developed specifically for correlation networks that are constructed from time series, on the epidemic-correlation data. Our findings underscore the need to use appropriate generative models for the development of spatial null models for community detection.PACS numbers 87.19.Xx,89.20.-a,89.75.Fb,05.45.Tp