RT Journal Article SR Electronic T1 The Structured Coalescent and its Approximations JF bioRxiv FD Cold Spring Harbor Laboratory SP 091058 DO 10.1101/091058 A1 Nicola F. Müller A1 David A. Rasmussen A1 Tanja Stadler YR 2016 UL http://biorxiv.org/content/early/2016/12/02/091058.abstract AB Phylogenetics can be used to elucidate the movement of pathogens between different host populations when the location of samples are considered alongside of pathogen sequence data. Pathogen phylogenies therefore offer insights into the movement of pathogens not available from classic epidemiological data alone. However, current phylogeographic methods to quantify migration patterns from phylogenies have several known shortcomings. In particular, one of the most widely used method treats migration the same as mutation, and as such does not incorporate information about population demography. This may lead to severe biases in estimated migration rates for datasets where sampling is biased across populations. On the other hand, the structured coalescent allows us to coherently model the migration and transmission process, but current implementations struggle with complex datasets due to the need to additionally infer ancestral migration histories. Thus, approximations to the structured coalescent which integrate over all ancestral migration histories have been developed. However, the validity and robustness of these approximations remain unclear. We here provide an exact numerical solution to the structured coalescent that does not require the inference of migration histories. While this solution is computationally unfeasible for large datasets, it clarifies the assumptions of previously developed approximate methods and allows us to provide an improved approximation to the structured coalescent. We have implemented these methods in BEAST2, and we show how our newly described approach outperforms previously described methods in accuracy at comparable computational cost.