RT Journal Article SR Electronic T1 16S rRNA amplicon sequencing for epidemiological surveys of bacteria in wildlife: the importance of cleaning post-sequencing data before estimating positivity, prevalence and co-infection JF bioRxiv FD Cold Spring Harbor Laboratory SP 039826 DO 10.1101/039826 A1 Maxime Galan A1 Maria Razzauti A1 Emilie Bard A1 Maria Bernard A1 Carine Brouat A1 Nathalie Charbonnel A1 Alexandre Dehne-Garcia A1 Anne Loiseau A1 Caroline Tatard A1 Lucie Tamisier A1 Muriel Vayssier-Taussat A1 Helene Vignes A1 Jean François Cosson YR 2016 UL http://biorxiv.org/content/early/2016/06/19/039826.abstract AB Human impact on natural habitats is increasing the complexity of human-wildlife interfaces and leading to the emergence of infectious diseases worldwide. Highly successful synanthropic wildlife species, such as rodents, will undoubtedly play an increasingly important role in transmitting zoonotic diseases. We investigated the potential for recent developments in 16S rRNA amplicon sequencing to facilitate the multiplexing of large numbers of samples needed to improve our understanding of the risk of zoonotic disease transmission posed by urban rodents in West Africa. In addition to listing pathogenic bacteria in wild populations, as in other high-throughput sequencing (HTS) studies, our approach can estimate essential parameters for studies of zoonotic risk, such as prevalence and patterns of coinfection within individual hosts. However, the estimation of these parameters requires cleaning of the raw data to mitigate the biases generated by HTS methods. We present here an extensive review of these biases and of their consequences, and we propose a comprehensive trimming strategy for managing these biases. We demonstrated the application of this strategy using 711 commensal rodents collected from 24 villages in Senegal, including 208 Mus musculus domesticus, 189 Rattus rattus, 93 Mastomys natalensis and 221 Mastomys erythroleucus. Seven major genera of pathogenic bacteria were detected in their spleens: Borrelia, Bartonella, Mycoplasma, Ehrlichia, Rickettsia, Streptobacillus and Orientia. The last five of these genera have never before been detected in West African rodents. Bacterial prevalence ranged from 0% to 90% of individuals per site, depending on the bacterial taxon, rodent species and site considered, and 26% of rodents displayed coinfection. The 16S rRNA amplicon sequencing strategy presented here has the advantage over other molecular surveillance tools of dealing with a large spectrum of bacterial pathogens without requiring assumptions about their presence in the samples. This approach is therefore particularly suitable for continuous pathogen surveillance in the context of disease monitoring programs.