RT Journal Article SR Electronic T1 Gene expression models based on a reference laboratory strain are bad predictors of Mycobacterium tuberculosis complex transcriptional diversity JF bioRxiv FD Cold Spring Harbor Laboratory SP 091082 DO 10.1101/091082 A1 Alvaro Chiner-Oms A1 Fernando González-Candelas A1 Iñaki Comas YR 2016 UL http://biorxiv.org/content/early/2016/12/02/091082.abstract AB Species of the Mycobacterium tuberculosis complex (MTBC) kill more people every year than any other infectious disease. As a consequence of its global distribution and parallel evolution with the human host the bacteria is not genetically homogeneous. The observed genetic heterogeneity has relevance at different phenotypic levels, from gene expression to epidemiological dynamics. However current systems biology datasets have focused in the laboratory reference strain H37Rv. By using large expression datasets testing the role of almost two hundred transcription factors, we have constructed computational models to grab the expression dynamics of Mycobacterium tuberculosis H37Rv genes. However, we have found that many of those transcription factors are deleted or likely dysfunctional across strains of the MTBC. In accordance, we failed to predict expression changes in strains with a different genetic background when compared with experimental data. The results highlight the importance of designing systems biology approaches that take into account the tubercle bacilli, or any other pathogen, genetic diversity if we want to identify universal targets for vaccines, diagnostics and treatments.