PT - JOURNAL ARTICLE AU - Alvaro Chiner-Oms AU - Fernando González-Candelas AU - Iñaki Comas TI - Gene expression models based on a reference laboratory strain are bad predictors of <em>Mycobacterium tuberculosis</em> complex transcriptional diversity AID - 10.1101/091082 DP - 2016 Jan 01 TA - bioRxiv PG - 091082 4099 - http://biorxiv.org/content/early/2016/12/02/091082.short 4100 - http://biorxiv.org/content/early/2016/12/02/091082.full AB - Species of the Mycobacterium tuberculosis complex (MTBC) kill more people every year than any other infectious disease. As a consequence of its global distribution and parallel evolution with the human host the bacteria is not genetically homogeneous. The observed genetic heterogeneity has relevance at different phenotypic levels, from gene expression to epidemiological dynamics. However current systems biology datasets have focused in the laboratory reference strain H37Rv. By using large expression datasets testing the role of almost two hundred transcription factors, we have constructed computational models to grab the expression dynamics of Mycobacterium tuberculosis H37Rv genes. However, we have found that many of those transcription factors are deleted or likely dysfunctional across strains of the MTBC. In accordance, we failed to predict expression changes in strains with a different genetic background when compared with experimental data. The results highlight the importance of designing systems biology approaches that take into account the tubercle bacilli, or any other pathogen, genetic diversity if we want to identify universal targets for vaccines, diagnostics and treatments.