RT Journal Article SR Electronic T1 Identification of combinatorial and singular genomic signatures of host adaptation in influenza A H1N1 and H3N2 subtypes JF bioRxiv FD Cold Spring Harbor Laboratory SP 044909 DO 10.1101/044909 A1 Zeeshan Khaliq A1 Mikael Leijon A1 Sándor Belák A1 Jan Komorowski YR 2016 UL http://biorxiv.org/content/early/2016/03/20/044909.abstract AB Background The underlying strategies used by influenza A viruses (IAVs) to adapt to new hosts while crossing the species barrier are complex and yet to be understood completely. Several studies have been published identifying singular genomic signatures that indicate such a host switch. The complexity of the problem suggested that in addition to the singular signatures, there might be a combinatorial use of such genomic features, in nature, defining adaptation to hosts..Results We used computational rule-based modeling to identify combinatorial sets of interacting amino acid (aa) residues in 12 proteins of IAVs of H1N1 and H3N2 subtypes. We built highly accurate rule-based models for each protein that could differentiate between viral aa sequences coming from avian and human hosts,. We found 68 combinations of aa residues associated to host adaptation (HAd) on HA, M1, M2, NP, NS1, NEP, PA, PA-X, PB1 and PB2 proteins of the H1N1 subtype and 24 on M1, M2, NEP, PB1 and PB2 proteins of the H3N2 subtypes. In addition to these combinations, we found 132 novel singular aa signatures distributed among all proteins, including the newly discovered PA-X protein, of both subtypes. We showed that HA, NA, NP, NS1, NEP, PA-X and PA proteins of the H1N1 subtype carry H1N1-specific and HA, NA, PA-X, PA, PB1-F2 and PB1 of the H3N2 subtype carry H3N2-specific HAd signatures. M1, M2, PB1-F2, PB1 and PB2 of H1N1 subtype, in addition to H1N1 signatures, also carry H3N2 signatures. Similarly M1, M2, NP, NS1, NEP and PB2 of H3N2 subtype were shown to carry both H3N2 and H1N1 HAd signatures.Conclusions To sum it up, we computationally constructed simple IF-THEN rule-based models that could distinguish between aa sequences of virus particles originating from avian and human hosts. From the rules we identified combinations of aa residues as signatures facilitating the adaptation to specific hosts. The identification of combinatorial aa signatures suggests that the process of adaptation of IAVs to a new host is more complex than previously suggested. The present study provides a basis for further detailed studies with the aim to elucidate the molecular mechanisms providing the foundation for the adaptation process.aaAmino acidsCAGCombinatorial accuracy gainHAHemagglutininIAVsInfluenza A virusesLHSLeft hand sideM1Matrix protein 1M2Matrix protein 2MCCMathew’s correlation coefficientMCFSMonte carlo feature selectionNANeuraminidaseNEPNuclear export proteinNPNucleoproteinNS1Non structural protein 1NS2Non structural protein 2PAPolymerase acidic proteinPB1Polymerase basic protein 1PB2Polymerase basic protein 2SnSensitivitySpSpecificity