PT - JOURNAL ARTICLE AU - Alex Rubinsteyn AU - Timothy O'Donnell AU - Nandita Damaraju AU - Jeff Hammerbacher TI - Predicting Peptide-MHC Binding Affinities With Imputed Training Data AID - 10.1101/054775 DP - 2016 Jan 01 TA - bioRxiv PG - 054775 4099 - http://biorxiv.org/content/early/2016/05/23/054775.short 4100 - http://biorxiv.org/content/early/2016/05/23/054775.full AB - Predicting the binding affinity between MHC proteins and their peptide ligands is a key problem in computational immunology. State of the art performance is currently achieved by the allele-specific predictor NetMHC and the pan-allele predictor NetMHCpan, both of which are ensembles of shallow neural networks. We explore an intermediate between allele-specific and pan-allele prediction: training allele-specific predictors with synthetic samples generated by imputation of the peptide-MHC affinity matrix. We find that the imputation strategy is useful on alleles with very little training data. We have implemented our predictor as an open-source software package called MHCflurry and show that MHCflurry achieves competitive performance to NetMHC and NetMHCpan.