%0 Journal Article %A Alex Rubinsteyn %A Timothy O'Donnell %A Nandita Damaraju %A Jeff Hammerbacher %T Predicting Peptide-MHC Binding Affinities With Imputed Training Data %D 2016 %R 10.1101/054775 %J bioRxiv %P 054775 %X Predicting the binding affinity between MHC proteins and their peptide ligands is a key problem in computational immunology. State of the art performance is currently achieved by the allele-specific predictor NetMHC and the pan-allele predictor NetMHCpan, both of which are ensembles of shallow neural networks. We explore an intermediate between allele-specific and pan-allele prediction: training allele-specific predictors with synthetic samples generated by imputation of the peptide-MHC affinity matrix. We find that the imputation strategy is useful on alleles with very little training data. We have implemented our predictor as an open-source software package called MHCflurry and show that MHCflurry achieves competitive performance to NetMHC and NetMHCpan. %U https://www.biorxiv.org/content/biorxiv/early/2016/05/22/054775.full.pdf